It is because the level of attainable phrase sequences increases, along with the styles that advise outcomes grow to be weaker. By weighting words within a nonlinear, distributed way, this model can "understand" to approximate words and never be misled by any not known values. Its "being familiar with" of a presented word is just not as tightly tet