Monday, 15 April 2019

Investigation into Language: The Evolution of Language

The evolution of language told through mathematical and statistical theory
  • Making infinite use of finite means 
  • Early in the evolution of language, errors in singling and perception would be common
  • Grammar originated as a simplified rule system that evolved by natural selection, to reduce mistakes in communication

  1. Primate social behaviours
  2. Diversity of existing human languages
  3. The development of language in children
  4. The genetic and anatomical correlates of language competence 
  5. The theoretical studies of cultural evolution
  6. The learning of lexicon formation 
  7. The studies of bees, birds and mammals have show that complex communication can evolve without the need for a human grammar or for large vocabularies of symbols

  • The lack of obvious formal similarities between human language and animal communication has led some to propose that human language is not a product of evolution but a side-effect of a large and complex brain evolved for nonlinguistic purposes 
  • The origin of life has been described as a passage from limited to unlimited hereditary replicators, whereas the origin of language as a transition from limited to unlimited semantic representation 
  • Combining sounds into words leads to an essentially unlimited potential for different words. this step in language can be seen as a transition from an analogue to a digital system. The repertoire is not increased by adding more sounds, but by combining s set of easily distinguishable sounds into words. In all existing human languages, only a small subset of the sounds producible by the vocal apparatus are employed to generate a large number of words. These words are then used to construct an unlimited number of sentences. The crucial different between word and sentence formation is that the first consists essentially of memorising all (relevant) words of a language, whereas the second is based on grammatical rules. We do not memorise a list of possible sentences. 
  • By combining sounds into words, the protolanguage achieve an almost limitless potential for generating words with the power of describing a large number of objects of actions. Grammar emerges in the attempt to convey more information by combining these words into phrases or sentences. Simply naming an object will be less valuable than naming it and describing its action 
  • Grammar can be seen as a simplified rule system that reduces the chances of mistakes in implementation and comprehension and is therefore favoured by natural selection in a world where mistakes are possible 
  • Objects may be associated with properties, giving rise to adjectives. Events can have similar associations, giving rise to adverbs. The essential result is that a grammatical language that has words for each component of an event receives a higher payoff in the evolutionary language game than a non-grammatical language that has words (or a string of words) for a whole event. In this context, the grammar of human languages evolved to reflect the "grammar of the real world" that is, the underlying logic of how objects relate to actions and other objects.
  • Signal-object associations form only when information transfer is beneficial to both speaker and listener. Otherwise, the evolution of communication requires cooperation between individuals. 
  • Cooperation may represent an important prerequisite for the evolution of language
  • Animal communication may have came to a stop due to error limit - larger variety of sounds needed, or to organise these sounds into words 
  • The need for grammar arises only if communication about many different events is required: a language must have more relevant sentences than words. It is likely that for most animal communication systems the inequality is not fulfilled (the error limit)





Coding of language - from typewriters to keyboards
Chinese Vs English


  • Tom Mullaney is a historian of modern China exhibiting Chinese typewriters and keyboards, the curation of which has led Mullaney to the conclusion that China is rising ahead technologically while the West falls behind, clinging to its QWERTY keyboard.
  • Now this was and still is an unusual view because Chinese—with its 75,000 individual characters rather than an alphabet—had historically been the language considered incompatible with modern technology.
  • How do you send a telegram or use a typewriter with all those characters? How do you even communicate with the modern world?
  • Mullaney argues the invention of the computer could turn China’s enormous catalogue of characters into an advantage.
  • Our relationship to computers, not just as physical objects but as conduits to intangible software.
  • Typing English on a QWERTY computer keyboard, he says, “is about the most basic rudimentary way you can use a keyboard.” You press the “a” key and “a” appears on your screen. “It doesn't make use of a computer’s processing power and memory and the cheapening thereof.” Type “a” on a QWERTY keyboard hooked up to a Chinese computer, on the other hand, and the computer is off anticipating the next characters. Typing in Chinese requires mediation from a layer of software that is obvious to the user.
  • To type a Chinese character is essentially to punch in a set of instructions—a code to retrieve a specific character.
  • Mullaney calls Chinese typists “code conscious.”
  • Dozens of ways to input Chinese now exist, but the Western world mostly remains stuck typing letter-by-letter on a computer keyboard, without taking full advantage of software-augmented shortcuts. Because, he asks, “How do you convince a person who's been told for a century and a half that their alphabet is the greatest thing since sliced bread?”
  • It’s China’s awkward history with the telegraph and the typewriter that primed Chinese speakers to take full advantage of software when it came along—to the point where it’s now faster to input Chinese than English.
  • When the telegraph came to China in 1871, the Chinese first had to bend their language to Western technology.
  • The solution, devised by a Dutch astronomer and a French customs officer, was to assign a four-digit code to each character, which was then translated into the dots and dashes of Morse. This worked, but it put Chinese at a disadvantage. Numbers in Morse code contain five dots or dashes and letters only one to three, which made Chinese telegrams both more expensive and less efficient.
  • The Chinese typewriter had a tray bed of more than 2,000 common characters. A typist selected characters by manoeuvring a chassis on top of the tray bed, pushing a lever that struck the chosen character against the page. If you wanted to type an uncommon character, you had to go hunting for it among thousands in a secondary tray bed.
  • At the same time, dozens of inventors tried to find better ways to send telegrams or build typewriters. To do so, they had to come up with new ways of indexing Chinese characters, breaking them into subunits. Take, for example, the “four corner method,” which notes the shape in each corner. Ten different shapes are assigned a number 0 through 9; going around the corners in a clockwise direction gives you a four-digit code to send telegrams or to organize characters in a typewriter. If you don’t write Chinese, this might not seem particularly profound. But in fact, it is a complete rethinking of the Chinese character.
  • It would be like, if instead of spelling an English word letter by letter, you represent it by noting the number of letters that are ascenders (d b l h), descenders (p y g j), or neither. The idea of choosing characters by inputting an abstract code was part of Chinese technology from the start.
  • So when the computer comes along, the number of ways to input Chinese just exploded in the “input wars of the 70s and 80s,” says Mullaney.
  • Different input methods require different ways of thinking about Chinese characters. You might do it based on the four corners or three corners or radicals (subunits of Chinese characters) or stroke order. Others experimented with pronunciation-based systems that used the QWERTY keyboard, taking advantage of software to translate letters into characters. And in a real breakthrough, these systems were now predictive. You, might, for example, input a string of characters by typing just the first letter corresponding with each character. In other words, it’s predictive text.
  • The Chinese way of inputting text—the software-mediated way—will win out.
  • Our mobile phones now have predictive text and autocomplete. It took the constraint of mobile to get Westerners to realise the limits of the simple what-you-type-is-what-you-get keyboard.
  • The introduction of T9—the predictive texting system on early cell phones—illuminates that cultural gap.
  • When the Seattle-based Tegic company first developed T9, it created a new letter arrangement on cellphones. The standard had always been 2 = abc, 3 = def, 4 = ghi, and so on. Several different words often match the same set of numbers, so you might hit “4663” and have to key through “good,” “home”, and “hoof”, before finally arriving at “hone.” But Tegic had initially developed a new way of assigned numbers to letter—not QWERTY or alphabet-based—optimized to prevent overlaps.
  • “One of our early stage investors said, ‘You are not going to change the letter arrangement on mobile keyboards,’” recalls Tegic cofounder William Valenti. “We had to accept a less efficient input system because we had to be in the constraints of the existing letter arrangement.”
  • In China, it was easier to convince users to try something new because there was no established default to fight.
  • Chinese T9 is a stroke-based system, and different keys represented different strokes. With predictive text anticipating the next character, a typist on T9 averaged just 1.4 strokes per character, says Valenti. For reference, common Chinese characters are made up of nine strokes on average. This is a huge leap in efficiency
  • “In China,” says Christina Xu, an independent design researcher and ethnographer, “the proliferation of mobile over the last ten years has meant an explosion of new users learning to type for the first time on a mobile device—they don't have any attachment or experience in QWERTY, so getting them to adopt new input methods is way easier.
  • The input method families include stroke-based ones like T9, pronunciation-based typing on QWERTY keyboard, or writing on a touchscreen—with numerous variations therein.
  • Alternative, faster typing methods in English, like ShapeWriter or Swype that let you swipe through the letters of the word in one motion, have struggled to catch on outside of early adopters. Plain old QWERTY is good enough. Who wants to completely relearn how to interact with their phone just to type a little bit faster?
  • “There is an incredibly self-satisfied culture in alphabetic world.” Personally, I’d identify less as an active alphabet worshipper and more as a passive lazy person who does not want to learn a coded, if more efficient, way of typing.
  • That language is also jarring because the West as default has gone unchallenged for so long.
  • The telegraph was developed with the alphabet in mind. So was the typewriter. And the computer. And internet protocols. And yes, Chinese speakers spent a century conforming their language to those technologies until computing power transcended them, resulting in a relationship with technology richer and more complicated than in the alphabetic world.

No comments:

Post a Comment