Work on Star Trek universal translator is beginning in a few more years
From Uzbek to Klingon, the Machine Cracks the CodeBy CHRISTOPHER JOHN FARAH
…Statistical machine translation - in which computers essentially learn new languages on their own instead of being "taught" the languages by bilingual human programmers - has taken off. The new technology allows scientists to develop machine translation systems for a wide number of obscure languages at a pace that experts once thought impossible.
Dr. Knight and others said the progress and accuracy of statistical machine translation had recently surpassed that of the traditional machine translation programs used by Web sites like Yahoo and BabelFish. In the past, such programs were able to compile extensive databanks of foreign languages that allowed them to outperform statistics-based systems.
Traditional machine translation relies on painstaking efforts by bilingual programmers to enter the vast wealth of information on vocabulary and syntax that the computer needs to translate one language into another. But in the early 1990's, a team of researchers at I.B.M. devised another way to do things: feeding a computer an English text and its translation in a different language. The computer then uses statistical analysis to "learn" the second language.
Compare two simple phrases in Arabic: "rajl kabir'' and "rajl tawil.'' If a computer knows that the first phrase means "big man," and the second means "tall man," the machine can compare the two and deduce that rajl means "man," while kabir and tawil mean "big" and "tall," respectively. Phrases like these, called "N-grams" (with N representing the number of terms in a given phrase) are the basic building blocks of statistical machine translation.
posted by Prometheus 6 at 8/2/2003 06:40:15 AM |
Posted by P6 at August 2, 2003 06:40 AM
| Trackback URL: http://www.prometheus6.org/mt/mt-tb.cgi/1327