Shodhan Sheth
22.m@Pune
single n lookin
shodhan[dot]sheth[at]gmail[dot]com
This is the Rosetta Stone approach of translation. Let’s take a simple example: if a book is titled “Thus Spoke Zarathustra” in English, and the German title is “Also sprach Zarathustra”, the system can begin to understand that “thus spoke” can be translated with “also sprach”. (This approach would even work for metaphors – surely, Google researchers will take the longest available phrase which has high statistical matches across different works.) All it needs is someone to feed the system the two books and to teach it the two are translations from language A to language B, and the translator can create what Franz Och called a “language model.” I suspect it’s crucial that the body of text is immensely large, or else the system in its task of translating would stumble upon too many unlearned phrases. Google used the United Nations Documents to train their machine, and all in all fed 200 billion words. This is brute force AI, if you want – it works on statistical learning theory only and has not much real “understanding” of anything but patterns.
2 Comments:
Hmmm You have a very groovy blog.... and a groovier name..Shodhan
By Ad astra per aspera, at 9:46 AM
Groovy huh? now thats what i kall a sexy kompliment
By z, at 12:16 PM
Post a Comment
<< Home