Contemporary translation modeling overview

2.2 Contemporary Translation Modeling Overview

Machine translation is a hard problem in part because there is a trade-off between the methods with which we would like to translate and those that we can readily compute. There are several main strategies for attacking the translation problem, but most of them are still out of reach.

Warren Weaver viewed languages as:

“[…] tall closed towers, all erected over a common foundation. Thus it may be true that the way to translate from Chinese to Arabic […] is not to attempt the direct route […]. Perhaps the way is to descend, from each language, down to the common base of human communication – the real
but as yet undiscovered universal language […].” (Weaver 1949/1955)

Weaver’s notion of decoding linguistic utterances using the fundamental knowledge representation formalism of an interlingua, while ideal, is utterly impractical with the current state of the fields of Linguistics and Natural Language Processing. Other ambitious methods include semantic transfer approaches, where the meaning of the source sentence is derived via semantic parsing and the translation is then generated from it. There are cases where the translation is too literal to be clear, but the main obstacle is the lack of semantically annotated parallel corpora.

Next are syntactic transfer approaches, such as Yamada and Knight (2001), where the source sentence is initially parsed and then transformed into a syntactic tree in the target language. The translated sentence is then generated from this tree. Syntax-based approaches produce correctly ordered translations, but may miss the semantics of the sentence. However, the main obstacle is again the requirement of parsing tools, or more precisely the money to fund their research, a requirement that is currently not yet met for many languages.

Word-level translation models adopt a simple approach. They translate the source sentence word for word into the target language and then reorder the words until they make the most sense. This is the essence of the IBM translation models. It has the disadvantage of failing to capture semantic or syntactic information from the source sentence, thus degrading the translation quality. The great advantage of word-level translation models, however, is that they are readily trained from available data and do not require further linguistic tools that may not be available. Because of this, wordlevel translation models form the basis of statistical machine translation.

(No Ratings Yet)

Похожие топики по английскому:

The 20 greatest historical myths It is said that those who don’t know history are condemned to repeat it and as any history buff can tell you, much of history...
The effects of nonsymmetric matrix permutations and scalings in semiconductor device and circuit simulation Abstract – The solution of large sparse unsymmetric linear systems is a critical and challenging component of semiconductor device and circuit simulations. The time for...
How device drivers work Device drivers consist of software code that allows your PC’s operating system to interact with a hardware device. Every device driver performs a different function...
New source for generating ‘green’ electricity University of Minnesota engineering researchers discover new source for generating ‘green’ electricity Contacts: Rhonda Zurn, College of Science and Engineering, rzurn@umn. edu, (612) 626-7959 Preston...
50 facts about russians 1: Russians distrust anything cheap. 2: The English word “bargain” can not be adequately translated into Russian. 3: Although Russians distrust anything with a cheap...
How much for your parking space, bud Living in a car-crazed (crazy about; very interested in) city like Los Angeles, I find one of the most stressful parts of my week is...
Doctor who chat part 6 Name: Doctor Who Chat Part: 6 Writer: Andrey Lysenkov – vkontakte. ru/id105176267 Warning: there is a couple mistakes – – Chat restoring – – –...
Parable of love A man and woman had been married for more than 60 years. They had shared everything and talked about everything. They had no secrets from...
The results of tqc-company The coating industry suffered severely from the crisis that started in 2009. There was a recovery in 2010, however the industry has not fully recovered...
Любовь изнутри / The Nature Of Love What is love for people living alone? Why have they decided once to exist on their own? Maybe they are arrogant, ugly or too freedom-loving?...
To seduce a man Seducing a man doesn’t necessarily mean stealing him from someone else, or convincing him to do something he shouldn’t be doing. It can simply mean...
How michael portillo became a single mum I always thought of Michael Portillo, the polician, as an arrogant and self-important man, but in this programme, Portillo comes across as being very different....
Usmle – tests 1. A 69-year-old male with a 45 pack-year smoking history presents with hemoptysis, 20 lb. weight loss, and proximal muscle weakness that improves throughout the...
The tree of life subtitles Where are you when I did the foundation of the earth? When the morning started singing and all God songs were filled with joy? It...
Software testing terms and definitions Precision and Accuracy As a software tester, it’s important to know the difference between precision and accuracy. Suppose that you’re testing a calculator. Should you...
How price action trading will cure emotional trading problems Price action forex trading strategies provide much more than just high probability entry signals because they also work to influence the proper trading mindset. Many...
Shopaholic takes manhattan – part 16 I DON’T DECIDE straight away. It takes me about two weeks of pacing around the flat, drinking Endless cups of coffee, talking to my parents,...
Problems reveal genius. robin sharma’s best articles Problems are servants. They help you grow and lead to better things, both within your organization and in your life. To resist them is to...
Suzanne collins – the hunger games i. part 2. “the games”/11 11 Sixty seconds. That’s how long we’re required to stand on our metal circles before the sound of a gong releases us. Step off before...
Twilight part 2-11-14 “Dad?” I asked when he was almost done. “Yeah, Bella?” “Um, I just wanted to let you know that I’m going to Seattle for the...