Summary Ch2 - External Book Huma and Machine-The+History+of+Machine+Translation
Summary Ch2 - External Book Huma and Machine-The+History+of+Machine+Translation
what is translation?
what is text ?
Translation what are the criteria of text?
• Sometime translation might say more than its source text which means our
translation could add more information that does not exist in source text. For
example, the English pronoun "I" could be translated into French "née" for
female or "né" for male that means in this translation we add the information
of gender which is not mentioned in the source text. what is equivalence?
could the target text have the same meaning of the source text?
• Many scholars are reluctant to say that a source and target text have the
same meaning because languages have a lot of differences that sometime
cannot be conveyed to the target text. The right term should be used for this
relationship is " equivalence". Equivalence is the relationship that emerges
from the decision-making of a translator and arises between two text
Converting one language (SL) to another (TL) so that the TL could convey the intended message in
SL
snippets because the translator has deemed them to be of equal value in
their respective co-texts and contexts.
• The main thing is that most professional translators will realize when they
have a gap in their knowledge, or need inspiration, and they will conduct
conscientious research to address that gap, solve the translation problem and
move on.
What is the role human plays in machine translation process?
1. Human translation sets the standard by which machine translation is judged,
and anything that contributes to the maintenance of high quality in human
translation is ultimately of relevance to machine translation. Likewise, human
translation processes can help to put into sharp relief occasional deficits in
machine translation.
2. Most contemporary machine translation relies on translations completed by
humans to learn how to translate in the first place.
Translation memory
When did translation memory start?
In the 1990s translators working in the burgeoning software localization industry
found themselves translating texts that were either extremely repetitive in
themselves or that repeated verbatim whole sections of earlier versions of a
document. This was the case, for example, with software manuals that had to be
updated any time there was a new release of the software. Rather than translate
each sentence from scratch, as if it had never been translated before, they invented a
tool that would store previous translations in a so-called translation memory.
what is SMT?
Statistical Machine Translation (SMT) systems basically build two types of statistical
models based on the training data, the mathematical representation of observed
data:
1- The first model is known as translation model. This type of model
is bilingual one which divides the source text into segments then
compares them and puts them in a table, which is called phrase
table, alongside their translation, using statistical evidence and
distortion probabilities to choose the most appropriate
translation. The word "phrase" in "phrase table" is inaccurate as
the strings don’t necessarily correspond to phrases as commonly
understood in linguistics. However, they are 𝑛-grams which is one
way to help machine understand a word in its context. N-grams
are strings of one, two, three or 𝑛 words that appear adjacent to
each other in the training data, for example, In the sentence,
“appear adjacent” is a bigram, and “appear adjacent to” is a
trigram. The table below contains an example of an excerpt from
such a phrase table.
Why is NMT so much better that SMT, if it is simply learning from data? Is that not
what SMT was already doing?
The answer lies in the kind of representations that NMT systems use and in the kind
of models they learn.
• Advantages of NMT:
1. NMT can build up very rich representations of words as they appear in
a given source text, taking the full source sentence into account,
rather than mere 𝑛-grams.
2. Since NMT deals with full sentences, it is better at dealing with tricky
linguistic features like discontinuous dependencies and it handles all
sorts of agreement phenomena better than SMT.
• Disadvantages of NMT:
1. NMT systems are restricted to only sentence-level not the full text
which means they do not look beyond the current sentence. So, they
face difficulties to find information from a previous sentence in order
to figure out what a pronoun like “it” refers to in the current
sentence.
2. NMT can also output words that don’t actually exist in the target
language. its output can be fluent but inaccurate.
3. Like other technologies trained on large quantities of existing text, it
can also amplify biases encountered in the training data.
4. NMT systems take much longer and much more computing power to
train than their predecessors and use up vast quantities of energy in
the process. They usually require expensive hardware and massive
quantities of training data, which are not available for every language
pair.
Conclusions
Some people might claim that there is no need for learning foreign language or
training human translators, however, they forget the fact that NMT still depends on
human translations or at least translations validated by humans as training data.
NMT, just like other machine translation, needs human intervention as its output
sometime should be evaluated and improved by excellent translators.