|
tranfree issue 11 - 15 February 2000
Translation Technology
By Bert Esselink
This is my second contribution in a series of four articles about
software localisation. The previous article discussed the
differences between translation and localisation, and below I
will cover the types of translation tools that are used by
translators working for localisation service providers.
First of all, a distinction needs to be made between machine
translation (MT) tools and computer aided translation (CAT)
tools. Where machine translation tries to replace a translator to
a certain extent, computer aided translation tools support the
translator by preventing repetitive work, automating terminology
lookup activities, and recycling previously translated texts.
Machine translation has not been applied much in the software
localisation industry mainly because, unlike in the automotive
and aerospace industries, software publishers never really
created their documentation in a structured way that would make
machine translation successful. Even though this seems to be
gradually changing, I will focus on computer aided translation
tools in order to reflect current practices in the localization
industry.
Computer aided translation tools, also called computer assisted
translation tools, can be categorized as follows...
- Translation Memory tools
- Terminology tools
- Software Localization tools
The first two types, translation memory and terminology tools,
are typically combined for translation of documentation, online
help, or html text. Software localization tools are used to
translate and test software user interfaces, i.e. dialog boxes,
menus, and messages.
Translation Memory tools
Basically, a translation memory system is no more than a database
which stores translated sentences.
When a source text is imported into a translation memory tool,
the text is segmented. Usually segmentation is performed at
sentence level, where segments are separated by colons, commas,
question marks, etc. However, it is also possible to segment
texts on a paragraph basis, where segments are separated by
paragraph marks. Each segment is a record in the translation
memory database, and each record can store several fields, such
as source text segment, translated segment, language, name of
translator, date of translation, or type of text. The number of
possible data fields in records differs per translation memory
tool.
When text segmented by a translation memory tool is translated,
all translations are automatically stored in the records
containing the source segments. If identical or similar sentences
occur in the source text, the translations are automatically...
...
retrieved from the database and inserted into the target text. An
identical segment that is automatically translated is called a
full match; a similar sentence that is automatically translated
is called a fuzzy match. Obviously, fuzzy matches need to be
post-edited to make them correspond to the source text. A fuzzy
match is for example a sentence where only one word has changed.
On large projects, translation memory databases can be shared
amongst a team of translators. This means that if translator A
has translated a sentence which also occurs in the text that
translator B is translating, A's translation will automatically
be retrieved from the translation memory database and inserted in
B's target text.
Naturally, translation memory tools are especially useful on
large volume texts, which contain a lot of repetitive text and
where translations can be created on a one-to-one sentence basis.
Using translation memory tools to translate marketing text or
adverts is not a good idea, simply because those types of texts
often require many adjustments, rewrites, and other modifications.
In the software localisation industry, translation memory tools
have always been very popular because of the short life cycle of
software products. Most software products are updated at least
once a year, and re-using translations of previous versions will
decrease time to market of localised versions drastically.
Examples of translation memory tools are Trados Translator's
Workbench (www.trados.com), Atril Déjà Vu (www.atril.com), and
STAR Transit (www.star-ag.ch).
Terminology tools
In localisation, terminology management is usually dealt with in
a very basic manner. Localisers typically don't create or use
large multilingual terminology databases with term definitions,
context, grammatical information, source, etc. Instead, in most
cases only bilingual glossaries of translated terms or phrases
are used, for example all translated terms from the software user
interface, e.g. menu options, dialog box items, etc. For this
reason, not only are professional terminology management tools
used, but also basic glossary tools with search features.
Most translation memory tools come with terminology management
applications which can be linked to the translation memory for
automatic terminology lookup. Automatic terminology lookup means
that terms in the source text which are found in the dictionary
or terminology database are automatically displayed with their
translations.
Examples of terminology tools are Trados Multiterm
(www.trados.com), Atril TermWatch (www.atril.com) or STAR
TermStar (www.star-ag.ch).
Software Localisation tools
Special tools have been developed to translate graphical user
interfaces of software applications, i.e. the dialog boxes,
menus, and messages that are displayed on a computer screen.
These tools allow translators to view their translations in
context. For example translations can be entered directly in a
dialog box and then saved.
Software localisation tools also contain features for
automatically translating updated software with previously
translated versions, and for running basic tests on localized
software, for example checking if no translated text has been
truncated in the screens because of space restrictions.
Examples of software localisation tools are Corel Catalyst
(catalyst.corel.ie) and Passolo (www.passolo.com).
The Next Generation
Even though many translators still need to get...
...
acquainted with traditional translation technology such as translation memory,
the next generation translation tools have already been
introduced. Companies like e-Translate and Uniscape offer
automated internet-based translation workflow solutions that
automate many steps in translation projects. Texts are not only
transferred automatically through each translation and review
phase, but databases containing the source text are linked to
translation technology that detects changed content and then
first pre-translates it using a combination of translation memory
and machine translation before it is forwarded to a human
translator for post-editing.
These so-called translation portals and internet-based
translation management solutions are especially designed to deal
with frequently changing content, such as text published on daily
updated web sites. For more information about these types of
translation technology, visit www.uniscape.com or
www.e-translate.com.
Technology and the web will totally change the way translations
have been done for many centuries. Frequently updated content,
geographically distributed resources, and pressure to keep prices
down will result in further integration of technology and
workflow automation in translation processes. Undoubtedly the
tools can be developed quickly, but the real challenge will be to
keep producing quality translations and to still enjoy
translating.
Bert Esselink has been active in localization since 1990.
After graduating in technical translation and doing coursework in
programming and computational linguistics, he worked for several
years as a software translator, localization engineer, and
technical manager. In 1996 he joined ALPNET in Amsterdam as
localization manager, training new engineers and localization
specialists and coordinating software localization projects and
since early 1999 as globalization manager developing production
standards. As of 1 January 2000 he is employed by INT'L.com
in the Netherlands, managing consultancy services. His first
book, A Practical Guide to Software Localization, was published
in 1998 by John Benjamins Publishing Company (www.benjamins.com). The
second edition is now out and further information is available at
http://www.locguide.com/

Click here to read the next article
Click here to return to tranfree
11 main page
|