Welcome to MultiLing's Newsletter section, use the navigation to the right to view current and past issues.


2006
   February

2005

January  February  March
June August November
 
2004
January   February   April
  July   August 
November

December

2003

April
   May   June

2002

July
   September   October

2001

January
   March   August

2000

January
   February   March
  April   May   June
   July   August   October

 

 

 

 

 

MULTILING CORPORATION NEWS (FEBRUARY 2006)
THE TRANSLATION TIMES


NEWS: Will I get replaced by a computer program?

Localization for Central and Eastern Europe

DTP TIP: Making PDF Files Without Garbled Characters

FORTIS TIP: Inserting Common Extended Characters and Symbols

NEWS: WILL I GET REPLACED BY A COMPUTER PROGRAM?


Anybody who's played around with translation software knows how bad the technology can be. Everyone in my office knows the hoary classic in which "The spirit is willing, but the flesh is weak," translated into Russian and back, comes out "The vodka is good, but the steak is lousy." We all knew, or thought we knew, that computer translation – also known as machine translation (MT) – could never replace a human translator, with his vast cultural and linguistic experience, his ear for nuance, and his superior multilingual education. We all slept very well in the certainty of our indispensability.

Still, machine translation has been in development for almost 60 years, since it was conceived as an offshoot of the cryptographic technology developed during World War II. Grandiose prophecies of its perfectibility have been made ever since. Had MT evolved while I hadn't been paying attention? Had it really improved to the extent that it could be a viable alternative to the human touch?

For more information, please visit http://www.slate.com/id/2133922/

TOP

LOCALIZATION FOR CENTRAL AND EASTERN EUROPE


The countries of the former Soviet Bloc are becoming more and more attractive destinations for U.S. and Western European companies looking to expand their international sales. These markets are growing in overall economic power and importance, and thus represent strategic markets for long-term growth for companies that can build brand loyalty early on.

Although they are part of Europe, these countries present their own unique challenges that separate them from the Western European markets that most international companies consider the “easy” localization target.

Technical issues

Languages spoken in the former Soviet Bloc are predominantly written in two different scripts: Cyrillic and Roman. The division is rooted in the religious history of the region (Cyrillic is used in predominantly Orthodox Christian areas and Roman in Catholic or Protestant areas) and affects the availability of linguistic resources for these regions. Furthermore, the Roman-script languages of the region utilize a number of characters not found in most Western fonts that make use of ISO 8859-1 encoding, such as o, ?, l, ?, and so forth. Because of the differences in character sets, applications localized for this region must support ISO 8859-2 (Eastern European Roman), ISO 8859-4 (Lithuanian) and ISO 8859-5 (Cyrillic), and should also support Unicode. However, even when Unicode support is available, applications must support these additional code pages – as well as fonts encoded using them – since there is a considerable installed base of legacy fonts and data encoded using these various standards.

While there is an increasing number of Unicode fonts and Unicode-capable applications, many fonts and applications in common use by localizers (including Quark 4, which is still widely used) do not support Unicode or all of the languages in the area. At present the most robust support for Eastern Europe in a mainstream desktop publishing package is Adobe’s Creative Suite 2, although the forthcoming release of Quark 7 should level the playing field somewhat. As a rule, when localizing documentation for any Eastern European country, it is best to consult early on with your localization service provider to determine any problems that might arise and to allow the necessary time to find appropriate solutions.

In addition to the need to support a variety of fonts and encodings, the languages spoken in the region tend to be grammatically complex. Many traditional coding practices that would work in English (and to a lesser extent in other languages), such as “concatenating” text strings, will not work and can lead to tremendous problems. This difficulty is particularly evident in a language such as Hungarian, which has more than two hundred distinct forms for each noun. In Russian, the form a number takes depends on the number, gender and case (grammatical function) of the noun it precedes, which means that a number cannot be easily combined with a noun in a computer program to generate grammatical output. As a result of the grammatical complexity of languages in this region, computer code must be carefully internationalized from the ground up to prevent costly and time-consuming problems.

Legal issues

Recent expansion of the European Union to much of Eastern Europe has improved the legal framework for companies doing business in the region in some respects, but it has also added a whole new level of compliance requirements that have come with E.U. mandates. In addition to facing new regulatory requirements, these countries may also interpret some E.U. laws and regulations differently than their Western counterparts. As a result, it is wise to find good in-country legal counsel to help meet local requirements that might otherwise pose a problem for a lawyer trained in the West.

Though many people recognize that localization is a legal requirement to do business in most of Eastern Europe, enforcement of such requirements varies from jurisdiction to jurisdiction. Comprehensive knowledge of the English language is lower in this region than in many other countries. For example, a lack of localization that might be tolerated in Germany or Sweden will not work in Poland or Estonia. Hence, even if a legal requirement for localization would not be enforced, it makes good business sense to fully localize products.

Cultural and business issues

There are few generalizations that can be made about Eastern Europe because of its large geographic size and its wide collection of differing histories and customs. Nevertheless, certain common characteristics can affect businesses. One of the most important is Internet penetration. While Internet access is in fact growing, it is still not as extensive as in the United States or Western Europe and, therefore, customers cannot yet be expected to have ready or inexpensive online access. Additionally, many will not have credit cards either so locally acceptable alternatives must be sought (in much of the region, postal money orders are a common way to send payments). Any Web site sales or localization efforts must account for these factors.

Software and media piracy are also significant issues in the region. A visit to almost any open market in a large Eastern European city will reveal copies of major software packages being sold for as little as a few Euro. Although piracy is illegal, enforcement of anti-piracy laws is spotty and ineffective. Piracy, coupled with the decreased buying power in Eastern European countries, may force companies to think about changing their pricing for these markets: a reasonable price in Boston will be exorbitant in Kraków (Cracow), Poland. If pricing is different in these markets, companies also need to consider how these price differences across borders may affect sales in neighboring markets and how such “gray market” sales may impact overall sales and development strategy.

Despite the technical and business-related issues that can complicate localization for Eastern Europe, these obstacles can be overcome with proper planning and a little bit of care. Working closely with your localization provider(s) and your local partners will help you identify and resolve issues before they become a problem. With careful attention and foresight, businesses can gain a foothold in a region that is set to play a larger and larger role in the European and global economies.

TOP

 

DTP TIP: MAKING PDF FILES WITHOUT GARBLED CHARACTERS


Question:

When using Microsoft Word to make PDF files on a Macintosh computer, some of the characters become "garbled" in the PDF document. For example, the bullet characters now appear as Yen symbols. What can I do to fix this problem?

Description:

Microsoft Word and other Microsoft Office programs often use Microsoft-specific fonts, which incorrectly encode some character information in the PostScript file from which the PDF file is made. The result is that "upper ASCII" characters are displayed improperly in the PDF document. The easy solution to this problem is to use non-Microsoft fonts in your file. Microsoft fonts include Arial, certain variations of Times, Verdana and Wingdings, among others. If you replace these fonts with Macintosh standard fonts or with fonts made by a non-Microsoft vendor, the garbled characters should appear correctly in the PDF file. We recommend using PostScript fonts from a reliable font foundry/vendor, such as Adobe or ITC, since these fonts rarely cause this problem.

TOP

 

FORTIS TIP: INSERTING COMMON EXTENDED CHARACTERS AND SYMBOLS


Question:

I am using Fortis to translate from English to Spanish. What is the easiest and best way to insert the extended characters, such as "ñ" or "á"?

Answer:

The easiest way to insert the extended ANSI characters (in English Windows) is by using the "Alt" key combination sequences. These characters can be inserted by first holding down the "Alt" key on the keyboard, and while continuing to hold down the "Alt" key, pressing the corresponding number keys individually. The following is a list of commonly used extended characters and symbols and some of their corresponding "Alt" key combinations:

¡ 0161
£ 0163
¥ 0165
§ 0167
© 0169
« 0171
® 0174
1/4 0188
1/2 0189
3/4 0190
¿ 0191
À 0192
Á 0193
 0194
à 0195
Ä 0196
Å 0197
Æ 0198
Ç 0199
È 0200
É 0201
Ê 0202
Ë 0203
Ì 0204
Í 0205
Î 0206
Ï 0207
Ð 0208
Ñ 0209
Ò 0210
Ó 0211
Ô 0212
Õ 0213
Ù 0214
Ø 0216
Ù 0217
Ú 0218
Û 0219
Ü 0220
Ý 0221
Þ 0222
ß 0223
à 0224
á 0225
â 0226
ã 0227
ä 0228
å 0229
æ 0230
ç 0231
è 0232
é 0233
ê 0234
ë 0235
ì 0236
í 0237
î 0238
ï 0239
ð 0240
ñ 0241
ò 0242
ó 0243
ô 0244
õ 0245
ö 0246
ø 0248
ù 0249
ú 0250
û 0251
ü 0252
ý 0253
þ 0254
ÿ 0255

TOP

 

 

 

Home |Company |Translation |Localization |Technology |Clients |Press |Contact |Bid Request |Site Map
|Terms of Use|

Copyright © 1994-2007 MultiLing Corporation. All Right Reserved.
MultiLing®, Fortis®, and Semantis® are registered trademarks of MultiLing Corporation.
All other trademarks and service marks are the property of their respective owners.