Posts Tagged ‘Localization’

Wordfast Pro 3.1: Solid Contender

Posted by Nick Peris on April 16, 2013

Wordfast 3.1.5

The translators network ProZ.com recently published an article about the use of CAT tools in the industry. It was based on a survey they ran within their community and which received over 3,000 responses.

Apart from the perennial dominance of Trados which about 75% use in some shape or form, 3 facts caught my attention.

First, the translators’ preference: while 43% said Trados is the CAT tool they use the most, only 36% cited it as their favourite tool. Compared to this Wordfast, second in line in this survey, showed the same proportion of primary users and supporters. memoQ seems even more popular with substantially more people citing it as their favourite than actually use it as their primary tool.

The second point was the real deciding factor in the choice of CAT tool: the main driver, listed by over 45% of respondents was customer requirements, with market research second at about 36%. Pricing was at the bottom of the list.

It seems fair to conclude already that translators often use the CAT tool they have to, rather than the CAT tool they choose to. There are several reasons for this:

Translators usually work with handoffs or translation kits which have been prepared for them by their clients. When they don’t start from the raw source documents, they have a more limited choice in the translation technology.
They also quite commonly download packages from Translation Management Systems, and are tied into the CAT tools supported by the workflow.
Finally in some cases they are forced by business requirements to the technology of the LSP they are affiliated to.

The third and last point I took away from reading the ProZ.com post was that Wordfast and memoQ are the most common CAT tools after Trados. We have talked about Trados many times in these pages, and have covered memoQ on several occasions as well. However Wordfast which is also in the Top 3 of our own never-ending Poll in the right sidebar, was never yet covered on Localization, Localisation.

This article will begin to remedy that.

Editions

There are 4 separate versions in the Wordfast offering:

Wordfast Anywhere: a web-based CAT tools with Translation Memory, Glossary and Machine Translation functionality. It is available free-of-charge for translators.
Wordfast Classic: a well-established, Microsoft Word-based translation tool. For readers more familiar with Trados, this is the equivalent to using Workbench in Word instead of translating in TagEditor.
Wordfast Server: an enterprise Translation Memory server compatible with the other 3 Wordfast applications.
Wordfast Pro: the professional, full-featured CAT tool, flagship of the Wordfast family. One of its main attributes is the extensive platform compatibility: it supports Mac OS and Linux as well as the Windows.

Wordfast Pro is the application I will talk about in the rest of this post.

Installation

The latest version of Wordfast Pro (3.1.5 at the time of writing) is available for download from their website. The trial version has no feature limitation other than a 500 Translation Units cap in the TM.

The installation itself is very fast and requires minimal user input. There is one screen in the wizard which lets you select optional components like the Aligner or PDF Support and choose the Hunspell spell checker languages to install. Wordfast can also use the Microsoft Office spell checker dictionaries if they are installed.

On my Windows system, the whole installation process took about 2 minutes.

Getting started

Once that’s done, you can immediately get started with your first translation by following these steps:

Create a Project (File – Create Project…)
Set the source and target language (only 1 language pair per Project)
Click OK
The Preferences dialog opens
Under General – Translation Memory – TM List
Click Create TM
Enter a location, file name, language pair and click OK
To add a Glossary, go to General – Terminology – Glossary List
Click Create Glossary
Enter a location, file name, language pair and click OK
In the Active column of the TM and Glossary lists, select the TM or TM’s to use. The language pair of the TM and Glossary must match those of the Project.
If you have multiple Active TM’s and/or Glossary set the order of priority in the corresponding Priority table
When ready, click OK to close the Preferences dialog. You can access and edit these options and others (see details later in this article) at any point by clicking Edit – Preferences
Open the document to translate by pressing CTRL + O and browsing to its location.
The document is immediately converted to a bilingual format (.txml) and displayed in a familiar segmented, 2-columns table

You are now ready to start translating. Type your translations in the target column for each of the segments. If your TM already contains matches, the best way to proceed is to use the Translate until Fuzzy button (CTRL + Alt + F) to move from segment to segment.

With the translation completed, save your Project (CTRL + S) and generate your translated file (CTRL + ALT + S).

To add your translations to the primary TM, select Commit all segments to TM (CTRL + ALT + END) from the Translation Memory menu.

Advanced Options

Wordfast offers a wide choice of features to enhance translators productivity and improve translation quality and consistency.

Most of these options can be accessed by clicking one of the icons in the Tool bar and can be configured from the Preferences dialog (Edit > Preferences). This dialog box and some of its views have a very practical filter text box which lets you hide any feature setting you are not currently interested in.

For example, to see the quality control settings, simply type Transcheck in the type filter text field and press Enter. All other Preferences will be hidden from view and you will be able to access the Transcheck options without having browse to them (see Screencap).

Some of the most useful UI options available are the configurable keyboard shortcuts found under General > Keys. The optional software Automatic updates are also a neat, non-intrusive way of making those available.

But the real powerful stuff can be found in the Translation folder of the Preferences:

Auto-propagation copies your new translation to any duplicates within the project. This can be fine-tuned to apply only to certain segment types.
Auto-suggest, not to be confused with the previous feature, works much like predictive text in mobile phones. Some like to use it, some don’t. Of course it can be switched on or off.
Filters list all supported file types. File filters can be duplicated to contain variations of their settings. This works fine, but the way to add support for new file types is not as easy as in other systems.
Auto-propagation settings
Machine Translation is one of the highlights. Wordfast can be connected to an existing Google Translate account, Microsoft Translate account, to Worldlingo or to all at once. MT can then be used to provide suggestions when no TM match is found.
Terminology supports sequencing, blacklists and even automatic fuzzy term recognition. The supported Glossary formats are Tab-delimited (.txt) and TBX.
Transcheck is Wordfast’s automatic quality control tool. It comes with an array of options shown in the screencap above.
Translation Memories also has a vast amount of settings relating to Sequencing Priority, Penalties, TM Update behaviors etc. By default Wordfast does not pre-populate fuzzy matches, but it can be configured to by editing the minimum threshold. Wordfast TM’s can be exported to a simple txt format or to TMX.

: Auto-suggest settings

Overall the features available and the amount of flexibility in their configuration is on par with the most modern CAT tools around. The only significant limitation in my opinion is the lack of real-time preview. In order to preview your work you will need to generate the translated file (CTRL + ALT + S) and open it in its native application. This may not sound like a big deal, but if you’ve been using a CAT tool which does have real-time preview you won’t want to give it up.

A Different Perspective

Apart from the TXML Editor we’ve been looking at until now, Wordfast a different has view called the PM perspective.

This can be opened by clicking the PM perspective icon below the File menu, and gives access to a number of batch activities useful for pre and post-production.

Analyze can be used to calculate the leveraging of file sets against Translation Memories and output reports.
Clean-up generates target files, updates TMs, passes on Attributes and reports on the results.
Pseudotranslation is a good pre-production tool used to test the content exposed to translation before a project goes to translation.
Split/merge divides big projects into smaller, more manageable pieces according to the number of Translation Units or words found in TXMLs.
Bilingual export lets you export and reimport the bilingual file into a Word document (with optional track changes), so linguistic review can be performed by Subject Matter Experts in MS Word and automatically incorporated back into the TXML by the language team.
Show/Hide 100% lets the pre-production team exclude 100% matches from the handoff.
Transcheck creates QA reports based on the same options available in the TXML Editor.
Swap Source/Target does just that.

The user interface here is easy to get used but maybe a bit outdated. The shortcuts to Preferences in each screen are a good idea and the Bilingual Export sounds very practical.

Wordfast - PM perspective

CAT tool use by translators: who is using? (blogproz.wordpress.com)

Posted in Beginner's Guide, Surveys, Wordfast | Tagged: CAT Tools, Computer-assisted translation, Localisation, Localization, Machine Translation, Review, Translation, Translation Memory, Wordfast | 2 Comments »

SDL WorldServer: Getting Started with Custom Reports

Posted by Nick Peris on October 22, 2012

The Report Center was completely upgraded with the last major release of WorldServer. Overall the new offering was very good, with a more modern interface and a more powerful underlying technology.

Yet upgrading the Report Center and starting to make the most of its full potential required a certain amount of effort. This is mostly because while it is accessible through WorldServer, the Report Center is in fact a completely separate application. Moreover, it is made-up of three distinct elements, for which no integrated documentation exists: the queries, the reports layout and the repository site.

This post reviews the basic functionality of the current version and suggests ideas to improved it in the future.

Managing the Report Center

Adding Reports

WorldServer Reports are designed offline (see next section). Once ready, they need to be imported:

Open the Report Center (Tools > Report Center)
Click View > Repository
Right-click on the folder where you want to add a report
Click Add Resource > JasperReport
Enter a name for your report and upload the JasperReport file (.jrxml)

At first glance this works well and certainly is easy. There are however a few ways I think it could be made more efficient:

Add Version Control and Roll-back functions for successive uploads of the same JRXML. This is essential since the queries and layout cannot be edited via the Report Center.
Automatically read the Report Name and Resource ID from the JRXML file, to save manual steps and prevent typos.
Batch JRXML upload: this would be very useful to support upgrade effort, as well as to transfer reports from a Test server to a Production environment.
JRXML Download would help with future migration and simplify back-up processes
Finally some editable JRXML samples should be provided to show users how JasperReports can be used in WorldServer.

Data Sources

Next, each Report has to be connected to a database. The setup steps may differ slightly depending on whether WorldServer uses Oracle or SQL, and drivers may need to be installed.

First, create a Data Source:

Right-click on the folder where the reports are located
Click Add Resource > Data Source
Enter your Database details and test the connection before clicking Submit

You can now connect Reports to the Data Source:

Right-click on a Report
Choose Data Source > Select data source from repository
Browse to the database you just connected

Here again, there is room for improvement in my opinion. The possibility to connect a location within the Report Center to a database would be helpful. Instead we have to connect each report one at a time.

Permissions

User access can be managed at both user or role-level. You can also setup different access for each report separately or for a folder within the repository. This is in keeping with one of WorldServer’s strengths, where permissions are extremely flexible, and relatively easy to fine-tune.

You could for instance have some reports only visible to Project Managers and others to Language coordinators. You could show linguists reports where the data only relates to their own work, or create Customer or Business Unit specific reports and then only grant access to them to people in selected groups.

Permissions can be edited by right-clicking on a report and choosing Permissions. Roles and Users are accessed via the Manage menu.

Inputs Controls

A dialog box can easily be created to allow users to filter their searches or, more precisely, to set up the value of parameters to use when the report runs. In an SQL-based setup, percentage signs can be used for wildcards. A parameter added to the report during layout design is associated to each Input Control setup in the Report Center.

To create an Input Control:

Right-click on Input Controls in the Repository
Click Add Resource > Input Control and follow the steps on-screen

Note: the parameter name must match that from the Jasper Report (case-sensitive).

Once an Input Control is created it can be re-used for any number of reports:

Right-click on the Report
Click Edit > Controls & Resources > Add Input Controls and follow the steps on-screen

A Report can have several input controls, so the user could for example set a value for Project Name, Language and Workgroup themselves, before running the Report. The Input Controls dialog also lets you save commonly used search parameters.

Overall this too works very well and is also relatively easy to setup. My only criticism is the lack of documentation: there is no Online Help or Report creation guide apart from the Samples in the View menu.

Designing the Reports Layout

JasperSoft

The layout of WorldServer Reports cannot be designed or changed from WorldServer. The best way to do that is to embed your SQL query within a Jasper Report (.jrxml) using the JasperSoft iReport Designer. There is a free version available for download, which provides everything needed to design a WorldServer Report. Once again though, there doesn’t seem to be any WorldServer-related documentation available.

Here are a few pointers to get you started:

Connect iReport to your WorldServer database
Create new report (File – New)
Copy your query into the Query viewer
Click Read Fields
Go back to the Designer and drag 1 or more fields from the Report Inspector into the Detail area. This will automatically create headers which you can then rename, align etc.

Online vs. Offline

It can be time-consuming to pretty-up reports in iReport. The first way to gain efficiency is to make a choice between the way they look online and how clean the exports are. If you expect your users to consult the reports online, you may want to spend time making the report look good online, and load fast for example by breaking the output into pages. By opposition, if you expect the reports to be downloaded and their data further manipulated in Excel, you should instead make sure that the output doesn’t have empty lines or columns.

Re-using layouts

If you are creating several variations of a report, or migrating a number of reports between successive versions of the Report Center, it is worth trying re-use some of this tedious layout work:

Open an existing JRXML in iReport
Save it under an alternate name
Overwrite or edit the query
Click Read Fields to update the list available in Designer
Edit the Fields, Descriptions, and Parameters which need to be changed in the layout.

Editing Queries

SQL server management studio

The query viewer in iReport is useful to a point but it doesn’t provide much feedback regarding syntax errors or other issues in queries. Another big limitation is that it doesn’t give any visibility into which Tables and Views are available in the database.

If your WorldServer uses an SQL database, you should consider using SQL server management studio when writing the queries. You can create and test your queries there before copying them to iReport, and browse through the database to get familiar with how the data is structured.

One thing to remember is that active and completed projects are in two separate locations. Just like in WorldServer you have a view for Active projects and another for Completed and Cancelled Projects, under Assignements, the database has the latter in dbo.archive Tables, and the former in dbo.active Views. Performance is much better when querying active projects, and the way the data is structured can also differ in the two locations.

You must have a very clear understanding of the following 3 WorldServer concepts.

Project Group: all files, all languages, 1 file submission
Project: all files, 1 language, 1 file submission
Task: 1 file, 1 language

Their ID numbers are essential within queries because they link information associated with each of them. For example the language name is Project data, but the current owner is Task information. The 2 will need to be joined in order to create a report on current Task owners which lists the languages.

Lastly, any search parameter is better created directly in iReport once the query is finalised. Just replace them with an arbitrary value for testing purposes while working in the server management studio.

Learning resources

There are plenty of tutorials available for beginners and less-experienced database users, and a lot of them are free. I found SQL course.com very clear and concise. The interactive SQL interpreter is great for practicing and experimenting safely. W3Schools is another very good resource.

Posted in Beginner's Guide, SDL WorldServer | Tagged: Databases, JasperReports, Localisation, Localization, Report Center, Reports, SQL, TMS, WorldServer | Leave a Comment »

MT, TM’s & TMS’s: Interview with Wayne Bourland, Global Localization Director, Dell‏

Posted by Nick Peris on March 27, 2012

Transcreation is used in high visibility content on Dell.com. In this example, the French banner has a more seductive tone, and replaces "Shop Now" with "Discover More"

Last summer Wayne Bourland, Director of Global localization at Dell, spoke about Machine Translation at the LocWorld Conference in Barcelona. He raised some very interesting points, which were later echoed in an article in MultiLingual (July-August 2011). The central idea was that MT was failing to gain traction for three reasons: clients not knowing how to buy it, Providers not knowing how to sell it, and Translators being reluctant to support it.

Wayne is an active member the Global Localisation community. He has been involved in developing best practises in the industry, sharing experiences with other Localisation consumers and developing sophisticated partnership with providers.

He has now accepted to revisit these ideas with us and discuss the outlook for MT. We’ll also take this chance to talk about other aspects of Localisation, such as Translation Management Systems and Translation Technology in general.

[Nick] Hi Wayne, thanks very much for agreeing to give some of your time to talk to Localization, Localisation. Could you start by giving us an overview of your career?

[Wayne] I came into this industry in an unconventional way. After a decade in the US Army I joined Dell, starting as a phone tech support agent. After moving into management I helped to establish call centers in India and South America before making a jump over to managing writers for support content. We had a small translation operation supporting tech support content in 7 languages. After being asked to take over the translation team we grew the team rapidly, moving into online content, then marketing, to where we are today supporting more than 90 different groups across Dell.

Machine Translation

Now let’s start with MT. Does MT still get more talk than action or have you observed an evolution in the last year? Has your team been driving changes in this area?

I think we are certainly seeing a groundswell. Jaap van der Meer with TAUS used to talk about 1000 MT engines, now he talks about 10s or 100s of thousands of them, trained very quickly and supporting a multitude of languages and domains. Every major client side translation leader I talk to is using MT in some way. Some are still toying with it, but many are investing heavily. Vendors have caught on to the growing demand and are either building their own capabilities or forging partnerships with MT providers. We are seeing groups from academia starting to see the value in commercializing their efforts. Soon we may have the problem of too much choice, but that’s on the whole a positive change for buyers. As far as the role my team is playing, we are doing what we have done for years, representing the enterprise client voice, discussing our perspective wherever we can (like here).

"If you go to Dell.com to purchase a laptop in France, or Germany for instance, the content you see is Post-Edited Machine Translation"

I know Dell has been using MT for a long time for Support content. Are you now able to use it for higher visibility content? Is MT effective for translating technical marketing material such as product specs and product webpages? Are more Localisation consumers ready to trust it?

Since May of last year we have been using MT with Post Edit in the purchase path of Dell.com. Meaning if you go to Dell.com to purchase a laptop in France, or Germany for instance, the content you see is PEMT. As of February of this year we are supporting 19 languages with PEMT. Yes, MT can be used for something other than support content. That’s not to say we have cracked the code, it still requires extensive Post Edit, we haven’t seen the level of productivity gains we had hoped yet, but we are making progress. Being on the cutting edge means dealing with setbacks.

I don’t think it’s a question of consumer trust. I think if you’re doing a good job of understanding the consumer need for your domain, and you measure your MT program against quality KPIs that mirror those expectations (v. relying simply on BLEU scores and the like to tell you how you are doing), then consumer trust won’t be an issue.

Which post-editing strategy produces the optimum results? Presumably it depends on the content type, but do you favour Post-Editing, Post-editing plus Review sampling, Post-Editing plus Full Review? What are the Quality assurance challenges specific to using MT?

I favour all of the above, each has their place. Following on to my previous answer, it’s about understanding the desired outcome. MT will be ubiquitous some day and people need to get used to it. You don’t start with picking the right process, you start with picking the right outcome, the appropriate balance of cost, time and quality, and you work backwards to the right process. If you’re supporting a large operation like I am, or just about any large enterprise client translation team, you’re going to need a number of different processes tuned to the desired outcomes for each content type. You build a suite of services and then pull in the right ones for each workflow. What we are doing on Dell.com today is PEMT with quality sampling. We made a decision that full QA (which we are moving away from for all translation streams) didn’t make sense when you factored in the cost and velocity goals we had. Of course, we have a great set of vendors and translators that make the PE work. Our quality standard has not changed.

Are LSP’s learning how to sell it? Is it finding its way into the standard offering of most providers or does it remain a specialists’ area only available for example in very big volume programs?

Wayne Bourland, Dell

I think some of them are. There are many LSPs out there who are still shying away from it, but the majority of your larger suppliers are getting the hang of it. They see the trends, they know that services, not words, is what will drive margin dollars, and MT is a big part of that service play. I wouldn’t say it’s standard yet though, it’s still handled as a completely separate conversation to traditional translation in many cases, but that is changing too. The more savvy LSPs are talking to clients about desired outcomes and how they can support that across the enterprise. The key is, at least for selling into large enterprises, you can’t be a speciality shop. Companies are increasingly moving to preferred supplier list, narrowing down the number of companies that they buy services from. So going out and picking 2 or 3 translation companies, and 2 or 3 MT providers, and a couple of transcreation specialist is happening less and less. Clients are looking for a handful of vendors who can bring all of these services, either organically or through partnerships.

You also expressed the opinion that the work of Translators would tend to polarise with high-end Transcreation type of work on one hand, and high-volume post-editing on the other. Are you observing further signs in that direction? How does the prospect of localising ever-expanding user-generated content such as blogs and social media fit into this view?

I think this still holds true, we can argue about when it happens, but at some point MT will be a part of nearly every translation workflow. Traditional translation work may not decrease, but the growth will be in MT and MT related services. I think user generated content is the domain of raw MT or even real time raw MT. Investing dollars and resources to translate content that you didn’t invest in creating in the first place really doesn’t make sense. Either the community will translate it, or interested parties can integrate MT to support their regional customers, but I can’t see a business case for any other form of translation for this domain of content.

Support sites were one of the earlier content types to adopt Machine Translation

Is the distinction between MT and TM loosing relevance? In mature setups, all content is highly leveraged. Often TM sequencing is used to maximise re-use even across content types, while taking into account the different levels of relevance. Post-editing and Review have to ensure any leveraged translation is the right translation in-context and at the time, regardless of its origin. In other words, once a match is fuzzy, does it matter whether it comes from human or machine translation?

It shouldn’t matter, and I think eventually it won’t, but it still does today, to my frustration. Translators still dislike MT, even in case studies where it has been shown that the MT output was performing better than TM fuzzy matching. And of course MT still has its challenges. We just aren’t there yet, I see them co-existing for some time to come, but eventually they will be one in the same for all practical purposes.

Translation Memory Technologies

What are the main advances you have observed in TM Technology over the last few years? Which are the most significant from the point of view of a Localisation consumer? Translator productivity tools such as predictive text? In-context live preview? The deployment of more TMS’s? The variety of file formats supported? Or perhaps the ability to integrate with CMS and Business Intelligence tools?

I won’t claim to be an expert on translation technology, but I really like in-context live preview and more TMS’s are starting to support it. Nothing beats seeing something as its going to be seen by the consumer for ensuring the most contextually accurate translation. I think all of the mentioned technologies have a place, but I am interested in tools that assist the translator. We have this crazy paradox in our industry where we have spent years trying to make human translators more machine like (increased productivity) and machines more human like (human quality MT). I think to a large degree we have neglected to innovate for the translator community. Too much time was spent trying to squeeze rates down and word counts up without really investing in the translator and their tools to facilitate this.

Wayne Bourland, Dell

By opposition, are there pain points you have been aware of for some time and are surprised are still a problem in 2012?

There are a number of them, TM cleaning is way more difficult than it should be and good tools to help are sparse. The differences in words counts between different tool sets is also challenging (a quote generated by one vendor can vary widely than one from another vendor for the same job and with similar rates due to large deltas in word count).

The ability to leverage from many Translation Memories and prioritise between them is in my opinion a must-have. Do you see any negative side to TM sequencing? Is the cost of managing linguistic assets a concern to customers?

I think one potential negative to TM sequencing is it allows people to get lazy with TM management. Simply adding more TMs to the sequence doesn’t ensure success. The cost for managing linguistic assets is a concern, although I think we don’t always realize how big of a concern it should be. As mentioned above, TM cleaning is costly and time-consuming, but necessary. Clients and SLVs alike should put TM maintenance on the annual calendar, ensure at the least some time is devoted to reviewing the strategy. There is a lot of lost cost and quality opportunity in good TM management. It’s something I don’t think we do nearly well enough.

How about TM Penalties? Do you see a use for them as a part of Quality Management strategy, or are they a cost factor with little practical use to the customer?

I think they have a purpose, if you know one set of TMs is more appropriate for your domain you want to ensure it is used first for segment matching, however, it should be used cautiously. We penalized a TM when we shouldn’t have and it cost us a large amount of money before we figured it out. Hence the need to review your TM strategy periodically and also watch leverage trending!

I see source Content Management, or Quality control during the creation of the source content, as a key to quality and cost control in Translation. Can you tell us about what you have observed? How is source quality controlled at Dell? Do you have any insight into the process of choosing and setting up Source Content tools with Localization in mind?

I agree there is huge potential in controlling the upstream content creation process. It’s also, for many of us, very difficult to do. You’re starting to see a lot of clients and LSPs do more here. It’s another one of those services that SLVs can build into their suite to derive revenue from non-word sources. It’s also an area where translation teams can show innovation and have a larger impact on company objectives. We are in the process of implementing Acrolinx with several of our content contributors. I think the key is getting buy-in from the authoring teams and their management. You have to be able to demonstrate how this helps them and the bottom line.

Are Terminology and Search Optimization Keywords managed in an integrated manner, from the source content creation to the Localised content deployment?

Wayne Bourland, Dell

You’re kidding right? I know you’re not, and it’s a really important topic, but no, we don’t do it in an integrated manner today and I think many of us are struggling to figure this one out. We are piloting an Acrolinx add-on that targets localized SEO, but I think a lot of education is needed for companies to understand the potential here.

Translation Management Systems

Your team uses a Translation Management System to collaborate with vendors and their linguists. What is your overall level of satisfaction with the technology?

I haven’t talked to a single large enterprise client who is “satisfied” with their TMS. That’s not to say that everyone is unhappy, but many of us have had to invest a great deal of time and money into fitting the TMS into our ecosystems. The lack of observed standards exacerbates the problem. I don’t know what the solution is here, more competition would help, but it isn’t a silver bullet. Perhaps more interest from major CMS players would help drive innovation here. The CMS industry is much larger than the TMS industry, and integrations are becoming more and more common place. We will have to wait and see. I do know that user communities have formed around many of the larger TMS offerings, and I think the shared voice of many customers will help to push for the right changes. If you’re not participating in one of these now, I would encourage you to do so!

When purchasing enterprise solutions it can be difficult to accurately estimate the financial benefits. Providers will often help potential buyers put to together an ROI. With the benefit of hindsight, would you be able to share with us how long it took for your Translation Management System to pay for itself in cost saving? How did that compare to the ROI estimated at the time of the original investment.

I wasn’t a party to the purchase and implementation of our current solution. I am aware of the cost, but not the promised ROI. However, I can say that it probably paid for itself in year 2, due more to the volume ramp than anything else. I would certainly say utilizing our TMS solution more than pays for the on-going maintenance. I do know that moving between TMS’s, when you consider the level of integration we have, would be daunting and the ROI would have to be very large and very attainable.

Online Sales sites re-use large amounts of existing translations thanks to TMs

Which would be your top 5 criteria if you were to purchase a new Workflow System today?

1- It would have to support multiple vendors
2- Have a robust API for integrating with varied CMSs
3- Support all modern document formats (CS 5.x, Office 2010, etc.)
4- Cloud based and scaleable
5- Easy client-side administration
There are probably 100 more factors….

I’ve come across a number of relatively news TMS’s recently. They often have some nice new features, and friendlier, more modern user interfaces. But I find they tend to lack features users of the more established systems would take for granted: TM sequencing, the ability to work offline or even download word count reports are not necessarily part of the package. Have you had opportunities to compare different systems? If so what was your impression?

We are so tightly integrated with a number of CMSs that we have not been in the position to look at other options. I think that is the key challenge for companies selling TMSs, how do you break the lock-in.

The upgrade process for TMS systems is sometimes difficult because of the vast number of users involved or the automation and development effort which may have been done to connect to Content Management Systems, Financial Systems, Portals etc. Is that also your experience? Can you tell us about your process for minor and major upgrades?

We feel this pain often. We have rarely had an upgrade that didn’t spawn additional issues or downtime. We have worked with IT and the tool supplier to setup regression testing, testing environments, upgrade procedures, failure protocols, etc. but it still seems we can’t pull off a seamless launch, primarily due to a failure of some sort on the supplier side. It’s frustrating, and many of my peers are having the same experience.

In the domain of Quality Control, the availability of QA Models in TMS’s seemed like a major development one or two years ago. Yet I find they are not actively rolled out, and offline spread sheet-based Quality Reports have proven resilient. Is that also your experience? And do you think the trends towards more flexible and content-specific quality measurement systems like that of TAUS, particularly in the area of gaming, make online LISA-type QA models more or less adequate?

Wayne Bourland, Dell

We championed the inclusion of a QA system in our current TMS and don’t use it. We found that it just wasn’t robust enough to handle all of the different scenarios. We still use spreadsheets; it has worked for years and probably will for many more. We are participating with TAUS on their new proposed quality model and I am anxious to see where it goes, I think the use of the content and the audience plays a big role and are ignored in quality models today that just look at linguistics. Customers don’t care about linguistics, they care about readability and if the content talks to them or not.

Do you know the proportions of Translators and Reviewers respectively, working online and offline on your production chain? Is this proportion changing or stable? What do you think would be the deciding factor to finally getting all linguists to work online?

I think it is about 50/50 right now, but that’s really more a difference in how our different vendors work than tools or process. I don’t see it changing in the near term, but I would like to see more move online, I think there is opportunity for quicker leverage realization and other enhancements that make a completely online environment look attractive.

Conclusion

As you probably know, Ireland has had a pretty rough ride in recent years. But the Localisation industry is doing comparatively well. What are the main factors to explain Ireland’s prominent place on the Localisation Industry. Many companies have their decision centres and supplier partnerships setup from Dublin when it comes to Localisation. Do you think this will continue in the future?

Now we are really going outside my area of expertise. I think Ireland’s location (in Europe), its influx of immigrants with language skills, the strong presence of language and MT in Academia, and of course, the resilience and work ethic of the Irish all serve to make Ireland a great place for the language services industry. I don’t see that changing anytime soon. Hopefully not, I do love my bi-annual trips to Dublin! Coincidentally, I am typing these answers on the plane to Dublin. I can taste the Guinness now. 🙂

Wayne will participate to two discussions at this year’s LocWorld Conference in Paris, June 4-6: one about Dell’s Self Certification program for Translators and one about Multilingual Terminology and Search Optimisation. Self Certification is a concept implemented by Dell where instead of having Translation and then QA, Translators perform a few additional QA steps to certify their own work. This removes any bottleneck due to a full-on QA step. Independent weekly sampling and scoring are used to monitor the system, but is not a part of the actual production chain.

Posted in Interviews | Tagged: Localisation, Localization, Machine Translation, MT, TM, TMS, Translation Management Systems, Translation Memories, Wayne Bourland | 3 Comments »

Happy Anniversary: 3 Years, 100,000 Visits!

Posted by Nick Peris on March 25, 2012

100,000 visits in 3 years! Thank you all so very much. Those clicks are what fuels our posts.

Thank you most particularly to those of you who come back, tweet, retweet, linkin, repost, comment and generally contribute to sharing information and opinions about our industry.

Localization, Localisation remains a non-commercial haven, a place to think about our profession without the pressure of daily sales targets, project deadlines and cost reduction plans. I think this is valuable to you, it certainly is to me.

So, let’s make a deal: I’ll keep the ads and sponsored links at bay, and you keep coming back 🙂

If you have any interest, WordPress compiled a funny report about you: https://localizationlocalisation.wordpress.com/2011/annual-report/ It’s a bit corny and self-indulgent, but some of the details about readers by country are interesting.

Posted in News | Tagged: Localisation, Localization | 1 Comment »

memoQ roadshow – Dublin 2012

Posted by Nick Peris on January 30, 2012

Kilgray Translation Technologies started 2012 with their first visit to Ireland. The Localisation community here gathered in respectable numbers to greet the makers of memoQ at the Hilton Hotel, Dublin 2, Ireland on Jan. 25.

Peter Reynolds, Executive Director, spoke about the history and vision for Kilgray as a company. István Lengyel, COO, presented memoQ and lead an inspiring audience-driven workshop in the afternoon.

There were also two case studies by Martin Beuster from con[text] and Jonathan Young of PopCap Games.

Who are Kilgray?

Kilgray Translation Technologies was founded and is still owned by three passionate Localisation professionals. From 2004 to 2007, they concentrated mainly on developing the technology. Thanks to financial support by local grants, they were able to treat memoQ almost as a pet project while they were readying themselves for battle on the international markets. To some extent this initial dedication to the product remains what attracts a lot of their customers and justifies their Support and Developers’ enviable reputation.

In the next 2 to 3 years, they started developing a more purposeful business strategy. They increased brand awareness, first in Germany, then throughout Europe and beyond. Since 2010, they have established a strong customer base, whose feedback is one of the main influences on their technological and strategic directions.

Kilgray’s portfolio

memoQ

memoQ 5.0 added version tracking (for source update management), change tracking, Term extraction (built on MT technology, customisable), Cascading filters , and some Source content connectors (file management tools, CMS etc.)

More recently added:

terminal server support
regex based text filter
Asia online MT integration (chosen for being less industry-specific than Google’s MT)
pseudo-translation

qTerm

qTerm is a dedicated Terminology Management System which can be seamlessly integrated with memoQ or used in conjunction with other CAT tools. TBX compatible, it offers a Portal and quality control.

memoQ WebTrans

WebTrans is a browser-based version of memoQ which allows Translators to use it without having to install it. Released only with memoQ 5.0, it offers full functionality including the exact same User Interface, keyboard shortcuts, Concordance tool etc. as the desktop version of memoQ.

TM repository

TM repository is a CAT tool-independent, SQL-based appplication which offers a solution to many of the common problems linked with hosting and managing ever-growing amounts of Linguistic Assets.

Where to now?

memoQ is currently undergoing a springtime clean-up. Kilgray call it refactoring, which essentially means they are going through all the various pieces of code that were added over the years, and looking for ways to streamline and increase its efficiency without actually changing any of the existing functionality. This is apparently necessary to allow memoQ to meet the demands of customers with much bigger project size, although memoQ has been proven to work fine with multimillion words projects.

This reaffirms Kilgray’s dedication to the quality of their flagship product, and their ambition to ensure it fulfills its potential in terms of stability, performance and integration.

the memoQ server demo

Content-connected projects

Content connection is used to monitor a location (FTP, Subversion, CMS etc.). Armed with the required Content Connector License, the Project Manager or Engineer can program memoQ and automate certain Project creation behavior when new source files are detected.

Key Project Settings:

Push connection is supported but if the Service Provider or CMS doesn’t, a Pull connection can be used
LiveDocs let you work from unaligned single language documents in addition to regular TM’s. It also enables Live alignment (more details below).
All versions of the source are automatically saved to allow Roll-back and Diff analysis. Any new version is automatically pushed to a running Project
Target files are automatically generated on completion of the translation.
Screenshot 1: memoQ Create View - Status tab

LiveDocs

LiveDocs are a type of Reference, which can be used to create a corpus of material from aligned or unaligned documents.

The advantage of LiveDocs over TM is they make leveraging easier to control and generate no Maintenance cost. There is no overhead for Global TM changes when for example a Term is changed. All work is done on the content that will actually be used.

By opposition, the advantage of TMs is that they are lighter on resources because little or no formatting is stored.

The File Filter supports: Java Properties, RESX, Office 2007 and before, OpenOffice and SDLXLIFF (except Status information), InDesign INX, Star Transit, WorldServer XLZ packages etc.

File preview is available for doc, Excel, PPT, html and xml (with or without XSLT).

Here is how to create a Project using LiveDocs:

Create Project
Tick Record Version History for Translation Documents (may slow down on bigger projects)
Add files
Previews get created
Tick Use context
Setup Termbase (if on a server it can be moderated: all users suggest, Terminologists approve)
Create new LiveDocs corpus (i.e. file location)
Add Alignment pair

The Audience-driven Project creation demo

The last time I attended a conference on Translation Technology I was asked if I had any suggestion to improve the way they are run. I suggested less marketing slides and more interactive demo. The workshop ran by István Lengyel that afternoon delivered just that!

Create a new project
Click Add Document as (to edit Import settings)
Select a Filter for relevant File Type (Excel filter can exclude text-based on cell ranges, but not colour)
Context matches (101%) are based on segment before and after, as well as context ID for structured files
Filter configurations can be saved for re-use, including a set of 2 cascading filters
Run QA is found in the Operations menu
Run Regex Tagger is in the Format menu
Use Create View to:
1. Glue/Split file
2. Extract Repetitions (Advanced – Minimum frequency = 1, untick Keep Duplicates to create a Repetitions file for Translation)
3. Filter certain rows depending on their content
4. Once created, Views can be used exactly like actual documents (see screenshot 1)

The icons in the memoQ UI help identify segments which have been populated using Search and Replace (red dot in a Search icon), or Auto-Propagated (green down arrow). These criteria can also be used to filter the segments displayed.

Projects can be linked to a forum for participants to interact using IM through the memoQ interface). Email notifications are also available and the Online Project Management module has complete audit trail, to the point of tracking who reassigned users.

Bilingual files

Bilingual files can be exported from memoQ in the following formats:

.mbd binary (for memoQ only)
XLIFF offer wider compatibility. Although they are bigger in size, a compressed version is available
Trados doc files
Multicolumns RTFs can be used by reviewers to work offline. There comments can be added to the project on re-import.

Workflows

There are 5 types in memoQ:

Package based (offline)
Bilingual document
Online Project 1 (requires memoQ server) with server document
Online Project 2 (requires memoQ server) with desktop document
Online Project 3 (requires memoQ server) with web-based interface (translators can either work online or offline)

Related Links:

Check out Kilgray on VIMEO:

http://vimeo.com/search/videos/search:kilgray/st/d7dca8eb

Kilgray Articles on Localization, Localisation:

memoQ 5.0: Mr. Q Brings Change Management to the Localisation Continuum

Kilgray TM repository: a New Home for Translation Memories

Posted in Kilgray | Tagged: CAT Tools, Computer-assisted translation, Kilgray, Localisation, Localization, memoQ, Translation, Translation Memory | 4 Comments »

The Value of Professional Linguistic Review

Posted by Nick Peris on December 19, 2011

All Translators I know are consummate professionals, who take great pride in the quality of their work. They are well-used to using various sources of reference material to ensure they meet the expectations of their customers, and they systematically proof read their work before delivery. Most of them use CAT tools, which allow them to maximise consistency and partly automate quality control.

Translation agencies and Language Service Providers all offer what is known as TEP, Translating, Editing and Proofreading, as their most basic level of service. TEP provides a systematic Quality Assurance process, often involving several linguists with various levels of seniority.

And yet independent linguistic review services are one of the most dynamic sectors in our industry. This article explains why it is so successful and what you should take into consideration if you are ready to take this particular plunge.

Scalability

I am not always a strong supporter of outsourcing, but in the case of linguistic review there are compelling arguments in its favour.

Let’s first ask ourselves, who typically are the in-house reviewers? Two of the most common categories are linguists on one hand, and in-country Marketing and Brand staff on the other. It can be difficult for a company which purchases translation services to keep dedicated linguists in full-time employment. Product releases are often seasonal, or at least vary in pace from one month to the next, and the associated translation requirements follow the development cycles. By opposition, it may be difficult for in-country staff who are not linguists to commit to Localisation schedules. Review is a secondary task for them and they cannot drop everything else when review activity peaks. Moreover, they are unlikely to have the tools and skills a professional linguist employs.

A third-party linguistic review partner can provide the best of both worlds:

in-country linguists who will become familiar with your international and local brand identity,
dedicated resources who can develop expertise based on your existing content
flexible workload to meet your peaks in translation activity
staff working on multiple accounts so they are easily redeployed when you do not need them full-time.

Sectors like the Life Science or the heavy vehicle industries also even require SME’s (Subject Matter Expert) as an alternative or even additional Review step to ensure your translations are not only of the highest quality in linguistic terms, but technically and legally accurate.

Error categorisation

Professional review services use customisable error categorisation. Often based on the LISA model, they are used to classify errors and better decide corrective and preventative actions.

Here are a few examples of categories and possible actions:

Terminology
- Ensure Glossaries are used
- Review the Terminology maintenance process (new Terms should be proposed continuously, approved periodically)
- Root out the use of local copies by providing a Portal
- Use a tool to automate Terminology checks
Style
- Ensure Style guides are used
- Review Style guides periodically (once or twice a year)
- Root out the use of local copies by providing a Portal
- Put in place a system to advertise Style guide updates
Consistency
- Provide access to Global TMs for Concordance search
- Provide a searchable linguistic query management tool (please see section on Query Management below)
- Encourage communications between linguists during the translation process
Accuracy
- Agree on a linguistic references
- Improve translators proofreading process
- Use tools to automate grammar or spell checks

Error Ratings

Measuring quality requires clearly defined and pre-agreed criteria, independence of the rator and historic data analysis so judgments can be made according to trends and not just levels.

Like for categorisation, error rating is often based on industry standard classifications like the LISA QA Model. The reviewer inputs the rating for each error found. This is mostly reported using QA report spread sheets but can also be fully integrated in Workflow technology such as WorldServer or SDL TMS. Each rating is associated with a number of points which is often deducted from a starting score of 100%.

A score can then be calculated for a project, job or sample. A Pass/Fail rate can even be decided in advance, with the Fails prompting for different levels of corrective actions, especially if they are repeated.

Corrective actions

Implementation may be the responsibility of the Translator or the Reviewer. Letting the Translators implement the changes, ensures they are aware of every change recommended by the Reviewer. On another hand, allowing the Reviewer to implement their own changes speeds up the overall process because the translation does not have to “change hands” again before it is delivered.

Whatever the choice is, a solid arbitration process must be in place. Translators must have an opportunity to discuss the Reviewer’s recommendations but it is advisable to set in advance the number of times this feedback loop is allowed to happen on a particular project, or the schedule will be affected by excessive discussions.

In the case of repeated concerns with one language or one set of Translators an escalation of the corrective actions may be needed. This may take the shape of closer collaboration between Translators and Reviewers, detailed training and improvement plans. Change in personnel or similar sanctions can occur as a last resort.

The proactive approach

Reviewers can bring a great amount of value to a translation process by taking part during the translation process rather than only afterwards. Think of it a prevention instead of cure.

Query Management

An efficient Query process promotes communication between Reviewers and Translators, and enables the Translators to consult with the Reviewer during the translation process. The aim is to avoid their having to make decisions which may or may not be approved during Review. The challenge in setting this up is that the Reviewer’s work becomes more difficult to measure and price. However, the use of a Query database should allow linguists to research previously answered Queries and compensate Reviewers based on the number of Queries answered.

A slightly different process needs to be setup for Source Queries. Answering those questions about the source text, may be an area where your in-country Brand and Marketing staff as well as content creators and other stakeholders remain involved with the Translation supply chain. Ideally this should happen through the same Query database as Linguistic Queries.

Linguistic Asset Management

Reviewers may also be the ideal people to have the responsibility for maintaining Linguistic Assets such as Glossaries, Translation Memories or Style guides.

While Translators are the first linguists to get exposed to new content, the Reviewers should have a more global overview of your content, particularly if you use more than one LSP. A suggestion process is required for Translators to request new Terminology, Global changes in legacy translations or standardisation through Style guide updates. But the Reviewers are likely to be the only ones who can coordinate feedback from multiple sources. Professional Reviewers are experienced Translators and they often double-up as Terminologists.

For this to be succesful, it is essential to have a central repository where all involved can access the latest version of each piece of reference material at any time. This can be a Translation Management System or a separate repository like SharePoint, eRoom etc. It should prevent the use of local copies as much as possible, and an email notification system can be used to advertise updates at least for the more stable elements like the Style guides.

The update process may also need to be scheduled with clear cut-off and update publication dates if failure to comply results in errors measurable during Review.

Cost effectiveness

Reviewers are usually experienced Translators and the hourly cost of a Reviewer can be substantially higher than that of a Translator.

This is easily offset by the value they bring if the process is setup correctly, even if you don’t move from a setup where review was done by in-house staff.

Professional review will lower the volume and therefore cost of error fixing. It will increase the quality and consistency of your content, and reenforce in-country brand integrity.

In more mature translation chains, the ratings are sometimes used to target languages where full review is required versus those where sampling might be enough because quality has been observed to be consistently high. In such cases, the make-up of the Reviewers role should transition to less review work and more production support activity through Query and Asset Management.

Rookie Story: Where to Start with Localisation Management? (localizationlocalisation.wordpress.com)

Posted in Linguistic Review, Quality Management | Tagged: Computer-assisted translation, Linguistic Review, Linguistics, Localisation, Localization, Review, Style guide, Subject-matter expert, Translation | 8 Comments »

Rookie Story: Where to Start with Localisation Management?

Posted by Nick Peris on October 11, 2011

Congratulations! You aced that interview a few weeks ago, and this morning you strolled into the office with a spring in your step! You had the HR induction and were introduced to your new colleagues. Now you’re logging onto the network, the company handbook reassuringly lying on the corner of your desk, or saved on your desktop.

Time to get started! The Company hired you to bring under control this thing almost mysteriously referred to as “Translations”. Your objectives are simple: reduce cost and improve quality. You are their first ever Localisation Manager, and you know the keys to your success will be the standardisation and centralisation of all Localisation activities.

So what do you need to consider from a technical and organisational point of view?

Getting to Know your Internal Customers

If there have been Translations in your Organisation, there are existing processes and linguistic assets you should be able to build on. You need to quickly learn about them by focussing on:

Who are your allies? Each Department, Local Office etc. probably has at least one “Translation person”. Find out who they are and what they have been doing. Determine whether they will remain involved once you’ve established the new structure, or if they expect to be relieved of Localisation duties. All going well, you may be able to enroll some of them in an inter-departmental Localisation team, even if it’s only a virtual team.
What is the inventory of current processes? Meet the current owners and document everything. No need for anything fancy since you are going to change these processes, but you need to have it all down so that when the inventory is finished you have an accurate and complete picture.
What are the points common to all? Which of those processes work well and which don’t? The successful ones will be the building blocks for your future world.
What are the specificities of each one? Which are worth keeping? Can they be used by other parts of the Organisation? Do they need to remain specific? Your new processes will need to achieve a balance between harmonisation and flexibility.
Do any of those existing processes use technology such as CAT Tools, Content Management Systems, Translation Management Systems? If so should they be upscaled and shared across the Organization?
Do any maintain linguistic assets like Glossaries, Style guides, Translation Memories or even just bilingual files which could be used to create TM’s?

Understanding your product lines

You need to understand what you are going to localise thoroughly before you can develop the processes. The question to answer are:

What types of content: marketing, commercial website, Software, Help systems, self-service technical content, user-driven content like blogs etc. all those use very different registers, vocabulary, address etc. Moreover the choices made will differ again from one language to the next. Some content types require high volumes at low cost, such as Support content or product specifications. Some require high quality and creativity like Copywriting and Transcreation and you may even choose not to use TM’s for some of those. Some will be specific to parts of your Organisation while other will be global material. You will need to ensure a consistent Corporate identity across all these, in all languages.
What are the fields: automotive, medical, IT require linguists with different backgrounds and specialisation. Make sure you know all the areas of expertise to cover during Translation and Review. For some you might to add Subject Matter Expert (SME) review to the more common step of Linguistic Review. Review changes will need to be implemented, communicated to Translators, fed into the TM’s, but the process will need to let SME’s take part in the process without having to learn CAT Tools.
From a technical point of view you will also need to work with the content creators to determine the type of files you will receive from them and those they expect to receive back.
Start a war on spread sheets as soon as possible. You probably won’t win it but the more you root out, the better. Teach your customers to understand how parsing rules protect their code by exposing only Localisable content during translation. Promote Localisation awareness during Development and Content creation. Document best practices such as avoiding hard-coded strings, providing enough space in the UI to accommodate the fact that some translations will be up to 30% longer than source text, at least if that is English.
Your aim should be:
- to receive files that can go straight to Translation with minimum pre-processing
- to deliver files that your customers can drop into their build or repository for immediate use.
No one should be doing any copy-paste engineering, manual renaming or file conversion.

Designing your Workflows

This can start with a pen and paper, a white board or whatever helps you think quicker, but it should end with a flowchart or set of flowcharts describing the process you’re setting up.

Collaborate with your internal customers. You need to agree a signoff process, and avoid multiple source updates during or after the Translation process.
Enumerate all the stages required and determine the following:
- How many workflows do you need to describe all scenarios? Try to find the right balance: fewer workflows ensures efficiency, but too few workflows will lead participants to implement their own sub-processes to achieve their goals and you will lose control and visibility.
- What stages do you need? The most common are:
  1. Pre-processing
  2. Translation
  3. Linguistic Review
  4. Post-Processing
  5. Visual QA
Who are the owners of each step? Are they internal or external (i.e. colleagues or service providers)? How will you monitor progress and status? How will you pay?
Is there a feedback loop and approval attached to certain steps? Will they prevent the workflow from advancing if certain criteria are not adhered to? Is there a limit to the number of iteration for certain loops?
What automation can be put in place to remove human errors, bottle necks and “middle men” handling transactions.

Choosing your Vendors

Once you’ve determined which of your workflow steps need to be outsourced, you will need to select your providers. Linguistic vendors will likely be your most important choice.

Translation

In-house translators are a luxury rarely afforded. When choosing Translation vendors, first decide between Freelancers and Language Service Providers (LSP). Managing a pool of Translators is a job in itself, so most will hire the services of an LSP which will also be able to provide relief in terms of Project Management, Technology changes, Staff fluctuations depending on activity or holiday periods etc.. Having more than one LSP can be good strategic choice: it gives you more flexibility with scheduling and pricing. You can specialise your vendors according to content, region or strength. A certain amount of overlap is necessary for you to be able to compare their performance and benefit from a bit of healthy competition.

Linguistic review

Whichever setup you have for Translation, you will need linguistic review in order to ensure the integrity of the message is kept in the target languages. You will also need to ensure consistency between Translators or Agencies, check Terminology, maintain TM’s and Style guides.

Marketing and Local Sales Offices often get involved with that. However using internal staff removes them from their core tasks, unless you are lucky to have dedicated Reviewers. More than likely in-country colleagues will find it difficult to keep up with the volume and fluctuations of the Review work and ultimately will prove an unreliable resource. The solution is to hire the services of professional Reviewers. Many LSPs provide such services. Some ask their competing providers to review each other, but that often results in counter productive arguments. A third-party dedicated review vendor will be the best to enforce consistency, accurately measure quality, maintain linguistic assets, and even manage translator queries on your behalf.

Selecting Technology

Translation Memory technology is a must. Which one you go for may be determined or influenced by existing internal processes, particularly if there are linguistic assets (TM’s and Glossaries) in proprietary formats. Your vendors may also have a preferred technology or even propose to use their own. If you go down that road, make sure you own the linguistic assets. The file format is another choice that needs to be made carefully from the start. Open source formats may save you from being locked into one technology. However technology vendors often develop better functionalities for their proprietary formats. It can be a trade-off between productivity and compatibility.

The good news is that conversion between formats is almost always possible. This means migration between technologies is possible, but avoid including conversion as a routine part of the process. Even if it’s automated, having to routinely output TM in several formats for example, will introduce inefficiencies and increased user support requirements.

Translation Management Systems have become so common, some think they are on the way out. You will at the very least, need a Portal to support file transactions, and share your linguistic assets with all the participants in your supply chain. Emails, preferably automatic notifications, should be used to support the transactions, but they should be avoided when it comes to file swapping. FTP is a common option, easy to set up, learn and cheap to run, but it can soon turn into a mess and gives you zero Project Management visibility. In order to achieve efficient status monitoring, resource pooling and any type of automation, you should consider a Translation Management System.

Whether you go for the big guns like WorldServer or SDL TMS, or for something more agile like XTRF TMS, you will reduce the amount of bottle necks in your process: handoffs will go straight from one participant to the next. The Project Managers will still have visibility, but no one will have to wait on them to pass on the handoff before they get started. TM’s will be updated in real-time and new content will become re-usable immediately.

A few things to look out for in your selection:

Less click = shorter kickoff time. Setting up Projects in a TMS is an investment. It is always going to be longer then dumping files on an FTP and emailing people to go get them if you look at an isolated Project. As soon as you start looking at a stream of Projects TMS makes complete sense. Still, a TMS’s worst enemy is how many clicks it needs to get going.
Scalability: you need the ability to start small and deploy further, without worrying about licenses or bandwidth.
Workflow designer: demand a visual interface, easy to customise which can be edited without having to hire the services of the technology provider. Don’t settle for anything that will leave out at the mercy of the landlord.
Hosting: weigh your options carefully here again. In-house is good if you have the infrastructure and IT staff. But letting the Technology provider host the product may a more reliable option. This is their business after all, maybe you don’t need to reinvent the wheel on that one.
User support: the cost and responsiveness of the Support service is essential. No matter how skillful you and your team are, once you deploy a TMS to dozens of individual linguists there will be a non-negligeable demand for training and support. Make sure this is provided for before it happens.

Once you’ve made all these decisions, you will be in good shape to start building and efficient Localisation process. Last but not least, don’t forget to decide whether to spell Localisation with an “s” or a “z”, and then stick to it! 🙂

Crowdsourcing in Localisation: Next Step or Major Faux Pas?
Globalization – The importance of thinking globally
SDL Trados 2007: Quick Guide for the Complete Beginner
Which comes first, Globalization or Internationalization?
Who’s responsible for Localization in your organization?

Posted in Beginner's Guide | Tagged: Automation, CAT Tools, Computer-assisted translation, Content management system, Globalisation, I18N, L10N, Localisation, Localization, Review, TM, TMS, Translation, Translation Memory, Workflow, WorldServer | 3 Comments »

SDL Trados Studio 2011 Preview: Can It Convince Trados 2007 Faithfuls?

Posted by Nick Peris on September 20, 2011

SDL have been drumming up interest for SDL Trados Studio 2011 through the summer. Eventhough the successor to SDL Trados Studio 2009 is announced to release at the end of September, I must admit that I have been slower to turn my attention to it than I was with Studio 2009.

This is in part due to my current occupation which brings me to spend more time using Translation Management Systems than CAT tools. But it is also because SDL Trados Studio 2009 was such an exciting breakthrough: the idea of fully integrating SDLX, Trados and Synergy was a major shift. The technology behind the new Studio file formats (.sdlxliff bilingual files, .sdltm Translation Memories, and .sdltb Term database) was also quite promising. Lastly, the productivity improvements were many thanks to the entirely new xml-based TM engine, which allows multiple TMs look-ups, AutoPropagation™, AutoSuggest™, QuickPlace™, Real-Time Preview etc.

Reading through those posts about SDL Trados Studio 2009 reminds me how attractive it seemed. But there was also a distinct possibility that this substantial innovation would not necessarily cause a mass migration of Trados 2007 users. Budgets were tight due to the worldwide recession. The prospect of migrating entire Localisation production chains seemed like an unnecessary overhead. Users would have to be re-trained, Enterprise and LSP proprietary automation redesigned in order to work with those new file formats. Above all, SDL Trados 2007 was delivering perfectly acceptable services.

Sure enough, two years later, empirical evidence suggests Trados 2007 is alive and well. It is apparent in my daily interaction with Localisation professional around the World. All Trados users are aware of Studio by now, but I’d venture to say all of them still have Trados 2007 installed, and that it probably even remains their SDL tool of choice. Assuming the hits on Localization, Localisation have any statistical value, it is a telling sign that SDL Trados 2007: Quick Guide for the Complete Beginner continues to be the most frequently visited post in these pages, 2.5 years after being posted. But then perhaps that’s my own fault, for not making a beginner’s guide to Studio 2009…

So let’s now turn to the future and look at SDL Trados Studio 2011’s prospects. New comers to the CAT tools market will inevitably consider Trados as one of their options; which new features it offers does not matter much. As for existing Studio 2009 users, I doubt any amount of innovation can make them upgrade if they haven’t already a budget or subscription plan which allows for systematic upgrades. The real measure of the impact of Studio 2011 will be whether it can convince the remaining Trados 2007 users.

What does SDL Trados Studio 2011 bring to the table to meet the needs of this demographic?

Some New Features

All the great advances made with Studio 2009 are of course still available, although some of them have matured. The main highlights in terms of novelty are the return of Perfect Match and the focus on productivity during review cycles.

Perfect match 2.0

Perfect Match makes a return to Trados: it existed in Trados 2007 but was absent in Studio until now. It now co-exists with Context Match, and together with Terminology and Sub-Segment leveraging make up the concept of Total Leveraging.

The differences between Perfect and Context Matches are:

Perfect Match can run on a batch of files (right-click a bilingual file to pre-translate and select Batch Tasks > Perfect Match) and is good for Project rather than document updates.
SDLXLIFF, TTX and ITD are all supported.
Context Match runs on successive versions of the same file, file names have to match.
They are marked as PM and CM respectively in the resulting bilingual files. Both segment types are locked.

Track changes

Studio 2011 uses a change tracking technology which is fully compatible with Microsoft Word. Thanks to the SDL XLIFF Converter, an SDL Open Exchange add-on now included in Studio, changes and comments made in Trados can be viewed, accepted etc. in Microsoft Word and vice versa.

This makes it easy to collaborate with users who do not have Studio during the review process. Whether they are linguists using other CAT tools or Subject Matter Experts not familiar with any CAT tool, they will all be able to input their feedback using Word.

The versions of Word officially supported are 2007 and 2010; 2003 should work but this is unconfirmed for now. Track Changes can be turned on or off for different parts of the process such as Translation, Review or Signoff under Options > Tools.

Display Filters

In Trados Studio, segments can be filtered to show only those relevant to the current task. The filters in this list are another way Studio 2011 helps productivity during review, with new options such as Segments with Comments or Segments with Track Changes. These filters can also be applied during export using the SDL XLIFF Converter.

Improved Spell Checkers

Trados Studio 2011 brings the Microsoft Spell Checker back. Hunspell is still available but users can now configure which checker to use for each language. This is to resolve issues present in the Studio 2009 Spell Checkers which were not fully accurate for certain languages, notably Scandinavian ones.

QA Checker 3.0

QA Checker 3’s claim to fame is the interactive dialog box which makes reviewing and implementing reported issues a much clearer process. It is reportedly also a first step in longer term plans of adding grammar checks.

Enhanced File Filters

Studio 2011 includes new filters for:

OpenOffice, Libre Office, StarOffice and IBM Lotus Symphony.
INX and Java properties.
improved FrameMaker MIF support.
bilingual Word files which can now be edited directly.

Other novelties to discover in Trados Studio 2011 include pseudo-translation, for testing parsing rules and settings before the launch of new Project Types. Character, rather than just wordcount is now also available.

An Evolving Image

Lighter Ownership Experience

First impressions tend to last, and the installation and activation process are a big part of how a new application is experienced by users. In Studio 2011 the installation is made simpler. One single installer enables compatibility with Trados 2007 file formats (.ttx, .itd, TM upgrades and alignment tasks). With TTXit!, freely available on SDL Open Exchange, users should no longer need a copy of Trados 2007 in addition to Studio.

Because the user interface and technology in Studio 2011 are so similar to Studio 2009, no big learning curve is required. Any time and effort invested in learning to use Studio will just give users a head start in being proficient at the new version.

Starting a project itself is a simpler process, with only 3 files needed (source, bilingual and TM), and no associated folder structure in the background.

The standalone License Manager has been replaced. Activation is now fully integrated into Studio, and borrowing licenses are supported.

Finally, the SDL Multiterm Widget is being pushed into the limelight. This taskbar tool lets you browse Terminology from external applications like Microsoft Excel, Powerpoint etc. at the touch of a button. It also provides a handy shortcut to searches in Google or Wikipedia and is now included in Trados Studio.

Expanding the Trados Community

Technology webinars have been an SDL strength for a long time now. Call it free education or a carbon-conscious alternative to business trips, they are an efficient way for any technology vendor to showcase their goods.

There are other ways SDL share information about Trados like the Studio 2011 Series on the SDL Blog, or the SDL Trados Youtube channel. SDL are certainly not the only language technology provider to use new media but I think it’s fair to mention their consistent effort to meet their user community and ensure information is widely available.

SDL OpenExchange is also used to promote this spirit of community with Developers (look out for prize competitions!) and has produced a number commercial as well as free Apps which efficiently respond to very specific needs.

The connectivity with SDL’s Enterprise applications is also kept up to date. Studio 2011 can connect to WorldServer or TMS Translation Memories for Concordance just like it would with local TMs. An Express Edition of Studio 2011 will be released for users who need Studio only for WorldServer projects.

Posted in News, SDL Trados Studio 2011 | Tagged: AutoSuggest, CAT Tools, Context Match, I18N, L10N, Leverage, Localisation, Localization, MultiTerm 2011, News, SDL Studio, SDL Trados, SDL Trados 2007, SDL Trados Studio 2009, SDL Trados Studio 2011, SDLX, SDLXLIFF, TagEditor, TM, TMS, Trados 2011, Translation, Translation Memory | 9 Comments »

Offline TM Update Process for SDL TMS

Posted by Nick Peris on July 19, 2011

While this is as efficient as one could expect, there are cases where “manual” updates of the online TMs must be performed. A Terminology change may have to be implemented globally into legacy TUs. A linguist may be asked to perform an offline clean-up of an overgrown or aging TM, and the resulting file may have to be imported back into the online TM. Audits may be conducted on live content and also require manual edits of the online TMs, etc.

In most cases, these edits will need to be performed by accessing the remote TMs using SDLX rather than SDL TMS 2007. This is because SDL TMS doesn’t let linguists directly edit TMs, as I previously explained. The present post describes the step-by-step process to update TMs hosted on an SDL TMS 2007 server, using SDLX 2007. It can be used by linguists such as Translators, Reviewers or Language Leads or by Engineers depending on who in the process is in charge of implementing manual edits such as global updates or imports.

Prerequisites:

SDLX 2007 Professional: no access to remote TM Servers is possible for SDLX Light or Freelancer users.
TM Editing rights have to be granted to the users by the SDL TMS Administrator

Process:

Got to Start – All Programs – SDL International – SDL Trados 2007 – SDLX and start SDLX
In the SDLX Dashboard, click Maintain
In SDL Maintain, go to Tools – Options – Advanced – Object Management, click SDLX server and OK. Click OK again to close the SDL Maintain options dialog (this step is only required the first time you connect to an SDLX server)
In SDL Maintain, click TM – Open – SDLX Server
In the Select SDLX Server dialog, click Add and enter your SDL TMS server connection details
Once the connection is established, open the Translation Memories drop down menu and select the TM to edit. Click OK twice to validate your choice and close all dialog boxes.
Once the TM is loaded:
1. Perform Text Searches by pressing F7 and edit as required (this is faster than using Find)
2. Or import into the TM by clicking TM- Import
3. Save and Close the TM when completed

SDL TMS 2011: Inner Peace (localizationlocalisation.wordpress.com)
SDL Studio Online 2011: the New Face of TMS (localizationlocalisation.wordpress.com)

Posted in SDL TMS, Translation Management Systems | Tagged: CAT Tools, Computer-assisted translation, L10N, Localisation, Localization, practical, SDLX, TM, TMS, Translation, Translation Memory | 3 Comments »

memoQ 4: Interview with István Lengyel

Posted by Nick Peris on December 22, 2009

I have been trying to diversify the topics we cover on LocLoc; and especially the tools we talk about. It started recently with a QA tool and now continues with a CAT tool. I already know from the survey I’ve had on this page, that a lot of you are familiar with Kilgray’s memoQ. This, is a preview of what to expect from the forthcoming memoQ4, from the mouth of Kilgray’s COO, István Lengyel.

[Nick Peris] Hi István, could you introduce Kilgray and your role within the company?

[István Lengyel] Hi Nick! Thanks for inviting me to do this interview. Kilgray Translation Technologies is an independent company dedicated to the development of clean and innovative tools for translation, but so far we are by far the best known for our memoQ translation environment. Though we are based in Hungary and all the founders are Hungarians, we became quite an international team in the last two years, opening up in Germany, Poland and now in the US. It’s really great to work in this team, as we have people coming from all sorts of companies such as Idiom, Passolo, SDL Trados, etc., and every addition to the team opens up new perspectives and shows new approaches – the company culture builds on respect and cooperation.

I am one of the architects of memoQ and also the chief operating officer at Kilgray, though in reality I’m mostly managing our sales and marketing team and our international expansion.

[Nick] Could you give a general overview of what memoQ is for readers who are not familiar with it?

[István] memoQ is an integrated translation environment that has a couple of focal points. First, it is easy to use, easy to learn. Second, we translate a lot in it and manage memoQ’s localization in memoQ itself, so we developed an eye for details – there are lots of smaller features that really make life easier. Third, from the very beginning we were concentrating on collaboration, and even the first version included an internet-enabled TM/TB server. Fourth, we don’t believe that we should lock in any of our customers – the entire system supports interoperability between tools to the maximum extent, meaning that you can process files prepared by virtually any major translation tool, and you can also prepare files for processing in other tools. There’s also a full set of documented APIs available for integration with other tools. Fifth, leverage, which means that we are trying to make the most of your resources. There were a couple of things where memoQ pioneered: we were the first to introduce real-time previews that change as you type, we were the first to introduce communication such as knowledge bases and instant messaging and offline synchronization into a translation memory server, we were the first to introduce the translation memory-based segmentation where pre-translation emulates the way your translators join and split segments, and we were the first to introduce the automated concordancing. But quite frankly, we are just as happy to take over things that work from other tools as we are to introduce new stuff.

[Nick] I know you are preparing to release a new version; could you give us a release date for memoQ 4?

[István] A few days ago we named January 31, 2010 for the release date, but I was reminded that it’s a weekend. So the first week of February. (Well, who cares about weekends? :))

[Nick] What are the main changes from memoQ 3.5 and main reasons to upgrade?

[István] There are so many changes that I can hardly list them! memoQ 4 is the first memoQ version that really focuses on project management. We like to build bottom-up and believe that an organization will only have a good experience deploying a tool if the translators like it, and we spent the last five years making the translators happy. So let’s start with the revolutionary feature: post-translation statistics. Imagine a situation where several people are working on the same set of similar documents, using a server-based translation memory. There can be a lot of fuzzy matches coming from the other translator’s translated entries, but so far there was no way in any tool to enumerate these matches, because the person who starts working later gets more matches than the person who is the first to start. memoQ 4.0’s post-translation statistics will solve this Gordian knot, and give you the actual fuzzy match analysis for every translator after the project. This way finally there is a business model for server-based translation.

Other than this, the biggest change is that we have upgraded the concept of translation memory servers to the concept of resource servers. So far you could share translation memories, term bases and documents between translators, and you could set up projects for them centrally. In the new version, you can share every other resource such as auto-translatables (for people used to Trados lingo: customizable placeables), non-translatables, segmentation rules, QA settings, keyboard shortcut settings, ignore lists for the spell checker and so on – 12 of them, all together. What’s more, sharing this happens in the background so you can start the publication of a big TM on the server and go on managing other projects in the meantime. These resources can all be exported into an XML-based format so clever project managers can prepare them also automatically.

memoQ 4 also brings finally the concept of multilingual projects. You can create handoff packages and receive delivery packages, or you can simply publish a project on the server. Those who receive the handoff package can in turn create new handoff packages (handy for a multi-tier enterprise-MLV-SLV-translator setup), and through delivery the files and reports are updated automatically. The handoff packages are just zipped containers of open-source format data – XLIFF for documents, TMX for TMs and CSV for terminology. You can process the packages in any tool, so the users are not locked in.

Compared to these improvements, the brand new text editor, the completely revamped user interface and the streamlined quality assurance seem small. Even the previous version of memoQ got quite a lot of credits for its good support of bidirectional and CCJK languages, memoQ 4 takes this further and also introduces support for Indic languages. We are introducing a very advanced multi-tier undo/redo logic, real-time spell checking and other minor improvements. The quality assurance checks have also been dramatically improved and also the interface for fixing warnings has been fine-tuned.

And I failed to mention so many things! memoQ 4 is the single biggest upgrade memoQ ever received.

[Nick] For non-memoQ users, could you give us the main reasons to switch to memoQ 4?

[István] Because other people do and they are happy about it! 🙂 Just like every company, we make mistakes at times but there has not been any single case that anybody asked for a refund. Seriously, I think the main reasons to switch to memoQ are collaboration, interoperability and support. memoQ is a truly collaborative application, it is one of the few tools that enable simultaneous translation and proofreading on the same document, complete configuration of projects for your translators, or using several translation memories or term bases that can be local, remote — they can even be on different servers — or offline synchronized. The server is fast even on a HSDPA connection and it’s also very affordable – no wonder we have over 150 servers out there.

The other important aspect is interoperability. Our main market is language service providers, and an LSP can never say that they use only a single tool, period, otherwise they lose business and what’s more, they can also lose translators. With memoQ you can process documents and packages created by other tools, and you can prepare packages in industry-standard formats for other tools too. Therefore you don’t find yourself in a situation that you bought the tool because you liked it and then you have to fight with everyone around you to make it accepted.

And the third most important aspect is support. I think Kilgray’s support is just great – fast, focused and friendly.

[Nick] What is the pricing structure for memoQ 4?
What are the different Editions of memoQ 4?

[István] memoQ 4 comes in three client editions: translator standard, translator pro and project manager.

memoQ translator standard is for those translators who never work in teams. It does not enable access to servers and does not enable export of files into XLIFF or bilingual DOC, only memoQ’s proprietary MBD format. It also lacks the ContexTM (101%) matching which takes the context also into account, and comes without support. But the price tag is attractive: 99 euros a year.

The memoQ translator pro is the edition for professional translators and very small translation companies who don’t want to invest into a server solution. It costs 620 euros.

The memoQ project management edition comes with multilingual project management and reporting functionality and we charge around a thousand euros for that.

When it comes to server technology, we sell our solution with mobile (ELM or floating) licenses, meaning that companies can give away and take back licenses to translators over the internet. The initial package contains five mobile licenses, and we sell additional bundles of five licenses at very competitive prices. When it comes to servers, we prefer not to sell without a trial period of 30 days – we want everybody to use the tool, not just buy it for the drawer.

[Nick] How did you take into consideration user feedback during the development of memoQ 4?

[István] Oh I could name the people who contributed with their user feedback here! I think it’s worth mentioning how we work. Basically there are four people who decide on what gets into the next release, and every release has a theme. These themes are contained in our 5-year roadmap and we regularly come together for things that we call “walk in the woods”‘ – creative sessions outside the office where we discuss the main ideas and concepts. We personally talk a lot with users and try to learn the rationale behind their feature requests. These talks shape the main themes/features a lot. On top of that, we have a system to archive all the threads on feature requests, and we go through these regularly. I could give you a rather precise list of features for the next three versions!

So basically the user feedback is taken into consideration on two levels: when we realize that a business problem is hard to solve with memoQ, we incorporate the solution into the high-level concepts. The other level is the feature level where for example users request amendments to file filters or suggest small usability improvements. If these are justified, these can go straight into the feature overview.

[Nick] How is Terminology Management undertaken in memoQ 4? What are the Termbase formats supported?

[István] Terminology management is one of the most controversial components in memoQ! So far we only support CSV and – surprise-surprise – TMX as import formats and can also export into Multiterm XML. Why TMX? Just think about software localization and then the help and you’ll understand. With memoQ we decided that this is a translation tool and not a terminology application, and therefore we gave a finite set of attributes but something that is pretty comprehensive: you can have synonyms, definitions, notes, grammatical information, contexts, project, domain, subject, client information, and a few other fields. You can also have images in the term base, and forbidden term variants can also be flagged. From the workflow point of view, memoQ has had a term base moderation feature since v2.0 in 2006, which means that terminologists may need to approve all terms suggested by translators before they become final. Terminology matching is really exciting: you can use wildcards to indicate the end of the invariable part of every word in a term, i.e. for a language like Spanish you can enter cinturón* de seguridad and that will also find cinturónes de seguridad. For translators of Slavic languages this is really crucial (fuzzy matching does not always work for terms). I can list quite a few pros for memoQ’s terminology management but I must say that it’s a very practical approach. However, we understand that corporate terminology management is not a subset of translation, and terminologists may need some more freedom.

Expect that freedom in a third-party tool based on the memoQ engine soon.

[Nick] Is there anything specific to memoQ in the way Translation Memories are created and maintained?

[István] Translation memories are by default context-enabled in memoQ, and memoQ supports two kinds of contexts: the segment before and after and context bound to structural information. This latter means that if you have for example the software strings in an XML or Excel file, with an attribute indicating where the text appears, you will get a 101% match if the attribute is the same to the attribute where you originally entered this translation – this way you can shuffle the translatable strings and still keep the context information. If you speak the Idiom lingo, this is very similar to ICE and SPICE matching.

As for maintenance, there are a couple of things that are quite unique. First, a 100% or 101% match for us is only a match that is identical both in content and formatting to the original. But we have a special bracket, 95-99% that contains segments where numbers, formatting, whitespaces, punctuation marks can be different. Any change in the text results in something lower than that. You can join and split segments wherever you want, and when you get an update to the document, the TM-driven segmentation will automatically join and split the segments according to your previous translation, as it looks into the translation memory for better matches through joining and splitting. During pre-translation, cases where you get multiple 100% matches (because you translated the segment differently in two contexts, and this third context is unknown so far) are flagged and they are very easy to locate. All these features fall under the umbrella term we use for design: “reproducibility”. I think it’s also worth mentioning that memoQ has a built-in TM editor and can work with as many TMs at a time as you wish. Oh, and yes, a minor nuance, just to make things elegant and please those who are really tech-savvy: our support for TMX also covers attributes, so if you import a TMX file coming from another tool that has attributes, even if the TMX attributes there cannot be displayed in memoQ, you can expect that the TMX export from memoQ will preserve and contain them – so memoQ does not swallow the information that it cannot process.

[Nick] Is there any new feature in memoQ 4 you are particularly fond or proud off? Maybe some anecdote about features which took you a lot of efforts to achieve and which you are now very happy to bring to memoQ 4 users?

[István] Well, I’m a person who prefers the big picture to the small details, and for me the biggest achievement – and a big praise goes to Gábor Ugray, our head of development who designed these features – is that the tool did not get more complicated for translators according to the feedback of those users whom we showed the system. We always pay a lot of attention to the user interface, but when we started conceptualizing memoQ 4 about two years ago, keeping its simplicity seemed like a daunting task. The visual marker of the entire resource management and multilingual project management feature is now just two drop-down lists: the server selector and the language selector. And I am of course proud of the fact that the resource concept makes the entire system future-proof – no matter what sort of a linguistic resource comes into existence in the next years, we’ve got a place for it, and savvy users are also welcome to write third-party resource managers.

[Nick] We are seeing a merging trend where tools are less specific to either software or documentation. This is partly due to the content types evolution, and partly to an effort by tool developers to become more all encompassing. How does memoQ fit into this? How is your support for software localisation? Also xml and xliff?

[István] I saw this very much in 2005 when we started off but I don’t see it that much anymore. About a year ago or so we implemented visual localization support for RESX files and quite a few users are using it, but we have no plans to implement visual localization for other formats such as RC or binary files. On the other hand there are quite a few considerations in memoQ that make it a very good tool for localizing Help content. I already mentioned the TMX import into the term base and the support for context based on another column in the Excel file or an attribute in the XML file, I’d like to mention the automated concordancing feature that was inspired by one of our translation jobs – in our earlier lives as translators – where TM management (another issue I could talk about for hours) was virtually non-existent. I don’t want to name the end-client and the LSP we got this from (they are both very reputable and well-known in localization), but basically to translate the help of version 8 of a well-known application we only got a TM that contained version 2 to 7 of the same application. No terminology, no localized software strings for version 8, nothing. We spent hours to find out what screen caption has been translated before and what expressions did we have to coin, because – as it is with software – quite a few of them were 8-10 words long, and of course developers make changes to these every now and then, changing one or two words maximum, adding a few words to the end, etc. The automated concordance automates this manual process: it automatically gives you the longest multiword expressions that appear at least a given number of times in the translation memory. It does not give you the translation in most cases, but if you select it, it opens the concordance window with the right expressions. And yes, the concordance can look for a series of words. So basically we don’t want to take away business from the excellent software localization tools, but we definitely want to be the best technology for translating help and manuals.

[Nick] Do memoQ and Kilgray offer workflow technology allowing supplier and clients in the localisation chain to work together online?

[István] Our workflow is a linguistic one, and not a highly structured one. We coined two terms. For us, horizontal workflow means when people work together on the same task. Vertical workflow is the traditional workflow, passing along the files between different people doing different jobs. memoQ is excellent in helping people work together on the same task and has a lot of workflow tools such as moderated term bases, simultaneous translation and proofreading, different forms of review, communication and knowledge bases, etc. From the point of view of traditional workflows, we only cover translation and review – items that happen within the tool. There’s no way to integrate things like source text review, DTP or settlements into memoQ. However, the extensive set of APIs enable integration with workflow tools, and at this point I have to mention that both Beetext Flow and Plunet Business Manager do a great job when it comes to deep integration. They can both take care of the entire process, and generate and maintain the projects automatically in memoQ. One of the things we are putting a lot of emphasis on nowadays is client review. I think memoQ is one of the best tools for this, but there is still a lot of room for improvement.

[Nick] Could you say a few words about the memoQ support network? How can new users avail of the experience of other users and if necessary receive support from Kilgray directly?

[István] Here are a couple of interesting resources: http://rc.kilgray.com – the Resource Center that contains training videos, guides, filter configurations for XML-based file formats, but also interesting articles on general topics such as TM management, technology purchase pitfalls, etc. for people and companies not using memoQ.

The memoQ Yahoo! Group (http://tech.groups.yahoo.com/group/MemoQ/) offers the expertise of other users but we also contribute often, and hey, you have the best experts of the competition also there and they often contribute too.

There is a memoQ wikibook too, and the forums on proz.com and other sites can also be interesting.

If direct support is required, it’s primarily through our support email address – please don’t publish the address directly on your website, we don’t want more spam there, but it’s at kilgray.com.

[Nick] Is it too early to ask you about roadmap? What are you plans for memoQ?

[István] It’s not too early at all, but I’m afraid I can’t tell much about the big improvements at this point. One thing is for sure – after 4.0, we will relax a bit and iron out any rough edges that may have remained in this brand new tool. One of the things that many users asked for and will be there in 4.1 (or whatever the final version number will be) is the bilingual DOC table format for review with comments. But one thing is for sure, you can expect another major version with a huge new resource in 2010.

[Nick] This has been a very informative interview. I thank you for your time and detailed answers and look forward to reviewing memoQ4 in the new year!

Posted in Interviews, Kilgray, memoQ | Tagged: Automation, CAT Tools, Context Match, Kilgray, L10N, Leverage, License Server, Localisation, Localization, Master TM, memoQ, memoQ 4, MultiTerm, News, Placeable, Term, TM, Translation Memory, XML | 3 Comments »

Globalization – The importance of thinking globally

Posted by Patrick Wheeler on May 21, 2009

Crouching Tiger, Hidden Dragon…

In essence, Globalization (Internationalization in MS speak) is your Kung Fu. Bear with me, I have a point here, either that or this is a thinly veiled attempt on my part to get you to read further. 🙂

Globalization represents more than just an all-embracing term used simply to describe the sub-processes of Internationalization and Localization, it is in fact both an ethos and strategy that describes how your organization needs to position and prepare every facet part of its being.

Those familiar with Chinese martial arts or who have spent too much time watching Kung Fu movies will understand the fundamental difference between the Tiger fighting style and the Dragon fighting style. The Tiger style relies on sheer strength and the memorization of moves, whereas the dragon style is based on the principal of a deeper understanding of movement. It’s about anticipating more than simply acting upon and reacting to events.

Staying on the fortune cookie philosophy theme, if you adopt the Tiger approach to Globalization you may make all the right moves, correctly identify your target global markets, prepare and push forward with Internationalization of your product with vigour and determination, and skilfully and swiftly execute product localization, but even this is not sufficient if you want to ensure your business is ready to go global and prepared for the effects of going global.

You need to adopt the dragon Style. In addition to the above actions, you should seek a deeper understanding of the impact that these actions will have on your business and anticipate this reaction. After all, every action has an equal and opposite reaction. Once you have decided to go global with your software offerings, you will have to consider how this decision will subsequently impact all areas of your business such as Programme/Project Management, Development & QA, Sales & Marketing, Legal, Accounting, Distribution, Support, etc.

Thinking out loud – So who does what?

Product Management: will need to coordinate with all groups to ensure that localized releases are part of any global product roadmap and are approved by and communicated to all stakeholders.

All global product release schedules need to recognize that the Development and QA teams will have to work in “harmony” with Localization Engineering and QA, and therefore core Development and QA time and resources will have to be allocated to addressing I18n, Customizability and Localizability issues.

Failure to factor these tasks into any global project scope will mean that a simship will be impossible, Developers and QA alike will be frustrated by having to potentially allocate additional time to deal with unplanned for I18N defects, Localization will be stalled until defects effecting Localizability and Customizability are addressed, and regional sales channels will suffer from late availability of localized product.

Development & QA: As mentioned above, these core groups, usually charged with domestic software releases, will now need to work in-synch with their Localization counterparts; the frequency and format of handoffs to the Localization team need to be agreed, I18N exit criteria will need to be established for design and development phases, pseudo-localized software builds will need to be created for I18n testing, code freeze dates will need to be agreed to allow for the extra volume of i18n defects that will be logged during I18n/L10n testing, the workflow and management of i18N defects through the core defect tracking system will need to be established, and core Development and QA resources will need to be allocated to resolving and regressing i18N, Localizability, and Customizability defects.

The Localization team will mainly be focussed on addressing L10n issues, so the majority of I18n and Localizability issues will need to be resolved by the core Development team.

Even prior to Internationalization, it is essential that those at senior levels within an organisation understand the impact of going global on their core Development and QA teams.

As highlighted in my first post, assuming that the creation of localized software releases is the sole responsibility of a single Localization team is imprudent and unrealistic. Globalization means a significant investment in core Development and QA time and resources and cannot happen in isolation of these groups or without their involvement.

Sales and Marketing: Sales and Marketing teams responsible for the target regions need to be made aware of strategic plans regarding localized releases. Often these groups will be the ones who identified the business case/requirement for a localized software release.

Regional Sales and Marketing teams will have an insight into the features that are important to their markets and any customer issues with in-market localized product that need addressing as a matter of priority for subsequent releases. They will also be able to advise on any region specific customization of software features that will be required. These customizations will need to be considered during design and development under the heading of “Customizability”. Furthermore, it is important for Programme Management to work closely with these teams when formulating the localised product roadmap, ensuring they are involved in any beta program review of the software and they have sign-off as part of the localized product review process. This may all seem fairly obvious and simply requires clear lines of communication, but I have often witnessed a certain disconnect between regional offices and global Programme Management.

The following excerpt from Beyond Borders – Web Globalization Strategies by John Yunker (2003) is a good example of how poor communication and planning within an organization can ensure a rather embarrassing false start on the journey to global domination;

“The marketing director of a professional society wanted to expand the subscriber base in other countries. The society already had many international members, but because none of the publications had been translated, members needed as least a moderate grasp of English to reap the benefits of joining. So the marketing director decided to translate the society’s membership form into Chinese, in the hopes that it would make joining the society much easier for Chinese speakers and increase membership.

Within a few weeks, the society received its first completed Chinese form by fax, the membership directory, unaware of what the marketing director had been up to, looked at this form, filled out in Chinese, and said, “What the hell am I supposed to do with this?” The membership director didn’t understand Chinese. No one of her staff understood Chinese. Even if someone on her staff did understand Chinese, their membership database didn’t accept Chinese characters.

So this person in China completed the membership form and subscribed to a couple of publications and the organization could do nothing about it. The professional society didn’t even know what publications were selected because the publication names were translated to Chinese – and they had no English template to compare it against. It may seem obvious that you shouldn’t create marketing materials in a language your company can’t support, yet companies that jump into global markets too fast frequently repeat this scenario.” (Yunker, 2003, p.82).

Branding and cultural customization are also important considerations that also require input from regional Sales and Marketing groups. Some may favour regional branding and cultural customization over global branding with a universally consistent user-experience. This allows regional Sales and Marketing the flexibility to better connect with their target audience. It is all too easy to alienate your customers if they get the impression that your organization’s software products, website, support etc were not developed with their region in mind. However, others would argue that allowing such distinct and unique branding combined with a high level of customization on a region-by-region basis, simply serves to dilute global brand power, resulting in a confusing and inconsistent user-experience. Additionally, by allowing diverse and inconsistent localized content per region, the global management of this content can be troublesome and costly.

The whole area of cultural customization is vast and there is a lot of information as well as misinformation offered on this topic, and it can be hard to discern urban legend from truth. On the theme of colour and cultural significance of colour in the global marketplace, one publication I read recently would lead you to believe that red cars are illegal in Brazil and Ecuador because of the perception that they cause more accidents. This is in fact absolute bunkum. So approach cultural customization with caution and seek the guidance of local contacts.

Legal: There are a variety of laws governing software being sold in different regions of the world, many of these laws pertain to language and support for the official languages in these regions; such as the Toubon law in France, GB18030 certification for China, and the charter of the French Language in Quebec (Bill 101).

For translation of End-User License Agreements (EULAs) and software warranties, your organization will require the services of legal translators and a review of the EULAs by your in-country operations centres/partners to ensure compliance with local legislation.

Legal regulation on the sale of software worldwide is unlikely to become any more lenient. To the contrary, with proposals such as the EU’s two year guarantee for software (games), which would allow users who are unhappy with “buggy” software to return their purchase, the situation will only become more complex. This is another reason why a well thought-out Globalization strategy combined with a strong focus on I18n is of paramount importance.

With poor I18n, your localized software will inevitably contain more functional and cosmetic defects than the source release, and that could be a real headache when faced with a future where customers are within their rights to simply ask for their money back on the basis of these defects and are not compelled to wait for a hotfix as may currently be the case under the terms of existing EULAs.

Accounting: Your accounting team must be ready to provide pricing in the local currencies of the regions your software is to be sold into. Accordingly, they will also need to be ready to accept payment in these currencies. Ensure you have a clear understanding of how royalties and revenues from localized software sales are distributed throughout your organization.

Distribution: You will of course need to consider your distribution channels, competition, and how you will physically deploy your localized software to your customers. For hosted solutions, automatic updates etc; existing data centres serving your domestic customers may not offer sufficient connectivity/speed to customers in other regions.

Support: Before you have localized software in-market, your organization will need to be ready to support these target markets. It is an all too common mistake to simply expect that this will somehow take care of itself and that existing support channels for domestic product will be sufficient. This is yet another way to disaffect the customers in new markets you’ve worked so hard beguile with your digital wares.

You need to consider the mechanisms for localized support; knowledge base, email, phone etc. What level of support will your in-country operations centres/partners can offer, if any? How are support issues with localized software escalated? Do your call centre representatives have the necessary language skills and knowledge of the localized software to handle calls/emails from all the regions you sell your software in? Do you have a Content Management System (CMS) behind your existing website/knowledge-base? Does the functionality of this CMS lend itself to the management of global content in multiple languages?

Once the knowledge-base route has been exhausted, there is a common preconception that it is a good idea to heard customers to email support, like cows being shoved into a cattle crush, as opposed to presenting them with the option of phone support. This is based on the logic that email support is far more cost-effective than phone support. Whilst it makes sense to encourage customers to avail of email support over phone support, I do not believe it is a good idea to completely eliminate phone support as an option.

Many organizations prefer to remove any reference to phone support from their site. For me, this represents a false economy, whilst you may be saving on call centre costs, you will probably be losing customers, and any chance of repeat business. This is particularly flawed strategy in new markets where you are fighting for market-share.

I have yet to experience an email support system where I have received a (useful) answer “within 24 hours” as promised. Besides, 24 hours may be a long wait depending on the nature of the issue. Even if there is a customer cost associated with phone support, it is better to offer this as an option as opposed to lose customers who may prefer to simply return your software (see “Legal” above) and align themselves with your competitors rather than wait for a delayed response from support.

What happened to Localization??

You may have noticed that I have made no mention of the Localization team/departments specific responsibilities in terms of Globalization. This is a deliberate omission. I will address aspects of Localization in various future posts (after all, the URL for this blog puts me under some pressure to do so!). For now, however, it is more beneficial to recognize that in the grand scheme of Globalization, Localization is actually one of the simplest components. Granted, as “Localization” experts, we are in fact required to be “Globalization” experts and provide guidance in relation to Globalization strategies, but if all other areas of your business are ready to go global, then Localization should be the least of your worries.

Once again, failure to take a holistic approach to Globalization will result in Localization being a tedious, costly, and protracted affair. Localized product quality will suffer and inevitably your organization’s performance in the target region will be poor. Additionally you will have filled the lives of your Localization team with a degree of despair! So for the sake of good Karma, get the fundamentals right and Localization will be a walk in the park.

The above are just some of the areas for consideration when formulating your Globalization strategy. One could certainly write a book on the topic and a number have been written on the topic. Globalization is the broadest and most subjective area when it comes to looking at G11n, I18N, and L10n and is therefore open to the most debate.

What color/colour is the sky in your world?

The Sapir–Whorf hypothesis (roughly) states that through the medium of language, different cultures attempt to define their reality and enforce a structure on the world as they view it. This results in certain perspectives that are unique to particular cultures; this is why Localization and Globalization extend beyond simple translation.

This probably also goes some way to explaining why a Chinese friend and work colleague of mine finds a particular Rice Krispies Squares TV commercial so amusing, whilst I simple perceive it to be mind numbingly boring. Or maybe I just don’t get it! Whatever the case may be, to be truly successful in a particular regional market, your organization will not alone have to speak the language of that region, but also understand the predominant cultural perspectives distinct to that region.

The important thing is to have a carefully considered Globalization strategy that would make Lex Luthor seem nonchalant in his scheming, and to execute the plan in a decisive and coherent manner throughout the organization and without procrastination. Understanding that Globalization is the responsibility of your entire organization and must permeate through every level is a good first step.

This is particularly important in the current economic climate. Whilst many organizations are running home for shelter and scaling back on their global operations, this presents opportunities for other organizations to get traction in emerging markets if their Globalization strategy is sound. It may be a long term investment, but if your competition is busy running for cover, these recessionary times could represent an opportunity to gain market share in valuable new markets. As Warren Buffett said, “Be fearful when others are greedy, and be greedy when others are fearful.” In other words, advance when your competition is retreating from global markets.

In conclusion, you could of course try the Tiger approach and see what happens, but as another icon of our times (Homer Simpson) once said, “Trying Is the First Step towards Failure”. 🙂 So instead I urge you to think like the Dragon and have a deeper appreciation of how Globalization will impact your own organization and how your organization as a whole will need to evolve to meet these challenges.

Posted in Globalization, Internationalization | Tagged: Best Practices, G11N, GILT, Globalisation, Globalization, glocalization, I18N, Internationalisation, Internationalization, L10N, Localisation, Localization, principals, Sapir–Whorf, Theory, Translation | 1 Comment »