• On GameSpot: Wii Fit tells 10-year-old she's fat
August 4, 2008 9:40 AM PDT

Google's translation center: Language lessons for the Googlebot?

Posted by Stephen Shankland
  • Print

Updated 1:17 p.m. to correct that Google ranked first in the machine-translation accuracy evaluation. Updated 10:50 a.m. PDT with Google's no-comment.

Google looks set to launch a beta test of a document translation service, a new move in the company's efforts to break down language barriers.

With the service, the company will connect people who need documents translated with humans who will be paid to do so, according to the Google Translation Center information page. The site was spotted by sharp eyes at the Google Blogoscoped blog.

(Credit: Google)

"Google Translation Center is the fast and easy way to get translations for your content. Simply upload your document, choose your translation language, and choose from our registry of professional and volunteer translators. If a translator accepts, you should receive your translated content back as soon as it's ready," the site said.

Google prefers to rely on computer algorithms rather than humans, so at first glance the Google Translation Center looks somewhat anomalous, even though Google is only playing a middleman role. But it's possible that the human translators might be gradually improving Google's machine translation technology as they work, in effect helping to put themselves out of a job.

That's because Google's translation system uses a statistical model that works better the more it can compare the same text in two different languages. And Google evidently will track translation work in its database; according to the center's introduction for translators, "our translation search feature matches your current translation with previous translations, so you don't have to translate over and over again."

Google is fervently interested in better machine translation. With it, it can use its search technology to link people with data around the world, regardless of language barriers, making its search engine significantly more powerful.

Wanted: More Rosetta Stones
Google's translation technique essentially relies on having as many Rosetta Stone-like documents as possible. The more documents it has in two languages, the better able it is to match words and phrases from one language to another, according to a recent speech by Jeff Dean, a Google fellow who works Google's computing infrastructure.

"By computing statistics over all words and phrases, you...get a model of word-by-word and phrase-by-phrase replacements," Dean said. Machine translation often produces awkward results today, but "the impact of having a really large language model makes the sentences flow a lot more easily."

The screenshot below, from Google, shows the online interface a Google translator apparently will see. It shows text in two languages, with the passage broken down into chunks of text. It also suggests a previous translation of one chunk, offering a "use suggestion" button to employ it. It's not clear if the previous translation draws just on that individual translator's work or a larger collection.

Google Translation Center offers tools to speed translation.

Google Translation Center offers tools to speed translation.

(Credit: Google)

Based on the Bilingual Evaluation Understudy method for rating translation accuracy, Google scored first place in a 2005 evaluation by the National Institute of Standards and Technology evaluation.

Google was mum about the project. "We're always looking at new ways of providing tools for users to connect with each other, share information, and improve access to information on the internet, but we don't have any new details to share at this time," the company said in a statement.

Paying the middleman
It's a time-tested business to be the middleman who connects customers to those willing to pay for a product or service, but the Internet has taken the role to new heights by more easily enabling that process on a national and sometimes global scale. For example, Amazon.com's Mechanical Turk, Serebra Connect, and Elance can help companies that need tasks done find people who can do them.

But the Google Translation Center seems to have a different approach. Translators get access to free Google tools, and it appears Google isn't involved in any payment transactions, according to the site.

"Google Translation Center provides a venue for you to enter into and complete translation transactions. Except when you use Google Translation Center as provided in Section 4, Google is not involved in any transactions in Google Translation Center. Your interaction with any third party participant(s) or user(s) within Google Translation Center, including payment and delivery of goods and services...are solely between you and such third party participant(s) or user(s) and Google is not involved in such dealings," according to the terms of service. Section 4, titled "Google Participation," says just that "Google and/or its subsidiaries and affiliates may use Google Translation Center from time to time."

So what's in it for Google?
Of course, Google has a strong search-ad business that it uses to subsidize any number of efforts that may not be profitable for years, if indeed ever. After all, Google's mission is "to organize the world's information and make it universally accessible and useful."

But even if Google doesn't charge a percentage, improving automated translation could be a powerful incentive as Google tries to keep its core product, the search engine, competitive.

Google's translation technology is available through the Google Translate site, but the company also has technology called Cross Language Information Retrieval (CLIR) that builds translation into its search engine.

This Google screenshot shows the interface to the Google Translate Service.

This Google screenshot shows the interface to the Google Translate Service.

(Credit: Google)

Search increasingly is the gateway by which people discover what's on the Internet, so building automated two-way translation into the process could open up the very parts of the Internet that today are available but effectively hidden by language barriers.

CLIR can translate a search query into a foreign tongue then translate the answer back into the search results. Clicking a link produces the translated version of that page.

For example, a search in Russian for Tony Blair's biography will present an option, in Russian and presented at the bottom of the search results page, to search pages written in English. Clicking on a link then translates the English page into Russian.

Google executives have given indications recently about just how grand the company's ambitions are for the automated language translation. The company wants people from any major language to understand any other.

"We will eventually do 100 by 100 languages, to take this set of languages and convert to another," Google Chief Executive Eric Schmidt said in a June talk. "That alone will have a phenomenal impact on an open society," he said, a reference to concerns many have expressed about Google's censored search results in countries such as China.

Coming soon, perhaps: Google Translation Center

Coming soon, perhaps: Google Translation Center

(Credit: Google)

Stephen Shankland covers Google, Yahoo, search, online advertising, portals, digital photography, and related subjects. He joined CNET News in 1998 and since then also has covered servers, supercomputing, open-source software, and science. E-mail Stephen.
Recent posts from Digital Media
Yahoo stock rises on new acquisition report
EFF to court: Don't shield telecoms from illegal-spying suits
No escaping Britney Spears: 2008's top searches
Netflix's Web site suffers brief outage
Roku Netflix Player suffers mystery glitch
Add a Comment (Log in or register) 18 comments
by mkimura1971 August 4, 2008 11:22 AM PDT
There's a typo on the 4th paragraph: "But it it's possible..." should be "But is it's possible..."?
Reply to this comment
by mkimura1971 August 4, 2008 11:24 AM PDT
Ha! There's a typo on my last comment! Sorry!
by mkimura1971 August 4, 2008 11:29 AM PDT
Sorry there's a typo in my first comment too!!
by mkimura1971 August 4, 2008 11:24 AM PDT
Ha!
Reply to this comment
by kareldriesen August 4, 2008 11:45 AM PDT
Google scored *first* place in NIST.
Google is only fifth in an *alphabetical list of participants*.
Look at the actual scores lower on the page http://www.nist.gov/speech/tests/mt/2005/doc/mt05eval_official_results_release_20050801_v3.html
and you see that Google had the *highest* score in all four tracks (2 for Arabic to English and 2 for Chinese to English).
Reply to this comment
by jrodadlv2007 August 4, 2008 12:43 PM PDT
I hope that actually using real people to translate would be more successful that what's available now. All that I know is that when you hit "translate this page," be ready for a horrible sounding translation that doesn't sound good in English. The question is how good the translators are.
Reply to this comment
by nisteval August 4, 2008 3:55 PM PDT
The latest 2008 NIST results can be found here: http://www.nist.gov/speech/tests/mt/2008/doc/mt08_official_results_v0.html. It should be noted that while Google remained number 1 in Arabic to English, Chinese to English and English to Chinese, the spread in BLEU scores is much smaller now. A difference in 1-2 BLEU points is typically not perceivable to a reader. It all depends if the subject matter used to train the MT system matches the subject matter (or domain) of the document or text being translated.

What's more interesting is why Google is even bothering to launch this site. I suspect it's because the company is finding it difficult to collect parallel data to improve their machine translation systems (which are statistical based). Use their Translation Center... upload your data.. have the translators translate the documents.. upload it to the center.. and presto... Google acquires even more data for their own purposes. Just my 2 cents
Reply to this comment
by nistcorrect August 11, 2008 5:28 PM PDT
Google lost the number 1 in Chinese to English category in the 2008 results. They lost it to Microsoft Research (collaborating with others), a team which had not participated in 2005 or in the other 2008 categories.
by zextron August 5, 2008 4:07 AM PDT
Looks like in Europe the URL http://translate.google.com/center/learn-more.html just redirects users to their national google page :(
Reply to this comment
by benjaminstraight August 5, 2008 4:11 AM PDT
Cool.
Reply to this comment
by yuri34 August 5, 2008 8:34 AM PDT
I don't think that Google will be able to keep itself outside customer service in this service. If I'm not happy with the translation or the translator ? Decent translators will not work for free. I prefer services like http://www.onehourtranslation.com or proz.com
Reply to this comment
by mdrick August 7, 2008 11:58 PM PDT
I kind of agree with the offer of this article that google will gradually put many translators and translation services vendors out of business.
http://www.polilingua.com
Reply to this comment
by Anyword_Guillaume September 2, 2008 8:43 AM PDT
Up until now, there has been, on one hand the unsatisfactory but free automatic translation programs, and on the other hand the option to pay for a professional translation service either provided by independent translators or by a translation company. It is already possible, for the more inquisitive buyer, to recruit on line their own human translator but generally speaking companies have more of a tendency to confide their projects to a translation agency.

Translation businesses bring access to professional translators in every language and in several fields of expertise. They analyse and prepare the work prior to translation, control turnaround times, serve as a point of contact both to the clients and the translators, sometimes acting as the voice of the customer to the translator and sometimes that of the translator to the client. They assure correct delivery of the final project to the client and also handle payment issues controlling client creditworthiness and assuring timely payment for themselves as well as the translator. At least this is the way we work at Anyword and how most of our competitors currently operate.

In this context is Google Translation Centre a threat to translation agencies? It is possible (and even probable) that initially a number of clients seeking to make savings will contact the translators directly. Over time these clients will either settle for the minimal solution and remain loyal to Google Translation Centre or they will wish to benefit from a veritable intermediary service and return to the agencies. I do hope so.

It is however necessary to clarify an important point: the service launched by Google is not a novelty. There are already existing human translation on line recruitment systems which are extensively used the translation companies. What is new is that Google authorise the ?volunteers? to register themselves as translators on their service. By ?volunteer? one must understand ?voluntary? and therefore in most cases ?inexperienced?. This means that the customers utilising this service are running an important risk that Google refuse to endorse in their general conditions of use.
This is what makes me think that businesses, always reluctant to take unnecessary risks, will not adopt Google Translation Centre. This debate is, of course open on the Translators Observers blog run by Anyword (http://blog.anyword.fr).
Reply to this comment
by Eurominuteman September 15, 2008 7:21 AM PDT
Google Translation Center is a mechanical model. It doesn't perform collaboration.
Collaborative Translation remains to be organical:

http://collaborative-translation.ning.com/group/googletranslationcenter
Reply to this comment
by janice_a October 21, 2008 8:35 AM PDT
I be in awe if Google?s determination to truly be concerned or provide any concentration at all to customer service or to excellence of the translation. There?s more translation services similar to this which focus on translators such as <a href="http://www.tomedes.com/"> http://www.tomedes.com</a>
Reply to this comment
by janice_a October 21, 2008 8:37 AM PDT
I be in awe if Google?s determination to truly be concerned or provide any concentration at all to customer service or to excellence of the translation. There?s more translation services similar to this which focus on translators such as http://www.tomedes.com
Reply to this comment
by 008victory November 20, 2008 3:18 AM PST
<a href="http://funnychristmasgifts.vidiLife.com">funny christmas gifts</a> | <a href="http://funchristmasgifts.vidiLife.com ">fun christmas gifts</a> | <a href="http://lightedpalmtree.vidiLife.com ">lighted palm tree</a> | <a href="http://lightedchristmastree.vidiLife.com">lighted christmas tree</a> | <a href="http://christmasdecor.vidilife.com">christmas lighted outdoor decorations</a> | <a href="http://personalizedchristma.vidiLife.com">personalized christmas ornaments</a> | <a href="http://giftbasketsforchrist.vidilife.com">gift baskets for christmas</a>
Reply to this comment
by 008victory November 20, 2008 3:20 AM PST
[url=http://google.com]google[/url]
Reply to this comment
advertisement

In the news now

A tech veteran responds to the recession

LogLogic's Patricia Sueltz heard a clear message about the economy from investors, but she already knows a thing or two about navigating through tough times.


Obama's AG pick on privacy

Eric Holder has criticized the warrantless wiretapping program, but his views on other online policies may not be that far from those of the Bush administration.


About Digital Media

The Web is now the place to go for news and entertainment. Look here for the latest on blogs, music, video, virtual worlds, social networking and more.

Add this feed to your online news reader

Digital Media topics

advertisement
advertisement

Inside CNET News

Scroll Left Scroll Right