Google Integrates Automatic Translation into Gmail

Not exactly a stunning technical development, but still pretty interesting: Google has gotten around to integrating their translation tools right into Gmail through their labs section. Once the “Message Translation” plug-in is activated Gmail will detect if an email is not in your default language and will automatically give you the option to translate it. You can see the “Translate message to:” option in the screen shot below:

autotranslate
Source: Google Gmail Blog

While they readily admit “it’s not quite the universal translators we’re so fond of from science fiction,” they do make a comment that I thought was pretty interesting:

“If all parties are using Gmail, you can have entire conversations in multiple languages with each participant reading the messages in whatever language is most comfortable for them.”

This is an interesting concept, and certainly for any non-mission critical exchanges will be quite handy – although I have to wonder what some of the quoted text further down in the email will look like after multiple runs through the machine translation system. I’ve found in the past even once through the grinder and back can leave the text pretty mangled, who knows what several exchanges back and forth will leave it looking like.

Either way an interesting addition to the Gmail system and another tiny step towards knocking down the language barrier.

Move over America…

… China is about to become the largest online population. Between the end of 2006 and the end of 2007 China added roughly 73 Million users to the Internet.

73 MILLION.

To put it in perspective – even if Canada doubled it’s population and put an internet connection in the house of every man, woman and child in the country. We’d still come up about 7 Million people short.

On the flip side though there’s two factors at work here. China’s population is roughly 1.3 Billion right now which means a total user base of 210 Million is only a 16% penetration rate. In Canada we have a ~65% penetration and the US has ~71%.

India will no doubt pick up steam in the coming and will definitely rank in the number 2, if not number 1 spot.

So what does this mean for the Internet in general?

The connected world’s borders are no longer geographical – they’re lingual.

The world may be flattening, but there’s still a a few big walls running across the landscape. The reality is the “hidden web” is going to keep growing. As I’ve posted about before, your ability to access information online revolves almost exclusively around the languages you can read/write.

As countries like China & India continue to pump new users online more and more content will be generated in their native languages, likely invisible to you unless you speak (and search in) that language.

Google’s getting better and better with opening access to these sites through their machine translation tools but the reality is there just isn’t enough CPU horsepower to run every Google search through machine translation for all the different language variations.

Language Weaver, through Kontrib, is also making an interesting attempt at opening up more content to a broader audience through a Digg like portal. It’s a great idea although I think they’re going to have a hard time getting the traction it needs. I’d personally love to see them work with Digg directly instead and create a licensing deal similar to what my friends at Idee have done with their image duplication detection technology.

It’s going to be interesting to watch this story play out. Who ever busts the language barrier the mos effectively first will dramatically change the search game. Google is clearly out in front, and the most likely victor, but you never know who’s running in stealth right now and could surprise us all.

In-chat Machine Translation via Google Talk

Saw on Techcrunch this am that Google talk know has a few bots you can add to your chats that will “translate” your conversations for you in realtime.

trans_botIt creates the translation through Google Translate so at the very least you’d want to be sure whoever you’re talking with understands that some translations might be downright wacky. Needless to say if it requires clarity and exact directions (talking through heart surgery, supporting nuclear power center operators or peace negotiations for example) this is not an appropriate tool.

The implementation is a little clunky – you need to add a bot to the chat for each language pair, in each direction. For example talking to someone in French you’d need both the English-to-French bot and the French-to-English bot. I’m guessing this is the result of someone’s 20% time at Google. (Edit: They’ve since confirmed it is)

I’d hope if they had actually roadmapped this feature the translation option could have just been built in to the tool. The system should really just know if the people who are talking to each other are using different language interfaces or preferences. If it detects that two people with different preferences start chatting just throw up a “We see you’re talking with someone who may speak a different language, would you like us to translate for you?” kind of message.

Right now it supports 29 language pairs, which is kind of odd as it leaves the conversation a little bit one sided… From my quick look it seems English-to-Bulgarian is the pair left out in the cold (but Bulgarian-to-English is supported.) (See edit below: real number is 24)

All in all, for the time being it’s a fun toy but it’ll be interesting to see how this functionality evolves…

Google Blog post

Edited to Add: If anyone out there can read the Chinese text in the screen cap I’d love to know how legible it actually is. The English is passable, which is probably why they used it, but I wouldn’t be surprised to find out there’s some crazy stuff happening on the other end.

EDIT: They published the wrong list of language pairs on the Google blog initially… there’s actually 24: ar2en, de2en, de2fr, el2en, en2ar, en2de, en2el, en2es, en2fr, en2it, en2ja, en2ko, en2nl, en2ru, en2zh, es2en, fr2de, fr2en, it2en, ja2en, ko2en, nl2en, ru2en, zh2en

Microsoft (silently) launches "Live Translator"

Reading TechCrunch tonight and a post popped up indicating Microsoft has a Free Machine Translation site up now called Windows Live Translator BETA.

My initial reaction, just like Mike’s @ TC, was “YAWN”….. After all Babel Fish has been around, well roughly forever as far as the web is concerned and to top it off Microsoft’s also bears the “Powered by SYSTRAN” badge.

I initially wrote it off as a “whatever” but just for kicks ran our site through it to see if the engine came back with anything different form Babel Fish or Google’s tools (it didn’t) – at least not on the language front. Interface wise though, Microsoft just upped the ante.

Microsoft has four view choices when you get your translation results:

Side-by-Side

live_trans1

Note the highlight on the corresponding chunks of text! There’s also a Top/Bottom view which is identical to this.

Source w/Translation Hover

live_trans2

Hover over a sentence and a block will appear with the translation.

Target w/Source Hover:

live_trans3

Like above but with the source appearing in the hover box.

Certainly nothing earth shattering here but I’d definitely say hands-down this is one of the best free machine translation interfaces on the web right now.

Source: TechCrunch

It’s Alive…. Google launches multilingual search

Google has finally launched their new translation tool I blogged about here (and predicted here).

You’ll find it buried of the “More>>” link next to their search box and then click on “Translate” in the right hand column. Finally, click on the search tab on that page.

Or you could just click here.

For some reason clicking on “Language Tools” on the Google home page doesn’t take you there (at least not here in Canada) – but after some previous experiences with Google I can’t say I’m surprised.

All in all, it’s kind of neat but still not a final solution. From what I’ve read they really expect this to be used by non-English speakers to access more of the English web.

It would have been nice if they could have at least given you the ability to get results in English as well as one other language – Instead you have to switch into this whole other interface just to search in one language and get results back in one other language.

Not quite a vision realized, but a good start nonetheless.