Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I speak a language that is highly colloquialized, and in its casual written format includes a rather inscrutable system of abbreviation (one of the features of this system is to basically omit nearly all vowels). I had always figured that this language would be impossible for machines to translate, but I just tested it and both google translate and ChatGPT can accurately identify the language and translate the slang into English (Google didn’t pick up some of the subtleties between similar dialects, but still provided a correct translation). So I’m somewhat optimistic that they could be potentially managing these problems quite well.


What's the language?


Indonesian. ChatGPT could quite reliably tell the difference between Indonesian and Malaysian. Google translate seemed to have a bias towards thinking it was Malaysian. But if I tried Indonesian mixed with Javanese slang (which is a common way of talking), they would both just say it was Indonesian. I only tested a few phrases though, so maybe it breaks down at some point.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: