Skip to main content

Transliteration


App Code: transliteration

Transliteration

Open In Collab Run in Postman

For languages that don't use the latin script, e.g., Arabic, Hindi, Punjabi, Sinhala and many other spoken around the world, typing can be challenging as keyboards/keypads mostly default to latin characters. That makes creating content in vernacular languages difficult. With transliteration you can create content in these languages using your latin keypad.

For instance, you type a word on the latin keypad the way you would pronounce it in Punjabi, and using transliteration you can convert that into Punjabi. It transforms a word from one alphabet to the other phonetically.

APIs

  • Off-the-shelf Models: Use our pre-trained production-grade models through APIs and integrate them in any application.
  • Language Support: Over 20 language pairs are supported including Arabic and 11 Indian languages.
  • Train Your Own with AutoNLP (Coming Soon): Using AutoNLP improve/customize existing transliteration models with your own data.
  • Accelerate Dataset Creation with our Creator Studio (Coming Soon): Equipped with handy utility tools, our Creator Studio is an in-browser text editor for creating datasets.
  • Easy to Integrate and Scale (Coming Soon): Scale or replicate your deployed models for higher availability and throughput and integrate them with your application through REST APIs.

Chatbot Usecases

Integrating with Google Maps, Spotify, and Other Such APIs

If you have tried integrating Google Maps or Spotify APIs with your "non-latin" chatbots, you know they rarely give acceptable results. With transliteration you can extract entities like names, addresses, songs, etc. in your local language and convert them into English and fully utilize such APIs.

Creating NLU Data for Chatbots

A common practice is to create a dataset in English and translate it to another language using Google's Translate or similar APIs. While this can work for very simple FAQ bots, contextual bots require well structured/meaningful data in the local languages. Transliteration powered typing tools can help accelerate the dataset creation process for chatbots in "non-latin" languages.

Accessibility Usecases

Low-Cost Content Accessibility in Multiple Languages

Internationalization/Translation can be extremely expensive when your databases in English and are dynamic and ever expanding with content like songs, albums, or addresses. Transliteration can be a handy tool to convert such content on the fly to other "non-latin" languages. This can be indexed using popular frameworks like Elasticsearch, or other similar tools and instantly made accessible in multiple languages. This can potentially save thousands of dollars on manual translation efforts.

Content Creation

Something as simple as typing can be extremely challenging when we talk about languages spoken in the Middle East, Africa and South East Asia. With governments and financial institutions mandating inclusion and content creators being more used to the latin keypad, a tool like transliteration can be extremely handy to make it easy to create local language content.

What's next?