Editor's Note: Wouldn't it be nice to have a quick way to write Urdu comments on Facebook, or for that matter on any website? Or to be able to browse Hindi websites written in Devangari script? Syed Ghulam Akbar, an innovative software developer, has written a bookmarklet that allows you to do just that, without any cumbersome installation. In the following, Akbar describes his tools and their use.

Dynamic Language Tools is a bookmarklet application which I have developed, which helps users read and write Urdu easily on any web page without installing any software. This tool uses the Google Transliteration API to do the on-demand transliteration of the roman script to Urdu script.  This tool also provides on the fly Hindi to Urdu transliteration on the web-pages making all the Hindi content (in the Devanagari script) readable to Urdu readers.

As this application internally uses the Google Transliteration API, so, all the supported Google transliteration language pairs are available via this application. To use this application, one first need to save its bookmarklet in their web-browser, similar to creating a normal webpage bookmarks, and that’s it. The one main advantage of using the bookmarklet technique is that you don’t need to install anything on your machine, plus it automatically updates itself on the next load, so users will always see the latest version without running any updates.

urdu01How does it help in writing Urdu? To write Urdu in any text box on web-page, one needs to first load this application by clicking its saved bookmarklet link. Once the application is loaded in that page context, the user may enter text in Urdu using roman script in any text box on that page. To activate the Urdu Entry mode, one needs to simply press ‘Ctrl’ and click in a text box. When the Urdu mode is activated, the text-box orientation is changed from left to write, and an آ image is displayed in the background of text box indicating that Urdu mode is now active. In the Urdu Entry mode, if a user enters roman text like “Ye to bhat shandar khabar hai.”, it will be auto-transliterated and displayed in Urdu script i.e. “یہ تو بہت شاندار خبر ہے.”.

This Urdu mode entry works nearly on all the web sites pages, and all types of text boxes, i.e., one-liner, multi-liner, dynamic, and rich-text editors (as used in Google and Yahoo web mails). Using this tool, one can comment in Urdu on any website, can send emails in Urdu, can set their Facebook status in Urdu or can even chat with friends on Facebook in Urdu. These are just few examples of how and where this tool can be used, but basically using this tool you can write Urdu on the web everywhere you can currently type English without installing or changing any thing on the website.

urdu02How does it help in reading Urdu? The main inspiration behind this tool development was not actually Urdu writing. In fact, there are many existing tools and applications which let users type Urdu either using a special keyboard layout or by using roman script transliteration. What actually inspired me to develop this tool was to provide a way to easily convert the roman content on all the existing web-pages to Urdu script so that it is more readable. For example, there is a good poetry archive on www.urdupoetry.com, but all of that is in roman script. This is where this tool comes handy, and it lets user perform on-page transliteration of the existing roman Urdu content on the web-page without leaving that page. The transliterated text is displayed in place of the original text keeping the same format and layout. One can do this transliteration simply by just highlighting the text, and clicking the “Transliterate Selected Text” button in the preferences dialog of the bookmarklet. It also displays the original text (before transliteration) when the user moves the mouse over the transliteration text. Another handy feature is that one can also view alternate transliteration options and apply those by double clicking the transliterated words.

urdu03Why Hindi to Urdu transliteration? Hindi and Urdu languages share grammar, morphology, and a huge vocabulary, yet the majority of readers of both languages can’t read the other written language mainly because Urdu is written using the Arabic script while Hindi is written using the Devangari script. The speakers and readers of these two languages combined are more than a billion. So, if we can provide a good transliteration between these two languages, it can serve a very large community. Some very good work has already been done in this field, notably the Hindi to Urdu Transliterator by CRULP, and Hindi Urdu Machine Transliteration by M G Abbas Malik. Even though the results of these transliterations are good, they are not excellent because Urdu language has more than one equivalent character for a Devanagri character. For example, the character त (ta) in Hindi has two equivalent characters ت (tay) and ط (toey) in Urdu. Similarly, the Devanagari character ज़ (za) could map to ز (zay), ض (zoad), ذ (zal) and ظ (zoay) Urdu characters depending on the context and language rules. This becomes tricky to handle without complex language processing rules and dictionary. What I have done for this tool is a two step process. First, I transliterate Hindi to the corresponding roman script using the combination of the ITRANS and IAST transliteration rules to produce a roman script which is best for the transliteration. The transliterated roman text is then passed to the Google Transliteration API for English to Urdu transliteration and as a result the Urdu script you get using his technique is almost accurate. Of course, similar to other transliterations supported by this tool, one can perform Hindi to Urdu transliteration on the fly (without leaving the page), and you can also do the transliteration corrections on the transliterated words.

What’s next? It’s not the end; it is actually just a start. I have been getting a lot of feedback and suggestions from friends and the community. Other than tweaking and improving the interface, I’m also working on the Google Translation integration. I also plan to integrate dictionary and image lookup-up for selected words. So stay tuned and keep sending your valuable feedback and comments.

About the author:

Akbar has been developing software for more than 12 years. He is an Application Architect at Jin Technologies Pvt Limited.  He also does software development as hobby and some of his freeware utilities/projects are available at his site. He maintains a blog to share the software tips and tricks with community. When he is not doing software development, he loves to spend time with his family, watch movies and fly & build R/C planes!

19 Responses to “A Hindi to Urdu Converter and Urdu Transliteration Bookmarklet”

  1. Sohaib Khan says:

    I have tried it out and it works like magic! I would recommend installing the nastaleeq font whose link appears in the dialog, because then the output looks much nicer. The Ctrl-Shift-R shortcut is *really* useful. You can highlight any text which is in roman urdu or in hindi and by just pressing Ctrl-Shift-R, it will be converted to proper Urdu.

    Many thanks to Syed Ghulam Akbar for developing and sharing this!

  2. bt says:

    good stuff, this.

    however, there is an egregious error in the posted example: lab pe aati hai dua, not dawa.

    agar tamanna aap hi dawa ho jaaye to phir humein allama ki zaroorat na rahay.

    • Syed Ghulam Akbar says:

      Thanks for pointing this transliteration issue. Actually, this is the default transliteration of the following page and I left this on purpose in the article screenshot:
      http://www.urdupoetry.com/iqbal01.html

      Though once the transliteration is done, you can review and quickly fix most of the errors by double clicking the words. Like if you transliterate the above poem, the roman word “dua” will transliterate to “دوا”. Now if you double click this word, you will see the option “دعاء” in the options menu.

      So, although it’s not perfect in start, you can quickly fix most of these errors, and at least it’s better than roman reading.

      BTW, well said on “اگر تمنا آپ ہی دعا ہو جائے تو پھر ہمیں علامہ کی ضرورت نہ رہے”

      • Syed Ghulam Akbar says:

        Well, I mean to transliterate it to “اگر تمنا آپ ہی دوا ہو جائے تو پھر ہمیں علامہ کی ضرورت نہ رہے”.

        Once you correct a word, it is be remember for that session.

  3. ArpitNext says:

    Tried and it works! Well done!!

    • KHAN GM KHAN says:

      سللام…بہت شوق تھا صاحب بہت شوق تھا اودو میٹھی اردو میرے وطن عزیز کی زبان اردو کا ..رومن لکھتے لکھتے برسا برس بیت گے بڑھی- مشکل سے انٹرنیٹ پر آوارہ گهرد ی کرتے کرتے … آج الف بے کا پتہ چلا تو یہاں بھی آ نکلا …. یہ اردو کہیں اور جگہ لکھی شپنگ کسی اور جگہہ سے کی،،،،، لا ڈالا یہاں جانے یہ کونسی جگہہ ہے . . . KALABAGH_DAM_USA@YAHOO.COM

  4. Shahid Saeed says:

    It works nearly flawlessly. Great work man !
    I’ve been wishing to work on Urdu Speech Recognition this summer. Anybody here knows someone whose worked in this area?

  5. Everyone,

    A quick update on this. This set of tools are now also available as Google Chrome Extension. You can install this from here:
    https://chrome.google.com/extensions/detail/jbnobcphncooboncogejlkafgdfllcnb/publish-accepted

    Enjoy the fast and always ready translation and transliteration tools on Chrome.

    Regards, Akbar

  6. [...] English and Hindi. Syed Ghulam Akbar, the bookmarklet’s creator, describes his motivation in a post on the Pakistani science blog STEP: The main inspiration behind this tool development was not [...]

  7. [...] English and Hindi. Syed Ghulam Akbar, the bookmarklet’s creator, describes his motivation in a post on the Pakistani science blog STEP: The main inspiration behind this tool development was not [...]

  8. [...] urdu sa engleskog i hindu jezika. Sied Gulam Akbar, tvorac bukmarkleta, piše o svojoj motivaciji u postu na pakistanskom naučnom blogu STEP: Glavna inspiracija za ovu alatku nije zaista nađena u pisanju [...]

  9. [...] sa engleskog i hindu jezika. Sied Gulam Akbar, tvorac bukmarkleta, piše o svojoj motivaciji u postu na pakistanskom naučnom blogu STEP: Glavna inspiracija za ovu alatku nije zaista nađena u [...]

  10. azhar umri says:

    please give mi hindi to urdu convertaor

Discuss

  • STEP aspires to be the central place for discussion on improving the state of Science, Technology, and Education in Pakistan. Read More
  • To learn how you can contribute, click here
  • Never miss a new article! Choose your favorite method to stay up to date with STEP
  •