NEWS
- 28 April | India to Witness Deadliest Event of World History Mega El Nino Click Here →
- 15 April | The 3-Attempt Strategy No One Talks About | How He Scored 420+ in GS Click Here →
- 30 March | The Honest UPSC Talk Nobody Tells You Click Here to see Abhijit Asokan AIR 234 talk →
- A team of scientists from IIT Madras has developed a method for reading documents in Bharati script using a multi-lingual optical character recognition (OCR) scheme.
- Optical character recognition schemes involve first separating or segmenting the document into text and non-text.The text is then segmented into paragraphs,sentences, words and letters.
- Each letter has to be recognised as a character in some recognisable format such as ASCII or Unicode.The letter has various components such as the basic consonant,consonant modifiers,vowels among others.
- Bharati is a unified script for nine Indian languages which is being proposed as a common script for India.The scripts that have been integrated include Devanagari,Bengali,Gurmukhi,Gujarati,Oriya,Telugu, Kannada,Malayalam and Tamil.English and Urdu have not been integrated so far as they have a very different phonetic organisation.
- The Roman script is used as a common script for many European languages which facilitates communication across nations that speak and write those languages.Likewise a common script for the entire country is hoped to bring down many communication barriers in India.




