Dear All, I have written txt2unicode [1] converter which will convert any type of encode to unicode. At the moment it will convert only Tamil encodes.
Supported encodes are 1. anjal, 2. bamini, 3. boomi, 4. dinakaran, 5.dinamani, 6.dinathanthy, 7.kavipriya, 8.murasoli, 9.mylai, 10.nakkeeran, 11.roman, 12.tab, 13.tam & 14.tscii Also I have written "auto2unicode" function which will find input text's encode automatically and convert to unicode. It can find 11 encodes out of 14 above encodes. I took tscii sample input from [2] and tested out. "auto2unicode" function able to find "tscii" encode and convert to unicode successfully. I need remaining 13 encodes sample text to test it. If anybody have these Tamil encoded sample text files, please send me offline or share here any link to get sample for the same. Suggestions are welcome. [1] https://github.com/arulalant/txt2unicode [2] http://projectmadurai.org/index.tscii.html Thanks. -- Regards, Arulalan.T Project Associate Centre for Atmospheric Sciences Indian Institute of Technology Delhi My Github Home : http://arulalant.github.io My Experiments In Gnu/Linux : http://tuxcoder.wordpress.com _______________________________________________ ILUGC Mailing List: http://www.ae.iitm.ac.in/mailman/listinfo/ilugc ILUGC Mailing List Guidelines: http://ilugc.in/mailinglist-guidelines