I am working on creating Free Tamil Ebooks to read in kindle or tablet devices.
Read more here on how I am doing manually. https://onroads.wordpress.com/2012/11/23/how-to-read-tamil-books-in-kindle-ebook-reader/ We have to copy the content from various websites/blogs and paste in libreoffice write to generate pdf files. To automate this, I wrote a script, which can read the links from a text file, download them all, clean them all, merge them into single html file. Then, I convert the html into odt using xhtml2odt tool. Here is my script. https://github.com/tshrinivasan/html2odt Now, I am stuck with the process of collecting links and storing in a text file. This is a manual process. How can we automate this? For example, I want to get the URL of all the posts from a blog. Example. http://kaniyam.com or http://365process.blogspot.in/ How can we get the URL of all the posts? -- Regards, T.Shrinivasan My Life with GNU/Linux : http://goinggnu.wordpress.com Free/Open Source Jobs : http://fossjobs.in Get CollabNet Subversion Edge : http://www.collab.net/svnedge _______________________________________________ ILUGC Mailing List: http://www.ae.iitm.ac.in/mailman/listinfo/ilugc
