I am working on creating Free Tamil Ebooks to read in kindle or tablet devices.

Read more here on how I am doing manually.
https://onroads.wordpress.com/2012/11/23/how-to-read-tamil-books-in-kindle-ebook-reader/

We have to copy the content from various websites/blogs and paste in
libreoffice write to generate pdf files.

To automate this, I wrote a script, which can read the links from a
text file, download them all, clean them all,
merge them into single html file.

Then, I convert the html into odt using xhtml2odt tool.

Here is my script.
https://github.com/tshrinivasan/html2odt

Now, I am stuck with the process of collecting links and storing in a text file.
This is a manual process.

How can we automate this?

For example, I want to get the URL of all the posts from a blog.

Example.
http://kaniyam.com
or
http://365process.blogspot.in/


How can we get the URL of all the posts?



-- 
Regards,
T.Shrinivasan


My Life with GNU/Linux : http://goinggnu.wordpress.com
Free/Open Source Jobs : http://fossjobs.in

Get CollabNet Subversion Edge :     http://www.collab.net/svnedge
_______________________________________________
ILUGC Mailing List:
http://www.ae.iitm.ac.in/mailman/listinfo/ilugc

Reply via email to