On Tue, 17 Mar 2009 13:38:31 +0530, Saurabh <phoneth...@gmail.com> wrote: > Heres the reason behind wanting to get chunks at a time. > Im actually retrieving data from a list of RSS Feeds and need to > continuously check for latest posts. > But I dont want to depend on Last-Modified header or the pubDate tag > in <channel>. Because a lot of feeds just output date('now') instead > of the actual last-updated timestamp. > But when continuously checking for latest posts, I dont want to > bombard other people's bandwidth - so I just want to get chunks of > bytes at a time and internally check for <item>...</item> with my > database against timestamp values. > Is there a better way to achieve this ?
I don't know much about RSS, but one approach is "If they are too lazy to provide the information which protects their bandwidth, they deserve being bombarded. But they also deserve a polite mail telling them that they have that problem." /Jorgen -- // Jorgen Grahn <grahn@ Ph'nglui mglw'nafh Cthulhu \X/ snipabacken.se> R'lyeh wgah'nagl fhtagn! -- http://mail.python.org/mailman/listinfo/python-list