Awesome. Great going. I also read the interview on Spike Developer Zone.
Dorai On Fri, Mar 7, 2008 at 9:58 PM, Anand Balachandran Pillai < [EMAIL PROTECTED]> wrote: > I actually went ahead and did this today. I registered a new blog > at http://pythonjobs.blogspot.com . It took me roughly 3 hours to > write a custom crawler using HarvestMan to crawl monthly archives > of bangpypers and post Jobs automatically to blogger. It uses > the Google blogger API in gdata-python-client library. > > http://code.google.com/p/gdata-python-client/ > > If someone wants to see the code of the custom crawler > it is available in the HarvestMan-2.0 trunk. > > > http://svn.eiao.net/robacc/experimental/HarvestMan-2.0/harvestman/apps/postingcrawler.py > > I wrote a custom blogger module by using sample code from the google > blogger > API. Since it contains google's code, I have not checked it into the > subversion trunk. > If someone wants the code, let me know. > > To make sure your jobs are in the Blog, just ensure that you make your > job posts with [JOB] in the title. That is all the crawler looks for. > > Regards, > --Anand > > > On Fri, Mar 7, 2008 at 6:32 PM, Anand Balachandran Pillai > <[EMAIL PROTECTED]> wrote: > > On Fri, Mar 7, 2008 at 6:30 PM, Anand Balachandran Pillai > > <[EMAIL PROTECTED]> wrote: > > > > > > On Fri, Mar 7, 2008 at 6:05 PM, Harish Krishnan < > [EMAIL PROTECTED]> wrote: > > > > > > > > > > > > On 07-Mar-08, at 4:57 PM, Anand Balachandran Pillai wrote: > > > > > > > > > > > > 1. Automate blog posting backend when a mail which seems to > mention a new > > > > job posting is posted. This can be done bye requiring specific > keyword(s) > > > > in > > > > the subject for job postings such as [JOB]. I am not sure, but > mailman > > > > might > > > > allow such customizations in the backend. > > > > > > > > Sounds like a nice idea. It would also be good if we have a policy > for not > > > > posting jobs directly on the mailing list else it will lead to > duplication. > > > > > > > > > > > > > > > > 2. An incremental crawler (always!) which monitors the group for > postings > > > > and > > > > automatically fetches JOB posting posts (similar approach, use > keywords or > > > > naive bayesian classification!) and post it to a specific blog. > > > > > > > > > > > > > > > > This is even better. what does it take for this to work? > > > > > > > > > > Nothing much. Just give me half a day to create a custom crawler for > this > > > on top of HarvestMan :) > > Ok, this is not posturing :) If someone can register an appropriate > blog and > > send me the URL and the auth credentials I will create the "job > > posting crawler". > > Only that someone has to bear the responsibility of running it on > > a frequent basis. > > > > gnuyoga, can you do this ? It would be a nice exercise to write a > custom > > crawler for this... > > > > > > > > > Harish > > > > > > > > > > > > > > > > > > _______________________________________________ > > > > BangPypers mailing list > > > > BangPypers@python.org > > > > http://mail.python.org/mailman/listinfo/bangpypers > > > > > > > > > > > > > > > > > > > > -- > > > -Anand > > > > > > > Thanks > > > > -- > > -Anand > > > > > > -- > -Anand > _______________________________________________ > BangPypers mailing list > BangPypers@python.org > http://mail.python.org/mailman/listinfo/bangpypers > -- Dorai Thodla (http://www.thodla.com) US: 650-206-2688 India: 98408 89258
_______________________________________________ BangPypers mailing list BangPypers@python.org http://mail.python.org/mailman/listinfo/bangpypers