Miki <[EMAIL PROTECTED]> wrote:
> (You can find lynx at http://lynx.browser.org/)
not exactly -
The current version of lynx is 2.8.6
It's available at
http://lynx.isc.org/lynx2.8.6/
2.8.7 Development & patches:
http://lynx.isc.org/current/index.html
--
Thomas E. Dickey
http://i
To maintain paragraphs, replace any p or br tags with your favorite
operating system's crlf.
On Jul 13, 8:57 am, kublai <[EMAIL PROTECTED]> wrote:
> On Jul 13, 5:44 pm, Paul McGuire <[EMAIL PROTECTED]> wrote:
>
>
>
> > On Jul 12, 4:42 am, kublai <[EMAIL PROTECTED]> wrote:
>
> > > Hello,
>
> > > Fo
On Jul 13, 5:44 pm, Paul McGuire <[EMAIL PROTECTED]> wrote:
> On Jul 12, 4:42 am, kublai <[EMAIL PROTECTED]> wrote:
>
> > Hello,
>
> > For a project, I need to develop a corpus of online news stories. I'm
> > looking for an application that, given the url of a web page, "copies"
> > the rendered t
On Jul 12, 4:42 am, kublai <[EMAIL PROTECTED]> wrote:
> Hello,
>
> For a project, I need to develop a corpus of online news stories. I'm
> looking for an application that, given the url of a web page, "copies"
> the rendered text of the web page (not the source HTNL text), opens a
> text editor (N
On Jul 13, 2:19 am, Stefan Behnel <[EMAIL PROTECTED]> wrote:
> kublai wrote:
> > For a project, I need to develop a corpus of online news stories. I'm
> > looking for an application that, given the url of a web page, "copies"
> > the rendered text of the web page (not the source HTNL text), opens
kublai wrote:
> For a project, I need to develop a corpus of online news stories. I'm
> looking for an application that, given the url of a web page, "copies"
> the rendered text of the web page (not the source HTNL text), opens a
> text editor (Notepad), and displays the copied text for the user
On Jul 12, 10:22 pm, Jon Rosebaugh <[EMAIL PROTECTED]> wrote:
> On 2007-07-12 04:42:25 -0500, kublai <[EMAIL PROTECTED]> said:
>
> > For a project, I need to develop a corpus of online news stories. I'm
> > looking for an application that, given the url of a web page, "copies"
> > the rendered tex
On Jul 12, 5:24 pm, "Andre Engels" <[EMAIL PROTECTED]> wrote:
> 2007/7/12, Andre Engels <[EMAIL PROTECTED]>:
>
> I forgot to include
>
> import urllib2, re
>
> here
>
> > def textonly(url):
> ># Get the HTML source on url and give only the main text
> >f = urllib2.urlopen(url)
> >text =
On 2007-07-12 04:42:25 -0500, kublai <[EMAIL PROTECTED]> said:
> For a project, I need to develop a corpus of online news stories. I'm
> looking for an application that, given the url of a web page, "copies"
> the rendered text of the web page (not the source HTNL text), opens a
> text editor (Not
2007/7/12, Andre Engels <[EMAIL PROTECTED]>:
I forgot to include
import urllib2, re
here
> def textonly(url):
># Get the HTML source on url and give only the main text
>f = urllib2.urlopen(url)
>text = f.read()
>r = re.compile('\<[^\<\>]*\>')
>newtext = r.sub('',text)
>w
2007/7/12, kublai <[EMAIL PROTECTED]>:
> For a project, I need to develop a corpus of online news stories. I'm
> looking for an application that, given the url of a web page, "copies"
> the rendered text of the web page (not the source HTNL text), opens a
> text editor (Notepad), and displays the
Hello jk,
> For a project, I need to develop a corpus of online news stories. I'm
> looking for an application that, given the url of a web page, "copies"
> the rendered text of the web page (not the source HTNL text), opens a
> text editor (Notepad), and displays the copied text for the user to
Hello,
For a project, I need to develop a corpus of online news stories. I'm
looking for an application that, given the url of a web page, "copies"
the rendered text of the web page (not the source HTNL text), opens a
text editor (Notepad), and displays the copied text for the user to
examine and
Some one knows some python module for working with pdf files (extract text)?
or any kind of document such as PostScript, doc, etc?
thanks in advance
--
http://mail.python.org/mailman/listinfo/python-list
14 matches
Mail list logo