[EMAIL PROTECTED] wrote: >I need to extract the number on each <td tags from a html file. > >i.e 49.950 from the following: > ><td align=right width=80><font size=2 face="New Times >Roman,Times,Serif"> 49.950 </font></td> > >The actual number between: 49.950 can be any number of >digits before decimal and after decimal. > ><td align=right width=80><font size=2 face="New Times >Roman,Times,Serif"> ######.#### </font></td> > >How can I just extract the real/integer number using regex? > > > If all the td's content has the [value_to_extract] pattern, things goes simplest
[untested] /<td.* ([^&]*) / the parentesis will be used to group() the result (and extract what you really want) Cheers Gerardo -- http://mail.python.org/mailman/listinfo/python-list