Il giorno lunedì 27 agosto 2012 12:59:02 UTC+2, mikcec82 ha scritto: > Hallo, > > > > I have an html file on my pc and I want to read it to extract some text. > > Can you help on which libs I have to use and how can I do it? > > > > thank you so much. > > > > Michele
Hi Oscar, I tried as you said and I've developed the code as you will see. But, when I have a such situation in an html file, in wich there is a repetition of a string (XX in this case): CODE Target: 0201 CODE Read: XXXX CODE CHECK : NOT PASSED TEXT Target: 13 TEXT Read: XX TEXT CHECK : NOT PASSED CHAR Target: AA CHAR Read: XX CHAR CHECK : NOT PASSED With this code (created starting from yours) index = nomefile.find('XXXX') print 'XXXX_ found at location', index index2 = nomefile.find('XX') print 'XX_ found at location', index2 found = nomefile.find('XX') while found > -1: print "XX found at location", found found = nomefile.find('XX', found+1) I have an answer like this: XXXX_ found at location 51315 XX_ found at location 51315 XX found at location 51315 XX found at location 51316 XX found at location 51317 XX found at location 52321 XX found at location 53328 I have done it to find all occurences of 'XXXX' and 'XX' strings. But, as you can see, the script find the occurrences of XX also at locations 51315, 51316 , 51317 corresponding to string XXXX. Is there a way to search all occurences of XX avoiding XXXX location? Thank you. Michele -- http://mail.python.org/mailman/listinfo/python-list