I have two files file1 in format <id> <val1> <test1> <test2> 'AA' 1 T T 'AB' 1 T F
file2 same as file1 <id> <val1> <test1> <test2> 'AA' 1 T T 'AB' 1 T T Also the compare should be based on id. So it should look for line starting with id 'AA' (for example) and then match the line so if in second case. so this is what I am looking for: 1. read both files. 2. read id of first line in file1 check if it matches with the same id in file2. 3. repeat step 2 for all lines in file1. 4. return a percent of success to failure. ie if one line matches and one lines does'nt then return 0.5 or 50% I wrote a boolean version ..as a start def getdata(f): try: f1 = open(f,'r') data=[] for eachline in f1.readlines(): data.append(re.split("", re.sub('\n','',strip(re.split('\s\s+',eachline)[0])))) return data except IOError: raise("Invalid File Input") if __name__=='__main__': data1 = getdata('file1') data2 = getdata('file2') if data1 == data2: print "True" else: print "False" hope I am clear... -- http://mail.python.org/mailman/listinfo/python-list