Might be of interest to someone painting a bikeshed. I played a little 
with Levenshtein module.

$ egrep --no-filename -R -o -w '[a-z]{4,15}' src/sage > /tmp/all
$ cat /tmp/all | sort | uniq -c | fgrep -w 1 | colrm 1 8 > /tmp/singles
$ cat /tmp/all | sort | uniq -c | fgrep -v -w 1 | colrm 1 8 > /tmp/nonsingles

And now

#!/bin/python

from Levenshtein import distance

f = open('/tmp/singles', 'r')
singles = f.readlines()
f.close()

f = open('/tmp/nonsingles', 'r')
nonsingles = f.readlines()
f.close()

for s in singles:
     for w in nonsingles:
         if distance(s, w) == 1:
             print(s+" "+w)
             break

This can be used to spot at few typos.

-- 
Jori Mäntysalo

Tampereen yliopisto - Ihminen ratkaisee

-- 
You received this message because you are subscribed to the Google Groups 
"sage-devel" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to sage-devel+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/sage-devel/alpine.DEB.2.21.9999.1908081049080.46669%40shell.sis.uta.fi.

Reply via email to