On Jun 21, 8:47 am, cjl <[EMAIL PROTECTED]> wrote: > I would like to be able to extract the name of the street from the > addresses in my data, however they vary significantly. Here a some > examples: > > 25 Main St > 2500 14th St > 12 Bennet Pkwy > Pearl St > Bennet Rd and Main st > 19th St > > As you can see, sometimes I have the house number, and sometimes I do > not. Sometimes the street name is a number. Sometimes I simply have > the names of intersecting streets. > > How might I approach this complex parsing problem?
In the past, I used the data cleansing routines from Febrl to do this with some success. http://www.google.com/search?q=febrl HTH. ... Jay Graves -- http://mail.python.org/mailman/listinfo/python-list