On Jun 21, 8:47 am, cjl <[EMAIL PROTECTED]> wrote:
> I would like to be able to extract the name of the street from the
> addresses in my data, however they vary significantly. Here a some
> examples:
>
> 25 Main St
> 2500 14th St
> 12 Bennet Pkwy
> Pearl St
> Bennet Rd and Main st
> 19th St
>
> As you can see, sometimes I have the house number, and sometimes I do
> not. Sometimes the street name is a number. Sometimes I simply have
> the names of intersecting streets.
>
> How might I approach this complex parsing problem?

In the past, I used the data cleansing routines from Febrl to do this
with some success.

http://www.google.com/search?q=febrl

HTH.
...
Jay Graves



-- 
http://mail.python.org/mailman/listinfo/python-list

Reply via email to