On Fri, Oct 25, 2002 at 01:07:08AM -0500, _brian_d_foy wrote: > In article <[EMAIL PROTECTED]>, Perl Authors Upload Server ><[EMAIL PROTECTED]> wrote: > > > The following module was proposed for inclusion in the Module List: > > > modid: Finance::CompanyNames > > > description: Searches free text for names of companies > > how does it know what a company name is?
Well, that's the trick. You tell it what company you are looking for (e.g. O'Reilly and Associates), and the module finds instances of that company in the free text (e.g. O'Reilly, O'Reilly and Assoc., ORA, etc.) The main point is that the code understands permutations applied to company names in the English language, such as contraction, abbreviation, embellishment, and so forth. > if you can only find public company names, for instance, > then the name should say something about that. No, it can find any company name specified, although the body of work mentioning public companies is much bigger. This module does not provide any list of company names whatsoever. > i also don't think this is the best name. i thought it > was something completely different (a translation from > ticker symbols to names and the other way around). perhaps > something like "Extract" should be somewhere in there. :) As usual with CPAN, we hope to put it in the section where people will actually find it. Finance or Business is the most suitable first word, but unfortunately Finance is in Chapter 23 "Also Ran", and this module is about text. So either would be appropiate. How about Text::ExtractCompanyNames? Business::ExtractCompanyNames? Regards, jwb