Hello everyone,

I’ve just come back to LiveCode and I'm pretty little rusty. I used to do some 
basic text analysis of files where the lines containing strings of interest 
were consistent and therefore easy to spot. I am now working on files where the 
chunk of text that contains the data I want is more ambiguous. I figure I 
should be using MatchChunk and was wondering if anyone might give me some tips 
on how to do the following. The chunk that I want to extract will have a 
certain word or phrase near its start and a certain word or phrase near its 
end. There may be many such chunks like it in the document, but the best 
candidate contains certain other strings. Here’s an example:

The chunk starts with the word *owner* or the phrase *beneficial owner*.

The chunk ends with *all directors* or *less than one percent*.

The chunk contains all of the following:
- At least four or five big numbers, e.g., 234,879
- At least two percentages, e.g., 3.4%, or percentage signs

If you are curious, this would more or less identify an ownership table in a 
proxy statement filed at the Securities and Exchange Commission. These are 
archived at the SEC in text and html (in vintages going back to about 1994).

Any tips or examples would be much appreciated.

Regards,

Gregory




_______________________________________________
use-livecode mailing list
use-livecode@lists.runrev.com
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-livecode

Reply via email to