Re: Re[4]: [fpc-pascal] Text scan in text files - (was: Full text scan - PDF files)

2010-11-02 Thread Marcos Douglas
On Tue, Nov 2, 2010 at 8:32 PM, DaWorm wrote: > > > On Tue, Nov 2, 2010 at 11:45 AM, Marcos Douglas wrote: >> >> If I found a match, I need to know which page the >> token was found... >> > > That may prove most difficult, since I doubt your pdf to text will preserve > that very well. Yeah... th

Re: Re[4]: [fpc-pascal] Text scan in text files - (was: Full text scan - PDF files)

2010-11-02 Thread DaWorm
On Tue, Nov 2, 2010 at 11:45 AM, Marcos Douglas wrote: > If I found a match, I need to know which page the > token was found... > > That may prove most difficult, since I doubt your pdf to text will preserve that very well. Jeff. ___ fpc-pascal maillis

Re: Re[4]: [fpc-pascal] Text scan in text files - (was: Full text scan - PDF files)

2010-11-02 Thread Marcos Douglas
On Tue, Nov 2, 2010 at 11:38 AM, José Mejuto wrote: > Hello FPC-Pascal, > > Tuesday, November 2, 2010, 11:02:18 AM, you wrote: > > TH> If I understand it correctly, this assumes reading the whole file into > TH> memory at once. Depending on the size of that file and other conditions, > TH> this ma

Re[4]: [fpc-pascal] Text scan in text files - (was: Full text scan - PDF files)

2010-11-02 Thread José Mejuto
Hello FPC-Pascal, Tuesday, November 2, 2010, 11:02:18 AM, you wrote: TH> If I understand it correctly, this assumes reading the whole file into TH> memory at once. Depending on the size of that file and other conditions, TH> this may or may not be advisable... Yes, and a pdf2text conversion will

Re: Re[2]: [fpc-pascal] Text scan in text files - (was: Full text scan - PDF files)

2010-11-02 Thread Tomas Hajny
On Mon, November 1, 2010 21:52, José Mejuto wrote: . . > So if you are looking for ASCII words, use PDF2Text and use the POS > function over the result: > > function HaveString(Filename: String; TheString: string): Boolean; > var > F: TFileStream; > S: String; > AtPos: integer; > begin >

Re: [fpc-pascal] Text scan in text files - (was: Full text scan - PDF files)

2010-11-01 Thread Graeme Geldenhuys
Hi, I don't know the PDF file format at all, but I would guess if you look at the many pdf-to-text conversion tools, you will see what they have done. In all likelihood, those conversion tools will be written in C, which you will have to translate to Object Pascal - but that is normally not a diff

Re[2]: [fpc-pascal] Text scan in text files - (was: Full text scan - PDF files)

2010-11-01 Thread José Mejuto
Hello FPC-Pascal, Monday, November 1, 2010, 8:40:45 PM, you wrote: MD> On Mon, Nov 1, 2010 at 4:27 PM, Alberto Narduzzi MD> wrote: >> Sorry, >> >>> I agree. But as I search for text within PDF files? >> >> I assumed true the following statement of yours... >> >> [Somebody can help me please? >>

Re: [fpc-pascal] Text scan in text files - (was: Full text scan - PDF files)

2010-11-01 Thread Marcos Douglas
On Mon, Nov 1, 2010 at 4:27 PM, Alberto Narduzzi wrote: > Sorry, > >> I agree. But as I search for text within PDF files? > > I assumed true the following statement of yours... > > [Somebody can help me please? > I need to search strings in Text files using just FPC.] Yes, I changed my first mail

Re: [fpc-pascal] Text scan in text files - (was: Full text scan - PDF files)

2010-11-01 Thread Alberto Narduzzi
Sorry, I agree. But as I search for text within PDF files? I assumed true the following statement of yours... [Somebody can help me please? I need to search strings in Text files using just FPC.] so I suggested the fast-enough (in my opinion) use of the Pos function. If _now_ you want to sea

[fpc-pascal] Text scan in text files - (was: Full text scan - PDF files)

2010-10-31 Thread Marcos Douglas
On Tue, Oct 26, 2010 at 11:54 AM, Marcos Douglas wrote: > WTF? > http://lists.freepascal.org/lists/fpc-pascal/2010-October/027067.html > > Well... again: > > I need to do a full text scan in PDF files i.e. convert PDF in plain > text, parser and save the informations. > I saw the XPDF[1] lib but I