If all that you want to do is retrieve the text then you should look into Apache Tika. It’s a programmers tool that does text extraction including OCR of PDF.
Tika.apache.org/ Regards, Dave Sent from my iPhone > On Sep 22, 2019, at 6:16 AM, Rory O'Farrell <ofarr...@iol.ie> wrote: > > Copy to unsubscribed poster: > > On Sun, 22 Sep 2019 08:02:28 -0400 > "David Myers" <get2le...@optonline.net> wrote: > >> Da'anzho Apache, (in the immortal words of Yogi Berra, You can look it up) >> >> >> >> It's prob'ly been ten year since I logged in, maybe longer, so I tried to >> register, BUT . my email is already in use, so I tried to log in, BUT . the >> two most common user names I use weren't recognized, BUT . you don't have a >> link to email me my username upon some further identification. >> >> >> >> Let me ask what I want to know. I wrote a 425 page novel 30 years ago >> before computers that almost got published, BUT . merger mania intervened, >> and those editors were offed. Staples converted it to PDF. Adobe >> supposedly converted that to Word, BUT . only the first 14 pages are >> editable, and eve they had every kind of typographical reconfiguration like >> bunching words together, some of which I could space apart, others I had to >> paste and redo, as well as 50 megaspaces between words on every page that I >> had to backarrow. >> >> The rest reverts to Picture Tools in purple above the task >> bar with that insidious cross of theirs everywhere I move the cursor, like >> the target fixture in a sub periscope screen, aiming to blow up my whole >> document AND Word with it. >> >> >> >> I don't recall seeing Apache Word or Writing or Office as an option in the >> Adobe export menu. IS THERE ANYTHING YOU CAN DO, SO I CAN EDIT MY OLD >> NOVEL? >> >> >> >> Thanks. >> >> >> >> David Myers >> >> 631-724-5675 >> >> get2le...@optonline.net <mailto:get2le...@optonline.net> > > This posting gives information on how to obtain access to an older Forum > account > https://forum.openoffice.org/en/forum/viewtopic.php?f=50&t=527 > > -- > Rory O'Farrell <ofarr...@iol.ie> > > --------------------------------------------------------------------- > To unsubscribe, e-mail: users-unsubscr...@openoffice.apache.org > For additional commands, e-mail: users-h...@openoffice.apache.org > --------------------------------------------------------------------- To unsubscribe, e-mail: users-unsubscr...@openoffice.apache.org For additional commands, e-mail: users-h...@openoffice.apache.org