Hello,
1- I need help to extract a selected text from a pdf document.
Tet pdflib can do that and i need help to execute command line using
Pharo.
I am able to do the work using a bat file but i think we can do better.
2- How we can code properly the delay (when we wait the Bat file)
Thank you for your help.
https://www.pdflib.com/products/tet/
|bat pdf stream pdfAsText contents |
bat := 'C:\some\code.bat'.
pdf := 'C:\some\document.pdf'.
stream := bat asFileReference writeStream.
"Console Windows code"
stream nextPutAll: 'CD "C:\Program Files\PDFlib\TET 5.0 64-bit\bin"';
nextPut: Character linefeed;
nextPutAll: 'TET --samedir --lastpage last-1 --pageopt "includebox={
{98.38 693.32 253.41 709.91} {522.70 183.66 595 226.30} {416.97 574.79
479 598.03} {401.33 773.91 453 789.55} {294.18 575.74 369.56 598.5}
{295.6 492.3 373.35 524.43} {150.53 414.55 219.28 438.25} {47.17 494.67
98.85 520.75} {112.26 684.01 193.66 698.03} {533.61 212.57 595 620}
{100.75 657.29 217.85 673.41}}"';
nextPut: Character space;
nextPutAll: pdf;
nextPut: Character linefeed;
nextPutAll: 'EXIT';
close.
OS2Process command: bat.
pdfAsText := String streamContents: [ :s | 1 to: (pdf size - 4) do: [
:i | s nextPut: (pdf at: i) ]. s nextPutAll: '.txt'; close ].
"C:\some\document.txt"
(Delay forMilliseconds: 500)wait.
contents := pdfAsText asFileReference contents.
bat asFileReference delete.
pdfAsText asFileReference delete.
contents
================================================================
Missatge enviat a través del Webmail de Girona.com
http://www.girona.com
================================================================