Re: "ouch: the beginning of the end"

Mark Waddingham via use-livecode Wed, 08 Mar 2017 04:02:24 -0800

Hi Dr Hawkins,

I've been away on holiday for just over a week, and this thread has got
quite long, so I thought it easier to answer the original post rather
than some off shoot on it.


On 2017-03-03 00:13, Dr. Hawkins via use-livecode wrote:

I just got off the phone with the court clerk in Reno, and received the
beginning of the end . . .I figured it would come from some state oranther
in a year or two, but they are requiring me to use the *exact* pdf as
propagated by the court.

Having read the entire thread, my understanding of your problem is asfollows

(please correct if I am wrong):

----

You have PDF forms which are downloadable from a government department.Theyare intended for filling printing and then filling in - i.e. they do notuse

editable PDF forms (FPDF?).

The government department for whatever reason requires that the formsare usedexactly as is with the user filling in the relevant spaces within themand then

submitting.

There is some claim by said department that 'at some point' they willgetscanners which will be able to tell whether the original forms were usedor not

thus you are not allowed to recreate the non-user parts of the form.

----

Reading between the lines the latter requirements of the department arenotunreasonable - I suspect they would like to automate their processes asmuchas possible and as such would like to be able to have a computer via OCRorwhatever suck out the appropriate parts of forms at some point to removea

human from the equation.

Given that there is an obvious 'printing' element involved in this atpresentpixel-perfection is not exactly what they are looking for (unless theyareimagining they live in a world where all printers are capable ofabsolutelyperfect registration - some skew / offset is always going to be present)justthat whatever software they might use in the future to automate canlocatethe user written parts to suck out - therefore it is reasonable for themtorequire that the non-user sections are relatively laid out and lookprecisely

the same as if you printed the original PDF.

I'll run on these above assumptions for now.

----

First of all let me just point out that EPS is definitely *not* what youwant.


EPS is just a PostScript program with appropriate comments describing an

(optional) pre-rendered thumbnail, and other print related metadata soitcan be embedded in another document. Rendering EPS properly requires afullPostScript interpreter - many programs which 'support EPS' actually onlysupport

rendering the thumbnail and then only printing on a PostScript printer.

Indeed, there is a good reason why no non-GPL full open-sourcePostScriptinterpreter exists (as far as I'm aware at least) - they are complexpieces

of software which have a high degree of commercial value.

Whilst Linux and Mac users might be used to transparent PostScriptsupport thisis only because GhostScript is installed as an innate part of theprinting toolchain on those platforms - thus this is an innate part of the 'system'and assuch you can write non-GPL applications which use it as you don't needto distributeit with your app. On all other platforms, however, you are looking athaving todistribute a PS interpreter with your app - and at that point you arehit by theGPL (in particular, in your case, it would classify as an 'innate'requirement

of your application and non-optional and thus virality would kick in).

So, if you want a PostScript interpreter in your app you are going tohave topay $$$$$ to license such a thing. (Including such a thing in LiveCodewouldrequire license fees or development costs way above what most peoplewould wantto pay for a feature they would probably rarely if ever use and as suchit isunreasonable to expect LiveCode to support such things cross-platform aspart of

the standard license fee - event at the Business license level).

One of the main reasons that Adobe created PDF was to avoid needing aPostScriptinterpreter to accurately create 'archival' type quality representationsof printabledocuments and to provide a much easier way to edit / amend and modifysuch documents.As PDF is just a data structure the latter can be done with processing ageneratedPDF. As EPS/PS are actually a program all bets are off for editing - theprogramdoes what it is written to, and you can write it in any way you want. Ifyou want to

'edit' it, you need to edit the program.

However....

PDF is also a large complicated format whose reading, writing andrasterisation

has huge commercial value.

Up until Google bought and open-sourced *part* of FoxIT so they couldinclude afull and complete cross-platform PDF renderer in Chrome (in the form ofPDFium)there was no non-GPL open-source full and complete PDF rendereravailable in

the open-source world that I know of.

As far as I'm aware all such open-source libraries for PDF rasterisationandmanipulation which existed up until that point where GPL and all of themoffercommercial licensing terms. The costs of which are substantial - again,welloutside the cost of what you could reasonably expect to get 'built in'to the

LiveCode license at any level.

Of course, when you look into what Google did you find out that whilstPDFiumis FoxIT - it is only a *subset* of FoxIT. Google only licensed therasterisationpart - PDFium does not contain any of the public APIs which allowediting, merging,

modification and re-export of PDFs.

Again, you can understand why - the latter part of PDF manipulation hasperhapsthe greatest part of the commercial value and since Google only wantedrasterisation

that was all they were going to pay for.

----

So, just to reiterate, the expectation that LiveCode should contain afull PS/EPS/PDFrendering, manipulation and 'do whatever I want' type thing in it on allplatforms issomewhat beyond the current price of the license fee. Or should I say,far beyond whatanyone one person/organisation who does not need such functionality(which are most people)

would be willing to pay.

(I should point out here that I know what is involved in writing both aPostScriptinterpreter, and PDF renderer as I have written a partial implementationof both in thedim and distant past - for RiscOS in the early 1990's... Back when PSwas still mostlyLevel 2, and the PDF spec weighed in at around 150 pages... PostScriptis now universallyat Level 3, and the PDF spec weighs in at 700+ pages - thus I do notbegrudgethe commercialization of such libraries at all as they are large heftypieces of work whichhave to deal with inputs which may or may not completely conform tospecification).

Anyway, bemoaning about the costs of developing and supporting suchthings aside back

to your actual problem...

First of all on some platforms what you want to do is actually not allthat hard at all.

Mac and iOS both include full built-in PDF rendering and emissionsupport. CoreGraphicscan both load and render PDF directly *and* also render and save PDFdirectly which meansthat it is relatively straightforward (with a bit of LiveCode Builder orC++) to do whatyou want - i.e. render an original page of a PDF then render some texton top. However,it is important to point out that this approach will not result in thePDF necessarilybeing original PDF + extra bits since you are re-rendering the PDF(although I don'tthink this is a problem in your case as it sounds like there is animplicit may go through

an actual scanner in the government departments process).

Similarly, Linux always includes a postscript interpreter in its defaultinstall if youinstall printing support. PDF can be rendered in PostScript by using anappropriateheader PostScript program (which converts the PDF data structure into aPostScriptprogram - in fact the main rendering bits in PDF are actually PostScriptprogramsjust with a very fixed set of well defined operators which you candefine in a PSenvironment). Thus on this platform you could emit the necessary header,the PDF

and then the additions you require as PostScript programs.

Where you run into difficulty is on Windows and Android. Neither ofthese platformsinclude either publicly accessible PDF nor PS support (although itappears Windows

10 might have a built in PDF Printer at least...).

----

So what options are there?

- Option 1 - bi-level background images

Here I'm assuming that your original PDFs do not change that often and(given therequirements you have found out from the government department involved)the formsmust be used as is. Thus, I presume any 'recurring sections' would needto berendered on repeated images of the appropriate page rather than cuttingup the

original forms into pieces and just replicating those parts.

In this case, then pre-rendering all the pages as high-resolutionblack-and-white1bpp bitmaps and then rendering those underneath the LiveCode fields isprobably notthat bad an option. Given that the average printer people will be usingwill probablyonly have a true black-and-white resolution of 300-600dpi and mostprinted forms areonly about 5% black pixels you will get immensely high compressionratios. The onlyslight snafu here right now is that PDF printing support in LC does notyet existfor Android, and would need a small patch to pass PNG data straightthrough to thePDF (at present it only does this for JPEG). [ The reason PDF printingis not currentlysupported on Android is due to text rendering which is not astraightforward thing inPDF nor PostScript; the reason only JPEG image data is currentlysupported is thatwhen the pass-through was implemented the library we use to do PDFprinting - cairo -only supported it for JPEG, I *think* it does support certain PNGformats now though

since we updated the library for other reasons a while back ].

- Option 2 - augment the original PDF

PDF documents can be augmented after creation - the data structure isdesigned toallow revisions which overlay the original document. Thus it should bepossible to

generate modifications to the original PDF and append them to it.

The difficulty here is that it would require some intimate knowledge ofthe PDFdocument structure (although far less than what would be required togenerate onefrom scratch). Basically, you provide modified page objects for eachpage and amodified 'page tree' which first contains all the original things on thepageand then adds text objects (which is not too bad to generate if you justwant ASCIIcharacters in one of the built in fonts such as Helvetica) in the placesyou need.

Such a process could be implemented in LiveCode Script and would becompletelyindependent of platform. Also, it would preserve the original PDFentirely (noround-tripping through a PDF rasterizer) as you would only be adding towhat

was already there.

How much work would be involved in writing said script, however, isanother matter.

- Option 3 - wait until LiveCode can render PDFs directly as an objecton a card

This is obviously what you had hoped you could do and whilst notentirelyunreasonable, I hope you can appreciate from the above why you currentlycannot -

particular on all platforms.

PDFium does at least give us a starting point - however it isn't theeasiest of librariesto build or maintain building of and there's still a fair bit of work weneed to do toallow it to function cross-platform (not least the building of it forall platforms!).

Also, lamentably, that is only one side of the story - you also need togenerate PDFs,which means some library to output PDF is needed which is happy to bindto PDFium'srasterisation implementation. This is certainly not something which isexposed in thepublic APIs of PDFium, and would probably require bespoke customisationof PDFium to

achieve.

- Option 4 - focus on Mac/iOS and do other platforms later

As mentioned above, both Mac and iOS include PDF rendering and emissionas part ofCoreGraphics - they also include relatively straightforward APIs fordrawing typeset

text. The process here would be:

  1) Create a CG PDF output context
  2) Load your original PDF as a CG PDF object
  3) For each page:
     i) Render the original page into the PDF output context
     ii) Render the text into the appropriate places on the page
  4) Finalize the output context to generate a PDF

I recently did some work for a business services request which needed torenderportions of a PDF to a new PDF on Mac - and it turned out to be around50 lines ofC to do that. Rendering the text you would need through CoreText wouldbe a little

more than that, but nothing too onerous.

----

So anyway, sorry to be the bearer of perhaps not entirely great news,however whatyou want to do is certainly possible - but like most things will requiresome leg-work

and a little bit of patience and/or some financial investment.

I do strongly suggest you contact business services(https://livecode.com/services/)about what you need here. It is important to understand that whilst wewould like todo everything, we do need a way to prioritise what we focus on. WhilstPDF renderingand output features are (obviously) quite widely useful for lots ofthings they are alsosubstantial and large features to develop and maintain (if they weren'twe would besurrounded by lots of open-source non-GPL implementations to choose fromand base themon) thus progress on them generally in terms of additions to the coreproduct are likelyto be slow. However you do have a very specific use-case with welldefined inputs andoutputs so we may be able to help you for far less then it would costyou to commerciallylicense the relevant cross-platform libraries you need and/or a platformwhich providesthe functionality out of the box. (My gut tells me that starting withMac/iOS due totheir built in API support for what you want to do is probably the bestfirst step to takeat least then you get a product which works as it needs to to - and likeany venture, thesooner you ship, the sooner you can generate revenue to reinvest andexpand!).


Warmest Regards,

Mark.

--
Mark Waddingham ~ m...@livecode.com ~ http://www.livecode.com/
LiveCode: Everyone can create apps

_______________________________________________
use-livecode mailing list
use-livecode@lists.runrev.com
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-livecode

Re: "ouch: the beginning of the end"

Reply via email to