Yes, the index for the catalog of documents, consisting of the idx and psx 
files, not the component of a PDF file also called catalog.

Do you know of any FOSS software that supports the document catalog? I was 
assuming that I would have to track down the formats and do it on the QT (My 
Linux desktop is KDE).


--
Shmuel (Seymour J.) Metz
http://mason.gmu.edu/~smetz3

________________________________________
From: IBM Mainframe Discussion List [[email protected]] on behalf of 
Paul Gilmartin [[email protected]]
Sent: Sunday, April 18, 2021 6:05 PM
To: [email protected]
Subject: Re: Format of PDF collections?

On Sun, 18 Apr 2021 21:36:14 +0000, Seymour J Metz wrote:

>That's information on an individual PDF file, not the catalog for the entire 
>collection.
>
By "catalog" do you mean any of these?:
613 $ ( ls -l */*.idx *.pdx )
-rwxr-xr-x@ 1 paulgilm  staff        543 Apr  1 02:29 zOSV2R4-Search-Index.pdx
-rwxr-xr-x@ 1 paulgilm  staff       5104 Apr  1 02:29 
zOSV2R4-Search-Index/index.idx
-rwxr-xr-x@ 1 paulgilm  staff  241200313 Apr  1 01:42 
zOSV2R4-Search-Index/index1.idx
-rwxr-xr-x@ 1 paulgilm  staff  159263466 Apr  1 02:29 
zOSV2R4-Search-Index/index2.idx
614 $

Some have expressed an desire here to convert those indices to another format.
It might be easier to start afresh with a (FOSS) indexing tool on the PDF 
collection.

>BTW, that output doesn't show "SA23-2292-40" anywhere; does that mean that IBM 
>doesn't include the form code in the metadata?

-- gili

----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [email protected] with the message: INFO IBM-MAIN

----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [email protected] with the message: INFO IBM-MAIN

Reply via email to