Yes, the index for the catalog of documents, consisting of the idx and psx files, not the component of a PDF file also called catalog.
Do you know of any FOSS software that supports the document catalog? I was assuming that I would have to track down the formats and do it on the QT (My Linux desktop is KDE). -- Shmuel (Seymour J.) Metz http://mason.gmu.edu/~smetz3 ________________________________________ From: IBM Mainframe Discussion List [[email protected]] on behalf of Paul Gilmartin [[email protected]] Sent: Sunday, April 18, 2021 6:05 PM To: [email protected] Subject: Re: Format of PDF collections? On Sun, 18 Apr 2021 21:36:14 +0000, Seymour J Metz wrote: >That's information on an individual PDF file, not the catalog for the entire >collection. > By "catalog" do you mean any of these?: 613 $ ( ls -l */*.idx *.pdx ) -rwxr-xr-x@ 1 paulgilm staff 543 Apr 1 02:29 zOSV2R4-Search-Index.pdx -rwxr-xr-x@ 1 paulgilm staff 5104 Apr 1 02:29 zOSV2R4-Search-Index/index.idx -rwxr-xr-x@ 1 paulgilm staff 241200313 Apr 1 01:42 zOSV2R4-Search-Index/index1.idx -rwxr-xr-x@ 1 paulgilm staff 159263466 Apr 1 02:29 zOSV2R4-Search-Index/index2.idx 614 $ Some have expressed an desire here to convert those indices to another format. It might be easier to start afresh with a (FOSS) indexing tool on the PDF collection. >BTW, that output doesn't show "SA23-2292-40" anywhere; does that mean that IBM >doesn't include the form code in the metadata? -- gili ---------------------------------------------------------------------- For IBM-MAIN subscribe / signoff / archive access instructions, send email to [email protected] with the message: INFO IBM-MAIN ---------------------------------------------------------------------- For IBM-MAIN subscribe / signoff / archive access instructions, send email to [email protected] with the message: INFO IBM-MAIN
