This was written over 20 years ago. Things haven't changed? Modern, mostly university sites or large funded private archives, had put their documents into a form that allows for on-the-fly formatting. This also allows the files to be sorted and searched in a database format, basically almost instantly. The MIA HTML doesn't conform in large part of any database searches except brute-strength meta tag searches and text searches. With the most data now in PDF format, it makes it even harder. We started out with HTML 1.0 more...or less and we didn't enforce compliance. Bad on us but that is what volunteer groups do I suppose in trying to recruit volunteers.
David -=-=-=-=-=-=-=-=-=-=-=- Groups.io Links: You receive all messages sent to this group. View/Reply Online (#35531): https://groups.io/g/marxmail/message/35531 Mute This Topic: https://groups.io/mt/111336789/21656 -=-=- POSTING RULES & NOTES #1 YOU MUST clip all extraneous text when replying to a message. #2 This mail-list, like most, is publicly & permanently archived. #3 Subscribe and post under an alias if #2 is a concern. #4 Do not exceed five posts a day. -=-=- Group Owner: marxmail+ow...@groups.io Unsubscribe: https://groups.io/g/marxmail/leave/13617172/21656/1316126222/xyzzy [arch...@mail-archive.com] -=-=-=-=-=-=-=-=-=-=-=-