If there are particular HTML errors you encountered as a pattern in the
journal website, please feel free to let us know about them on the
Journal listserv, if you're interested.
On 1/4/2011 12:30 PM, Louis St-Amour wrote:
Given my journal2epub script's experience with the Code4Lib journal
site, does Anthologize have an option to produce TOC items from post
headings, modifying the HTML to add IDs where necessary? Does it map
links to posts with their offline copies, preserving references? Does
it try to add the largest image it can, or does it include only
embedded, potentially smaller ones? (In iBooks, unlike Adobe-based
readers, you can double-tap to zoom in on an automatically resized
large image.) Are metadata and stylesheets specified manually? And
finally, does it clean up the HTML to produce strict XHTML 1.1 as
required? In the journal's case, I had to process HTML three times
with manual checks to delete invalid attributes before things would
mostly validate. (Turns out validation is the hardest thing about
automatically producing EPUB files.) As to my script's use in
producing official EPUB files, sure, that's why I made it. But if you
look closely, it makes assumptions about the HTML structure of the
pages, so it might need modifications if the design or templates
change.
Louis.
Sent from my iPhone
On 2011-01-04, at 12:01 PM, "Hanrath, Scott"<[email protected]> wrote:
Anthologize lets you be as picky as you like about the content you use
with it. Essentially you create multiple Anthologize 'projects', then add
the whatever subset of content you need (native local WordPress content or
content imported via a feed) to the project. The Anthologize content is
added as copies, preserving the originals and allowing for editing
specific to your output needs.
Eric's right that it *is* manual and a bit tedious, but it's (hopefully)
getting less so. You do need to created a 'part' structure within your
project to organize your content. But when adding content you can filter
by Tag/Category/Date Range/Post Type. And with the last release you can
add more than one post at a time.
The Anthologize dev team would certainly be interested in the code4lib
journal committee's take on the tool and ways it could be improved.
(Support for automated project creation and output generation would an
interesting feature to see on the roadmap).
-- Scott
On 1/4/11 10:45 AM, "Eric Lease Morgan"<[email protected]> wrote:
On Jan 4, 2011, at 11:40 AM, Jonathan Rochkind wrote:
...Is there any easy way to get it to, for instance, make an anthology
of
all the posts with a certain WordPress tag or category instead?...
Based on my (poor) recollection of playing with the Anthologize plug-in,
the process is a bit manual. Initialize epub. Drag postings to it.
Annotate/tweak titles. Click 'Go'. Get epub file. The process is not
laborious, just a bit tedious. I would definitely recommend the "Journal
Committee" experiment with Anthologize.
--
Eric Morgan