I think a DJVU filter for tracker would be an important addition. The
DJVU format is becoming popular amongst people doing DIY book scanning,
thanks to these excellent tools:

1) Scan Tailor, for image post-processing
http://scantailor.sourceforge.net/

2) djvubind, a script for Linux that creates a DJVU file with searchable, 
positioned OCR text from Scan Tailor's output
http://code.google.com/p/djvubind/
deb here: http://code.google.com/p/djvubind/downloads/list


It seems that eventually the goal is to incorporate the functions of djvubind 
into the Scan Tailor GUI. The tools can be used on any images of pages, whether 
generated by a flatbed scanner (which I use) or one of the homebrew 
camera-based scanners developed here:
http://www.diybookscanner.org/

All of that is to say that I think DJVU may become somewhat less obscure
- the Internet Archive uses the format, too.

The following thread from the Tracker mailing list states that what is needed 
is an extractor module for Tracker (as opposed to a filter - filters are no 
longer used in tracker >= 0.7.x):
http://mail.gnome.org/archives/tracker-list/2010-August/msg00022.html

Here's the howto for writing an extractor module:
http://library.gnome.org/devel/libtracker-extract/unstable/libtracker-extract-How-to-use-libtracker-extract.html

I think the command needed for the extractor module is djvutxt, which is
part of the djvulibre-bin package.

This is as far as I've gotten with things so far. Maybe I will be able
to figure out how to make an extractor module for text and metadata or
maybe this information can help someone else with more skills.

-- 
Tracker ignores contents of djvu files
https://bugs.launchpad.net/bugs/428599
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to