I am finding this discussion illuminating. Can I ask: does it possibly make sense to use a dB? For an application of mine, I had been looking at putting the text into PostGres, and using their OpenFTS, at //http://openfts.sourceforge.net/ (there is a Python module to talk to OpenFTS, although it is in an early stage of development).
Jim -- http://mail.python.org/mailman/listinfo/python-list