Simon, We have almost everything you listed in our TODO http://www.sai.msu.su/~megera/wiki/todo
btw, there is gendict subdirectory, which help people to generate dictionaries (including snowball stemmers) for tsearch2.
Oleg On Tue, 3 Oct 2006, Simon Riggs wrote:
I'm looking at some of the code in contrib/tsearch2/snowball and see that the code there is *generated* code. The Snowball stemmer produces this C code in much the same way bison reads gram.y My understanding is that the Snowball code moves forwards regularly and there are many other stemmers we could be including with the distribution. Snowball has a BSD licence: http://snowball.tartarus.org/license.php Would it be possible to include the Snowball source directly and allow its execution to be part of the make process for tsearch2? Or have configure check for Snowball at make time? At the very least it would be good to have a Readme file explaining how to modify the Snowball stemmer and regenerate for tsearch2. That would then encourage people to improve the stemmers, as well as allow us to include French and Spanish versions etc.. Perhaps we should ask translators to provide stop word lists for their languages. It seems a shame to have docs in so many languages, but no language capability for Tsearch2. Also, why do we have another crc32 implementation in there?
Regards, Oleg _____________________________________________________________ Oleg Bartunov, Research Scientist, Head of AstroNet (www.astronet.ru), Sternberg Astronomical Institute, Moscow University, Russia Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/ phone: +007(495)939-16-83, +007(495)939-23-83 ---------------------------(end of broadcast)--------------------------- TIP 5: don't forget to increase your free space map settings