The Cassandra integration is probably beyond the time I have available. If the locations in the code that need to be rewritten to use the FST are known, and a patch simply 'plugs-in' the FST, that would be much easier. Eg, I don't know how Cassandra stores the current key index for example...
Basically I can write FST serializing, deserializing, and key lookup code fairly easy by basing it on Lucene's terms dict. On Fri, Jun 8, 2012 at 6:00 AM, Radim Kolar <h...@filez.com> wrote: > > If you are interested I can help, I used the FST on a Hadoop project >> to implement a fast map side range join. >> > create JIRA item with patch attached, i will test it. >