Hello,
I'm looking for a database backend for a dictionary coder project. It
would have three major tasks:
- take a text corpus, get their words and substitute each word by a 64
bit integer (the word:integer is always constant) and store the result
(encoding)
- take the previous result and s
On 01/24/2011 03:19 AM, Ben Chobot wrote:
On Jan 23, 2011, at 3:29 AM, Attila Nagy wrote:
Hello,
I'm looking for a database backend for a dictionary coder project. It would
have three major tasks:
- take a text corpus, get their words and substitute each word by a 64 bit
integer
On 01/24/2011 05:27 AM, Fredric Fredricson wrote:
I can easily do this with any RDBMS, with a table of three columns:
auto incremented ID, word and refcount, with a unique index on word.
The challenge could be:
- that it should scale to several TBs of size and several (hundred)
billion of re