I want to have IDs for the terms (words) not the documents! Also, I need the same ID for a word if it appears in more than one documents.
Example: Doc1: The sea is blue Doc2: Sky is blue For these two docs the dictionary would be [the]->1 [sea]->2 [is]->3 [blue]->4 [sky]->5 So I want to represent these docs by word-ids like this: Doc1: 1 2 3 4 Doc2: 5 3 4 Is there a way to use Lucene for this? I mean Lucene stores an internal dictionary. How can I access it? Thank you, Ilias On 10/31/07, Mark Miller <[EMAIL PROTECTED]> wrote: > The id does change. You need to index your own "id" field with the document. > > > Ilias Flaounas wrote: > > Dear experts, > > > > I need to store and index a string of text into Lucene, and later I > > want to get the Id of each term inside this string. Is it possible? > > How can I do that? > > > > I want a unique association, term (in my case a word) -> Id. I know, > > that If I delete a document, the dictionary changes. Does the term id > > change? > > > > > > Thanks a lot > > Ilias > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: [EMAIL PROTECTED] > > For additional commands, e-mail: [EMAIL PROTECTED] > > > > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]