Re: index and access to lines of a CSV file

2007-12-13 Thread Mike Klaas
On 13-Dec-07, at 3:26 PM, Tobias Rothe wrote: I got a quick question. I am handling hughe CSV files. They start with a key in the first column and are followed by data. I need to retrieve randomly this data based on the key. So it is kind of a search where I give a unique key and ideally ac

index and access to lines of a CSV file

2007-12-13 Thread Tobias Rothe
Hello, I got a quick question. I am handling hughe CSV files. They start with a key in the first column and are followed by data. I need to retrieve randomly this data based on the key. So it is kind of a search where I give a unique key and ideally access to the right line. The file contains

Fwd: Oracle-Lucene Domain Index (New Release)

2007-12-13 Thread Marcelo Ochoa
FYI. From: J. Delgado <[EMAIL PROTECTED]> Once again, LendingClub.com, a social lending network that today announced nation-wide expansion (see Tech Crunch), is please to contribute to the open source community a new release (2.2.0.2.0) of the Oracle-Lucene Domain Index, a fast implementation of

Re: SpellChecker: Spanish Dictionary

2007-12-13 Thread Daniel Naber
On Donnerstag, 13. Dezember 2007, Haroldo Nascimento wrote: >   I am using the SpellCheck classes of Lucene for create  the "Did you > Mean" feature. >   I need load into memory all verbets of Spanish language (it wil be my > dictinary). > >   Where I can get (download) this dictionary. Maybe .txt

Re: Indexing Wikipedia dumps

2007-12-13 Thread Petite Abeille
On Dec 13, 2007, at 8:39 AM, Dawid Weiss wrote: Just incidentally -- do you know of something that would parse the wikipedia markup (to plain text, for example)? If you find out, let us know :) You may want to check the partial ANTLR grammar for Wikitext: http://www.mediawiki.org/wiki/User

Re: Refreshing RAMDirectory

2007-12-13 Thread Michael McCandless
Ruslan Sivak wrote: Michael McCandless wrote: Ruslan Sivak wrote: Michael McCandless wrote: Ruslan Sivak wrote: Since my app would be multithreaded, there could be multiple threads accessing the reader, while i'm reloading it. This means that if I close the reader, and another thread

Re: Refreshing RAMDirectory

2007-12-13 Thread Ruslan Sivak
Michael McCandless wrote: Ruslan Sivak wrote: Michael McCandless wrote: Ruslan Sivak wrote: Since my app would be multithreaded, there could be multiple threads accessing the reader, while i'm reloading it. This means that if I close the reader, and another thread is using it, it might g

SpellChecker: Spanish Dictionary

2007-12-13 Thread Haroldo Nascimento
Hi, I am using the SpellCheck classes of Lucene for create the "Did you Mean" feature. I need load into memory all verbets of Spanish language (it wil be my dictinary). Where I can get (download) this dictionary. Maybe .txt file. Thanks

Re: Query.rewrite - help me to understand it

2007-12-13 Thread 游泳池的鱼
hehe ,you can do a test with PrefixQuery rewrite method,and extract terms . like this query = prefixQuery.rewrite(reader); query.extractTerms(set); for(String term : set){ System.out.println(term); } It will give you a pleasant surprise. 2007/12/13, Helmut Jarausch <[EMAIL PROTECTED

Re: Query.rewrite - help me to understand it

2007-12-13 Thread mark harwood
rewrite turns foo* into food foot football ... etc. Those variant terms are found by looking in the index, hence the need for the IndexReader. Searcher calls rewrite on queries to do this - the highlighter just pre-empts this conversion and sneaks a peek at the terms generated so it can

Query.rewrite - help me to understand it

2007-12-13 Thread Helmut Jarausch
Hi, since I need highlighting, I need to 'rewrite' a query. Query.rewrite takes an object of type IndexReader But what for? As I understand it, rewrite transforms a possibly complicated query into an simplified (internal?) form which is (unfortunately) needed by QueryScorer which is in turn nee

Re: Refreshing RAMDirectory

2007-12-13 Thread Michael McCandless
Ruslan Sivak wrote: Michael McCandless wrote: Ruslan Sivak wrote: Since my app would be multithreaded, there could be multiple threads accessing the reader, while i'm reloading it. This means that if I close the reader, and another thread is using it, it might get an exception. The n