Re: Language Detection for Analysis?

2009-08-09 Thread Lucas F. A. Teixeira
Google Translate just released (last week) its language API with translation and LANGUAGE DETECTION. :) It's very simple to use, and you can query it with some text to define witch language is it. Here is a simple example using groovy, but all you need is the url to query: http://groovyconsole.ap

Index "corruption" makes it return a different result

2008-03-26 Thread Lucas F. A. Teixeira
Hello all! I had a problem this week, and I like to share with you all. My weblogic server that generate my index hrows its logs in a shared storage. During my indexing process (SOLR+Lucene), this shared storage became 100% full, and everything collapsed (all servers that uses this shared stor

Re: Index "corruption" makes it return a different result

2008-03-26 Thread Lucas F. A. Teixeira
't quite follow the part about "_al1.cfs". It sounds like your indexing process hit a disk full event, that led to this error? Can you post the full exception(s) from the disk full? Which version of Lucene are you using? Mike Lucas F. A. Teixeira wrote: Hello all! I had a pr

Re: Index "corruption" makes it return a different result

2008-03-26 Thread Lucas F. A. Teixeira
post the CFS file that got mixed up? Be sure to cc me directly on the mail because the mailing list software removes attachments. Mike Lucas F. A. Teixeira wrote: This is just one of the index files. As I said, the local disk where the index is generated, it's not full, the full dis

Re: Index "corruption" makes it return a different result

2008-03-26 Thread Lucas F. A. Teixeira
LOL, I know Take a look, editing the cfs file: http://img296.imageshack.us/my.php?image=indexow4.jpg []s, Lucas Yonik Seeley wrote: On Wed, Mar 26, 2008 at 2:13 PM, Lucas F. A. Teixeira <[EMAIL PROTECTED]> wrote: one of the index files has these log messages from my appli

Re: Index "corruption" makes it return a different result

2008-03-26 Thread Lucas F. A. Teixeira
back. That screen shot is spooky! Is it possible that one of the documents you indexed had that content? (It could simply be a stored field). Mike Lucas F. A. Teixeira wrote: Thanks Michael! Lucene version: 2.3.0 Here is some screenshot of editing the cfs file: http://img296.imageshack.us/m

Re: Index "corruption" makes it return a different result

2008-03-27 Thread Lucas F. A. Teixeira
front. When I removed those lines then CheckIndex runs fine. This is very odd. Somehow your app server modified the frq file before it was added to the CFS file. Very strange. Mike Lucas F. A. Teixeira <[EMAIL PROTECTED]> wrote: 100% Impossible... My index has 1 xml, 3 numbe

Re: Question about indexing (BrazilianAnalyzer)

2008-06-04 Thread Lucas F. A. Teixeira
Are you using ISOLatin1AccentFilter ? []s, Lucas Frare A. Teixeira [EMAIL PROTECTED] Tel: +55 11 3660.1622 - R3018 Vinicius Carvalho escreveu: Hello there! I'm indexing documents using the BrazilianAnalyzer, and I've noticed that many words are not being indexed. I

Re: How to search on the indexed content

2008-06-24 Thread Lucas F. A. Teixeira
http://lucene.apache.org/java/docs/queryparsersyntax.html []s, Lucas Frare A. Teixeira [EMAIL PROTECTED] Tel: +55 11 3660.1622 - R3018 yugana escreveu: Hi All, I have created an index file and indexing the content retrieved from a database. How can I search on thi

Re: Lucene CFS naming significance

2008-06-27 Thread Lucas F. A. Teixeira
Folks, Could anyone tell me the significance of the naming of the cfs files in the luceneindex e.g. _1pp.cfs, _2kk.cfs etc. > Just names that won`t repeat in the same folder. I have observed many differently named files being created temporarily while the index is being built, but the same set