Fwd: SpellChecker adding and removing words

2013-09-11 Thread Johnny Jenkins
rwarded message -- From: Johnny Jenkins Date: 11 September 2013 14:24 Subject: SpellChecker adding and removing words To: java-user@lucene.apache.org I’m wanting to high jack SpellChecker class as a general spell checking and word suggestion tool. The idea of using this class was to avoid creat

SpellChecker adding and removing words

2013-09-10 Thread Johnny Jenkins
I’m wanting to high jack SpellChecker class as a general spell checking and word suggestion tool. The idea of using this class was to avoid creating my own. At first it seems to fit the bill. However, once I’ve used indexDirectory() I cannot seem to easily add/remove words. I can re-instantiate

Re: possible bug on Spellchecker

2013-02-21 Thread Samuel García Martínez
> Ingram Content Group >> (615) 213-4311 >> >> >> -Original Message- >> From: Samuel García Martínez [mailto:samuelgmarti...@gmail.com] >> Sent: Thursday, February 21, 2013 2:33 AM >> To: java-user@lucene.apache.org >> Subject: Re: possible bug on

Re: possible bug on Spellchecker

2013-02-21 Thread Samuel García Martínez
ucene.apache.org > Subject: Re: possible bug on Spellchecker > Importance: Low > > I'm using Solr 3.6 and DirectSpellchecker is available only on v4+. > Moreover, in "big" indexes i prefer using sidekick index rather than > iterating over term dictionary. > > > O

RE: possible bug on Spellchecker

2013-02-21 Thread Dyer, James
] Sent: Thursday, February 21, 2013 2:33 AM To: java-user@lucene.apache.org Subject: Re: possible bug on Spellchecker Importance: Low I'm using Solr 3.6 and DirectSpellchecker is available only on v4+. Moreover, in "big" indexes i prefer using sidekick index rather than iterating over

Re: possible bug on Spellchecker

2013-02-21 Thread Samuel García Martínez
Message- From: Samuel García Martínez > Sent: Wednesday, February 20, 2013 3:34 PM > To: java-user@lucene.apache.org > Subject: possible bug on Spellchecker > > > Hi all, > > Debugging Solr spellchecker (IndexBasedSpellchecker, delegating on lucene > Spellchecker) behav

Re: possible bug on Spellchecker

2013-02-20 Thread Jack Krupansky
@lucene.apache.org Subject: possible bug on Spellchecker Hi all, Debugging Solr spellchecker (IndexBasedSpellchecker, delegating on lucene Spellchecker) behaviour i think i found a bug when the input is a 6 letter word: - george - anthem - argued - fluent Due to the getMin() and getMax() the

possible bug on Spellchecker

2013-02-20 Thread Samuel García Martínez
Hi all, Debugging Solr spellchecker (IndexBasedSpellchecker, delegating on lucene Spellchecker) behaviour i think i found a bug when the input is a 6 letter word: - george - anthem - argued - fluent Due to the getMin() and getMax() the grams indexed for these terms are 3 and 4. So, the

Re: Improving disk efficiency for autocompleter / spellchecker

2011-06-10 Thread Ian Lea
on the contrib SpellChecker > source code (it creates start1-20 grams, no middle/end grams). The use > case for this class, as with most autocompleters, is that it is called > on each keypress from the user. > What happens is, that on each keypress, a static method > 'autocomplet

Improving disk efficiency for autocompleter / spellchecker

2011-06-10 Thread Elmer
Hello again :) I've created a autocompleter class, based on the contrib SpellChecker source code (it creates start1-20 grams, no middle/end grams). The use case for this class, as with most autocompleters, is that it is called on each keypress from the user. What happens is, that on each key

RES: spellchecker

2009-12-23 Thread Mário André
-- -Mensagem original- De: Simon Willnauer [mailto:simon.willna...@googlemail.com] Enviada em: quarta-feira, 23 de dezembro de 2009 15:39 Para: java-user@lucene.apache.org Assunto: Re: spellchecker Hi mario, PlainTextDictionary expects a

Re: spellchecker

2009-12-23 Thread Simon Willnauer
Hi mario, PlainTextDictionary expects a text file with one word per line like: hello world foo bar simon 2009/12/23 Mário André : > Hello friends, > > I’m new here and in the lucene Project. I’m trying use the "spellchecker" > according to the exemple below: >

spellchecker

2009-12-23 Thread Mário André
Hello friends, I’m new here and in the lucene Project. I’m trying use the "spellchecker" according to the exemple below: // To index a file containing words: spellchecker.indexDictionary(new PlainTextDictionary(new File("myfile.txt"))); String[] suggestions = spellch

Re: Potential leak of file resources in SpellChecker

2009-12-03 Thread Michael McCandless
ht out upgrade and deprecation policy. I really > like the way I could gradually replace deprecated API calls and then > make the final switch from 2.9.1 to 3.0.0. More projects should follow > this path. Keep up the good work! > > Eirik. > >> 2009/12/2 Eirik Bjørsnøs : >&

Re: Potential leak of file resources in SpellChecker

2009-12-03 Thread Eirik Bjørsnøs
ojects should follow this path. Keep up the good work! Eirik. > 2009/12/2 Eirik Bjørsnøs : >> Hi, >> >> I'm using SpellChecker (in Lucene contrib) to help users of SVNSearch >> who can't type right: >> >> http://svnsearch.org/svnsearch/repos/A

Re: Potential leak of file resources in SpellChecker

2009-12-03 Thread Michael McCandless
This sounds like an important bug fix -- could you open a Jira issue & attach a patch? Thanks! Mike 2009/12/2 Eirik Bjørsnøs : > Hi, > > I'm using SpellChecker (in Lucene contrib) to help users of SVNSearch > who can't type right: > > http://svnsearch.org/svnse

Potential leak of file resources in SpellChecker

2009-12-02 Thread Eirik Bjørsnøs
Hi, I'm using SpellChecker (in Lucene contrib) to help users of SVNSearch who can't type right: http://svnsearch.org/svnsearch/repos/ASF/search?logMessage=lusene SpellChecker works very well for my purpose, but I've encountered a possible file resource leak issue: I can't f

Re: SpellChecker locks folder

2009-05-01 Thread Chris Hostetter
: In the source of SpellChecker lines 352-353 I see that after the indexing is : done the index is reopened: : searcher.close(); : searcher = new IndexSearcher(this.spellIndex); : : Why does that second line exist at all? when you instantiate a SpellChecker object, it opens a Searcher for reuse

SpellChecker locks folder

2009-04-22 Thread John Cherouvim
Hello After I call the SpellChecker.indexDictionary method the directory which contained the lucene index is locked. I cannot rename of delete the folder (windows). In the source of SpellChecker lines 352-353 I see that after the indexing is done the index is reopened: searcher.close

Re: SpellChecker in use with composite query

2009-04-14 Thread Amin Mohammed-Coleman
w MultiReader(new IndexReader[] > {spellReader,spellReader2}); > > LuceneDictionary luceneDictionary = new LuceneDictionary(multiReader, > "content"); > > Directory spellDirectory = FSDirectory.getDirectory( spellcheck); > > SpellChecker spellChecker = new

Re: SpellChecker AlreadyClosedException issue

2009-04-14 Thread Chris Hostetter
: dir is a local variable inside a method, so it's not getting reused. : Should I synchronise the whole method? I think that would slow things down in : a concurrent environment. i don't really have any new advice beyond my previous comment... : > if you have a fully exeuctable test case (instea

Re: SpellChecker in use with composite query

2009-04-11 Thread Amin Mohammed-Coleman
); MultiReader multiReader = new MultiReader(new IndexReader[] {spellReader,spellReader2}); LuceneDictionary luceneDictionary = new LuceneDictionary(multiReader, "content"); Directory spellDirectory = FSDirectory.getDirectory(wrote: > Hi > I have been playing around with the SpellChecker cla

SpellChecker in use with composite query

2009-04-10 Thread Amin Mohammed-Coleman
Hi I have been playing around with the SpellChecker class and so far it looks really good. While developing a testcase to show it working I came across a couple of issues which I have resolved but I'm not certain if this is the correct approach. I would therefore be grateful if anyone could

Re: SpellChecker AlreadyClosedException issue

2009-04-10 Thread John Cherouvim
: try { :dir = FSDirectory.getDirectory("/path/to/dictionary"); : SpellChecker spell = new SpellChecker(dir); // exception thrown here :// ... :dir.close(); : This code works, but in a highly concurrent situation AlreadyClosedException : is being thrown when I try to instantiate t

Re: SpellChecker AlreadyClosedException issue

2009-04-09 Thread Chris Hostetter
: My code looks like this: : : Directory dir = null; : try { :dir = FSDirectory.getDirectory("/path/to/dictionary"); :SpellChecker spell = new SpellChecker(dir); // exception thrown here :// ... :dir.close(); : This code works, but in a highly concurrent

SpellChecker AlreadyClosedException issue

2009-04-02 Thread John Cherouvim
Hello My code looks like this: Directory dir = null; try { dir = FSDirectory.getDirectory("/path/to/dictionary"); SpellChecker spell = new SpellChecker(dir); // exception thrown here // ... dir.close(); } catch (IOException ex) { log error } finally { if

Re: Lucene SpellChecker returns no suggetions after changing Server

2008-12-22 Thread Matthias W.
did, but the effect was not that big... The reason was the usage of the RAMDirectory. Now I'm usig FSDirectory.getDirectory("/dictionary") instead of new RAMDirectory() and it works fine. -- View this message in context: http://www.nabble.com/Lucene-SpellChecker-returns-no-suggetions

Re: Lucene SpellChecker returns no suggetions after changing Server

2008-12-20 Thread Chris Hostetter
float accuracy) { : Vector v = new Vector(); : String[] suggestions = null; : : try { : SpellChecker spellchecker = new SpellChecker(new RAMDirectory()/*, new : Jaro

Re: Lucene SpellChecker returns no suggetions after changing Server

2008-12-15 Thread Matthias W.
I fixed this problem by writing an external Java class and changing completly to Lucene-2.4.0. The SpellChecker now returns results but it is very slow. My index contains about 2 words, I don't think this is too much. How can I speed it up? My temporary method: public static V

Re: Lucene SpellChecker returns no suggetions after changing Server

2008-12-12 Thread Matthias W.
Yes, I'm passing the same index for Spellchecker and IndexReader. I'm going to test if this is a reason for my problem. But I still don't understand why the same code is working on the testserver. I think this could be because of the rights from tomcat. Is there any tutorial a

Re: Lucene SpellChecker returns no suggetions after changing Server

2008-12-10 Thread Grant Ingersoll
So, what changed with the server? From the looks of your code, you're passing the same index into both the Spellchecker and the IndexReader. The spelling index is separate from the main index. See the example at: http://lucene.apache.org/java/2_4_0/api/contrib-spellchecker/org/a

Re: Lucene SpellChecker returns no suggetions after changing Server

2008-12-09 Thread Matthias W.
I tested it with the lucene-spellchecker-2.4.0.jar... Then I get the queryString as suggestion if it exists in the index. But there's no chance to get other suggestions neither with LevensteinDistance nor with JaroWinklerDistance. The setAccuracy() method has no effect on the result. I

Lucene SpellChecker returns no suggetions after changing Server

2008-12-08 Thread Matthias W.
Hi, I'm using Lucene's SpellChecker (Lucene 2.1.0) class to get suggestions. Till now my testing server was a VMWare-Image from http://es.cohesiveft.com http://es.cohesiveft.com (Ubuntu 8.10, Tomcat6, Java5). Now I'm using a Debian Etch Server with Tomcat5.5 and Java6. Code

Spellchecker Evaluation Criteria

2008-10-14 Thread mattspitz
So, it appears to me that the criteria for a "good suggestion" is the n-gram overlap of a given term, not the edit distance. Thus, if we're looking for "britney", but we mess up and type "birtney", "kortney" will come up before "birtney." I

ThreadSafe SpellChecker?

2008-10-14 Thread mattspitz
I was wondering if the Lucene SpellChecker class was threadsafe, specifically, indexDictionary(). Such that: for (int i = 0; i < numReaders; i++) { //spawn new thread to run: spellchecker.indexDictionary(new LuceneDictionary(readers[i], myField)); } Would work. Thanks, M

Re: Questions about use of SpellChecker: Constructor and Simillarity...

2008-04-08 Thread Mathieu Lecarme
king")), Occur.SHOULD); So will show all lines... the question now is which is best 'ranking' ... However you all help me so much , THANKS SO MUCH!!! (now I won't say bad about the constructor of SpellChecker) most word matched, the better score you have. You should use a thresol

Re: Questions about use of SpellChecker: Constructor and Simillarity...

2008-04-08 Thread Leandro
all lines... the question now is which is best 'ranking' ... However you all help me so much , THANKS SO MUCH!!! (now I won't say bad about the constructor of SpellChecker)

Re: Questions about use of SpellChecker: Constructor and Simillarity...

2008-04-08 Thread Karl Wettin
Leandro skrev: Sorry, I missunderstood your question. See other reply. Yes I got it. thanks Are you sure about that? Did you benchmark? Can we see the results? Hey man take it easy, I just imagine. But I guess use the ShingleFilter will help. I'm cool :) I just think you are overcompli

Re: Questions about use of SpellChecker: Constructor and Simillarity...

2008-04-08 Thread Leandro
> Sorry, I missunderstood your question. See other reply. > Yes I got it. thanks > Are you sure about that? Did you benchmark? Can we see the results? Hey man take it easy, I just imagine. But I guess use the ShingleFilter will help.

Re: Questions about use of SpellChecker: Constructor and Simillarity...

2008-04-08 Thread Karl Wettin
Leandro skrev: 1º Why need I pass a Directory objecto (obligatory) on constructor of SpellChecker? Mainly because it is a nasty peice of code. But it does a good job. How can we suggest it (create an normal constructor without param) to the team? Sorry, I missunderstood your question

Re: Questions about use of SpellChecker: Constructor and Simillarity...

2008-04-08 Thread Karl Wettin
Mathieu Lecarme skrev: Le 8 avr. 08 à 18:34, Karl Wettin a écrit : dreampeppers99 skrev: 2º Suposse that in my dictonary I had these words: "The Lord of the Rings: The Two Towers" "The Lord of the Rings: The Fellowship of the Ring" "The Lord of the Rings: The Return of the King" I just want t

Re: Questions about use of SpellChecker: Constructor and Simillarity...

2008-04-08 Thread Leandro
> > Mainly because it is a nasty peice of code. But it does a good job. > > > Because spellChecker use a directory to store data. It can be FSDirectory, > RAMDirectory Perfect explanation... !!! So use the RAMDirectory is better (perfomatically) spell

Re: Questions about use of SpellChecker: Constructor and Simillarity...

2008-04-08 Thread Mathieu Lecarme
Le 8 avr. 08 à 18:34, Karl Wettin a écrit : dreampeppers99 skrev: 1º Why need I pass a Directory objecto (obligatory) on constructor of SpellChecker? Mainly because it is a nasty peice of code. But it does a good job. Because spellChecker use a directory to store data. It can be

Re: Questions about use of SpellChecker: Constructor and Simillarity...

2008-04-08 Thread Leandro
> > > 1º Why need I pass a Directory objecto (obligatory) on constructor of > > SpellChecker? > > > > Mainly because it is a nasty peice of code. But it does a good job. > Thanks. How can we suggest it (create an normal constructor without param) to the team?

Re: Questions about use of SpellChecker: Constructor and Simillarity...

2008-04-08 Thread Karl Wettin
dreampeppers99 skrev: 1º Why need I pass a Directory objecto (obligatory) on constructor of SpellChecker? Mainly because it is a nasty peice of code. But it does a good job. 2º Suposse that in my dictonary I had these words: "The Lord of the Rings: The Two Towers" "The Lo

Questions about use of SpellChecker: Constructor and Simillarity...

2008-04-08 Thread Leandro
a Directory objecto (obligatory) on constructor of SpellChecker? 2º Suposse that in my dictonary I had these words: "The Lord of the Rings: The Two Towers" "The Lord of the Rings: The Fellowship of the Ring" "The Lord of the Rings: The Return of the King" I just want

Re: Questions about use of SpellChecker: Constructor and Simillarity...

2008-04-08 Thread Mathieu Lecarme
Use shingleFilter. I'm working on a wider SpellChecker, I'll post a third patch soon. https://admin.garambrogne.net/projets/revuedepresse/browser/trunk/src/java M. dreampeppers99 a écrit : Hi, I have two question about this GREAT tool.. (framework, library... "whatever"

Questions about use of SpellChecker: Constructor and Simillarity...

2008-04-08 Thread dreampeppers99
a Directory objecto (obligatory) on constructor of SpellChecker? 2º Suposse that in my dictonary I had these words: "The Lord of the Rings: The Two Towers" "The Lord of the Rings: The Fellowship of the Ring" "The Lord of the Rings: The Return of the King" I just

Re: SpellChecker: Spanish Dictionary

2007-12-13 Thread Daniel Naber
load) this dictionary. Maybe .txt file. The Lucene spellchecker is supposed to be fed with the terms from your own index. This way suggestions will only contains words which actually appear in your documents. If you still want to use a dictionary, see http://wiki.services.openoffice.org/wiki/Dicti

SpellChecker: Spanish Dictionary

2007-12-13 Thread Haroldo Nascimento
Hi, I am using the SpellCheck classes of Lucene for create the "Did you Mean" feature. I need load into memory all verbets of Spanish language (it wil be my dictinary). Where I can get (download) this dictionary. Maybe .txt file. Thanks

Re: Applying SpellChecker to a phrase

2007-12-11 Thread smokey
: > sequence of terms, > > : > then constructing a phrase query object using add(Term term, > > : > int position) > > : > method in org.apache.lucene.search.PhraseQuery class. Then > > I can inject > > : > similar words (suggested by SpellChecker) at appr

Re: Applying SpellChecker to a phrase

2007-12-11 Thread Doron Cohen
must much. > > : > I was thinking of parsing the phrase query string into a > : > sequence of terms, > : > then constructing a phrase query object using add(Term term, > : > int position) > : > method in org.apache.lucene.search.PhraseQuery class. Then > I can inject &

Re: Applying SpellChecker to a phrase

2007-12-10 Thread Chris Hostetter
> int position) : > method in org.apache.lucene.search.PhraseQuery class. Then I can inject : > similar words (suggested by SpellChecker) at appropriate : > positions for each : > term as I construct the final phrase query object. : > : > Do you agree that this should work too? : :

Re: Applying SpellChecker to a phrase

2007-12-08 Thread Karl Wettin
hod in org.apache.lucene.search.PhraseQuery class. Then I can inject similar words (suggested by SpellChecker) at appropriate positions for each term as I construct the final phrase query object. Do you agree that this should work too? I never tried this but I'm sure it will not work. The phrase query scorer requi

Re: Applying SpellChecker to a phrase

2007-12-07 Thread Doron Cohen
nt position) > method in org.apache.lucene.search.PhraseQuery class. Then I can inject > similar words (suggested by SpellChecker) at appropriate > positions for each > term as I construct the final phrase query object. > > Do you agree that this should work too? I never tried this but I'm

Re: Applying SpellChecker to a phrase

2007-12-04 Thread smokey
by SpellChecker) at appropriate positions for each term as I construct the final phrase query object. Do you agree that this should work too? On Dec 4, 2007 1:22 AM, Doron Cohen <[EMAIL PROTECTED]> wrote: > See below - > > smokey <[EMAIL PROTECTED]> wrote on 03/12/2007 05:1

Re: Applying SpellChecker to a phrase

2007-12-03 Thread Doron Cohen
See below - smokey <[EMAIL PROTECTED]> wrote on 03/12/2007 05:14:23: > Suppose I have an index containing the terms impostor, > imposter, fraud, and > fruad, then presumably regardless of whether I spell impostor and fraud > correctly, Lucene SpellChecker will offer the im

Re: SpellChecker performance and usage

2007-12-03 Thread Doron Cohen
smokey <[EMAIL PROTECTED]> wrote on 03/12/2007 17:23:21: > My question is for anyone who has experience with Lucene's SpellChecker, > especially around its performance characteristics/ramifications. > > 1. Given the fact that SpellChecker expands a query by adding all the

Re: Applying SpellChecker to a phrase

2007-12-03 Thread smokey
I have not tried this yet. I am trying to understand the best practices from others who have experiences with SpellChecker before actually implementing it. If I understand it correctly, the spell check class suggests alternate but similar words for a single input term. So I believe I will have to

SpellChecker performance and usage

2007-12-03 Thread smokey
My question is for anyone who has experience with Lucene's SpellChecker, especially around its performance characteristics/ramifications. 1. Given the fact that SpellChecker expands a query by adding all the permutations of potentially misspelled word, how does it perform in general? 2. Ho

Re: Applying SpellChecker to a phrase

2007-12-03 Thread Erick Erickson
Have you actually tried this and done a query.toString() to see how this is actually expanded? Not that I'm all that familiar with SpellChecker, but before presuming how things work you would get answers faster if you ran a test. And, why do you care about performance? I know that'

Applying SpellChecker to a phrase

2007-12-02 Thread smokey
Suppose I have an index containing the terms impostor, imposter, fraud, and fruad, then presumably regardless of whether I spell impostor and fraud correctly, Lucene SpellChecker will offer the improperly spelled versions as corrections. This means that the phrase "The login fraud involv

SpellChecker performance and usage

2007-12-02 Thread smokey
My question is for anyone who has experience with Lucene's SpellChecker, especially around its performance characteristics/ramifications. 1. Given the fact that SpellChecker expands a query by adding all the permutations of potentially misspelled word, how does it perform in general? 2. Ho

Re: Problems with stemming/SpellChecker

2007-10-13 Thread Daniel Naber
On Saturday 13 October 2007 07:57, Christian Aschoff wrote: > But as fare as i see (in the API DOC), the GermanAnalyzer is attached   > to the IndexWriter, i can't find an way to attach an analyzer it to a   > single field... Or do i miss something? See PerFieldAnalyzerWrapper. Regards Daniel

Re: Problems with stemming/SpellChecker

2007-10-12 Thread Christian Aschoff
But as fare as i see (in the API DOC), the GermanAnalyzer is attached to the IndexWriter, i can't find an way to attach an analyzer it to a single field... Or do i miss something? (There are tons of other fields in the index where GermanAnalyzer fits perfect). Am 12.10.2007 um 19:01 schrieb

Re: Problems with stemming/SpellChecker

2007-10-12 Thread Daniel Naber
On Friday 12 October 2007 15:48, Christian Aschoff wrote: >  indexWriter = new IndexWriter(MiscConstants.luceneDir,   > new GermanAnalyzer(), create); > [...] Not NO_NORMS is the problem but GermanAnalyzer. Try StandardAnalyzer on the field you get the suggestions from. Regards Daniel -- htt

Problems with stemming/SpellChecker

2007-10-12 Thread Christian Aschoff
Hi, i tried to implement a 'did you mean'-function (and successed in some way). But the hints from the SpellChecker are the stemmed versions of the keywords. For example, the search for the wrong word 'wasseraalfingen' results in the hint 'wasseralfing' but

Re: SpellChecker and Lucene 2.1

2007-03-15 Thread Ryan O'Hara
Thanks a ton, Hoss. I just did an ant on the contrib/spellchecker directory and it produced a jar file in the LUCENE_HOME/build/ directory. Replacing the old jar file with the new jar file fixed my errrors as I suspected. Thanks, again. -Ryan On Mar 15, 2007, at 1:38 PM, Chris

Re: SpellChecker and Lucene 2.1

2007-03-15 Thread Chris Hostetter
: My question basically was is there a spellchecker-2.*1*-rc1-dev.jar? There *should* be a spellchecker 2.1 jar in the contrib/spellchecker/ directory of the 2.1 release. -Hoss - To unsubscribe, e-mail: [EMAIL PROTECTED

Re: SpellChecker and Lucene 2.1

2007-03-15 Thread karl wettin
15 mar 2007 kl. 15.33 skrev Ryan O'Hara: My question basically was is there a spellchecker-2.*1*-rc1-dev.jar? There has been changes since 2.0. You can also take a look at this: https://issues.apache.org/jira/browse/LUCENE-632 The problems I'm having...

Re: SpellChecker and Lucene 2.1

2007-03-15 Thread Ryan O'Hara
My question basically was is there a spellchecker-2.*1*-rc1-dev.jar? The problems I'm having... Exception: java.lang.NullPointerException at org.apache.lucene.search.spell.SpellChecker.indexDictionary (SpellChecker.java:298) at DidYouMeanIndexer.createSpell

Re: SpellChecker and Lucene 2.1

2007-03-14 Thread karl wettin
14 mar 2007 kl. 21.47 skrev Ryan O'Hara: Is there a SpellChecker.jar compatible with Lucene 2.1. After updating to Lucene 2.1, I seem to have lost the ability to create a spell index using spellchecker-2.0-rc1-dev.jar. Any help would be greatly appreciated. Can you explain the pr

SpellChecker and Lucene 2.1

2007-03-14 Thread Ryan O'Hara
Is there a SpellChecker.jar compatible with Lucene 2.1. After updating to Lucene 2.1, I seem to have lost the ability to create a spell index using spellchecker-2.0-rc1-dev.jar. Any help would be greatly appreciated. Thanks, Ryan

Re: SpellChecker::suggestSimilar() Question

2007-01-25 Thread karl wettin
25 jan 2007 kl. 20.43 skrev Ryan O'Hara: Is there anyway to sort the suggestions prior, so that grabbing only one suggestion would give you the best suggestion, in this case "genetics"? Without having looked at the code for a long time, I think the problem is what the lucene scoring cons

SpellChecker::suggestSimilar() Question

2007-01-25 Thread Ryan O'Hara
It seems that the suggestions returned by SpellChecker::suggestSimilar (queryText, num_sug, reader, field, bool) are randomly chosen, then sorted. By altering num_sug (10, 5, 3,2,1), I received the following suggestions for "gnetics": suggestion0: genetics suggestion1: ginetics s

Re: SpellChecker Index - remove words?

2007-01-11 Thread Otis Gospodnetic
The value of the word - the word itself, should be your unique identifier. Otis - Original Message From: Josh Joy <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Thursday, January 11, 2007 5:39:24 AM Subject: Re: SpellChecker Index - remove words? Thanks for the repl

Re: SpellChecker Index - remove words?

2007-01-11 Thread Josh Joy
well. Can the same be done for the spell checker index? Thanks in advance, Josh Otis Gospodnetic wrote: > Josh, > > The spellchecker index is just another Lucene index, so you can delete documents/words from it the same way you delete documents from any Lucene index - using IndexRead

Re: SpellChecker Index - remove words?

2007-01-11 Thread Otis Gospodnetic
Josh, The spellchecker index is just another Lucene index, so you can delete documents/words from it the same way you delete documents from any Lucene index - using IndexReader's delete(...) methods. You can pass that delete method a Term where the field name is "word" and t

SpellChecker Index - remove words?

2007-01-10 Thread Josh Joy
Hi All, The spellchecker api is very nice to use, and I can easily add words to the index. However, because the words I am adding are from another index that is user generated per se (meaning it may have spelling errors), how can I safely remove words from the spell checker index? If I know

Re: Multi-Index Spellchecker

2006-11-21 Thread Mark Miller
Thanks Hoss, I hadn't looked at the indexDictionary method yet. It does not appear to be what I am looking for though...I should have been more explicit - I am using the spellchecker for a 'did you mean search', so I am not using a dedicated spell check index. Instead I am pa

Multi-Index Spellchecker

2006-11-21 Thread Mark Miller
Does anyone have any interested in making the spellchecker work across more than one index? Does the coder of the spellchecker have any advice/dont do that moron info etc ? - Mark

Re: Strange Spellchecker behaviour

2006-10-10 Thread Doron Cohen
I believe this was fixed in http://issues.apache.org/jira/browse/LUCENE-593 - Doron Björn Ekengren <[EMAIL PROTECTED]> wrote on 10/10/2006 02:12:23: > Hello, I have found that the spellchecker behaves a bit strange. My > spell indexer class below doesn't work if I use the spellf

Strange Spellchecker behaviour

2006-10-10 Thread Björn Ekengren
Hello, I have found that the spellchecker behaves a bit strange. My spell indexer class below doesn't work if I use the spellfield string set in the constructor directly, but it does work if I use the intern() value. The problem resides in the hasNext() method of LuceneIterator where an o

SpellChecker

2006-08-17 Thread Matthew York
Hi, I'm having issues creating a SpellChecker index. I am running JRE 1.5.0_06, and have the following in my classpath - lucene-core-2.0.0.jar & lucene-spellchecker-2.0.0.jar. When I run the code below I get the following exception and a 1KB segments file in the spellIndexDire

Re: SpellChecker

2006-07-29 Thread Chris Hostetter
: import org.apache.lucene.search.spell.Dictionary; : import org.apache.lucene.search.spell.LuceneDictionary; : import org.apache.lucene.search.spell.SpellChecker; : : cause is seems like there are not included in Lucene.Net. Could this be : right ? Are there alternatives ? Those classes are pat

Re: SpellChecker

2006-07-29 Thread neils
Lucene.Net. Could this be right ? Are there alternatives ? Thanks N;-)eils -- View this message in context: http://www.nabble.com/SpellChecker-tf2021230.html#a5557630 Sent from the Lucene - Java Users forum at Nabble.com. - To

Re: SpellChecker

2006-07-29 Thread Mark Miller
neils wrote: Hi, i had seen that a suggestion-tool (like google: Did you mean xyz?) can be implemented with a addon for lucene which is called SpellingChecker. Is this correct or is there another (better) solution and where can this addon be downloaded (cause i do not find a working download

SpellChecker

2006-07-29 Thread neils
for you help ;-)) -- View this message in context: http://www.nabble.com/SpellChecker-tf2021230.html#a5557433 Sent from the Lucene - Java Users forum at Nabble.com. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional

Re: Spellchecker Download at lucene wiki outdated

2006-06-30 Thread Otis Gospodnetic
, 2006 7:36:53 AM Subject: Spellchecker Download at lucene wiki outdated Hi all, I don't know who can update the Wiki Pages so I am just mailing here. The download of spellchecker1.1.zip contribution does not work with Lucene-2.0 anymore. http://wiki.apache.org/jakarta-lucene/SpellC

Re: Spellchecker Download at lucene wiki outdated

2006-06-30 Thread Chris Hostetter
: I don't know who can update the Wiki Pages so I am just mailing here. anyone can edit the wiki, just create an account (click "Login" and it will give you that option) : So I wanted to build _only_ the spellcheck-contrib from the : SVN-repository, but it seems to me that there are no ant-tar

Spellchecker Download at lucene wiki outdated

2006-06-30 Thread Martin Braun
Hi all, I don't know who can update the Wiki Pages so I am just mailing here. The download of spellchecker1.1.zip contribution does not work with Lucene-2.0 anymore. http://wiki.apache.org/jakarta-lucene/SpellChecker?highlight=spellchecker1.1.zip So I wanted to build _only_ the spell

Re: question with spellchecker

2006-06-13 Thread Bob Carpenter
o note Google's approach to this. Not some fancy spell checker - just mining searcher behaviour patterns. Google's Bosworth describes this approach approx 13 minutes into this podcast: http://www.itconversations.com/shows/detail571.html - Original Message From: Van Nguyen

Re: question with spellchecker

2006-06-13 Thread mark harwood
podcast: http://www.itconversations.com/shows/detail571.html - Original Message From: Van Nguyen <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Monday, 12 June, 2006 11:09:20 PM Subject: RE: question with spellchecker I'll experiment with both. Thanks... -Original Message-

RE: question with spellchecker

2006-06-12 Thread Van Nguyen
I'll experiment with both. Thanks... -Original Message- From: mark harwood [mailto:[EMAIL PROTECTED] Sent: Wednesday, June 07, 2006 2:16 AM To: java-user@lucene.apache.org Subject: Re: question with spellchecker I think the problem in your particular example is the suggestion sof

Re: question with spellchecker

2006-06-07 Thread mark harwood
I think the problem in your particular example is the suggestion software has no consideration of context. I've been playing with context-sensitive suggestions recently which take a bunch of validated (ie existing) words (eg "tape") and use this to help shortlist alternatives for an unknown or par

Re: question with spellchecker

2006-06-06 Thread eks dev
try your query like ((ducted^1000 duct~2) +tape) Or maybe (duct* +tape) or even better you could try to do some stemming (Porter stemmer should get rid of these ed-suffixes) and some of the above if this does not help, have a look at lingpipe spellChecker class as this looks like exactly what

question with spellchecker

2006-06-06 Thread Van Nguyen
I'm implementing a spellchecker in my search and have a question. After creating the index and spellchecker index, I pass in the word "ducted tape" to search (I am expecting "duct tape" back). I've played around with boosting the prefixes and suffixes, setti

Re: Problem using SpellChecker with run time strings

2006-05-23 Thread karl wettin
On Tue, 2006-05-23 at 18:45 +0200, karl wettin wrote: > On Tue, 2006-05-23 at 10:50 -0500, James Maes wrote: > > > It seems to be related to Strings and when they are created. > > the bug where the per instance float for maximum score (accuracy) is > modified instead of using it local in the met

Re: Problem using SpellChecker with run time strings

2006-05-23 Thread karl wettin
On Tue, 2006-05-23 at 10:50 -0500, James Maes wrote: > It seems to be related to Strings and when they are created. Try to create a new instance of SpellChecker for each suggestion. Will it work? Then you have hit the bug where the per instance float for maximum score (accuracy) is modif

Problem using SpellChecker with run time strings

2006-05-23 Thread James Maes
Here is the problem: We have implemented the lucene eninge within our application server which is built ontop of Tomcat. We've had no problems creating the indexes or searching them. The problems we are having are all related to the SpellChecker part of the system. It seems to be relat

  1   2   >