rwarded message --
From: Johnny Jenkins
Date: 11 September 2013 14:24
Subject: SpellChecker adding and removing words
To: java-user@lucene.apache.org
I’m wanting to high jack SpellChecker class as a general spell checking and
word suggestion tool. The idea of using this class was to avoid creat
I’m wanting to high jack SpellChecker class as a general spell checking and
word suggestion tool. The idea of using this class was to avoid creating my
own. At first it seems to fit the bill.
However, once I’ve used indexDirectory() I cannot seem to easily add/remove
words. I can re-instantiate
> Ingram Content Group
>> (615) 213-4311
>>
>>
>> -Original Message-
>> From: Samuel García Martínez [mailto:samuelgmarti...@gmail.com]
>> Sent: Thursday, February 21, 2013 2:33 AM
>> To: java-user@lucene.apache.org
>> Subject: Re: possible bug on
ucene.apache.org
> Subject: Re: possible bug on Spellchecker
> Importance: Low
>
> I'm using Solr 3.6 and DirectSpellchecker is available only on v4+.
> Moreover, in "big" indexes i prefer using sidekick index rather than
> iterating over term dictionary.
>
>
> O
]
Sent: Thursday, February 21, 2013 2:33 AM
To: java-user@lucene.apache.org
Subject: Re: possible bug on Spellchecker
Importance: Low
I'm using Solr 3.6 and DirectSpellchecker is available only on v4+.
Moreover, in "big" indexes i prefer using sidekick index rather than
iterating over
Message- From: Samuel García Martínez
> Sent: Wednesday, February 20, 2013 3:34 PM
> To: java-user@lucene.apache.org
> Subject: possible bug on Spellchecker
>
>
> Hi all,
>
> Debugging Solr spellchecker (IndexBasedSpellchecker, delegating on lucene
> Spellchecker) behav
@lucene.apache.org
Subject: possible bug on Spellchecker
Hi all,
Debugging Solr spellchecker (IndexBasedSpellchecker, delegating on lucene
Spellchecker) behaviour i think i found a bug when the input is a 6 letter
word:
- george
- anthem
- argued
- fluent
Due to the getMin() and getMax() the
Hi all,
Debugging Solr spellchecker (IndexBasedSpellchecker, delegating on lucene
Spellchecker) behaviour i think i found a bug when the input is a 6 letter
word:
- george
- anthem
- argued
- fluent
Due to the getMin() and getMax() the grams indexed for these terms are 3
and 4. So, the
on the contrib SpellChecker
> source code (it creates start1-20 grams, no middle/end grams). The use
> case for this class, as with most autocompleters, is that it is called
> on each keypress from the user.
> What happens is, that on each keypress, a static method
> 'autocomplet
Hello again :)
I've created a autocompleter class, based on the contrib SpellChecker
source code (it creates start1-20 grams, no middle/end grams). The use
case for this class, as with most autocompleters, is that it is called
on each keypress from the user.
What happens is, that on each key
--
-Mensagem original-
De: Simon Willnauer [mailto:simon.willna...@googlemail.com]
Enviada em: quarta-feira, 23 de dezembro de 2009 15:39
Para: java-user@lucene.apache.org
Assunto: Re: spellchecker
Hi mario,
PlainTextDictionary expects a
Hi mario,
PlainTextDictionary expects a text file with one word per line like:
hello
world
foo
bar
simon
2009/12/23 Mário André :
> Hello friends,
>
> I’m new here and in the lucene Project. I’m trying use the "spellchecker"
> according to the exemple below:
>
Hello friends,
Im new here and in the lucene Project. Im trying use the "spellchecker"
according to the exemple below:
// To index a file containing words:
spellchecker.indexDictionary(new PlainTextDictionary(new
File("myfile.txt")));
String[] suggestions = spellch
ht out upgrade and deprecation policy. I really
> like the way I could gradually replace deprecated API calls and then
> make the final switch from 2.9.1 to 3.0.0. More projects should follow
> this path. Keep up the good work!
>
> Eirik.
>
>> 2009/12/2 Eirik Bjørsnøs :
>&
ojects should follow
this path. Keep up the good work!
Eirik.
> 2009/12/2 Eirik Bjørsnøs :
>> Hi,
>>
>> I'm using SpellChecker (in Lucene contrib) to help users of SVNSearch
>> who can't type right:
>>
>> http://svnsearch.org/svnsearch/repos/A
This sounds like an important bug fix -- could you open a Jira issue &
attach a patch? Thanks!
Mike
2009/12/2 Eirik Bjørsnøs :
> Hi,
>
> I'm using SpellChecker (in Lucene contrib) to help users of SVNSearch
> who can't type right:
>
> http://svnsearch.org/svnse
Hi,
I'm using SpellChecker (in Lucene contrib) to help users of SVNSearch
who can't type right:
http://svnsearch.org/svnsearch/repos/ASF/search?logMessage=lusene
SpellChecker works very well for my purpose, but I've encountered a
possible file resource leak issue:
I can't f
: In the source of SpellChecker lines 352-353 I see that after the indexing is
: done the index is reopened:
: searcher.close();
: searcher = new IndexSearcher(this.spellIndex);
:
: Why does that second line exist at all?
when you instantiate a SpellChecker object, it opens a Searcher for reuse
Hello
After I call the SpellChecker.indexDictionary method the directory which
contained the lucene index is locked. I cannot rename of delete the
folder (windows).
In the source of SpellChecker lines 352-353 I see that after the
indexing is done the index is reopened:
searcher.close
w MultiReader(new IndexReader[]
> {spellReader,spellReader2});
>
> LuceneDictionary luceneDictionary = new LuceneDictionary(multiReader,
> "content");
>
> Directory spellDirectory = FSDirectory.getDirectory( spellcheck);
>
> SpellChecker spellChecker = new
: dir is a local variable inside a method, so it's not getting reused.
: Should I synchronise the whole method? I think that would slow things down in
: a concurrent environment.
i don't really have any new advice beyond my previous comment...
: > if you have a fully exeuctable test case (instea
);
MultiReader multiReader = new MultiReader(new IndexReader[]
{spellReader,spellReader2});
LuceneDictionary luceneDictionary = new LuceneDictionary(multiReader,
"content");
Directory spellDirectory = FSDirectory.getDirectory(wrote:
> Hi
> I have been playing around with the SpellChecker cla
Hi
I have been playing around with the SpellChecker class and so far it looks
really good. While developing a testcase to show it working I came across a
couple of issues which I have resolved but I'm not certain if this is the
correct approach. I would therefore be grateful if anyone could
: try {
:dir = FSDirectory.getDirectory("/path/to/dictionary");
: SpellChecker spell = new SpellChecker(dir); // exception thrown here
:// ...
:dir.close();
: This code works, but in a highly concurrent situation AlreadyClosedException
: is being thrown when I try to instantiate t
: My code looks like this:
:
: Directory dir = null;
: try {
:dir = FSDirectory.getDirectory("/path/to/dictionary");
:SpellChecker spell = new SpellChecker(dir); // exception thrown here
:// ...
:dir.close();
: This code works, but in a highly concurrent
Hello
My code looks like this:
Directory dir = null;
try {
dir = FSDirectory.getDirectory("/path/to/dictionary");
SpellChecker spell = new SpellChecker(dir); // exception thrown here
// ...
dir.close();
} catch (IOException ex) {
log error
} finally {
if
did, but the effect was not that big...
The reason was the usage of the RAMDirectory. Now I'm usig
FSDirectory.getDirectory("/dictionary") instead of new RAMDirectory() and it
works fine.
--
View this message in context:
http://www.nabble.com/Lucene-SpellChecker-returns-no-suggetions
float accuracy) {
: Vector v = new Vector();
: String[] suggestions = null;
:
: try {
: SpellChecker spellchecker = new SpellChecker(new
RAMDirectory()/*, new
: Jaro
I fixed this problem by writing an external Java class and changing completly
to Lucene-2.4.0.
The SpellChecker now returns results but it is very slow.
My index contains about 2 words, I don't think this is too much.
How can I speed it up?
My temporary method:
public static V
Yes, I'm passing the same index for Spellchecker and IndexReader.
I'm going to test if this is a reason for my problem.
But I still don't understand why the same code is working on the testserver.
I think this could be because of the rights from tomcat.
Is there any tutorial a
So, what changed with the server?
From the looks of your code, you're passing the same index into both
the Spellchecker and the IndexReader. The spelling index is separate
from the main index.
See the example at:
http://lucene.apache.org/java/2_4_0/api/contrib-spellchecker/org/a
I tested it with the lucene-spellchecker-2.4.0.jar...
Then I get the queryString as suggestion if it exists in the index. But
there's no chance to get other suggestions neither with LevensteinDistance
nor with JaroWinklerDistance. The setAccuracy() method has no effect on the
result.
I
Hi,
I'm using Lucene's SpellChecker (Lucene 2.1.0) class to get suggestions.
Till now my testing server was a VMWare-Image from http://es.cohesiveft.com
http://es.cohesiveft.com (Ubuntu 8.10, Tomcat6, Java5).
Now I'm using a Debian Etch Server with Tomcat5.5 and Java6.
Code
So, it appears to me that the criteria for a "good suggestion" is the n-gram
overlap of a given term, not the edit distance.
Thus, if we're looking for "britney", but we mess up and type "birtney",
"kortney" will come up before "birtney."
I
I was wondering if the Lucene SpellChecker class was threadsafe,
specifically, indexDictionary().
Such that:
for (int i = 0; i < numReaders; i++) {
//spawn new thread to run:
spellchecker.indexDictionary(new LuceneDictionary(readers[i],
myField));
}
Would work.
Thanks,
M
king")), Occur.SHOULD);
So will show all lines... the question now is which is best
'ranking' ...
However you all help me so much , THANKS SO MUCH!!!
(now I won't say bad about the constructor of SpellChecker)
most word matched, the better score you have.
You should use a thresol
all lines... the question now is which is best 'ranking' ...
However you all help me so much , THANKS SO MUCH!!!
(now I won't say bad about the constructor of SpellChecker)
Leandro skrev:
Sorry, I missunderstood your question. See other reply.
Yes I got it. thanks
Are you sure about that? Did you benchmark? Can we see the results?
Hey man take it easy, I just imagine. But I guess use the ShingleFilter will
help.
I'm cool :) I just think you are overcompli
> Sorry, I missunderstood your question. See other reply.
>
Yes I got it. thanks
> Are you sure about that? Did you benchmark? Can we see the results?
Hey man take it easy, I just imagine. But I guess use the ShingleFilter will
help.
Leandro skrev:
1º Why need I pass a Directory objecto (obligatory) on constructor of
SpellChecker?
Mainly because it is a nasty peice of code. But it does a good job.
How can we suggest it (create an normal constructor without param) to the
team?
Sorry, I missunderstood your question
Mathieu Lecarme skrev:
Le 8 avr. 08 à 18:34, Karl Wettin a écrit :
dreampeppers99 skrev:
2º Suposse that in my dictonary I had these words:
"The Lord of the Rings: The Two Towers"
"The Lord of the Rings: The Fellowship of the Ring"
"The Lord of the Rings: The Return of the King"
I just want t
>
> Mainly because it is a nasty peice of code. But it does a good job.
> >
> Because spellChecker use a directory to store data. It can be FSDirectory,
> RAMDirectory
Perfect explanation... !!!
So use the RAMDirectory is better (perfomatically)
spell
Le 8 avr. 08 à 18:34, Karl Wettin a écrit :
dreampeppers99 skrev:
1º Why need I pass a Directory objecto (obligatory) on constructor of
SpellChecker?
Mainly because it is a nasty peice of code. But it does a good job.
Because spellChecker use a directory to store data. It can be
>
>
> 1º Why need I pass a Directory objecto (obligatory) on constructor of
> > SpellChecker?
> >
>
> Mainly because it is a nasty peice of code. But it does a good job.
>
Thanks.
How can we suggest it (create an normal constructor without param) to the
team?
dreampeppers99 skrev:
1º Why need I pass a Directory objecto (obligatory) on constructor of
SpellChecker?
Mainly because it is a nasty peice of code. But it does a good job.
2º Suposse that in my dictonary I had these words:
"The Lord of the Rings: The Two Towers"
"The Lo
a Directory objecto (obligatory) on constructor of
SpellChecker?
2º Suposse that in my dictonary I had these words:
"The Lord of the Rings: The Two Towers"
"The Lord of the Rings: The Fellowship of the Ring"
"The Lord of the Rings: The Return of the King"
I just want
Use shingleFilter.
I'm working on a wider SpellChecker, I'll post a third patch soon.
https://admin.garambrogne.net/projets/revuedepresse/browser/trunk/src/java
M.
dreampeppers99 a écrit :
Hi,
I have two question about this GREAT tool.. (framework, library...
"whatever"
a Directory objecto (obligatory) on constructor of
SpellChecker?
2º Suposse that in my dictonary I had these words:
"The Lord of the Rings: The Two Towers"
"The Lord of the Rings: The Fellowship of the Ring"
"The Lord of the Rings: The Return of the King"
I just
load) this dictionary. Maybe .txt file.
The Lucene spellchecker is supposed to be fed with the terms from your own
index. This way suggestions will only contains words which actually appear
in your documents. If you still want to use a dictionary, see
http://wiki.services.openoffice.org/wiki/Dicti
Hi,
I am using the SpellCheck classes of Lucene for create the "Did you Mean"
feature.
I need load into memory all verbets of Spanish language (it wil be my
dictinary).
Where I can get (download) this dictionary. Maybe .txt file.
Thanks
: > sequence of terms,
> > : > then constructing a phrase query object using add(Term term,
> > : > int position)
> > : > method in org.apache.lucene.search.PhraseQuery class. Then
> > I can inject
> > : > similar words (suggested by SpellChecker) at appr
must much.
>
> : > I was thinking of parsing the phrase query string into a
> : > sequence of terms,
> : > then constructing a phrase query object using add(Term term,
> : > int position)
> : > method in org.apache.lucene.search.PhraseQuery class. Then
> I can inject
&
> int position)
: > method in org.apache.lucene.search.PhraseQuery class. Then I can inject
: > similar words (suggested by SpellChecker) at appropriate
: > positions for each
: > term as I construct the final phrase query object.
: >
: > Do you agree that this should work too?
:
:
hod in org.apache.lucene.search.PhraseQuery class. Then I can
inject
similar words (suggested by SpellChecker) at appropriate
positions for each
term as I construct the final phrase query object.
Do you agree that this should work too?
I never tried this but I'm sure it will not work.
The phrase query scorer requi
nt position)
> method in org.apache.lucene.search.PhraseQuery class. Then I can inject
> similar words (suggested by SpellChecker) at appropriate
> positions for each
> term as I construct the final phrase query object.
>
> Do you agree that this should work too?
I never tried this but I'm
by SpellChecker) at appropriate positions for each
term as I construct the final phrase query object.
Do you agree that this should work too?
On Dec 4, 2007 1:22 AM, Doron Cohen <[EMAIL PROTECTED]> wrote:
> See below -
>
> smokey <[EMAIL PROTECTED]> wrote on 03/12/2007 05:1
See below -
smokey <[EMAIL PROTECTED]> wrote on 03/12/2007 05:14:23:
> Suppose I have an index containing the terms impostor,
> imposter, fraud, and
> fruad, then presumably regardless of whether I spell impostor and fraud
> correctly, Lucene SpellChecker will offer the im
smokey <[EMAIL PROTECTED]> wrote on 03/12/2007 17:23:21:
> My question is for anyone who has experience with Lucene's SpellChecker,
> especially around its performance characteristics/ramifications.
>
> 1. Given the fact that SpellChecker expands a query by adding all the
I have not tried this yet. I am trying to understand the best practices from
others who have experiences with SpellChecker before actually implementing
it.
If I understand it correctly, the spell check class suggests alternate but
similar words for a single input term. So I believe I will have to
My question is for anyone who has experience with Lucene's SpellChecker,
especially around its performance characteristics/ramifications.
1. Given the fact that SpellChecker expands a query by adding all the
permutations of potentially misspelled word, how does it perform in general?
2. Ho
Have you actually tried this and done a query.toString() to see
how this is actually expanded? Not that I'm all that familiar
with SpellChecker, but before presuming how things work
you would get answers faster if you ran a test.
And, why do you care about performance? I know that'
Suppose I have an index containing the terms impostor, imposter, fraud, and
fruad, then presumably regardless of whether I spell impostor and fraud
correctly, Lucene SpellChecker will offer the improperly spelled versions as
corrections. This means that the phrase "The login fraud involv
My question is for anyone who has experience with Lucene's SpellChecker,
especially around its performance characteristics/ramifications.
1. Given the fact that SpellChecker expands a query by adding all the
permutations of potentially misspelled word, how does it perform in general?
2. Ho
On Saturday 13 October 2007 07:57, Christian Aschoff wrote:
> But as fare as i see (in the API DOC), the GermanAnalyzer is attached
> to the IndexWriter, i can't find an way to attach an analyzer it to a
> single field... Or do i miss something?
See PerFieldAnalyzerWrapper.
Regards
Daniel
But as fare as i see (in the API DOC), the GermanAnalyzer is attached
to the IndexWriter, i can't find an way to attach an analyzer it to a
single field... Or do i miss something? (There are tons of other
fields in the index where GermanAnalyzer fits perfect).
Am 12.10.2007 um 19:01 schrieb
On Friday 12 October 2007 15:48, Christian Aschoff wrote:
> indexWriter = new IndexWriter(MiscConstants.luceneDir,
> new GermanAnalyzer(), create);
> [...]
Not NO_NORMS is the problem but GermanAnalyzer. Try StandardAnalyzer on the
field you get the suggestions from.
Regards
Daniel
--
htt
Hi,
i tried to implement a 'did you mean'-function (and successed in some
way). But the hints from the SpellChecker are the stemmed versions of
the keywords.
For example, the search for the wrong word 'wasseraalfingen' results
in the hint 'wasseralfing' but
Thanks a ton, Hoss. I just did an ant on the contrib/spellchecker
directory and it produced a jar file in the LUCENE_HOME/build/
directory. Replacing the old jar file with the new jar file fixed my
errrors as I suspected. Thanks, again.
-Ryan
On Mar 15, 2007, at 1:38 PM, Chris
: My question basically was is there a spellchecker-2.*1*-rc1-dev.jar?
There *should* be a spellchecker 2.1 jar in the contrib/spellchecker/
directory of the 2.1 release.
-Hoss
-
To unsubscribe, e-mail: [EMAIL PROTECTED
15 mar 2007 kl. 15.33 skrev Ryan O'Hara:
My question basically was is there a spellchecker-2.*1*-rc1-dev.jar?
There has been changes since 2.0. You can also take a look at this:
https://issues.apache.org/jira/browse/LUCENE-632
The problems I'm having...
My question basically was is there a spellchecker-2.*1*-rc1-dev.jar?
The problems I'm having...
Exception:
java.lang.NullPointerException
at
org.apache.lucene.search.spell.SpellChecker.indexDictionary
(SpellChecker.java:298)
at DidYouMeanIndexer.createSpell
14 mar 2007 kl. 21.47 skrev Ryan O'Hara:
Is there a SpellChecker.jar compatible with Lucene 2.1. After
updating to Lucene 2.1, I seem to have lost the ability to create a
spell index using spellchecker-2.0-rc1-dev.jar. Any help would be
greatly appreciated.
Can you explain the pr
Is there a SpellChecker.jar compatible with Lucene 2.1. After
updating to Lucene 2.1, I seem to have lost the ability to create a
spell index using spellchecker-2.0-rc1-dev.jar. Any help would be
greatly appreciated.
Thanks,
Ryan
25 jan 2007 kl. 20.43 skrev Ryan O'Hara:
Is there anyway to sort the suggestions prior, so that grabbing
only one suggestion would give you the best suggestion, in this
case "genetics"?
Without having looked at the code for a long time, I think the
problem is what the lucene scoring cons
It seems that the suggestions returned by SpellChecker::suggestSimilar
(queryText, num_sug, reader, field, bool) are randomly chosen, then
sorted. By altering num_sug (10, 5, 3,2,1), I received the following
suggestions for "gnetics":
suggestion0: genetics
suggestion1: ginetics
s
The value of the word - the word itself, should be your unique identifier.
Otis
- Original Message
From: Josh Joy <[EMAIL PROTECTED]>
To: java-user@lucene.apache.org
Sent: Thursday, January 11, 2007 5:39:24 AM
Subject: Re: SpellChecker Index - remove words?
Thanks for the repl
well.
Can the same be done for the spell checker index?
Thanks in advance,
Josh
Otis Gospodnetic wrote:
> Josh,
>
> The spellchecker index is just another Lucene index,
so you can delete documents/words from it the same way
you delete documents from any Lucene index - using
IndexRead
Josh,
The spellchecker index is just another Lucene index, so you can delete
documents/words from it the same way you delete documents from any Lucene index
- using IndexReader's delete(...) methods. You can pass that delete method a
Term where the field name is "word" and t
Hi All,
The spellchecker api is very nice to use, and I can
easily add words to
the index. However, because the words I am adding are
from another index
that is user generated per se (meaning it may have
spelling errors), how
can I safely remove words from the spell checker
index? If I know
Thanks Hoss,
I hadn't looked at the indexDictionary method yet. It does not appear to
be what I am looking for though...I should have been more explicit -
I am using the spellchecker for a 'did you mean search', so I am not
using a dedicated spell check index. Instead I am pa
Does anyone have any interested in making the spellchecker work across more
than one index? Does the coder of the spellchecker have any advice/dont do
that moron info etc ?
- Mark
I believe this was fixed in http://issues.apache.org/jira/browse/LUCENE-593
- Doron
Björn Ekengren <[EMAIL PROTECTED]> wrote on 10/10/2006 02:12:23:
> Hello, I have found that the spellchecker behaves a bit strange. My
> spell indexer class below doesn't work if I use the spellf
Hello, I have found that the spellchecker behaves a bit strange. My spell
indexer class below doesn't work if I use the spellfield string set in the
constructor directly, but it does work if I use the intern() value. The problem
resides in the hasNext() method of LuceneIterator where an o
Hi, I'm having issues creating a SpellChecker index.
I am running JRE 1.5.0_06, and have the following in my classpath -
lucene-core-2.0.0.jar & lucene-spellchecker-2.0.0.jar.
When I run the code below I get the following exception and a 1KB segments file
in the spellIndexDire
: import org.apache.lucene.search.spell.Dictionary;
: import org.apache.lucene.search.spell.LuceneDictionary;
: import org.apache.lucene.search.spell.SpellChecker;
:
: cause is seems like there are not included in Lucene.Net. Could this be
: right ? Are there alternatives ?
Those classes are pat
Lucene.Net. Could this be
right ? Are there alternatives ?
Thanks
N;-)eils
--
View this message in context:
http://www.nabble.com/SpellChecker-tf2021230.html#a5557630
Sent from the Lucene - Java Users forum at Nabble.com.
-
To
neils wrote:
Hi,
i had seen that a suggestion-tool (like google: Did you mean xyz?) can be
implemented with a addon for lucene which is called SpellingChecker.
Is this correct or is there another (better) solution and where can this
addon be downloaded (cause i do not find a working download
for you help ;-))
--
View this message in context:
http://www.nabble.com/SpellChecker-tf2021230.html#a5557433
Sent from the Lucene - Java Users forum at Nabble.com.
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional
, 2006 7:36:53 AM
Subject: Spellchecker Download at lucene wiki outdated
Hi all,
I don't know who can update the Wiki Pages so I am just mailing here.
The download of spellchecker1.1.zip
contribution does not work with Lucene-2.0 anymore.
http://wiki.apache.org/jakarta-lucene/SpellC
: I don't know who can update the Wiki Pages so I am just mailing here.
anyone can edit the wiki, just create an account (click "Login" and it
will give you that option)
: So I wanted to build _only_ the spellcheck-contrib from the
: SVN-repository, but it seems to me that there are no ant-tar
Hi all,
I don't know who can update the Wiki Pages so I am just mailing here.
The download of spellchecker1.1.zip
contribution does not work with Lucene-2.0 anymore.
http://wiki.apache.org/jakarta-lucene/SpellChecker?highlight=spellchecker1.1.zip
So I wanted to build _only_ the spell
o note Google's approach to this.
Not some fancy spell checker - just mining searcher behaviour patterns.
Google's Bosworth describes this approach approx 13 minutes into this podcast:
http://www.itconversations.com/shows/detail571.html
- Original Message
From: Van Nguyen
podcast:
http://www.itconversations.com/shows/detail571.html
- Original Message
From: Van Nguyen <[EMAIL PROTECTED]>
To: java-user@lucene.apache.org
Sent: Monday, 12 June, 2006 11:09:20 PM
Subject: RE: question with spellchecker
I'll experiment with both.
Thanks...
-Original Message-
I'll experiment with both.
Thanks...
-Original Message-
From: mark harwood [mailto:[EMAIL PROTECTED]
Sent: Wednesday, June 07, 2006 2:16 AM
To: java-user@lucene.apache.org
Subject: Re: question with spellchecker
I think the problem in your particular example is the
suggestion sof
I think the problem in your particular example is the
suggestion software has no consideration of context.
I've been playing with context-sensitive suggestions
recently which take a bunch of validated (ie existing)
words (eg "tape") and use this to help shortlist
alternatives for an unknown or par
try your query like ((ducted^1000 duct~2) +tape)
Or maybe (duct* +tape)
or even better you could try to do some stemming (Porter stemmer should get rid
of these ed-suffixes) and some of the above
if this does not help, have a look at lingpipe spellChecker class as this looks
like exactly what
I'm implementing a spellchecker in my search and have a question.
After creating the index and spellchecker index, I pass in the word
"ducted tape" to search (I am expecting "duct tape" back).
I've played around with boosting the prefixes and suffixes, setti
On Tue, 2006-05-23 at 18:45 +0200, karl wettin wrote:
> On Tue, 2006-05-23 at 10:50 -0500, James Maes wrote:
>
> > It seems to be related to Strings and when they are created.
>
> the bug where the per instance float for maximum score (accuracy) is
> modified instead of using it local in the met
On Tue, 2006-05-23 at 10:50 -0500, James Maes wrote:
> It seems to be related to Strings and when they are created.
Try to create a new instance of SpellChecker for each suggestion. Will
it work? Then you have hit the bug where the per instance float for
maximum score (accuracy) is modif
Here is the problem:
We have implemented the lucene eninge within our application server which is
built ontop of Tomcat. We've had no problems creating the indexes or
searching them. The problems we are having are all related to the
SpellChecker part of the system.
It seems to be relat
1 - 100 of 104 matches
Mail list logo