gt; a possible discussion also for closed issues proceeds.
>
> Uwe
>
> Am 28. August 2022 13:03:53 MESZ schrieb Tomoko Uchida <
> tomoko.uchida.1...@gmail.com>:
>>
>>
>>
>> Jira notification to issues@ was stopped.
>> I'm going to run the scr
bout the fix-version: tagging. I wonder if going forward we
> > want to main that for new issues? I happened to notice there is also
> > this "milestone" feature in github -- does that seem like a place to
> > put version information?
> >
> > On Wed, Aug 24, 2022
tion about the fix-version: tagging. I wonder if going forward we
> want to main that for new issues? I happened to notice there is also
> this "milestone" feature in github -- does that seem like a place to
> put version information?
>
> On Wed, Aug 24, 2022 at 3:20 PM To
the issues.
>> - Pull requests are not affected. You can open/update PRs as usual.
>> Please let me know if you have any trouble with PRs.
>>
>>
>> Tomoko
>>
>>
>> 2022年8月18日(木) 18:23 Tomoko Uchida :
>>
>>> Hello all,
>>>
>&
/issues/1072
- While you cannot break these issues, migration scripts can
modify/overwrite your comments on the issues.
- Pull requests are not affected. You can open/update PRs as usual. Please
let me know if you have any trouble with PRs.
Tomoko
2022年8月18日(木) 18:23 Tomoko Uchida :
> He
Hello all,
The Lucene project decided to move our issue tracking system from Jira to
GitHub and migrate all Jira issues to GitHub.
We start issue migration on Monday, August 22 at 8:00 UTC.
1) We make Jira read-only before migration. You cannot update existing
issues until the migration is comple
) 14:41 Aditya Varun Chadha :
> Thanks Tomoko,
> There is no activity in JIRA from me as far as I can recall. This is the
> correct and only account though.
>
> On Sun 7. Aug 2022 at 05:50, Tomoko Uchida
> wrote:
>
> > Hi Aditya,
> > I found a Jira user "adichad&
ote:
>
> > jira: gnewton
> > github: gnewton (github.com/gnewton)
> >
> > Thanks,
> > Glen
> >
> >
> >
> > On Sat, 6 Aug 2022 at 14:11, Tomoko Uchida >
> > wrote:
> >
> > > Hi everyone.
> > >
> > > I wanted
>
>
>
> On Sat, 6 Aug 2022 at 14:11, Tomoko Uchida
> wrote:
>
> > Hi everyone.
> >
> > I wanted to let you know that we'll extend the deadline until the date
> the
> > migration is started (the date is not fixed yet).
> > Please let us k
Hi everyone.
I wanted to let you know that we'll extend the deadline until the date the
migration is started (the date is not fixed yet).
Please let us know your Jira/Github usernames if you don't see mapping(s)
for your account in this file:
https://github.com/apache/lucene-jira-archive/blob/main
Hi Atri and Christian,
thanks for your reply, we already have your accounts in
-
https://github.com/apache/lucene-jira-archive/blob/7654c0168a86fb05e942666d4514d48966d223bb/migration/mappings-data/account-map.csv.20220722.verified#L42
-
https://github.com/apache/lucene-jira-archive/blob/7654c0168a
Hi Mike, Marcus, and Praveen:
I verified the added two mappings - these Jira users have activity on
Lucene Jira, also corresponding GitHub accounts are valid.
- marcussorealheis
- pru30
Tomoko
2022年8月1日(月) 18:40 Michael McCandless :
> Thanks Praveen,
>
> I added your mapping here:
> https://gi
thors) still
remain unmapped since we have few clues to infer correct mappings.
Thanks,
Tomoko
2022年7月23日(土) 13:06 Tomoko Uchida :
> Hi all,
> we created account mapping (Jira user to GitHub account) for issue
> migration to GitHub. Please make sure your account is correctly mapped.
>
>
eply to this mail.
Tomoko
2022年7月10日(日) 15:02 Tomoko Uchida :
> [image: jira_profile.png]
> [image: github_profile.png]
> Hi all Lucene maintainers and contributors,
> we need your cooperation for the coming GitHub issue migration from Jira.
>
> In short, please set your GitHu
[image: jira_profile.png]
[image: github_profile.png]
Hi all Lucene maintainers and contributors,
we need your cooperation for the coming GitHub issue migration from Jira.
In short, please set your GitHub profile's "Name" to exactly the same
string in your Jira "Full Name" (see the attached images
Hi Anastasiya,
> 1) Use AnalyzingSuggester
> 2) Use PrefixQuery and SpanFirstQuery with the IndexSearcher.search()
> What is the differences between them? Do they have big differences in
> performance?
I don't think they are directly comparable but internally,
AnalyzingSuggerster decompounds the
harFilter since Lucene 9.0
so you don't need to worry about full-width and half-width
normalization if you use it.
Tomoko
2022年1月14日(金) 11:58 Tomoko Uchida :
>
> Hi,
>
> > The only thing that seems to differ is that the characters are full-width
> > vs half-width, so I was wo
Hi,
> The only thing that seems to differ is that the characters are full-width
> vs half-width, so I was wondering if this is intended behavior or a bug/too
> restrictive
This is intended behavior. The first column in the user dictionary
must be equal to the concatenated string of the second col
I will take care of Luke app. It has Lucene logo and sticker.
If by any chance a new logo for Luke itself is provided (maybe from fans
with graphic designing skills), I will also change it - the priority is low
I think.
2020年9月9日(水) 5:07 Anshum Gupta :
> Thank you, Ryan and everyone else who was
As for Lucene, "demo" module could be a help?
https://lucene.apache.org/core/8_6_1/demo/index.html
2020年8月17日(月) 23:23 Ali Akhtar :
> I don't know if there is a repo but this site helped me when i was getting
> started: http://www.lucenetutorial.com/
>
> On Mon, Aug 17, 2020 at 10:04 AM Ganapat
LUCENE-9322 as setting up a framework for experimentation +
> benchmarking (but not requiring us to commit a particular ANN
> implementation quite yet).
>
> Julie
>
> On 2020/07/17 12:16:18, Tomoko Uchida wrote:
> > > would it make sense to create a separate Lucene module
> would it make sense to create a separate Lucene module for ANN search ?
>From a bit of my experience with LUCENE-9004, it is currently impossible to
plug in or opt in custom codecs and indexing chain for aknn search without
touching lucene-core module (plz correct that if it's wrong).
I think LU
Hi,
did you try or consider SpanNearQuery?
You might need to insert some kind of spetial token (e.g., ) to the
end of the text field to match the "end of the sentence" anyways.
2020年1月10日(金) 1:30 陈志祥 :
> To be more clear,i think you need build a custom PhraseQuery class,which
> can set each slop
t; sacrifice some precision. For example mes/mess, ne/née, I'm sure there are
> others.
>
> So having both options is helpful, I don't think it's a bug on the face of
> it. However I didn't look closely at the code, so I'm not sure what the
> intent is exac
27日(土) 20:29 Tomoko Uchida :
>
> Hi Adrien,
>
> To me, it sounds simply a bug. Can you please open a JIRA (with a
> patch if possible)?
>
> Tomoko
>
> 2019年7月23日(火) 22:05 Adrien Gallou :
> >
> > Hi,
> >
> > I'm using both light and minimal Fre
Hi Adrien,
To me, it sounds simply a bug. Can you please open a JIRA (with a
patch if possible)?
Tomoko
2019年7月23日(火) 22:05 Adrien Gallou :
>
> Hi,
>
> I'm using both light and minimal French stemmers and encountered an issue
> when using the minimal stemmer.
>
> The light stemmer removes the la
Hi
forbiddenapis itself is an independent project, so maybe the github repo
for it is the right place for you (since it seems the code is not related
to Lucene)?
2019年7月17日(水) 2:05 elbek kamoliddinov :
> Hello everyone.
> We use Lucene's nice forbidden api check tool. I wonder if anonymous
> cla
; > Subject: Re: FuzzyQuery- why is it ignored?
> >
> > i see, i am using an older version 6.6 and we should switch to Your 8.1
> > version of at least 7.X.
> >
> > Tomoko i think i understood You meant MAIN NASHUA for the string :)
> >
> > Again i reall
t; version of at least 7.X.
>
> Tomoko i think i understood You meant MAIN NASHUA for the string :)
>
> Again i really appreciate all answers.
>
> How do we disable or enable stemming while indexing? :) another question.
>
> Best regards
>
>
> On 6/13/19 10:40 A
nerate hardly predictable
> > > similarities on additional fields. These add scores to the overall
> > > result and also affect normalization.
> > >
> > > Positively, the preferred matches are somewhere in the top ranks.
> > > So
> > > maybe rule
Hi Matthias,
What similarity class are you using.
Just a guess... but possibly one reason is document (field) length
normalization. Generally speaking shorter documents would get higher
scores than longer documents. (I saw that classic TFIDF similarity
tends to give much higher scores to shorter
the end except s works in this
> >>>> case.
> >>>>
> >>>> Case #3 had +contentDFLT:mains~2 but not +contentDFLT:"mains~2".
> >>>>
> >>>> i am using fuzzy query with ~ from Query.builder and that is not
> >>>>
dunstable etc., right?
>
> Best regards
>
> On 6/12/19 11:36 AM, baris.ka...@oracle.com wrote:
>
> Tomoko,-
>
> Thank You for Your suggestions. i am trying to understand it and i thought i
> did :)
>
> but it does not work with FuzzyQuery when i used with a *sin
> >> Index has a TextField named contentDFLT the following data :
> >> street="MAIN" city="NASHUA" municipality="HILLSBOROUGH" region="NEW
> >> HAMPSHIRE" country="UNITED STATES"
> >>
> >>
> >> When i
ere.)
Query parsing is a complex process, so it would be good to check
parsed raw query string especially when you have (reserved) special
characters in your query...
2019年6月11日(火) 1:10 Tomoko Uchida :
>
> Hi,
>
> I noticed one small thing in your previous mail.
>
> > when i use q1
reet=\"MAIN\""); i get same results
> which is good.
>
> But when i switch to MAINS~ then fuzzy query does not work.
>
>
> i need to say something with the q1 only in the booleanquery:
> it tries to match the MAIN in street, city, region and country which are
but it does not bring anything.
> >>
> >> createPhraseQuery builds a Phrasequery with one term equal to the
> >> string
> >> in the call.
> >>
> >> Best regards
> >>
> >>
> >>
> >> On 6/1
Hi,
What analyzer do you use for the text field? Is the term "Main"
correctly indexed?
2019年6月8日(土) 9:13 Baris Kazar :
>
> Hi,-
> i cant get FuzzyQuery working for searching with a query like Mains~2 to
> find the word Main in a TextField.
> Any suggestions please?
>
> -
will take time (and to be honest I'm not sure the patch will be
accepted) but I think it's much better than applying monkey-fixes to
the current build script.
If you are seriously interested in this work, please feel free to involve it.
Tomoko
2019年5月28日(火) 7:57 Tomoko Uchida :
>
&
why I mentioned "1-1)" and "1-2)" processes first.
> > >
> > > Anyway, in my personal opinion, Lucene does not need to consider whether
> > > the system dictionary status is good or not.
> > > I just think when some user wants to use a custo
dify the ant file or find some code for a long time
> to run the DictionaryBuilder.
> I think there should be at least a guide.
>
> Warm regards,
> Namgyu Kim
>
> P.S. Although not as good as the Tomoko's contents, there is a list of
> dictionaries supported by kuromoji.
> (
Hi,
The system dictionary is not a mere "word collection", it includes a
machine-learned language model which is carefully trained by
researchers. If you want to replace the system dictionary, you have to
start from "re-train" the model. This needs expert knowledge so I do
not recommend to just mo
Hi,
> If I provide entries in the user
dictionary is it just as if I had included them in the system
dictionary? If the same entry occurs in both, do the user dictionary
weights supersede those in the system dictionary? Is there some way to
suppress entries in the system dict?
User dictionary is
ter make huge query with all intersected items:
>
> Query query
> (String item: Intersection) {
> query.addBoolean({"foo": item, "bar": item})
> }
>
> Is any better way to find all doc's that has intersected terms?
>
> Thanks!
> --
> Regards,
> Valentin.
--
Tomoko Uchida
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
Hi,
Luke 8.0.0 is out.
Zip archive can be downloaded from here:
https://github.com/DmitryKey/luke/releases/tag/luke-swing-8.0.0
In this release,
- Lucene version was upgraded to 8.0.0.
- Added 'exact hits count' checkbox to the Search tab to return the
accurate total hit count.
- Supported 'Creat
Hi,
Luke 7.7.0 is out.
Zip archive can be downloaded at here:
https://github.com/DmitryKey/luke/releases/tag/luke-swing-7.7.0
In this release,
- Lucene version was upgraded to 7.7.0.
- Some trivial UI bugs were fixed.
Regards,
Tomoko
s documents with given query,
> and returns TopDocs instance. TopDocs instance has array of ScoreDoc
> instances.
>
> My question is:
> 1. Will query be tokenized during search?
> 2. If so, does ScoreDoc instance have only
Hi Luke users,
Luke 7.6.0 was just released.
Zip archive can be downloaded from here:
https://github.com/DmitryKey/luke/releases/tag/luke-swing-7.6.0
This release includes important bug fixes and improvements. Please consider
upgrading if you use Swing Luke 7.5.0.
Major changes: https://twitter.c
> --
> > Thanks & Regards,
> > Arpit Mittal
> > 650-834-6540
> >
>
>
> --
> Thanks & Regards,
> Arpit Mittal
> 650-834-6540
--
Tomoko Uchida
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
e are our Twitter account: @moco_beta_en or @DmitryKey
Thanks,
Tomoko
2018年10月4日(木) 0:50 Tomoko Uchida :
> Hi,
>
> Luke 7.5.0 was just released.
> For this release, we have two editions of Luke. (Sorry for the confusion.)
>
> * Luke 7.5.0 - Swing edition
> Swing edition can
Hi,
Luke 7.5.0 was just released.
For this release, we have two editions of Luke. (Sorry for the confusion.)
* Luke 7.5.0 - Swing edition
Swing edition can be downloaded from here:
https://github.com/DmitryKey/luke/releases/tag/luke-swing-7.5.0
This version of Luke works with JDK 8/9/10/11.
Use
ndomization.
>
>
> -Hoss
> http://www.lucidworks.com/
>
> ---------
> To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
> For additional commands, e-mail: java-user-h...@lucene.apache.org
>
--
Tomoko Uchida
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
xt condition
> and disregarding first two conditions.
>
>
> But it can be workable, if I manage to apply context condition separately.
>
>
> More probably using custom filtering through Collector interface
> https://lucene.apache
g
I'm not sure if Lucene had been having such Collector class so far.
Regards,
Tomoko
2018年8月20日(月) 19:07 alex stark :
>
> In Lucene 7.x, CollapsingTopDocsCollector is removed, is there any
> replacement of it?
--
Tomoko Uchida
oid beforeTriLevelCompositeIdRoutingTest() throws
> >> Exception {
> >> switch (random().nextInt(2)) {
> >> case 0:
> >> useFactory(null); // uses Standard or NRTCaching, FS based
> anyway.
> >> break;
> >> case 1:
&g
ble randomization, could you
give me any cues for this? Or, real test cases that disable randomization
are helpful for me, I will search Lucene/Solr code base.
Thanks,
Tomoko
2018年8月22日(水) 21:58 Tomoko Uchida :
> Thanks for your kind explanations,
>
> sorry of course I know what is the
c, which is not
> under control of the randomization. I breathe a sigh of relief,
> though, when a test _does_ reproduce with a particular seed 'cause
> then I have a hope of knowing the issue is actually fixed ;).
>
>
> Best,
> Erick
>
> On Tue, Aug 21, 2018 at 3:56 PM, Tomoko Uc
2216dc98b018ad
>
> 4> If MIke's intuition that it's one of the file system randomizations
> that occasionally gets hit _and_ you determine that that's an invalid
> test case (and for Luke requiring that the FS-basesd tests are all
> that ar
s again to see if I could get an example, and strangely
> these tests are no longer failing for me after several runs, when
> previously they failed quite often. I wonder if this is somehow related to
> running mvn from command line vs running in IntelliJ since previously I was
> doing t
Hello,
Could you give me some advice or comments about usage of LuceneTestCase.
Some of our unit tests extending LuceneTestCase fail by assertion error --
sometimes, randomly.
I suppose we use LuceneTestCase in inappropriate way, but cannot find out
how to fix it.
Here is some information about
in me or/and send some links on explanations for
> what Lucene is good and supposed to be used and for what it is not good and
> not supposed to be used?
>
> Thanks a lot in advance,
> Ira
>
> -----Original Message-
> From: Tomoko Uchida
> Sent: Wednesday,
.
Tomoko
2018年8月1日(水) 20:01 Gordin, Ira :
> Hi Tomoko,
>
> I need to search in many files and we use Lucene for this purpose.
>
> Thanks,
> Ira
>
> -Original Message-----
> From: Tomoko Uchida
> Sent: Wednesday, August 1, 2018 1:49 PM
> To: java-user@lucene.ap
es inside lines and not inside words,
> so I
> > am considering to tokenize lines. How I should to implement this idea?
> > I'll really appriciate you have more ideas/implementations.
> >
> > Thanks in advance,
> > Ira
> >
>
>
>
> -
> To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
> For additional commands, e-mail: java-user-h...@lucene.apache.org
>
>
--
Tomoko Uchida
Download the release zip from here:
https://github.com/DmitryKey/luke/releases/tag/luke-7.4.0
This update is for upgrading Lucene 7.4.0.
Note: If you use OpenJDK you need to install OpenJFX separately to run
JavaFX. Oracle HotSpot JVM includes JavaFX runtime.
Best,
Tomoko Uchida
There is SynonymGraphFilterFactory, the successor of SynonymFilterFactory.
They have same parameters so you can easily switch to.
See the javadoc for details:
https://lucene.apache.org/core/7_3_0/analyzers-common/org/apache/lucene/analysis/synonym/SynonymGraphFilterFactory.html
--
Tomoko Uchida
OpenJDK you need to install OpenJFX separately to run
JavaFX. Oracle HotSpot JVM includes JavaFX runtime.
--
Tomoko Uchida
plan to switch current (Thinlet) Luke to the JavaFX version in a future
release. Please try it and feedback is welcome!
Note: If you use OpenJDK you need to install OpenJFX separately to run
JavaFX. Oracle HotSpot JVM includes JavaFX runtime.
--
Tomoko Uchida
Download the release zip here:
https://github.com/DmitryKey/luke/releases/tag/luke-7.1.0
Upgrade to Lucene 7.1.0.
and, other changes in this release:
<https://github.com/DmitryKey/luke/blob/master/CHANGES.txt>
--
Tomoko Uchida
o respective contributors!
--
Tomoko Uchida
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
Hi,
I do not fully understand your requirements, but analyzers-kuromoji
(one of extended package for Japanese morphological analysis) has some
custom token attributes.
http://lucene.apache.org/core/6_0_1/analyzers-kuromoji/index.html?org/apache/lucene/analysis/ja/tokenattributes/package-summary.ht
Hi,
> We have a Lucene 3.6-based index set which is quite large and currently
in use. What will be the upgrade path to (a) 4.x or (b) 5.x? With respect
to the data migration, etc. What are the steps and is it technically
possible? I read that 3.x to 5.x is not possible, and throws IndexTooStale
ex
t has index-time params to
> further reduce how much RAM it is allowed to use for the terms index.
>
> Mike McCandless
>
> http://blog.mikemccandless.com
>
>
> On Thu, May 14, 2015 at 2:26 AM, Tomoko Uchida
> wrote:
> > Hi,
> >
> > A question about API cha
Hi,
A question about API changes in Lucene 5.
The method "DirectoryReader.open(commit, termInfosIndexDivisor)" has been
removed at Lucene 5.0.
This functionality was completely removed?
Backgrounds:
I am trying to make Luke work with Lucene 5.1.
Luke for Lucene 4.x supports "TermInfos index divis
Hi,
Currently Luke is not the part of Lucene, you should check Luke project in
Github.
https://github.com/DmitryKey/luke
Here is the Issue for lucene 5.
https://github.com/DmitryKey/luke/issues/21
I'm now trying to make Luke support Lucene 5.x in my local branch.
Pull requests from other develop
>
> regards.
>
> On Thu, Feb 19, 2015 at 2:10 AM, Tomoko Uchida <
> tomoko.uchida.1...@gmail.com
> > wrote:
>
> > Hi,
> >
> > I'm afraid there are no easy or straight way for your requirement.
> > I would try create an temporary tiny in
Hi,
I'm afraid there are no easy or straight way for your requirement.
I would try create an temporary tiny index from search results on the fly
in memory, and get top N terms from it by HighFreqTerms.
http://lucene.apache.org/core/4_10_3/misc/org/apache/lucene/misc/HighFreqTerms.html
(The logic i
Hi,
> find me the 10 most similar documents
I suppose you mean "mlt.count" supported by MoreLikeThisComponent.
https://cwiki.apache.org/confluence/display/solr/MoreLikeThis
MLT is ordinary search in Lucene, so you get documents in order of
similarity (default scoring criteria) and can limit resu
Hi,
How often does this error occur?
You do not tell the lucene version, but I guess you use lucene 3.x
according to the stack trace...
IndexWriter would not be closed until IndexWriter.close() method is called
explicitly.
https://github.com/apache/lucene-solr/blob/lucene_solr_3_6_2/lucene/core/sr
:
>
> Hi Tomoko,
>
> does sorting with Locala.JAPANESE also work for Kanji. Since Hiragana and
> Katakana are based on the phonetics, I guess it is easier to define a
> sorting order. But Kanji is more similar to the Chinese.
>
> Thanks,
> Nils
>
>
> On 17.12.2014
Hi, Nils,
I don't know Chinese at all... but collation is very important in Japanese
too.
Lucene has org.apache.lucene.collation package that use ICU4J's collators
(you can find "lucene-analyzers-icu-4.10.2.jar" in analysis/icu directory).
http://lucene.apache.org/core/4_10_2/analyzers-icu/index.h
eami.com/blog/comparing-document-classification-functions-of-
> lucene-and-mahout.html
>
>
> (2014/12/14 11:11), Tomoko Uchida wrote:
>
>> Sorry again,
>> I checked the o.a.l.u.fst.TestFSTs.java, and found this modification seems
>> to be OK.
>>
>> So, I chang
someone correct it.
Regards,
Tomoko
2014-12-11 0:34 GMT+09:00 Tomoko Uchida :
>
> Sorry, I paste a wrong link...
>
> > I run the "FST Construction example:" code in package.html with lucene
> 4.10.2, and saw a compile error.
> > http://lucene.apache.org/core
Hi Christopher,
At my environment, IndexFiles demo works fine.
May be you must give directory path including source data you want to index
after '-docs'.
For example,
$ ls /path/to/source/dir
aaa.txt bbb.txt
$ java -cp
"demo/lucene-demo-4.10.2.jar:analysis/common/lucene-analyzers-common-4.10.2
2, and saw a compile error.
http://lucene.apache.org/core/4_10_2/core/index.html?org/apache/lucene/util/fst/package-summary.html
2014-12-11 0:32 GMT+09:00 Tomoko Uchida :
> Hi,
>
> I run the "FST Construction example:" code in package.html with lucene
> 4.10.2, and saw a compi
Hi,
I run the "FST Construction example:" code in package.html with lucene
4.10.2, and saw a compile error.
http://lucene.apache.org/core/4_10_2/core/index.html
Error ocurred at this line.
builder.add(Util.toIntsRef(scratchBytes, scratchInts), outputValues[i]);
Consulting with Javadoc, I found U
85 matches
Mail list logo