from:"Tomoko Uchida"

Re: [ANNOUNCE] Issue migration Jira to GitHub starts on Monday, August 22

2022-08-29 Thread Tomoko Uchida

gt; a possible discussion also for closed issues proceeds. > > Uwe > > Am 28. August 2022 13:03:53 MESZ schrieb Tomoko Uchida < > tomoko.uchida.1...@gmail.com>: >> >> >> >> Jira notification to issues@ was stopped. >> I'm going to run the scr

Re: [ANNOUNCE] Issue migration Jira to GitHub starts on Monday, August 22

2022-08-24 Thread Tomoko Uchida

bout the fix-version: tagging. I wonder if going forward we > > want to main that for new issues? I happened to notice there is also > > this "milestone" feature in github -- does that seem like a place to > > put version information? > > > > On Wed, Aug 24, 2022

Re: [ANNOUNCE] Issue migration Jira to GitHub starts on Monday, August 22

2022-08-24 Thread Tomoko Uchida

tion about the fix-version: tagging. I wonder if going forward we > want to main that for new issues? I happened to notice there is also > this "milestone" feature in github -- does that seem like a place to > put version information? > > On Wed, Aug 24, 2022 at 3:20 PM To

Re: [ANNOUNCE] Issue migration Jira to GitHub starts on Monday, August 22

2022-08-24 Thread Tomoko Uchida

the issues. >> - Pull requests are not affected. You can open/update PRs as usual. >> Please let me know if you have any trouble with PRs. >> >> >> Tomoko >> >> >> 2022年8月18日(木) 18:23 Tomoko Uchida : >> >>> Hello all, >>> >&

Re: [ANNOUNCE] Issue migration Jira to GitHub starts on Monday, August 22

2022-08-22 Thread Tomoko Uchida

/issues/1072 - While you cannot break these issues, migration scripts can modify/overwrite your comments on the issues. - Pull requests are not affected. You can open/update PRs as usual. Please let me know if you have any trouble with PRs. Tomoko 2022年8月18日(木) 18:23 Tomoko Uchida : > He

[ANNOUNCE] Issue migration Jira to GitHub starts on Monday, August 22

2022-08-18 Thread Tomoko Uchida

Hello all, The Lucene project decided to move our issue tracking system from Jira to GitHub and migrate all Jira issues to GitHub. We start issue migration on Monday, August 22 at 8:00 UTC. 1) We make Jira read-only before migration. You cannot update existing issues until the migration is comple

Re: [HELP] Link your Apache Lucene Jira and GitHub account ids before Thursday August 4 midnight (in your local time)

2022-08-06 Thread Tomoko Uchida

) 14:41 Aditya Varun Chadha : > Thanks Tomoko, > There is no activity in JIRA from me as far as I can recall. This is the > correct and only account though. > > On Sun 7. Aug 2022 at 05:50, Tomoko Uchida > wrote: > > > Hi Aditya, > > I found a Jira user "adichad&

Re: [HELP] Link your Apache Lucene Jira and GitHub account ids before Thursday August 4 midnight (in your local time)

2022-08-06 Thread Tomoko Uchida

ote: > > > jira: gnewton > > github: gnewton (github.com/gnewton) > > > > Thanks, > > Glen > > > > > > > > On Sat, 6 Aug 2022 at 14:11, Tomoko Uchida > > > wrote: > > > > > Hi everyone. > > > > > > I wanted

Re: [HELP] Link your Apache Lucene Jira and GitHub account ids before Thursday August 4 midnight (in your local time)

2022-08-06 Thread Tomoko Uchida

> > > > On Sat, 6 Aug 2022 at 14:11, Tomoko Uchida > wrote: > > > Hi everyone. > > > > I wanted to let you know that we'll extend the deadline until the date > the > > migration is started (the date is not fixed yet). > > Please let us k

Re: [HELP] Link your Apache Lucene Jira and GitHub account ids before Thursday August 4 midnight (in your local time)

2022-08-06 Thread Tomoko Uchida

Hi everyone. I wanted to let you know that we'll extend the deadline until the date the migration is started (the date is not fixed yet). Please let us know your Jira/Github usernames if you don't see mapping(s) for your account in this file: https://github.com/apache/lucene-jira-archive/blob/main

Re: [HELP] Link your Apache Lucene Jira and GitHub account ids before Thursday August 4 midnight (in your local time)

2022-08-01 Thread Tomoko Uchida

Hi Atri and Christian, thanks for your reply, we already have your accounts in - https://github.com/apache/lucene-jira-archive/blob/7654c0168a86fb05e942666d4514d48966d223bb/migration/mappings-data/account-map.csv.20220722.verified#L42 - https://github.com/apache/lucene-jira-archive/blob/7654c0168a

Re: [HELP] Link your Apache Lucene Jira and GitHub account ids before Thursday August 4 midnight (in your local time)

2022-08-01 Thread Tomoko Uchida

Hi Mike, Marcus, and Praveen: I verified the added two mappings - these Jira users have activity on Lucene Jira, also corresponding GitHub accounts are valid. - marcussorealheis - pru30 Tomoko 2022年8月1日(月) 18:40 Michael McCandless : > Thanks Praveen, > > I added your mapping here: > https://gi

Re: [IMPORTANT] Please set your GitHub profile "name" to your Jira "Full Name" for account mapping

2022-07-25 Thread Tomoko Uchida

thors) still remain unmapped since we have few clues to infer correct mappings. Thanks, Tomoko 2022年7月23日(土) 13:06 Tomoko Uchida : > Hi all, > we created account mapping (Jira user to GitHub account) for issue > migration to GitHub. Please make sure your account is correctly mapped. > >

Re: [IMPORTANT] Please set your GitHub profile "name" to your Jira "Full Name" for account mapping

2022-07-22 Thread Tomoko Uchida

eply to this mail. Tomoko 2022年7月10日(日) 15:02 Tomoko Uchida : > [image: jira_profile.png] > [image: github_profile.png] > Hi all Lucene maintainers and contributors, > we need your cooperation for the coming GitHub issue migration from Jira. > > In short, please set your GitHu

[IMPORTANT] Please set your GitHub profile "name" to your Jira "Full Name" for account mapping

2022-07-09 Thread Tomoko Uchida

[image: jira_profile.png] [image: github_profile.png] Hi all Lucene maintainers and contributors, we need your cooperation for the coming GitHub issue migration from Jira. In short, please set your GitHub profile's "Name" to exactly the same string in your Jira "Full Name" (see the attached images

Re: Auto-complete in Lucene

2022-05-27 Thread Tomoko Uchida

Hi Anastasiya, > 1) Use AnalyzingSuggester > 2) Use PrefixQuery and SpanFirstQuery with the IndexSearcher.search() > What is the differences between them? Do they have big differences in > performance? I don't think they are directly comparable but internally, AnalyzingSuggerster decompounds the

Re: Issue with Japanese User Dictionary

2022-01-29 Thread Tomoko Uchida

harFilter since Lucene 9.0 so you don't need to worry about full-width and half-width normalization if you use it. Tomoko 2022年1月14日(金) 11:58 Tomoko Uchida : > > Hi, > > > The only thing that seems to differ is that the characters are full-width > > vs half-width, so I was wo

Re: Issue with Japanese User Dictionary

2022-01-13 Thread Tomoko Uchida

Hi, > The only thing that seems to differ is that the characters are full-width > vs half-width, so I was wondering if this is intended behavior or a bug/too > restrictive This is intended behavior. The first column in the user dictionary must be equal to the concatenated string of the second col

Re: [VOTE] Lucene logo contest, third time's a charm

2020-09-08 Thread Tomoko Uchida

I will take care of Luke app. It has Lucene logo and sticker. If by any chance a new logo for Luke itself is provided (maybe from fans with graphic designing skills), I will also change it - the priority is low I think. 2020年9月9日(水) 5:07 Anshum Gupta : > Thank you, Ryan and everyone else who was

Re: Reg Java Learning Git repo for Apache Lucene & Solr.

2020-08-17 Thread Tomoko Uchida

As for Lucene, "demo" module could be a help? https://lucene.apache.org/core/8_6_1/demo/index.html 2020年8月17日(月) 23:23 Ali Akhtar : > I don't know if there is a repo but this site helped me when i was getting > started: http://www.lucenetutorial.com/ > > On Mon, Aug 17, 2020 at 10:04 AM Ganapat

Re: ANN search current state

2020-07-23 Thread Tomoko Uchida

LUCENE-9322 as setting up a framework for experimentation + > benchmarking (but not requiring us to commit a particular ANN > implementation quite yet). > > Julie > > On 2020/07/17 12:16:18, Tomoko Uchida wrote: > > > would it make sense to create a separate Lucene module

Re: ANN search current state

2020-07-17 Thread Tomoko Uchida

> would it make sense to create a separate Lucene module for ANN search ? >From a bit of my experience with LUCENE-9004, it is currently impossible to plug in or opt in custom codecs and indexing chain for aknn search without touching lucene-core module (plz correct that if it's wrong). I think LU

Re: How to query for 'any word' in a phrase

2020-01-09 Thread Tomoko Uchida

Hi, did you try or consider SpanNearQuery? You might need to insert some kind of spetial token (e.g., ) to the end of the text field to match the "end of the sentence" anyways. 2020年1月10日(金) 1:30 陈志祥 : > To be more clear，i think you need build a custom PhraseQuery class，which > can set each slop

Re: Question about the light and minimal French stemmers

2019-07-27 Thread Tomoko Uchida

t; sacrifice some precision. For example mes/mess, ne/née, I'm sure there are > others. > > So having both options is helpful, I don't think it's a bug on the face of > it. However I didn't look closely at the code, so I'm not sure what the > intent is exac

Re: Question about the light and minimal French stemmers

2019-07-27 Thread Tomoko Uchida

27日(土) 20:29 Tomoko Uchida : > > Hi Adrien, > > To me, it sounds simply a bug. Can you please open a JIRA (with a > patch if possible)? > > Tomoko > > 2019年7月23日(火) 22:05 Adrien Gallou : > > > > Hi, > > > > I'm using both light and minimal Fre

Re: Question about the light and minimal French stemmers

2019-07-27 Thread Tomoko Uchida

Hi Adrien, To me, it sounds simply a bug. Can you please open a JIRA (with a patch if possible)? Tomoko 2019年7月23日(火) 22:05 Adrien Gallou : > > Hi, > > I'm using both light and minimal French stemmers and encountered an issue > when using the minimal stemmer. > > The light stemmer removes the la

Re: SuppressForbidden for anonymous classes

2019-07-16 Thread Tomoko Uchida

Hi forbiddenapis itself is an independent project, so maybe the github repo for it is the right place for you (since it seems the code is not related to Lucene)? 2019年7月17日(水) 2:05 elbek kamoliddinov : > Hello everyone. > We use Lucene's nice forbidden api check tool. I wonder if anonymous > cla

Re: FuzzyQuery- why is it ignored?

2019-06-22 Thread Tomoko Uchida

; > Subject: Re: FuzzyQuery- why is it ignored? > > > > i see, i am using an older version 6.6 and we should switch to Your 8.1 > > version of at least 7.X. > > > > Tomoko i think i understood You meant MAIN NASHUA for the string :) > > > > Again i reall

Re: FuzzyQuery- why is it ignored?

2019-06-16 Thread Tomoko Uchida

t; version of at least 7.X. > > Tomoko i think i understood You meant MAIN NASHUA for the string :) > > Again i really appreciate all answers. > > How do we disable or enable stemming while indexing? :) another question. > > Best regards > > > On 6/13/19 10:40 A

Re: Best fuzzy match on multiple terms

2019-06-14 Thread Tomoko Uchida

nerate hardly predictable > > > similarities on additional fields. These add scores to the overall > > > result and also affect normalization. > > > > > > Positively, the preferred matches are somewhere in the top ranks. > > > So > > > maybe rule

Re: Best fuzzy match on multiple terms

2019-06-14 Thread Tomoko Uchida

Hi Matthias, What similarity class are you using. Just a guess... but possibly one reason is document (field) length normalization. Generally speaking shorter documents would get higher scores than longer documents. (I saw that classic TFIDF similarity tends to give much higher scores to shorter

Re: FuzzyQuery- why is it ignored?

2019-06-13 Thread Tomoko Uchida

the end except s works in this > >>>> case. > >>>> > >>>> Case #3 had +contentDFLT:mains~2 but not +contentDFLT:"mains~2". > >>>> > >>>> i am using fuzzy query with ~ from Query.builder and that is not > >>>>

Re: FuzzyQuery- why is it ignored?

2019-06-12 Thread Tomoko Uchida

dunstable etc., right? > > Best regards > > On 6/12/19 11:36 AM, baris.ka...@oracle.com wrote: > > Tomoko,- > > Thank You for Your suggestions. i am trying to understand it and i thought i > did :) > > but it does not work with FuzzyQuery when i used with a *sin

Re: FuzzyQuery

2019-06-11 Thread Tomoko Uchida

> >> Index has a TextField named contentDFLT the following data : > >> street="MAIN" city="NASHUA" municipality="HILLSBOROUGH" region="NEW > >> HAMPSHIRE" country="UNITED STATES" > >> > >> > >> When i

Re: FuzzyQuery

2019-06-10 Thread Tomoko Uchida

ere.) Query parsing is a complex process, so it would be good to check parsed raw query string especially when you have (reserved) special characters in your query... 2019年6月11日(火) 1:10 Tomoko Uchida : > > Hi, > > I noticed one small thing in your previous mail. > > > when i use q1

Re: FuzzyQuery

2019-06-10 Thread Tomoko Uchida

reet=\"MAIN\""); i get same results > which is good. > > But when i switch to MAINS~ then fuzzy query does not work. > > > i need to say something with the q1 only in the booleanquery: > it tries to match the MAIN in street, city, region and country which are

Re: FuzzyQuery

2019-06-10 Thread Tomoko Uchida

but it does not bring anything. > >> > >> createPhraseQuery builds a Phrasequery with one term equal to the > >> string > >> in the call. > >> > >> Best regards > >> > >> > >> > >> On 6/1

Re: FuzzyQuery

2019-06-09 Thread Tomoko Uchida

Hi, What analyzer do you use for the text field? Is the term "Main" correctly indexed? 2019年6月8日(土) 9:13 Baris Kazar : > > Hi,- > i cant get FuzzyQuery working for searching with a query like Mains~2 to > find the word Main in a TextField. > Any suggestions please? > > -

Re: JapaneseAnalyzer's system vs user dict

2019-05-28 Thread Tomoko Uchida

will take time (and to be honest I'm not sure the patch will be accepted) but I think it's much better than applying monkey-fixes to the current build script. If you are seriously interested in this work, please feel free to involve it. Tomoko 2019年5月28日(火) 7:57 Tomoko Uchida : > &

Re: JapaneseAnalyzer's system vs user dict

2019-05-27 Thread Tomoko Uchida

why I mentioned "1-1)" and "1-2)" processes first. > > > > > > Anyway, in my personal opinion, Lucene does not need to consider whether > > > the system dictionary status is good or not. > > > I just think when some user wants to use a custo

Re: JapaneseAnalyzer's system vs user dict

2019-05-26 Thread Tomoko Uchida

dify the ant file or find some code for a long time > to run the DictionaryBuilder. > I think there should be at least a guide. > > Warm regards, > Namgyu Kim > > P.S. Although not as good as the Tomoko's contents, there is a list of > dictionaries supported by kuromoji. > (

Re: JapaneseAnalyzer's system vs user dict

2019-05-26 Thread Tomoko Uchida

Hi, The system dictionary is not a mere "word collection", it includes a machine-learned language model which is carefully trained by researchers. If you want to replace the system dictionary, you have to start from "re-train" the model. This needs expert knowledge so I do not recommend to just mo

Re: JapaneseAnalyzer's system vs user dict

2019-05-25 Thread Tomoko Uchida

Hi, > If I provide entries in the user dictionary is it just as if I had included them in the system dictionary? If the same entry occurs in both, do the user dictionary weights supersede those in the system dictionary? Is there some way to suppress entries in the system dict? User dictionary is

Re: fields contains equals term docs search

2019-04-19 Thread Tomoko Uchida

ter make huge query with all intersected items: > > Query query > (String item: Intersection) { > query.addBoolean({"foo": item, "bar": item}) > } > > Is any better way to find all doc's that has intersected terms? > > Thanks! > -- > Regards, > Valentin. -- Tomoko Uchida - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

[ANNOUNCE] Luke 8.0.0 released

2019-03-23 Thread Tomoko Uchida

Hi, Luke 8.0.0 is out. Zip archive can be downloaded from here: https://github.com/DmitryKey/luke/releases/tag/luke-swing-8.0.0 In this release, - Lucene version was upgraded to 8.0.0. - Added 'exact hits count' checkbox to the Search tab to return the accurate total hit count. - Supported 'Creat

[ANNOUNCE] Luke 7.7.0 released

2019-02-11 Thread Tomoko Uchida

Hi, Luke 7.7.0 is out. Zip archive can be downloaded at here: https://github.com/DmitryKey/luke/releases/tag/luke-swing-7.7.0 In this release, - Lucene version was upgraded to 7.7.0. - Some trivial UI bugs were fixed. Regards, Tomoko

Re: Question about Indexsearcher.search()

2019-01-25 Thread Tomoko Uchida

s documents with given query, > and returns TopDocs instance. TopDocs instance has array of ScoreDoc > instances. > > My question is: > 1. Will query be tokenized during search? > 2. If so, does ScoreDoc instance have only

[ANNOUNCE] Luke 7.6.0 released

2018-12-15 Thread Tomoko Uchida

Hi Luke users, Luke 7.6.0 was just released. Zip archive can be downloaded from here: https://github.com/DmitryKey/luke/releases/tag/luke-swing-7.6.0 This release includes important bug fixes and improvements. Please consider upgrading if you use Swing Luke 7.5.0. Major changes: https://twitter.c

Re: Question about upgrading lucene 4.4.0 to 7.5.0

2018-11-06 Thread Tomoko Uchida

> -- > > Thanks & Regards, > > Arpit Mittal > > 650-834-6540 > > > > > -- > Thanks & Regards, > Arpit Mittal > 650-834-6540 -- Tomoko Uchida - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: [ANNOUNCE] Luke 7.5.0 released

2018-10-05 Thread Tomoko Uchida

e are our Twitter account: @moco_beta_en or @DmitryKey Thanks, Tomoko 2018年10月4日(木) 0:50 Tomoko Uchida : > Hi, > > Luke 7.5.0 was just released. > For this release, we have two editions of Luke. (Sorry for the confusion.) > > * Luke 7.5.0 - Swing edition > Swing edition can

[ANNOUNCE] Luke 7.5.0 released

2018-10-03 Thread Tomoko Uchida

Hi, Luke 7.5.0 was just released. For this release, we have two editions of Luke. (Sorry for the confusion.) * Luke 7.5.0 - Swing edition Swing edition can be downloaded from here: https://github.com/DmitryKey/luke/releases/tag/luke-swing-7.5.0 This version of Luke works with JDK 8/9/10/11. Use

Re: Question about usage of LuceneTestCase

2018-08-27 Thread Tomoko Uchida

ndomization. > > > -Hoss > http://www.lucidworks.com/ > > --------- > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org > -- Tomoko Uchida - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: SQL OR in lucene : where ((term1=a and term2=b) OR (term3=a and term4=b)) and context in (2,3,4,5.....200)

2018-08-24 Thread Tomoko Uchida

xt condition > and disregarding first two conditions. > > > But it can be workable, if I manage to apply context condition separately. > > > More probably using custom filtering through Collector interface > https://lucene.apache

Re: Replacement of CollapsingTopDocsCollector

2018-08-24 Thread Tomoko Uchida

g I'm not sure if Lucene had been having such Collector class so far. Regards, Tomoko 2018年8月20日(月) 19:07 alex stark : > > In Lucene 7.x, CollapsingTopDocsCollector is removed, is there any > replacement of it? -- Tomoko Uchida

Re: Question about usage of LuceneTestCase

2018-08-22 Thread Tomoko Uchida

oid beforeTriLevelCompositeIdRoutingTest() throws > >> Exception { > >> switch (random().nextInt(2)) { > >> case 0: > >> useFactory(null); // uses Standard or NRTCaching, FS based > anyway. > >> break; > >> case 1: &g

Re: Question about usage of LuceneTestCase

2018-08-22 Thread Tomoko Uchida

ble randomization, could you give me any cues for this? Or, real test cases that disable randomization are helpful for me, I will search Lucene/Solr code base. Thanks, Tomoko 2018年8月22日(水) 21:58 Tomoko Uchida : > Thanks for your kind explanations, > > sorry of course I know what is the

Re: Question about usage of LuceneTestCase

2018-08-22 Thread Tomoko Uchida

c, which is not > under control of the randomization. I breathe a sigh of relief, > though, when a test _does_ reproduce with a particular seed 'cause > then I have a hope of knowing the issue is actually fixed ;). > > > Best, > Erick > > On Tue, Aug 21, 2018 at 3:56 PM, Tomoko Uc

Re: Question about usage of LuceneTestCase

2018-08-21 Thread Tomoko Uchida

2216dc98b018ad > > 4> If MIke's intuition that it's one of the file system randomizations > that occasionally gets hit _and_ you determine that that's an invalid > test case (and for Luke requiring that the FS-basesd tests are all > that ar

Re: Question about usage of LuceneTestCase

2018-08-21 Thread Tomoko Uchida

s again to see if I could get an example, and strangely > these tests are no longer failing for me after several runs, when > previously they failed quite often. I wonder if this is somehow related to > running mvn from command line vs running in IntelliJ since previously I was > doing t

Question about usage of LuceneTestCase

2018-08-21 Thread Tomoko Uchida

Hello, Could you give me some advice or comments about usage of LuceneTestCase. Some of our unit tests extending LuceneTestCase fail by assertion error -- sometimes, randomly. I suppose we use LuceneTestCase in inappropriate way, but cannot find out how to fix it. Here is some information about

Re: Search in lines, so need to index lines?

2018-08-02 Thread Tomoko Uchida

in me or/and send some links on explanations for > what Lucene is good and supposed to be used and for what it is not good and > not supposed to be used? > > Thanks a lot in advance, > Ira > > -----Original Message- > From: Tomoko Uchida > Sent: Wednesday,

Re: Search in lines, so need to index lines?

2018-08-01 Thread Tomoko Uchida

. Tomoko 2018年8月1日(水) 20:01 Gordin, Ira : > Hi Tomoko, > > I need to search in many files and we use Lucene for this purpose. > > Thanks, > Ira > > -Original Message----- > From: Tomoko Uchida > Sent: Wednesday, August 1, 2018 1:49 PM > To: java-user@lucene.ap

Re: Search in lines, so need to index lines?

2018-08-01 Thread Tomoko Uchida

es inside lines and not inside words, > so I > > am considering to tokenize lines. How I should to implement this idea? > > I'll really appriciate you have more ideas/implementations. > > > > Thanks in advance, > > Ira > > > > > > - > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org > > -- Tomoko Uchida

[ANNOUNCE] Luke 7.4.0 released

2018-07-01 Thread Tomoko Uchida

Download the release zip from here: https://github.com/DmitryKey/luke/releases/tag/luke-7.4.0 This update is for upgrading Lucene 7.4.0. Note: If you use OpenJDK you need to install OpenJFX separately to run JavaFX. Oracle HotSpot JVM includes JavaFX runtime. Best, Tomoko Uchida

Re: Lucene add synonyms to serch

2018-06-21 Thread Tomoko Uchida

There is SynonymGraphFilterFactory, the successor of SynonymFilterFactory. They have same parameters so you can easily switch to. See the javadoc for details: https://lucene.apache.org/core/7_3_0/analyzers-common/org/apache/lucene/analysis/synonym/SynonymGraphFilterFactory.html -- Tomoko Uchida

[ANNOUNCE] Luke 7.3.1 released

2018-05-15 Thread Tomoko Uchida

OpenJDK you need to install OpenJFX separately to run JavaFX. Oracle HotSpot JVM includes JavaFX runtime. -- Tomoko Uchida

[ANNOUNCE] Luke 7.2.0 released

2018-01-07 Thread Tomoko Uchida

plan to switch current (Thinlet) Luke to the JavaFX version in a future release. Please try it and feedback is welcome! Note: If you use OpenJDK you need to install OpenJFX separately to run JavaFX. Oracle HotSpot JVM includes JavaFX runtime. -- Tomoko Uchida

[ANNOUNCE] Luke 7.1.0 released

2017-10-20 Thread Tomoko Uchida

Download the release zip here: https://github.com/DmitryKey/luke/releases/tag/luke-7.1.0 Upgrade to Lucene 7.1.0. and, other changes in this release: <https://github.com/DmitryKey/luke/blob/master/CHANGES.txt> -- Tomoko Uchida

[ANNOUNCEMENT] Luke 6.5.0 released

2017-05-09 Thread Tomoko Uchida

o respective contributors! -- Tomoko Uchida - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Custom TokenStream + custom Attributes

2016-06-12 Thread Tomoko Uchida

Hi, I do not fully understand your requirements, but analyzers-kuromoji (one of extended package for Japanese morphological analysis) has some custom token attributes. http://lucene.apache.org/core/6_0_1/analyzers-kuromoji/index.html?org/apache/lucene/analysis/ja/tokenattributes/package-summary.ht

Re: Upgrading Lucene Indices and maintaining same resultset

2015-05-31 Thread Tomoko Uchida

Hi, > We have a Lucene 3.6-based index set which is quite large and currently in use. What will be the upgrade path to (a) 4.x or (b) 5.x? With respect to the data migration, etc. What are the steps and is it technically possible? I read that 3.x to 5.x is not possible, and throws IndexTooStale ex

Re: DirectoryReader.open(commit, termInfosIndexDivisor) has gone at 5.0 ?

2015-05-14 Thread Tomoko Uchida

t has index-time params to > further reduce how much RAM it is allowed to use for the terms index. > > Mike McCandless > > http://blog.mikemccandless.com > > > On Thu, May 14, 2015 at 2:26 AM, Tomoko Uchida > wrote: > > Hi, > > > > A question about API cha

DirectoryReader.open(commit, termInfosIndexDivisor) has gone at 5.0 ?

2015-05-13 Thread Tomoko Uchida

Hi, A question about API changes in Lucene 5. The method "DirectoryReader.open(commit, termInfosIndexDivisor)" has been removed at Lucene 5.0. This functionality was completely removed? Backgrounds: I am trying to make Luke work with Lucene 5.1. Luke for Lucene 4.x supports "TermInfos index divis

Re: Luke for Lucene 5.x?

2015-04-27 Thread Tomoko Uchida

Hi, Currently Luke is not the part of Lucene, you should check Luke project in Github. https://github.com/DmitryKey/luke Here is the Issue for lucene 5. https://github.com/DmitryKey/luke/issues/21 I'm now trying to make Luke support Lucene 5.x in my local branch. Pull requests from other develop

Re: High frequency terms in results document....

2015-02-19 Thread Tomoko Uchida

> > regards. > > On Thu, Feb 19, 2015 at 2:10 AM, Tomoko Uchida < > tomoko.uchida.1...@gmail.com > > wrote: > > > Hi, > > > > I'm afraid there are no easy or straight way for your requirement. > > I would try create an temporary tiny in

Re: High frequency terms in results document....

2015-02-18 Thread Tomoko Uchida

Hi, I'm afraid there are no easy or straight way for your requirement. I would try create an temporary tiny index from search results on the fly in memory, and get top N terms from it by HighFreqTerms. http://lucene.apache.org/core/4_10_3/misc/org/apache/lucene/misc/HighFreqTerms.html (The logic i

Re: Filtering MoreLikeThis results

2015-01-09 Thread Tomoko Uchida

Hi, > find me the 10 most similar documents I suppose you mean "mlt.count" supported by MoreLikeThisComponent. https://cwiki.apache.org/confluence/display/solr/MoreLikeThis MLT is ordinary search in Lucene, so you get documents in order of similarity (default scoring criteria) and can limit resu

Re: AlreadyClosedException on new index

2015-01-06 Thread Tomoko Uchida

Hi, How often does this error occur? You do not tell the lucene version, but I guess you use lucene 3.x according to the stack trace... IndexWriter would not be closed until IndexWriter.close() method is called explicitly. https://github.com/apache/lucene-solr/blob/lucene_solr_3_6_2/lucene/core/sr

Re: Chinese sorting

2014-12-18 Thread Tomoko Uchida

: > > Hi Tomoko, > > does sorting with Locala.JAPANESE also work for Kanji. Since Hiragana and > Katakana are based on the phonetics, I guess it is easier to define a > sorting order. But Kanji is more similar to the Chinese. > > Thanks, > Nils > > > On 17.12.2014

Re: Chinese sorting

2014-12-17 Thread Tomoko Uchida

Hi, Nils, I don't know Chinese at all... but collation is very important in Japanese too. Lucene has org.apache.lucene.collation package that use ICU4J's collators (you can find "lucene-analyzers-icu-4.10.2.jar" in analysis/icu directory). http://lucene.apache.org/core/4_10_2/analyzers-icu/index.h

Re: o.a.l.u.fst package's sample code might be outdated?

2014-12-13 Thread Tomoko Uchida

eami.com/blog/comparing-document-classification-functions-of- > lucene-and-mahout.html > > > (2014/12/14 11:11), Tomoko Uchida wrote: > >> Sorry again, >> I checked the o.a.l.u.fst.TestFSTs.java, and found this modification seems >> to be OK. >> >> So, I chang

Re: o.a.l.u.fst package's sample code might be outdated?

2014-12-13 Thread Tomoko Uchida

someone correct it. Regards, Tomoko 2014-12-11 0:34 GMT+09:00 Tomoko Uchida : > > Sorry, I paste a wrong link... > > > I run the "FST Construction example:" code in package.html with lucene > 4.10.2, and saw a compile error. > > http://lucene.apache.org/core

Re: Configuration

2014-12-10 Thread Tomoko Uchida

Hi Christopher, At my environment, IndexFiles demo works fine. May be you must give directory path including source data you want to index after '-docs'. For example, $ ls /path/to/source/dir aaa.txt bbb.txt $ java -cp "demo/lucene-demo-4.10.2.jar:analysis/common/lucene-analyzers-common-4.10.2

Re: o.a.l.u.fst package's sample code might be outdated?

2014-12-10 Thread Tomoko Uchida

2, and saw a compile error. http://lucene.apache.org/core/4_10_2/core/index.html?org/apache/lucene/util/fst/package-summary.html 2014-12-11 0:32 GMT+09:00 Tomoko Uchida : > Hi, > > I run the "FST Construction example:" code in package.html with lucene > 4.10.2, and saw a compi

o.a.l.u.fst package's sample code might be outdated?

2014-12-10 Thread Tomoko Uchida

Hi, I run the "FST Construction example:" code in package.html with lucene 4.10.2, and saw a compile error. http://lucene.apache.org/core/4_10_2/core/index.html Error ocurred at this line. builder.add(Util.toIntsRef(scratchBytes, scratchInts), outputValues[i]); Consulting with Javadoc, I found U

85 matches

Mail list logo