Re: Persistence/Serialization of Automaton

2016-03-24 Thread McKinley, James T
Here's an archive link from this mailing list regarding serializing queries, I guess this would work for Automaton objects as well. http://mail-archives.apache.org/mod_mbox/lucene-java-user/201603.mbox/browser Hope it helps. Jim From: José Tomás Atria S

Re: Serializing Queries

2016-03-18 Thread McKinley, James T
We use Kryo to pass query objects between hosts: https://github.com/EsotericSoftware/kryo We initially had some trouble with it creating dynamic classes and running out of PermGen space but we got around that using an ObjectPool: http://commons.apache.org/proper/commons-pool/api-1.6/org/apache/

Re: Scoring over Multiple Indexes

2015-10-22 Thread McKinley, James T
Hi Scott, I don't know your reasons for splitting your index up, but assuming you want to do that and then merge the search results back together I think you could re-unify the term document frequencies across all your indexes and then extend IndexSearcher and override termStatistics and collec

Re: Lucene 5 : any merge performance metrics compared to 4.x?

2015-09-30 Thread McKinley, James T
ks? Meaning Adrien's comment is a > whole lot significant? Merges ALWAYS pre-merge CheckIntegrity? Is this > a 5.0 feature drop? You can't deprecate, um, er totally remove an > index time audit feature on a point release of any level IMHO. > > > -Original Message- &g

Re: Lucene 5 : any merge performance metrics compared to 4.x?

2015-09-29 Thread McKinley, James T
compared to 4.x? This sounds robust. Is the index batch creation workflow a separate process? Distributed shared filesystems? --will -Original Message- From: McKinley, James T [mailto:james.mckin...@cengage.com] Sent: Tuesday, September 29, 2015 2:22 PM To: java-user@lucene.apache.org

Re: Lucene 5 : any merge performance metrics compared to 4.x?

2015-09-29 Thread McKinley, James T
Hi Adrien and Will, Thanks for your responses. I work with Selva and he's busy right now with other things, so I'll add some more context to his question in an attempt to improve clarity. The merge in question is part of our batch indexing workflow wherein we index new content for a given par

Re: How to index & search arrays of double?

2015-08-06 Thread McKinley, James T
Hi Stan, I played around with LIRE a couple years ago. I don't know exactly how it works, but it doesn't just use Lucene from what I remember, it has its own classes built around Lucene to perform the image search. There used to be a PDF of a paper on the site, but I couldn't find a link when

Re: Lucene Searcher Caching and Performance

2015-08-04 Thread McKinley, James T
Hi Clive, We essentially do what you're suggesting, namely we create a single index searcher (as well as the directory reader it uses) on each partition that is shared amongst all threads. We also perform various index operations (searching, browsing terms etc.) for a while to "warm up" Lucene

RE: Lucene Version Upgrade (3->4) and Java JVM Versions(6->8)

2015-02-12 Thread McKinley, James T
are away users, because I don't want them having index corruption. I am sick of people asking "but isnt it fine on the latest version" and so on. It is not. On Wed, Feb 11, 2015 at 11:41 AM, McKinley, James T wrote: > Hi, > > A couple mailing list members have brought t

RE: Lucene Version Upgrade (3->4) and Java JVM Versions(6->8)

2015-02-11 Thread McKinley, James T
ng G1GC with the 64-bit JVM given that it has better performance on large heaps which are becoming more common today. FWIW, Jim From: McKinley, James T [james.mckin...@cengage.com] Sent: Monday, February 09, 2015 11:00 AM To: java-user@lucene.apache.org Subject: RE:

RE: Lucene Version Upgrade (3->4) and Java JVM Versions(6->8)

2015-02-09 Thread McKinley, James T
course, is critically important to Solr so from that perspective it is about Solr too. https://wiki.apache.org/lucene-java/JavaBugs And, I assume, it also applies to your custom app. FWIW, Erick On Fri, Feb 6, 2015 at 12:10 PM, McKinley, James T wrote: > Just to be clear in case there wa

RE: Lucene Version Upgrade (3->4) and Java JVM Versions(6->8)

2015-02-06 Thread McKinley, James T
/lucene-java/JavaBugs? Regards Piotr On Tue, Jan 27, 2015 at 9:55 PM, McKinley, James T < james.mckin...@cengage.com> wrote: > Hi Uwe, > > OK, thanks for the info. We'll see if we can download the Lucene test > suite and check it out. > > FWIW, we use G1GC in our pro

RE: Lucene Version Upgrade (3->4) and Java JVM Versions(6->8)

2015-01-27 Thread McKinley, James T
h_these_settings.html) In fact, the problems with G1GC can sometimes lead to index corruption, and are hard to reproduce. So better don't use... Uwe - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- &

RE: Lucene Version Upgrade (3->4) and Java JVM Versions(6->8)

2015-01-27 Thread McKinley, James T
Why do you say not to use G1GC? We are using Java 7 & G1GC with Lucene 4.8.1 in production. Thanks. Jim From: Uwe Schindler [u...@thetaphi.de] Sent: Tuesday, January 27, 2015 2:49 PM To: java-user@lucene.apache.org; 'kiwi clive' Subject: RE: Lucene Versi

RE: ToChildBlockJoinQuery question

2015-01-22 Thread McKinley, James T
r) -Mike On 1/22/15 11:27 AM, McKinley, James T wrote: > Hi Greg, > > Thanks describing how block join queries were intended to work. Your > description makes sense to me, however according to the API docs: > > http://lucene.apache.org/core/4_8_0/join/org/apache/lucene/search/join/ToC

RE: ToChildBlockJoinQuery question

2015-01-22 Thread McKinley, James T
ument. A child can't be its own parent. BlockJoin is a very powerful feature, but what it's really doing is modelling relationships using an index that doesn't know what a relationship is. The relationships are determined by a combination of the order that you indexed the block,

RE: ToChildBlockJoinQuery question

2015-01-21 Thread McKinley, James T
in query, but the mailing list rejected my message, if there's a better place to send/upload this index let me know and I surely will. Thanks again for any help. Jim ____ From: Gregory Dearing [gregdear...@gmail.com] Sent: Wednesday, January 21, 2015 1:01 PM To: java-user@lucene.ap

ToChildBlockJoinQuery question

2015-01-21 Thread McKinley, James T
Hi, I'm attempting to use ToChildBlockJoinQuery in Lucene 4.8.1 by following Mike McCandless' blog post: http://blog.mikemccandless.com/2012/01/searching-relational-content-with.html I have a set of child documents which are named works and a set of parent documents which are named persons tha

Re: Doubt Lucene

2014-09-16 Thread James Chirinos Pinedo
Hello, I have a problem when i search with lucene, but i dont know if this is a problem of my Index or my Search. At Index, I have like parameter a list of data of an individual(the list is with data of my Ontology Individuals) The list has values like "Name:PeterRooney", "Age:25", "From:NY" if

Doubt Lucene

2014-09-16 Thread James Chirinos Pinedo
Hello, I have a problem when i search with lucene, but i dont know if this is a problem of my Index or my Search. At Index, I have like parameter a list of data of an individual The list has values like "Name:PeterRooney", "Age:25", "From:NY" if the individual is a person but if the individual i

Re: Optimizing Filters

2013-10-16 Thread James Clarke
filters is slow? > > > -- > Ian. > > > On Thu, Oct 10, 2013 at 7:01 PM, James Clarke wrote: >> Are there any best practices for constructing Filters to search efficiently? >> From my non-exhaustive experiments I cannot intuit how to construct my >> filters

Optimizing Filters

2013-10-10 Thread James Clarke
advice on optimizing these filters would be appreciated! James - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Equivalent LatLongDistanceFilter in Lucene 4.4 API

2013-09-24 Thread james bond
Hi All, Can you please let me know if there is an equivalent of LatLongDistanceFilter in Lucene 4.4 API. This API was present in Lucene 3.6 API. I have to mainly compute whether a point(lat,lang) is present at a distance d from another point(lat,lang). I have checked different classes from the s

RE: Handling a closed IndexWriter in Solr

2013-03-13 Thread Dyer, James
your solr-related question by posting to the solr-user list rather than the java-user list. James Dyer Ingram Content Group (615) 213-4311 -Original Message- From: Danzig, Scott [mailto:scott.dan...@nymag.com] Sent: Wednesday, March 13, 2013 11:03 AM To: java-user@lucene.apache.org Subject:

RE: possible bug on Spellchecker

2013-02-21 Thread Dyer, James
Samuel, Do you think you could write a failing unit test and open a JIRA issue? Or at the least open a JIRA issue with all the details without a test? James Dyer Ingram Content Group (615) 213-4311 -Original Message- From: Samuel García Martínez [mailto:samuelgmarti...@gmail.com

search influenced by token attributes

2012-11-15 Thread Masanz, James J.
ed through these means. Is this the same thing as flexible indexing, or are we missing something fundamental? Regards, James Masanz - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: questions on PerFieldSimilarityWrapper

2012-11-09 Thread James Clarke
? Improvement? New Feature? Thanks, James Clarke Basis Technology Corp. On Fri, Nov 9, 2012 at 6:08 AM, Ian Lea wrote: > Feels a bit of a hack, but you might be able to make it work by > storing the field name when MyPerFieldxxx.get(name) is called and > using that in MyPerFieldxxx.qu

Re: FilteredQuery returning entire index

2012-07-17 Thread James Nolan
filtering during this step. I am certain there is a better way, but this one does work. Thanks for the help getting me here. Jim Nolan On Fri, Jul 13, 2012 at 4:02 PM, James Nolan wrote: > Thanks for the heads up. > > I was using org.apache.lucene.search.FilteredQuery which says it applies a

Re: FilteredQuery returning entire index

2012-07-13 Thread James Nolan
- > Uwe Schindler > H.-H.-Meier-Allee 63, D-28213 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > > > -Original Message- > > From: James Nolan [mailto:nolansolrlistse...@gmail.com] > > Sent: Friday, July 13, 2012 7:26 PM > > T

Use of enablePositionIncrements in StopFilter and QueryParser

2011-09-08 Thread James Faulkner
own investigations I believe that the only time a conflict will occur is if enablePositionIncrements in Stopfilter is true while enablePositionIncrements in QueryParser is false. Would I be correct in this assumption? Thanks for the help, James

Re: about pdf search

2011-03-07 Thread James Wilson
ell. Oh almost forgot -- I use PDFBox to extract the images from the PDFs. James THX -- James J. Wilson II Systems Engineer U.S. District Court District of New Mexico 333 Lomas Blvd., NW Albuquerque, NM 87102 Phone: (505) 348-2081 Fax:

Re: Question about QueryParser

2008-10-23 Thread James liu
thks steve, i get it. 2008/10/24 Steven A Rowe <[EMAIL PROTECTED]> > Hi James, > > On 10/23/2008 at 8:30 AM, James liu wrote: > > public class AnalyzerTest { > >@Test > >public void test() throws ParseException { > >QueryParser parser =

Question about QueryParser

2008-10-23 Thread James liu
public class AnalyzerTest { @Test public void test() throws ParseException { QueryParser parser = new MultiFieldQueryParser(new String[]{"title", "body"}, new StandardAnalyzer()); Query query1 = parser.parse("中文"); Query query2 = parser.parse("中 文"); System.out.pri

RE: anyone

2007-12-19 Thread Hartrich, James CTR USTRANSCOM J6
--JP On Dec 19, 2007 1:14 PM, Michael McCandless <[EMAIL PROTECTED]> wrote: > > Yes. > > Mike > > Hartrich, James CTR USTRANSCOM J6 wrote: > > > Is this at least a semi-active list? > > > > James > > > > >

anyone

2007-12-19 Thread Hartrich, James CTR USTRANSCOM J6
Is this at least a semi-active list? James

Lucene and Eastern languages (Japanese, Korean and Chinese)

2007-07-24 Thread Shaw, James
Hi, guys, I found Analyzers for Japanese, Korean and Chinese, but not stemmers; the Snowball stemmers only include European languages. Does stemming not make sense for ideograph-based languages (i.e., no stemming is needed for Japanese, Korean and Chinese)? Also for spell checking, does the defau

Re: IndexWriter.Optimize() is too slow and IOException! How Can I do?

2007-06-06 Thread James liu
U should split your index. large index will cause query speed, optimize speed when u index. 2007/6/7, 童小军 <[EMAIL PROTECTED]>: my indexfile to 8G when I optionze() the index.program is too slow . and some time IOException. And use TOO memory . xiaojun tong 010-64489518-613 [EMAIL PROTEC

Re: times of match in a document

2007-05-27 Thread James liu
sorry,,i not see it clearly match in a document. 2007/5/28, James liu <[EMAIL PROTECTED]>: maybe solr facet fit for u .http://wiki.apache.org/solr/SolrFacetingOverview 2007/5/28, Anny Bridge <[EMAIL PROTECTED]>: > > Hi, > > I use lucene in my project and it works w

Re: times of match in a document

2007-05-27 Thread James liu
maybe solr facet fit for u.http://wiki.apache.org/solr/SolrFacetingOverview 2007/5/28, Anny Bridge <[EMAIL PROTECTED]>: Hi, I use lucene in my project and it works well. Now I hope that the search result presenting to the user include the times of the keyword match in a document. Is there so

Re: Command line search tool using Lucene (targeted for Site Search)

2007-05-20 Thread James liu
it seems not quick. http://demo1.minalyzer.com/minalyzerlite/search4.php?q=test&offset=0 Results 1 - 15 of 16 for test.(1.586 seconds) 2007/5/20, Saurabh Dani <[EMAIL PROTECTED]>: Greetings All, We would like to introduce our java lucene based command line search tool, Minalyzer Lite.

date window

2007-05-17 Thread James O'Rourke
Also, note that there are multiple windows here for a single document, we can't just search between min start and max end. Ideas from those more familiar with lucene would be greatly appreciated. James - To unsubsc

Re: QueryFilter usage

2007-05-15 Thread James Rhodes
unsubscribe

RE: customizing index file name

2007-05-04 Thread Shaw, James
Sent: Thursday, May 03, 2007 6:25 PM To: java-user@lucene.apache.org Subject: Re: customizing index file name Oh, fix as in make constant, not fix as in broken ... No, I don't know of any way to do this. Can your installer just pack up everything in a directory? Erick On 5/3/07, Shaw, Jam

RE: customizing index file name

2007-05-03 Thread Shaw, James
hing with it at all. What behavior are you observing that you want to change and why? Erick On 5/3/07, Shaw, James <[EMAIL PROTECTED]> wrote: > > Does anyone know how to fix the .cfs file name in an index directory? > The deletable and segments file names are always the same, but we

customizing index file name

2007-05-03 Thread Shaw, James
Does anyone know how to fix the .cfs file name in an index directory? The deletable and segments file names are always the same, but we have observed that the .cfs file name changes each time you index a content directory with some changes to the directory (some deleted files, added files, etc). H

Re: DBSight Turns Free! Instant Lucene Search on Database!

2007-04-24 Thread James liu
Does it mean DBSight is Free? 2007/4/24, Chris Lu <[EMAIL PROTECTED]>: For those who may be interested, DBSight 1.4.0 now has unlimited index size with Free version! Basically DBSight is more like SOLR + database adapter. You just point it with one or several SQLs to any database, and you can

Re: How to use StandardAnalyzer

2007-03-16 Thread James liu
You can read demo source code from lucene source package. 2007/3/16, [EMAIL PROTECTED] <[EMAIL PROTECTED]>: Hi, I am new to Lucene Java API. I want to use StandardAnalyzer for tokenizing my document. How can I use it? Further how can I index "Acronym" and "Company name" as one term.

Re: Huge Index

2007-01-11 Thread James Rhodes
I used the following settings for speeding up indexing on a similarly sized db table If you have enough ram it might help you. IndexWriter writer = *new* IndexWriter(fdDir,*new* StandardAnalyzer(), *true *); writer.setMergeFactor(100); writer.setMaxMergeDocs(99); writer.setMaxBufferedDocs(

Re: 2.0 and Tokenized versus UN_TOKENIZED

2006-11-05 Thread James Rhodes
by default) if you want it to work iwthout tokenizing, you need to use something like hte PerFieldAnalyzerWrapper, and the KeywordAnalyzer for hte city field ... the KeywordAnalyzer at query time will leave the query text untokenized so it can match the untokenized value you indexed. : Date: Sat, 4

Re: Re: lucene and web services?

2006-11-04 Thread James Rhodes
plating like Velocity. -- Chris Lu - Instant Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com On 11/4/06, James Rhodes <[EMAIL PROTECTED]> wrote: > Has anyone successfully implemented a web services front end

Re: 2.0 and Tokenized versus UN_TOKENIZED

2006-11-04 Thread James Rhodes
on't break the query places you don't expect. But if you index Austin UN_TOKENZED, then search for it using, say StandardAnalyzer, it'll look for austin and they won't match. This may apply to Luke too. In Luke, you can choose a different analyzer (WhitespaceAnalyzer comes to

lucene and web services?

2006-11-04 Thread James Rhodes
Has anyone successfully implemented a web services front end to remotely search a Lucene index? I've tried to do it with the Xfire stuff in MyEclipse, but their default Aegis xml mapping stuff doesn't support the Lucene Hits object. I'd like to avoid searching a remote index via RMI, but for now i

2.0 and Tokenized versus UN_TOKENIZED

2006-11-04 Thread James Rhodes
I'm using the 2.0 branch and I've had issues with searching indexes where the fields aren't tokenized. For instance, my index consists of count,lastname,city,state and I used the following code to index it (the data is in a sql server db): * if*(count != 0) { doc.add(*new* Field("count", NumberU

highlighting with WildcardQuery

2006-10-13 Thread James O'Rourke
ter(fragmentLength) highlighter.setTextFragmenter(fragmenter) tokenStream = analyzer.tokenStream('f', StringReader(text)) return highlighter.getBestFragments(tokenStream, text, numFragments) Basically, I want to show partial word matches also. James -

Re: Problem searching

2006-09-28 Thread James O'Rourke
Wow, thanks, just found a significant bug - thought I was using StandardAnalyzer throughout but was not!! ouch! Nasty typo. Thanks again. James On Sep 28, 2006, at 5:57 PM, Erick Erickson wrote: Sure, what analyzer are you using? Both for indexing and parsing the query? I'll bet

Problem searching

2006-09-28 Thread James O'Rourke
Query q = parser.parse("jorourke burmilla") does return the document!! Anyone have any ideas? James PS. The code is a rough translation but you get the idea. For some reason I think the index doesn't like "."'s in words. -

Re: which way to index pdf,word,excel

2006-09-05 Thread James liu
thk,,,Cohen and lin. 2006/9/6, Doron Cohen <[EMAIL PROTECTED]>: I think that Nutch would crawl and search all these 3 types. Not sure that Nutch would provide the framework you seem to look for, but perhaps it is worth to take a look - http://lucene.apache.org/nutch/ "James l

Re: which way to index pdf,word,excel

2006-09-05 Thread James liu
u must have knowledge how to extract information from them and what kind of keys they need to be set. Then you can do what you want to. On 9/5/06, James liu <[EMAIL PROTECTED]> wrote: > > i wanna find frame which can index xml,word,excel,pdf,,,not one. > > > 2006/9/6, Do

Re: which way to index pdf,word,excel

2006-09-05 Thread James liu
files? "James liu" <[EMAIL PROTECTED]> wrote on 05/09/2006 19:14:24: > i find lius many question so i wanna give up and find new. > > who recommend ? - To unsubscribe, e-mail: [EMAIL PROTECTED]

which way to index pdf,word,excel

2006-09-05 Thread James liu
i find lius many question so i wanna give up and find new. who recommend ?

Anyone know which way lucene read index file

2006-09-04 Thread James liu
After reading 'lucene in action', i know the format of indexfile. which way lucene read index file? line by line? I m very interesting.

how to write this regexps?

2006-08-29 Thread James liu
i wanna index html,,,but it have image,flash,javascript, and i wanna make index quick,, but i don't know how to get textmode content,,, anyone can help me?

Re: Can lucene query this result?

2006-07-19 Thread James liu
You misunderstand. This sql just tell what i wanna do. i have five user_group and i wanna group result which lucene do. 2006/7/20, Otis Gospodnetic <[EMAIL PROTECTED]>: No. Lucene is not a relational database and doesn't speak SQL. Otis - Original Message From: James

Can lucene query this result?

2006-07-19 Thread James liu
for example: $sql = "select count(*), user_group from groups where uid>0 group by user_group; can lucene query this result?

how to use BNF

2006-07-16 Thread James liu
i wanna use "+","-","and","or","not" "*"

Re: is it wrong with my code?

2006-07-15 Thread James liu
who know? 2006/7/15, James liu <[EMAIL PROTECTED]>: yes,it can do...but now work for chinese search. english search is ok. and how can i fix it? 2006/7/15, Doron Cohen <[EMAIL PROTECTED]>: > > Hits hits = searcher.search(qp.Query(queryStr)); > > I think it should

Re: is it wrong with my code?

2006-07-14 Thread James liu
yes,it can do...but now work for chinese search. english search is ok. and how can i fix it? 2006/7/15, Doron Cohen <[EMAIL PROTECTED]>: > Hits hits = searcher.search(qp.Query(queryStr)); I think it should be qp.parse(String query) (rather than qp.Query(String field))

is it wrong with my code?

2006-07-14 Thread James liu
i can compile it correctly.but it can not search anything. i can't find error...so wanna u help me? package kk; import java.io.*; import java.util.Date; //import org.apache.lucene.analysis.standard.StandardAnalyzer; import org.apache.lucene.analysis.cjk.CJKAnalyzer; import org.apache.lucene.d

SortComparatorSources and ScoreDocComparators

2006-07-11 Thread James Pine
ctor in my search as opposed to a search method which returns a Hits object. Thanx. JAMES __ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com

Re: General Approach: Analyzer versus Query

2006-07-10 Thread James Pine
Would Lucene's FuzzyQuery be useful in this case? I suppose it would depend on how meaningful the sequences of numbers are. http://lucene.apache.org/java/docs/api/org/apache/lucene/search/FuzzyQuery.html --- Chris Hostetter <[EMAIL PROTECTED]> wrote: > > : I could (1) up front, put in both vers

Re: FileDocument : cannot resolve symbol

2006-07-10 Thread James liu
i fix it...when i "import org.apache.lucene.demo.FileDocument;" and thk u for ur answer. 2006/7/10, Chris Hostetter <[EMAIL PROTECTED]>: : when i try javac Package: org.apache.lucene.demo; : name is IndexFiles.java : : it show me : FileDocument ,error info : cannot resolve symbol : : : how c

FileDocument : cannot resolve symbol

2006-07-08 Thread James liu
when i try javac Package: org.apache.lucene.demo; name is IndexFiles.java it show me : FileDocument ,error info : cannot resolve symbol how can i compile it?

RE: Managing a large archival (and constantly changing) database

2006-07-07 Thread James Pine
txt will contain "update"; however if the lucene internals do this: mkdir x echo original > x/x.txt cp -lr x x.copy --> rm x/x.txt echo update > x/x.txt diff x/x.txt x.copy/x.txt Then x/x.txt will have a different inode from x.copy/x.txt and thei

RE: Managing a large archival (and constantly changing) database

2006-07-06 Thread James Pine
a little extra maintenance overhead. JAMES --- Larry Ogrodnek <[EMAIL PROTECTED]> wrote: > We have a similar setup, although probably only > 1/5th the number of > documents and updates. I'd suggest just making > periodic index backups. > > I've been storin

Re: BitSet in a HitCollector

2006-07-06 Thread James Pine
constructor, before calling the collect method, or calculate the total result count/document id set in the collect method and then execute some sort of post-collect method, right? So I was just wondering which method was better/faster. Thanx. JAMES --- Chris Hostetter <[EMAIL PROTECTED]>

BitSet in a HitCollector

2006-07-05 Thread James Pine
r the index in order to populate the BitSet and then doing another pass while calling the collect method? Thanx. JAMES __ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yaho

HitCollector and Sort Objects

2006-06-29 Thread James Pine
e cache hit ratio is very low). If someone thinks my use of HitCollector is flawed, I'm certainly open to suggestions. Thanx. JAMES __ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection arou

Re: Searching is taking a lot...

2006-06-29 Thread James Pine
eve is slightly more efficient. JAMES --- "heritrix.lucene" <[EMAIL PROTECTED]> wrote: > This will break performance. It is better to first > collect all the document > > numbers (code without the proper declarations): > > > > public void collect(int id,

Re: Searching is taking a lot...

2006-06-28 Thread James Pine
new MyHitCollector(yourSearcher,500,20); yourSearcher.search(yourQuery,hc); List results = hc.getDocuments(); Now results should contain the 1001st-1020th results. JAMES --- "heritrix.lucene" <[EMAIL PROTECTED]> wrote: > I am using Hits object to collect all documents. > Let me

Re: batch indexing using RAMDirectory

2006-06-28 Thread James Pine
Hey Eric, I think you want: fsWriter.addIndexes(Directory[] {ramDir}); to be: fsWriter.addIndexes(new Directory[]{ramDir}); JAMES --- zheng <[EMAIL PROTECTED]> wrote: > I am a novice in lucene. I write some code to do > batch indexing using > RAMDirectory according to the c

Re: termquery beginners question

2006-06-26 Thread James Pine
completeQuery.add(userEntered,true,false); completeQuery.add(otherQuery,true,false); Then just use completeQuery in your search. Hope I understood your issue correctly. Also as a note, the syntax above will work in Lucene 1.4.3, but it looks like BooleanQuery.add() has changed in 2.0.0. JAMES ---

Re: Can't open index

2006-06-22 Thread James Pine
ing the indexing process that cause a file not found error? JAMES --- WATHELET Thomas <[EMAIL PROTECTED]> wrote: > I'm creating my index file and in the same time I > try to do some > searches inside. > Sometimes I retrieve this error message: > "\\tradluxstmp01\Ja

Re: faceting and categorizing on color?

2006-06-20 Thread James Pine
t's say 25 of the documents have FF in the color field) and I do a search for FF...using a HitCollector I'm iterating over all 100 documents while extracting values whereas with the regular Hits based search I would only be iterating over 25?

faceting and categorizing on color?

2006-06-16 Thread James Pine
FFF00 7-F0 1-FF 2-E0 4-EE 8-EEE000 5-00 PIXELS 1 FORMAT JPG That would be interpreted as FF is the most dominant of the top 10 most dominant colors and F0 is the least. Would that be a terrible document structure? Thanx in advance, especially for wading t

Problem using SpellChecker with run time strings

2006-05-23 Thread James Maes
OS: Linux (Suse 9.2) Java SDK Version: 1.5.6 Lucene Version: 1.9.1 Installed Jars: lucene-core-1.9.1.jar lucene-snowball-1.9.1.jar lucene-spellchecker-1.9.1.jar Thanks -JM James Maes Software Architect

Alternate Use of Lucene

2006-03-10 Thread James Cook
Is there a technique, using Lucene, to retrieve the remaining index values from the results of a query? Here is a better explanation of the question. I have various fields that have been indexed on a particular collection of Java POJO's. For example, an Employee object may have the following fiel

Re: thinking about the pronunciation of lucene

2005-09-23 Thread James Huang
I'm having trouble figuring that out ... I pronounced "lu-CENE", and I've heard people who didn't know it pronounce the same way. But, I got "corrected" into "LOO-sen" one time. Any concensus? --- [EMAIL PROTECTED] wrote: > Thank you for your answer! > I know the people in Germany pronounce it

Re: Example ....Lucene with Java Server Pages?

2005-09-21 Thread James Huang
ially new index.) Perhaps a special HTTP command to trigger that? I don't know. -James --- Otis Gospodnetic <[EMAIL PROTECTED]> wrote: > James, > > Well, if you want to do things right, then you > should probably also get > that IndexSearcher (or any other Java code)

Re: Example ....Lucene with Java Server Pages?

2005-09-21 Thread James Huang
ll not complete sure if this is the best way, but at least not every new IndexSearcher per request. -James --- Daniel Naber <[EMAIL PROTECTED]> wrote: > On Wednesday 21 September 2005 19:21, Patricio > Galeas wrote: > > > I search for a JSP examples of a simple searc

Re: Sort by relevance+distance

2005-09-19 Thread James Huang
of this exercise -- scalability -- just want to make sure. Thanks, -James --- markharw00d <[EMAIL PROTECTED]> wrote: > Here's an example I put together to illustrate the > point. > > > package distance; > > import java.io.IOException; > impo

Re: Sort by relevance+distance

2005-09-19 Thread James Huang
I think this is probably the closest thing I like to/am able to do now. If I ever get to do this, I'll share the idea/code and seek review and suggestions. Thank you very much, Mark, and all others that have helped! -James mark harwood <[EMAIL PROTECTED]> wrote: I think the H

Re: Sort by relevance+distance

2005-09-18 Thread James Huang
--- Jeff Rodenburg <[EMAIL PROTECTED]> wrote: > trimming the post further: > > On 9/18/05, James Huang <[EMAIL PROTECTED]> wrote: > > > > >The problem is quite generic, I believe. What I > like to do is similar to > > LIA-ch6, i.e. to find a "

Re: Sort by relevance+distance

2005-09-18 Thread James Huang
See comments below. --- Erik Hatcher <[EMAIL PROTECTED]> wrote: > [trimming the post a bit] > > On Sep 18, 2005, at 11:51 AM, James Huang wrote: > > The problem is quite generic, I believe. What I > like > > to do is similar to LIA-ch6, i.e. to find a "good

Re: Sort by relevance+distance

2005-09-18 Thread James Huang
--- Erik Hatcher <[EMAIL PROTECTED]> wrote: > > On Sep 18, 2005, at 10:24 AM, James Huang wrote: > > > --- Erik Hatcher <[EMAIL PROTECTED]> > wrote: > > > > > >> Get back to using your DistanceComparatorSource, > and > &g

Re: Sort by relevance+distance

2005-09-18 Thread James Huang
ew DistanceComparatorSource( you need>)), > SortField.FIELD_SCORE}); Thanks! Does the order of thest two fields matter? I mean, with your code, would distance take precedence over relevance? Anyway, I'll try it out and play with ordering and such. -James > > Erik > _

Re: Sort by relevance+distance

2005-09-17 Thread James Huang
rmance (and cleaner code). Previously, I have created a DistanceComparatorSource (similar to that in LIA-ch6); sorting by distance works but relevance is lost. -James --- Erik Hatcher <[EMAIL PROTECTED]> wrote: > > On Sep 17, 2005, at 4:10 PM, James Huang wrote: > > > Hi, > >

Re: Sort by relevance+distance

2005-09-17 Thread James Huang
I guess I can use HitCollector and implement my own sorting, right? Is there a better approach? --- James Huang <[EMAIL PROTECTED]> wrote: > Hi, > > I can sort the search results by distance now. But, > the relevance is lost. > > I like to have the results sorted b

Sort by relevance+distance

2005-09-17 Thread James Huang
Hi, I can sort the search results by distance now. But, the relevance is lost. I like to have the results sorted by relevance + distance, i.e., relevance first; for results of similar relevance, order by distance. How to do that? Thanks a lot in advance! -James --- James Huang <[EM

Re: Question: force a field must be matched?

2005-09-15 Thread James Huang
Yes, "+" is what I missed! Thanks. Suppose there is a book published by 3 publishers (I don't know how that works in real world): // At index time: doc.add( Field.Keyword("publisher", "Manning") ); doc.add( Field.Keyword("publisher", "SAMS") ); doc.add( Field.Keyword("publisher", "O'Reilly"

Re: Question: dynamic sorting

2005-09-15 Thread James Huang
d 747, the closest first, please". I'll re-read the book/chapter tonight, but look forward to any expert advises. Thanks, -James --- Otis Gospodnetic <[EMAIL PROTECTED]> wrote: > Hi James, > > Check out the org.apache.lucene.search.package, > there are several sort >

  1   2   >