Hi everybody,
I always used lucene with its default configuration values.now I want to
know if there is any document in which initial values for configuring a
Lucene project is specified (values for Lucene, App Server and JDK).
if anyone had such experience on achieving best performance of Lucene
ul Elschot <[EMAIL PROTECTED]> wrote:
>
> On Thursday 20 September 2007 09:19, Mohammad Norouzi wrote:
> > well, you mean we should separate documents just like relational tables
> in
> > databases ?
>
> Quite the contrary, it's called _de_normalization. This means
well, you mean we should separate documents just like relational tables in
databases ?
if yes, how to make the relationship between those documents
thank you so much Paul
On 9/20/07, Paul Elschot <[EMAIL PROTECTED]> wrote:
>
> On Thursday 20 September 2007 07:29, Mohammad N
incorrect because patient 1 has no service
with name service2:
patient-id service_name patient_result
1 s112
1 s313
On 9/20/07, Mohammad Norou
Hi Paul,
would you tell me what is the difference between AND and + ?
I tried both but get different result
with AND I get 1777 documents and with + I get nearly 25000 ?
On 9/17/07, Paul Elschot <[EMAIL PROTECTED]> wrote:
>
> On Monday 17 September 2007 11:40, Mohammad Norouzi w
Hi
you may put two fields in your document one contains decoded values and
another is original values, but still you need to implement your query
builder so while searching you must specify the decoded filed and you show
the original field to the user
On 9/18/07, [EMAIL PROTECTED] <[EMAIL PROTECTE
Hi
I have a problem in getting correct result from Lucene, consider we have an
index containing documents with fields "field1" and "field2" etc. now I want
to have documents in which their field1 are equal one by one and their
field2 with two different value
to clarify consider I have this query:
or not, or it
is in their TO DO list or they are going to provide it in future.
thank you all Lucene developers and producers.
--
Regards,
Mohammad Norouzi
--
see my blog: http://brainable.blogspot.com/
another in Persian: http://fekre-motefavet.blogspot.com/
n the Searchable interface.
>
> So, you can either cast to a known Searcher or I suppose you can
> figure out a way to get the IndexReader. What kind of Searcher are
> you using?
>
> -Grant
>
> On Sep 13, 2007, at 4:50 AM, Mohammad Norouzi wrote:
>
> > Thanks
> >
actually, I show the result with pagination support, and they have option to
choose the number of records per page. and yes, I should provide a test
program, but about the HitCollector, I already created one, and collect all
lucene's document id and also my needed ID that stored in the index
>> yo
mend
> that you measure to find out where the problem is before trying
> a solution. Otherwise you'll optimize code that isn't the problem.
>
> Best
> Erick
>
> On 9/13/07, Mohammad Norouzi <[EMAIL PROTECTED]> wrote:
> >
> > Thanks
> > a
he peculiar characteristics of my data set, I really suspect
> that a 10x improvement in retrieval speed is atypical...
>
> As for when lazily-loaded fields actually get loaded, I didn't really
> have to explore it very fully, but a short experiment should do it
> for you
on Europe presentation http://cnlp.org/presentations/
> slides/AdvancedLuceneEU.pdf for a few slides (towards the end) on
> FieldSelector.
>
> On Sep 12, 2007, at 5:13 AM, Mohammad Norouzi wrote:
>
> > Hi all,
> >
> > Can anyone explain what is the FieldS
Hi all,
Can anyone explain what is the FieldSelector and the usage or benefits of
this structure? I read the javadocs but I can't get for what goal it is
provided in Lucene.
Thanks in advance
--
Regards,
Mohammad
--
see my blog: http://brainable.blogspot.com/
another in
ng code, Context and Dispatcher are parts of interceptor
> pattern
> in which I change the given values if they are number and has nothing to
> do
> with queries with string values"
>
> and been totally wrong .
>
> Best
> Erick
>
> On 8/18/07, Mohammad Norouzi &
141 Ca. Oxalate:many
> > 394321 Ca. Oxalate:few
> > 397671 Ca. Oxalate:nod
> > 387549 Ca. Oxalate: mod
> >
> > however this is not the required result but another problem is when I
> put
> > patientResult:Oxalate or patientResult:Oxalate* no result will return!
!!!
let me tell you that I am extended MultiFieldQueryParser to override its
methods and in getFieldQuery(...) method I return TermQuery
I don't know what I was made wrong?
On 8/15/07, karl wettin <[EMAIL PROTECTED]> wrote:
>
>
> 15 aug 2007 kl. 07.18 skrev Mohammad Norouzi:
Hi
I am using WhitespaceAnalyzer and the query is " icdCode:H* " but there is
no result however I know that there are many documents with this field value
such as H20, H20.5 etc. this field is tokenized and indexed what is
wrong with this?
when I test this query with Luke it will return no res
Thanks Erick but unfortunately NumberTools works only with long primitive
type I am wondering why you didn't put some method for double and float.
On 8/13/07, Nilesh Bansal <[EMAIL PROTECTED]> wrote:
>
> Thanks. Probably this should be mentioned on the documentation page.
>
> -Nilesh
>
> On 8/12
you should not store them in an Array structure since they will take up the
memory.
the BitSet is the best structure to store them
On 8/2/07, makkhar <[EMAIL PROTECTED]> wrote:
>
>
> Heres my index structure :
>
> Document -> contract ID -id (index AND store)
> -> paramName
yes it decrease the performance but the only solution.
I've spent many weeks to find best way to retrive my own IDs but find this
way as last one
now I am storing the ids in a BitSet structure and it's fast enough
public void collect(...){
idBitSet.set(Integer.valueOf(searcher.doc(id).get("MyOwnI
yes if you extend your class from HitCollector and override the collect()
mthod with following signature you can get IDs
public void collect(int id, float score)
On 8/2/07, makkhar <[EMAIL PROTECTED]> wrote:
>
>
> Hi all,
>
>Can I get just a list of document Ids given a search criteria ? To
>
dont know why this happen, I am wondering if it has something to do with
character encoding. have you had such problem?
thanks
--
Regards,
Mohammad Norouzi
--
see my blog: http://brainable.blogspot.com/
another in Persian: http://fekre-motefavet.blogspot.com/
ayzer and I don't think it needs to be customized.
> On 7/16/07, Dima May <[EMAIL PROTECTED]> wrote:
> >
> > Mohammad,
> >
> > see for my 2 cents below,
> >
> > Good luck.
> >
> > D
> >
> >
> > On 7/16/07, Mohamm
e lead developer is quick reacting.
M.
Mohammad Norouzi a écrit :
> Mathieu,
> I need an object mapper for lucene would you please give me the
> Compass web
> site? is it open source?
>
> thanks
-
To unsubscribe,
ad + body.
>
> I will increase your index size, no doubt. Might increase indexing time
too.
>
> --Renaud
>
>
> -Original Message-
> From: Mohammad Norouzi [mailto:[EMAIL PROTECTED]
> Sent: Sunday, July 15, 2007 9:40 PM
> To: java-user@lucene.apache.org
> Subject: Re
/16/07, Mohammad Norouzi <[EMAIL PROTECTED]> wrote:
>
> Hello
> I have problem in range queries, for example, I have queries like
> "field:[1
> TO 25]" or "field:[1.1 TO 11.25]"
> currently these queries not work. field:[20 TO 25] works fine but when
the
&
Hello
I have problem in range queries, for example, I have queries like "field:[1
TO 25]" or "field:[1.1 TO 11.25]"
currently these queries not work. field:[20 TO 25] works fine but when the
both limits of the range have different number of digits the query won't
work. so the solution is NumberToo
On 7/14/07, Grant Ingersoll <[EMAIL PROTECTED]> wrote:
I think he means index all your different fields into a single field
named "all". Not sure what makes it special, it is just like any
other field.
but that really impossible ! because I have near millions records to be
indexed so this j
I've just tried "all" as a special field but it doesnt work!!! please take a
look at my snippet code if I was mistaken
QueryParser parser = new QueryParser("All",getAnalyzer(1));
Query q = parser.parse(myQuery.getQueryString());
thanks again
On 7/14/07,
Thanks Mathieu
you mean use a query like "all:test" using QueryParser?
On 7/14/07, Mathieu Lecarme <[EMAIL PROTECTED]> wrote:
you can use the "all" special field, but you loose the differents
boost values.
M.
Le 14 juil. 07 à 10:50, Mohammad Norouzi a écrit :
>
Hello all
is there any way to search through all the fields without using
MultiFieldQueryParser? currently I am using this parser but it requires to
pass all fields as an array to its parse method and QueryParser will search
through the default field if no field name is specified
lots of thanks
Hi
your IP is restricted and you are not allowed to download from that site.
they blocked some countries from downloading because of sanctions.
Good Luck
On 6/30/07, Mahdi Rahimi <[EMAIL PROTECTED]> wrote:
Hi.
Thanks To Steven Rowe
I'm using wget right like your suggestion. But there is cer
Hi karl,
we did something like hibernate to map an object (Entity) with lucene by
defining a bunch of annotations just like the Limax project (as far as I
know it is led by you),
the only problem we had was how to make relationship between two or more
separate indexes. I managed to resolve it but
Thanks Narednra and Daniel
but it accepts only long, I found LUCENE-530 to support double and float
etc.
Have you seen it?
https://issues.apache.org/jira/browse/LUCENE-530
another question is Do I have to use it while indexing? how about the index
already created?
--
Regards,
Mohammad
Hello
consider a query like this
patientResult:[8 TO 9]
this returns correct documents but
patientResult:[8 TO 10]
return nothing. and also this range [0.1 TO 10] returns results like 11,
10.8, 10.9 etc.
is there any tricky way or workaround to obtain correct results? I know
lucene compares
Hi
you can get them from searcher, if documents in your index are the same
Enumeration fields = searcher.doc(0).fields();
On 6/5/07, Stadler Hans-Christian <[EMAIL PROTECTED]> wrote:
Hi,
how would I efficently retrieve the names of all possible fields present
in an index?
One way would be
I'm not sure if this is fulfill your needs:
IndexReader.terms() which returns a TermEnum:
TermEnum te = reader.terms();
while(te.next()) {
if(te.term().field().equals("Class")) {
System.out.println(te.term
().field()+":"+te.term().text());
Hi Chris,
* It is a Unicode space character (SPACE_SEPARATOR, LINE_SEPARATOR, or
PARAGRAPH_SEPARATOR) but is not also a non-breaking space ('\u00A0',
'\u2007', '\u202F').
* It is '\u0009', HORIZONTAL TABULATION.
* It is '\u000A', LINE FEED.
* It is '\u000B', VERTICAL
So why the field1:"Test" returns results with
"Test one"
"Test two"
how can I tell lucene to return only "Test"
On 5/26/07, Erick Erickson <[EMAIL PROTECTED]> wrote:
It becomes a phrase query.
Erick
On 5/26/07, Mohammad Norouzi <[EMAIL PR
Hi All
I want to search an exact phrase, for example field1:"This is a Text"
or by this query: field1:"Test" returns the result that only contain the
word "Test" and not "Test one" or "Test two"
I have a page that build a query and finally send that query to a
MultiFieldQueryParser
I have a clas
Hi
I am using Lucene 2.0.0
I get following exception while indexing...
is there anyone has faced this exception?
thanks
java.io.IOException: Cannot rename E:\index-lab\segments.new to
E:\index-lab\segments
at org.apache.lucene.store.FSDirectory.renameFile(FSDirectory.java
:294)
a
Sorry Steven
that change is in WhitespaceTokenizer not WhiteSpaceAnalyzer but in Analyzer
I had to call the tokenizer
On 5/24/07, Mohammad Norouzi <[EMAIL PROTECTED]> wrote:
Hi Steven
Thank you so much for your thorough comments about Analyzer
I write that class a couple of months ag
Hi Steven
Thank you so much for your thorough comments about Analyzer
I write that class a couple of months ago, now I take a look at my
customized Analyzer
the only change I've made as follows:
the original class has this method:
protected boolean isTokenChar(char c) {
return !Character.isW
Wow, very nice comments
Thank you so much Erick. You really showed me the way
--
Regards,
Mohammad
--
see my blog: http://brainable.blogspot.com/
Hi Walter,
let me explain my problem in detail
I have a web page let user to create his own query simple
for example a user want to locate a service with specific value. so he/she
doesnt know exactly the name of the service so I have to provide a list of
services available (say in a combo box) and
w to get started:
http://wiki.apache.org/jakarta-lucene/HowToContribute
Thanks,
Steve
Mohammad Norouzi wrote:
> Walter,
> Yes I am using a customized WhiteSpaceAnalyzer while indexing.
> I said customized because I realized that standard WhiteSpaceAnalyzer
dont
> accept unicode terms i
I would use a term vector to get this. See
IndexReader.getTermFreqVector. You can get the term vector for just
field 3.
Grant, thanks, in my case, getTermFreqVector returns null, I dont know why
it accepts a docnumber as parameter, what is it? is that the same doc id?
if yes it restrict the r
e, the Analyzer, maybe writing your own one.
Mohammad Norouzi wrote:
> Hi all
>
> consider following index
>
> field1 field2 field3
> text1 text1 text2 text3 text4
> text4 text2
returns: (however i have to put an if
statement to filter result of the field3 only)
text3
text4
text2
text5
but I need following result:
"text3 text4"
"text2 text3 text5"
is this possible? if yes, how? and if no, is there any tricky way to get
this result?
thank you so much.
Any more idea?
On 4/17/07, Mohammad Norouzi <[EMAIL PROTECTED]> wrote:
Chris,
Thank you for your reply, your solution doesnt work in my case because I
was thinking of indexing more than one document in a single index and each
document representing a table in database. so if I put more th
www.dbsight.net
demo: http://search.dbsight.com
Lucene Database Search in 3 minutes:
http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutes
On 4/15/07, Mohammad Norouzi <[EMAIL PROTECTED]> wrote:
> well, let me tell you what we're going to do.
> ou
nd use the database for some portion of your joins.
Best
Erick
On 4/15/07, Mohammad Norouzi <[EMAIL PROTECTED]> wrote:
>
> Thanks Karl,
> sorry I was on business vacation and could not reply soon.
>
> Ok, I want to index my database and search through the indexes, and the
s and the speed up accessing
information. now our users are very satisfied by this. but we need to create
some relationship between documents.
On 4/11/07, karl wettin <[EMAIL PROTECTED]> wrote:
11 apr 2007 kl. 11.19 skrev Mohammad Norouzi:
> is it possible (or a trickery way) to searc
Hi
is it possible (or a trickery way) to search with a given query in which we
can set an equality for two fields
for example:
Document:
field1 field2field3 field4
Query:
field1:"test phrase" AND field2:"test" AND field3:field4
in this query we said that d
ٌWell
Philipp and Ronnie Thank you very much indeed
--
Regards,
Mohammad
Thanks Philipp
2007/4/5, Philipp Nanz <[EMAIL PROTECTED]>:
> That *is* the actual id in the index. There is no other.
> You should be careful using it outside of Lucene though, because
> Lucene may rearrange the document ids during optimization for example.
>
> If you need an application id, ad
Hi
I need the id of the document that returned by Hits as a result of a query.
Hits result = searchable.find(myQuery);
now I need something like result.getId()
is there any way to get it?
Thanks so much
--
Regards,
Mohammad Norouzi
sorry to correct my answer:
I need something like this result.doc( i ).getId();
this id from the result (the i ) is starting from 1 but I need the actual id
in the index.
On 4/5/07, Mohammad Norouzi <[EMAIL PROTECTED]> wrote:
Hi
I need the id of the document that returned by Hits as a
It has no performance problem and works fine.
whenever you are going to access a document the searcher will load the
document from the index.
On 4/1/07, Mohsen Saboorian <[EMAIL PROTECTED]> wrote:
This is possible, but the problem here is performance. Why is it not
possible
to support paginati
Mosen,
In order to support pagination, I wrapped the Hits is a class just like
java.sql.ResultSet
You can create a wrapper class and put the Hits in that and implement some
methods like next() prev() to forward and backward through the docuements.
Hope this help you.
--
Regards,
Mohammad
ding Lucene in a DB,
but I can't remember who. Search the archive for Oracle and you'll find
an extensive discussion....
Best
Erick
On 3/26/07, Mohammad Norouzi <[EMAIL PROTECTED]> wrote:
>
> I mean when I get result from the first index, find the common records
> from
> t
Hi all
I am going to index our database. one approach is to join them and then
index the fields. but the information are very large say more than 3
millions. so the Sql Server fails to select them.
I want to know if anyone has such this experience to indexing huge
information of database using lu
I just want to say, the common documents base on a field and not document id
On 3/29/07, is_maximum <[EMAIL PROTECTED]> wrote:
Hi
suppose we have two Hits, now we need the documents which exists in both
of
them and ignore the others.
is there any workaround?
thanks
Regards
Mohammad
--
View
I registered in Nabble, but to post message you should subscribe to lucene
mailing list and if you subscribe to mailing list your inbox will become
full of messages. this is very bad!!!
On 3/28/07, John Haxby <[EMAIL PROTECTED]> wrote:
Grant Ingersoll wrote:
> I like the mailing list approach m
sorry I cant comprehend, so why we should use two separate index? we can
merge it in one index file?
On 3/27/07, Steven Rowe <[EMAIL PROTECTED]> wrote:
Mohammad Norouzi wrote:
> Steven,
> what this means:
> "Each index added must have the same number of documents, but
>
Steven,
what this means:
"Each index added must have the same number of documents, but typically each
contains different fields. Each document contains the union of the fields of
all documents with the same document number. When searching, matches for a
query term are from the first index added th
Karl,
Maybe I am out of date!
do you mean with Nabble I can access this mailing list?
On 3/27/07, karl wettin <[EMAIL PROTECTED]> wrote:
27 mar 2007 kl. 08.03 skrev Mohammad Norouzi:
> I am using some JBoss products and they have a very nice and great
> forum,
>
> I am wo
I am using some JBoss products and they have a very nice and great forum,
I am wondering why Apache still uses this old-fashioned mailing list??
--
Regards,
Mohammad
--
2Michaelvalue2 fval1-1fval2-1 fval3-1
On 3/26/07, jafarim <[EMAIL PROTECTED]> wrote:
what do you mean by "applying the result to the second one"?
On 3/26/07, Mohammad Norouzi <[EMAIL PROTECTED]> wrote:
>
> hi
> I have tw
hi
I have two separated index but there are some fields that are common between
them. now I want to search from one index and then apply the result to the
second one. what solution do you suggest?
what happens on fields? I mean the first document has some fields that are
not present in the second
Hi Oramas
if I use that jar file, it conflicts with lucene-core.jar file. for exampl,
IndexSearcher class that you defined is different from the original one. Do
I have to remove the lucene-core jar file?
if yes, how about the other original classes
On 3/8/07, oramas martín <[EMAIL PROTECTED]> wr
ousand
documents from Hits, you are doing something that is extremely
inefficient. As Erick said, Hits will continuously re-query to fill it
ups cache. After grabbing info for a 100 docs it will re query, then
after 200 it will require, etc.
Stateless man. Trust me.
- Mark
Mohammad Norouzi wrote:
>
t check out a Searcher for each of the
underlying Indexes, so you will have to do that your self...then
remember to release them all when you release the MultiSearcher.
I think in general, you are over concerned. IndexAccessor will handle
most of this for you without much intervention on your part.
he.org/jira/browse/LUCENE-390
- Mark
Mohammad Norouzi wrote:
> Hi all,
> I am going to build a Searcher pooling. if any one has experience on
> this, I
> would be glad to hear his/her recommendation and suggestion. I want to
> know
> what issues I should be apply. considering I am
t
is, by String comparison).
Inspect your index with Luke to see if they're being stored as you
expect. Use QueryParser.toString to see how the query string
is broken up. Don't override anything until you do this or you'll waste
much effort .
Best
Erick
On 3/5/07, Mohammad Norouzi <
Hi Erick
I am completely confused about this IndexReader.
in my case, I have to keep the reader opened because of pagination of the
result so I have to had a reader per session. the thing that baffled me is
can only one reader service all the session at the same time?
I mean
1- having one reader
Hi
you know, actually we dont indexed this field as Date. we always use string
instead of Date type because we use both Hijri date and Gregorian date so if
we put a Hijri date the DateField not work properly. that is why we index
such this field as String.
as I read the javadoc for QueryParser, i
Hi again
I even removed "/" from my query but still not working good.
Do I have to index dates issues without "/"?
now in the index I have 1978/05/05 should I change it to 19780505?
On 3/4/07, Mohammad Norouzi <[EMAIL PROTECTED]> wrote:
Hi
I want to search through l
Hi
I want to search through lucene's index from a start date to end date.
when I pass this query it works, say, admitDate:1978/05/05
however, when I use a range syntax it return no records:
admitDate:[1978/05/05 TO 2005/05/05]
I even tried this: admitDate:["1978/05/05" TO "2005/05/05"]
but still
Hello
I am implemented an IndexResultSet just like java.sql.ResultSet with all its
methods. when I call searcher.search(...) I pass a the returned Hits to my
IndexResultSet.
in the IndexResultSet I have getString(String) getString(int) getInt()
next() previous() absolute() and all methods of the j
want to de-normalize your data for easy lookup etc.
There'll
be some up-front design work to get optimal performance. Especially,
there's
no sense of performing joins in Lucene, and you shouldn't try.
Overall, use Lucene for searching/sorting text, use your RDBMS for
relational thin
create a pool of searcher and one reader for all searcher?
3- or a pool of searcher and each has its own reader. but never close them
just set a flag to true.
which one?
On 2/25/07, Nicolas Lalevée <[EMAIL PROTECTED]> wrote:
Le dimanche 25 février 2007 16:55, Mohammad Norouzi a écrit
table_2_field1 table_2_field2 etc.
These documents will never interfere with each other during searches
because
they share no fields (and each query goes against a particular field).
I mention this because your maintenance will be much easier if you only
have
one index
Best
Erick
archer). Doing so would just slow you down.
Mohammad Norouzi wrote:
> Hi
> actually I dont have any writer or writing reader. I just have reader.
> when
> a reader is created by the user because the document returned by hits is
> very much, for example 20,000 so I display th
gt; safely for every Searcher with differenet queries?
>
> Thank you very much again.
>
> On 2/22/07, Mark Miller <[EMAIL PROTECTED]> wrote:
>>
>> I would not do this from scratch...if you are interested in Solr go
that
>> route else I would build off
>> http
wrote:
I would not do this from scratch...if you are interested in Solr go that
route else I would build off
http://issues.apache.org/jira/browse/LUCENE-390
- Mark
Mohammad Norouzi wrote:
> Hi all,
> I am going to build a Searcher pooling. if any one has experience on
> this, I
> wo
, take a few minutes and ask
yourself if this design is appropriate, and continue only if you can
answer
in the affirmative...
Best
Erick
On 2/22/07, Mohammad Norouzi <[EMAIL PROTECTED]> wrote:
>
> Hello
> In our application we have to index the database tables, there is two
way
> to
Hello
In our application we have to index the database tables, there is two way to
make this
1- index each table in a separate directory and then keep all relation in
order to get right result. in this method, we should use filters to overcome
the problem of searching on another search result.
2.
advance.
--
Regards,
Mohammad Norouzi
Hi
I've overcome this problem without HitCollector, I build an interface just
like java.sql.ResultSet and its implementation class accept a Hits as
parameter and provide next() previous() etc. method to navigate between
records.
in my opinion this is a good solution.
Hope this help you
On 2/21/0
, fields, flags, analyzer);
--
Regards,
Mohammad Norouzi
On 2/14/07, Akbar Devani <[EMAIL PROTECTED]> wrote:
Hi,
I'm in need of your help.I was looking for how serach multiple fields
using Lucene,in one of the mailing list I found your solution
to the issue.
Its working perfectly but
Hi
I want to index data with utf-8 encoding, so when adding field to a document
I am using the code new String(value.getBytes("utf-8"))
in the other hand, when I am going to search I was using the same snippet
code to convert to utf-8 but it did not work so finally I found somewhere
that had been
Hi
would you tell how we can create a searcher pool. I get interested in your
idea.
thanks
On 2/7/07, spamsucks <[EMAIL PROTECTED]> wrote:
I currently am sharing my IndexSearcher across multiple requests as a
static
object.
While I understand that the IndexSearcher is threadsafe, this does no
Hi
as far as I know there is no way to obtain the structure of the index. for
example, I need to have a list of fields available in the index and their
type.
I need this feature, what shall I do?
--
Regards,
Mohammad Norouzi
95 matches
Mail list logo