Re: Use the lucene for searching in the Semantic Web.

2006-01-17 Thread xing jiang
Hi, I think one problem of the existing method is that, to query on the RDF files or similar structures, we have to form SQL like queries. However, for searching in the text files, we only need to type several keywords. Can we combine the two methods and how can we combine the two methods. For ins

Re: Why we use Lucene for Database search like Oracle / Sybase ?

2006-01-17 Thread Kan Deng
For many applications, search engine and database are competitive solutions. One has to consider in depth to choose either search engine or database, and in some cases, the border is blurred. There are several issues to consider when selecting database or Lucene or both as the solution to a speci

RE: best practices

2006-01-17 Thread Chris Hostetter
The concept of a category means very differnet things to different people, but if you've got several differnet use cases that you think other people can benefit from, by all means feel free to write up descriptions of the problem you solved, and how you solved it and post it in the HowTo section

Re: Lucene Query Writing question

2006-01-17 Thread Paul Elschot
On Tuesday 17 January 2006 20:52, Dan Katz wrote: ... > Question 1) Is there a way in Lucene to have some sort of limit based > on term count. For example, "atleast5 Apple" to find items with the > word apple only when it has at least 5 mentions. This can be done, but you'll need to write your

Re: Why we use Lucene for Database search like Oracle / Sybase ?

2006-01-17 Thread Peter A. Daly
In many cases that essentially require traditional RDBMS but also require lucene like functionality, I would use the database as the primary data store. I would then either: 1. Update the lucene index using data from the database based on a scheduled process. 2. As records are added, add them to

RE: Why we use Lucene for Database search like Oracle / Sybase ?

2006-01-17 Thread John Powers
Would you say as a best practice that you can use both?When would you and when wouldn't you? I trust databases more then free files, so I am putting my more sensitive and volatile data in the database. If you built a commenting system.. like a blog or an flickr type app, would just a lucene

Re: Why we use Lucene for Database search like Oracle / Sybase ?

2006-01-17 Thread Kan Deng
1. The conventional database uses B+tree as the indexing mechanism, while search engine uses inverted-index. When user needs to update the data frequently, then B+tree is a better choice. However, for search engine, the data and index doesn't change too often. Inverted indexes are tables

Lucene Query Writing question

2006-01-17 Thread Dan Katz
I apologize if sending this to the wrong place, but I need some help writing some lucene queries.  I am not the Lucene manager here at our company.  Just a lowly unsophisticated user who would be appreciative of any help that can be provided.   Question 1)   Is there a way in Lucene to have

RE: best practices

2006-01-17 Thread John Powers
If that's it, that's fine. I guess I had in mind something else? For example, one of mine uses categories (something mentioned quite a bit), but it has some slight differences from what I've seen before. Items can be in multiple locations, the categories can move around amoungst themselves, and

AW: Use the lucene for searching in the Semantic Web.

2006-01-17 Thread Klaus
Hi Jiang, I'm currently facing a similar problem. Up to now I have to use for the semantic query a graph matching algorithm, but the fulltext search in the semantic web is performed by lucene. At first I wrote the whole text into a one index. The document contains one field for the unique id and

RE: best practices

2006-01-17 Thread Pasha Bizhan
Hi, > -Original Message- > From: John Powers [mailto:[EMAIL PROTECTED] > > Is there any repository of best practices? Does LIA represent that? > I was thinking about a blog or something that everyone could > post their solutions into. I think http://wiki.apache.org/jakarta-lucene/Ho

Re: Use the lucene for searching in the Semantic Web.

2006-01-17 Thread jason
hi Erik, thx for your reply. I think the Kowari is a system for searching information in the RDF files. It is only for finding information in the meta data files. However, i think one problem of the Semantic Web is that, if we have a document and its RDF annotate, how do we retrieve the documents

best practices

2006-01-17 Thread John Powers
Hia Is there any repository of best practices? Does LIA represent that? I was thinking about a blog or something that everyone could post their solutions into.I've written only 4 implementations of lucene, but each was so very different, I was thinking it might be nice to know what everyone el

Re: Why we use Lucene for Database search like Oracle / Sybase ?

2006-01-17 Thread Chris Lu
1) Because database mainly uses B-tree to index records' some columns. Lucene or other full text search software uses to analyze the content of each document and saved to inverted index. "Lucene's index falls into the family of indexes known as an inverted index. This is because it can list, for a

Re: Use the lucene for searching in the Semantic Web.

2006-01-17 Thread Erik Hatcher
Have a look at Kowari - http://www.kowari.org It is a scalable RDF engine that also has full-text search support via Lucene. Professionally I tinker with semweb and search topics, and eventually we'll have something to show for these efforts :) Erik On Jan 17, 2006, at 9:34 AM,

Use the lucene for searching in the Semantic Web.

2006-01-17 Thread jason
Hi friends, How do you think use the lucene for searching in the Semantic Web? I am trying using the lucene for searching documents with ontological annotation. But i do not get a better model to combine the keywords information and the ontological information. regards jiang xing

Re: One problem of using the lucene

2006-01-17 Thread jason
Ok, i will try it. On 1/17/06, Erik Hatcher <[EMAIL PROTECTED]> wrote: > > > On Jan 17, 2006, at 5:58 AM, jason wrote: > > I have test the snowballFilter and it does not stem the term > > "support". It > > means the term "support" should be in all the papers. However, i > > add the > > synonymF

Re: Memory

2006-01-17 Thread John Haxby
Aigner, Thomas wrote: I did a man on top and sure enough there was a PPID command on Linux (f then B) for parent process. And yes, they always have the same parent command. Thanks for your help as I'm obviously still a noob on Unix. Nope, that doesn't tell you they're different thre

Re: One problem of using the lucene

2006-01-17 Thread Erik Hatcher
On Jan 17, 2006, at 5:58 AM, jason wrote: I have test the snowballFilter and it does not stem the term "support". It means the term "support" should be in all the papers. However, i add the synonymFilter, the "support" is missing. Two very valuable troubleshooting techniques: 1) Run yo

Re: One problem of using the lucene

2006-01-17 Thread jason
hi, thx for your replies. I have test the snowballFilter and it does not stem the term "support". It means the term "support" should be in all the papers. However, i add the synonymFilter, the "support" is missing. I think i have to read the lucene source code again. yours truly Jiang Xing On

Why we use Lucene for Database search like Oracle / Sybase ?

2006-01-17 Thread Vikas Khengare
Hi Friends, I have very basic question that 1] Why we use Lucene for Database search like Oracle / Sybase ? 2] For that first we have to convert all records one bye one in string then build lucene document and then index it ? Thanks From, [ [EMAIL PROTECTED] ]

Memory Issues

2006-01-17 Thread Rob Young
Hi, I've developed a service which accepts search requests over the network, runs them with Lucene and pumps out results. I have noticed that if I use RAMDirectory the memory usage is much more (more than expected) and it grows as the service is left running. The lucene index is 34Mb but when

Re: One problem of using the lucene

2006-01-17 Thread Erik Hatcher
On Jan 17, 2006, at 12:14 AM, jason wrote: It is adding tokens into the same position as the original token. And then, I used the QueryParser for searching and the snowball analyzer for parsing. Ok, so you're only using the SynonymAnalyzer for indexing, and the SnowballAnalyzer for QueryP

Files in index directory

2006-01-17 Thread shailesh kumar
This question is based on the file format document. http://lucene.apache.org/java/docs/fileformats.html The following is my understanding. Is this correct 1) The .frq file is the most important file and that is where we get the mapping of terms to documents. 2) The .prx file is used f