Hi,
I think one problem of the existing method is that, to query on the RDF
files or similar structures, we have to form SQL like queries. However, for
searching in the text files, we only need to type several keywords. Can we
combine the two methods and how can we combine the two methods. For
ins
For many applications, search engine and database are
competitive solutions. One has to consider in depth to
choose either search engine or database, and in some
cases, the border is blurred.
There are several issues to consider when selecting
database or Lucene or both as the solution to a
speci
The concept of a category means very differnet things to different people,
but if you've got several differnet use cases that you think other people
can benefit from, by all means feel free to write up descriptions of the
problem you solved, and how you solved it and post it in the HowTo
section
On Tuesday 17 January 2006 20:52, Dan Katz wrote:
...
> Question 1) Is there a way in Lucene to have some sort of limit based
> on term count. For example, "atleast5 Apple" to find items with the
> word apple only when it has at least 5 mentions.
This can be done, but you'll need to write your
In many cases that essentially require traditional RDBMS but also require
lucene like functionality, I would use the database as the primary data
store. I would then either:
1. Update the lucene index using data from the database based on a
scheduled process.
2. As records are added, add them to
Would you say as a best practice that you can use both?When would
you and when wouldn't you? I trust databases more then free files, so I
am putting my more sensitive and volatile data in the database. If you
built a commenting system.. like a blog or an flickr type app, would
just a lucene
1. The conventional database uses B+tree as the
indexing mechanism, while search engine uses
inverted-index.
When user needs to update the data frequently, then
B+tree is a better choice. However, for search engine,
the data and index doesn't change too often.
Inverted indexes are tables
I apologize if sending this to the wrong
place, but I need some help writing some lucene queries. I am not the
Lucene manager here at our company. Just a lowly unsophisticated user who
would be appreciative of any help that can be provided.
Question 1) Is there
a way in Lucene to have
If that's it, that's fine. I guess I had in mind something else? For
example, one of mine uses categories (something mentioned quite a bit),
but it has some slight differences from what I've seen before. Items
can be in multiple locations, the categories can move around amoungst
themselves, and
Hi Jiang,
I'm currently facing a similar problem. Up to now I have to use for the
semantic query a graph matching algorithm, but the fulltext search in the
semantic web is performed by lucene.
At first I wrote the whole text into a one index. The document contains one
field for the unique id and
Hi,
> -Original Message-
> From: John Powers [mailto:[EMAIL PROTECTED]
>
> Is there any repository of best practices? Does LIA represent that?
> I was thinking about a blog or something that everyone could
> post their solutions into.
I think http://wiki.apache.org/jakarta-lucene/Ho
hi Erik,
thx for your reply.
I think the Kowari is a system for searching information in the RDF files.
It is only for finding information in the meta data files. However, i think
one problem of the Semantic Web is that, if we have a document and its RDF
annotate, how do we retrieve the documents
Hia
Is there any repository of best practices? Does LIA represent that?
I was thinking about a blog or something that everyone could post their
solutions into.I've written only 4 implementations of lucene, but
each was so very different, I was thinking it might be nice to know what
everyone el
1) Because database mainly uses B-tree to index records' some columns.
Lucene or other full text search software uses to analyze the content
of each document and saved to inverted index. "Lucene's index falls
into the family of indexes known as an inverted index. This is because
it can list, for a
Have a look at Kowari - http://www.kowari.org
It is a scalable RDF engine that also has full-text search support
via Lucene.
Professionally I tinker with semweb and search topics, and eventually
we'll have something to show for these efforts :)
Erik
On Jan 17, 2006, at 9:34 AM,
Hi friends,
How do you think use the lucene for searching in the Semantic Web? I am
trying using the lucene for searching documents with ontological annotation.
But i do not get a better model to combine the keywords information and the
ontological information.
regards
jiang xing
Ok, i will try it.
On 1/17/06, Erik Hatcher <[EMAIL PROTECTED]> wrote:
>
>
> On Jan 17, 2006, at 5:58 AM, jason wrote:
> > I have test the snowballFilter and it does not stem the term
> > "support". It
> > means the term "support" should be in all the papers. However, i
> > add the
> > synonymF
Aigner, Thomas wrote:
I did a man on top and sure enough there was a PPID command on
Linux (f then B) for parent process. And yes, they always have the same
parent command. Thanks for your help as I'm obviously still a noob on
Unix.
Nope, that doesn't tell you they're different thre
On Jan 17, 2006, at 5:58 AM, jason wrote:
I have test the snowballFilter and it does not stem the term
"support". It
means the term "support" should be in all the papers. However, i
add the
synonymFilter, the "support" is missing.
Two very valuable troubleshooting techniques:
1) Run yo
hi,
thx for your replies.
I have test the snowballFilter and it does not stem the term "support". It
means the term "support" should be in all the papers. However, i add the
synonymFilter, the "support" is missing.
I think i have to read the lucene source code again.
yours truly
Jiang Xing
On
Hi Friends,
I have very basic question that
1] Why we use Lucene for Database search like Oracle / Sybase ?
2] For that first we have to convert all records one bye one in
string then build lucene document and then index it ?
Thanks
From,
[ [EMAIL PROTECTED] ]
Hi,
I've developed a service which accepts search requests over the network,
runs them with Lucene and pumps out results. I have noticed that if I
use RAMDirectory the memory usage is much more (more than expected) and
it grows as the service is left running.
The lucene index is 34Mb but when
On Jan 17, 2006, at 12:14 AM, jason wrote:
It is adding tokens into the same position as the original token.
And then,
I used the QueryParser for searching and the snowball analyzer for
parsing.
Ok, so you're only using the SynonymAnalyzer for indexing, and the
SnowballAnalyzer for QueryP
This question is based on the file format document.
http://lucene.apache.org/java/docs/fileformats.html
The following is my understanding. Is this correct
1) The .frq file is the most important file and that is where we get the
mapping of terms to documents.
2) The .prx file is used f
24 matches
Mail list logo