Re: SpanQuery parser? Update (ugly hack inside...)

2005-11-07 Thread Sean O'Connor
Paul Elschot wrote: The goal(s) I am trying to accomplish is rather specific I think, so I imagine the use of my hacking is rather limited (i.e. just to me). At the moment my code: * parses the search text (i.e. user entered query) Are you using QueryParser? If

Re: SpanQuery parser? Update (ugly hack inside...)

2005-11-07 Thread Sean O'Connor
Erik Hatcher wrote: On 4 Nov 2005, at 18:32, Sean O'Connor wrote: I'm posting this primarily hoping to give back a tiny bit to a very helpful community. More likely however, someone else will open my eyes to an easier approach than what I outline below... I've come up w

Re: SpanQuery parser? Update (ugly hack inside...)

2005-11-04 Thread Sean O'Connor
ct hit found. This is really only useful for "termA near 'some phrase'" at the moment, but might become more advanced in the next 2-3 months. Sean Paul Elschot wrote: On Thursday 20 October 2005 00:40, Sean O'Connor wrote: Hello, I have user entered search

SpanQuery parser?

2005-10-19 Thread Sean O'Connor
Hello, I have user entered search commands which I want to convert to SpanQueries. I have seen in the book "Lucene in Action" that no parser existed at time of publication, but there was someone working on a SpanQuery parser. Can anyone point me to that code, or provide any suggestions?

Location of code which determines a Hit for PhraseQuery

2005-09-07 Thread Sean O'Connor
Hi, I am trying to work through the Hit collection process for a PhraseQuery (using an exact phrase). For an example search, say I'm looking for: "lucene action" (quotes indicating exact phrase) in a one doc, one field index consisting of: wow, lucene rocks, lucene action items are cool, v

Re: Hits document offset information? Span query or Surround? - thanks

2005-09-06 Thread Sean O'Connor
Thanks for the input. I am looking at the suggested links now. If I make any progress I will return to see if any of my work would be appropriate to contribute back. Sean Paul Elschot wrote: On Tuesday 06 September 2005 08:52, markharw00d wrote: >>I believe I have heard that Span queries

Hits document offset information? Span query or Surround?

2005-09-05 Thread Sean O'Connor
I believe I have heard that Span queries provide some way to access document offset information for their hits somehow. Does anyone know if this is true, and if so, how I would go about it? Alternatively (preferably actually) does the surround code from the SVN development area have a way of r

Re: Can Span Queries contain boolean, prefix and other component queries?

2005-09-04 Thread Sean O'Connor
help, Sean Paul Elschot wrote: Sean, On Sunday 04 September 2005 20:43, Sean O'Connor wrote: Hello, I am trying to do some complex queries such as: [Field contents] The movie Napoleon Dynamite is a movie about a kid named Napoleon who has no Dynamite. [Query] "Napol* Dynam

Re: Lucene contrib (surround), Subversion, and Eclipse

2005-09-04 Thread Sean O'Connor
ml file that can be used directly for both building and testing just the surround package, but it does require that lucene/java/trunk/contrib/contrib-build.xml and some other files from lucene/java/trunk/ are checked out, so you might as well just check out the whole thing. : Date: Sun, 04 Sep 20

Lucene contrib (surround), Subversion, and Eclipse

2005-09-04 Thread Sean O'Connor
Hello, I am new to subversion, junit and the Lucene contrib repository. I am looking over the 'surround' project at the moment. If there is anyone out there with Eclipse experience who uses the contrib subversion (or cvs) repository could you look over my approach listed below? I am using

Re: Phrase frequency

2005-09-04 Thread Sean O'Connor
I believe the index just contains information about single terms. A PhraseQuery then searches the index for the parts of the phrase and returns the hit information. So, as far as I understand, there is no way to get the frequency of phrase directly from an index, but you could create a PhraseQ

Can Span Queries contain boolean, prefix and other component queries?

2005-09-04 Thread Sean O'Connor
Hello, I am trying to do some complex queries such as: [Field contents] The movie Napoleon Dynamite is a movie about a kid named Napoleon who has no Dynamite. [Query] "Napol* Dynamite" near "film|movie" Is this possible with some version of a span query? Something like a PhrasePrefixQ

Example of Field.TermVector.WITH_POSITIONS_OFFSETS usage?

2005-08-23 Thread Sean O'Connor
Hello, I am trying to work through term positions and how to get them from a collection of hits. Does setting TermVector.WITH_POSITIONS_OFFSETS to true save the start/end position of the term in the source text file? (I _think_ it does). If so, where would I start for trying to make th

Re: Search Hit frequency and location

2005-06-16 Thread Sean O'Connor
to educate myself would be welcome as well. Cheers, Sean Erik Hatcher wrote: On Jun 16, 2005, at 12:03 PM, Sean O'Connor wrote: Yes, see the Javadoc for IndexReader.termPositions(). I'm probably missing the obvious here, but I assume this refers to the analyzed ter

Search Hit frequency and location

2005-06-16 Thread Sean O'Connor
Hello, I am trying to find the right approach for finding frequency (and, slightly lower in priority, location) of search hits in a document. I am working through the online documentation and the helpful "Lucene in Action" book. There are several examples and explanations which seem close, but