Done. LUCENE-5223 <https://issues.apache.org/jira/browse/LUCENE-5223>
On Mon, Sep 16, 2013 at 6:39 PM, Uwe Schindler wrote:
> Hi Bruce,
>
> Thanks for investigating! Can you open a bug report on
> https://issues.apache.org/jira/browse/LUCENE ?
>
> Uwe
>
> -
Here it fails because -verbose is not set:
$ java -cp ./lucene-core-4.4-SNAPSHOT.jar
org.apache.lucene.index.IndexUpgrader ./INDEX
Exception in thread "main" java.lang.IllegalArgumentException: printStream
must not be null
at
org.apache.lucene.index.IndexWriterConfig.setInfoStream(IndexWriterConf
hi list...
sorry to post here, but i figured you might be able to help...
i'm working on a project, that deals with building a crawler, and i'm
working out the details for the mgmt app for the crawler. i'm currently
looking at how to deal with the status/actions of the crawler, and how the
differ
ment, etc and that the classes also have
some kind of metadata that could be used to correlate. But unless
you can give us some clue what information you *do* have access to,
I don't think there's much we can do to help...
Best
Erick
On Thu, Mar 19, 2009 at 12:47 PM, bruce wrote:
>
Hi...
This may/may not have anything to do with Lucene/Nutch, but I figured I'd
ask/post anyway.
I'm working on a project, dealing with courses/classes on college sites. I'm
trying to figure out how to create an automated process where I can create a
process to link a given faculty member to a gi
Hi...
Sorry that this is a bit off track. Ok, maybe way off track!
But I don't have anyone to bounce this off of..
I'm working on a crawling project, crawling a college website, to extract
course/class information. I've built a quick test app in python to crawl the
site. I crawl at the top level
as someone who's posted, and replied to other project requests on other
sites, if you're not apamming... then i'd say go ahead!!
in this day/time, when you don't know if your job is safe next week, who's
really going to frown upon a potentially serious project/offer...
just be cool on the spammin
application/process.
Thanks
-bruce
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
test
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
override or
> re-implement PrefixFilter?
One possible solution that may or may not work for you is explained here:
http://mail-archives.apache.org/mod_mbox/lucene-java-user/200503.mbox/[EMAIL
PROTECTED] For general purpose searching I'm not sure it's a great choice
however I wanted t
hey cass...
anyway you could let us know the site/app that we're powering!!!
always good to see what's going on in the world!
thanks
-Original Message-
From: Cass Costello [mailto:[EMAIL PROTECTED]
Sent: Tuesday, March 20, 2007 12:58 PM
To: solr-user@lucene.apache.org; java-user@lucene
" BASIS,
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
implied.
* See the License for the specific language governing permissions and
* limitations under the License.
*/
Regards,
Bruce Ritchie
> -Original Message-
> From: Bill Taylor [mailto:[EMA
Rejiv,
You may want to take a look at http://akismet.com/development/ - I don't
believe it's open source but it may be worth looking into.
Regards,
Bruce Ritchie
> -Original Message-
> From: Rajiv Roopan [mailto:[EMAIL PROTECTED]
> Sent: Wednesday, October 04
levels in a similar
manner. this would/should allow the app to return all the required pages.
i could then create the parser(s) to extract information from each page.
although it would be better to actually have the ability to somehow manage
the data extraction from the DOM within the crawling ap
hi...
anybody on the list provide consulting/support for lucene/nutch...
get back to me with your contact info if you do...
thanks
-bruce
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL
extract certain information based upon the
DOM of the page... if the returned information from the 'crawler' is of a
textfile format, i can easily create a parsing function to go through the
files and generate the information...
can someone provide me with insight as to whether lucen
is
to send the first post I really want to send, and go check an archive
of the list a little later.
-Yonik
On 6/21/06, bruce <[EMAIL PROTECTED]> wrote:
> hi..
>
> can someone please respond to this so i can see if i'm getting through..
>
> thanks
>
> -bruce
-Yonik
hi..
can someone please respond to this so i can see if i'm getting through..
thanks
-bruce
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
ction
of the college site that i crawl...
btw, how does lucene/nutch compare to heritrix?
thanks...
-bruce
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
.
any research/papers/sample apps i could look at...
any thoughts/comments/etc
i could brute force this by writing a bunch of perl
scripts, with each script tied to a given registrar site,
but i'd like a more generalizable solution if one exists..
thanks
-bruce
--
> the document.
We use 2/ and use filters when we want to search only within a particular
language. Just be sure touse the same analyzer when indexing and
searching within a particular language.
Regards,
Bruce Ritchie
age (assumming that the
document isn't already loaded into cache).
The third approach was deemed unusable for the exact reason you outlined.
Regards,
Bruce Ritchie
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Is there any syntax to find all the documents where a particular
Lucene-indexed field is empty?
For instance, say there is a field called "author". I can do a field search
for
author:cutting
to find all documents with "cutting" in the author field; I could also
search for
author:
23 matches
Mail list logo