Re: Index Field feeded from Reader that also stores cleartext

Ian Lea Fri, 03 Sep 2010 06:02:46 -0700

If you can't use one of the Reader based Field methods then no.
You'll have to convert the data to a string.  If you do it a doc at a
time and you still don't have enough memory then I don't know what you
can do.



--
Ian.


On Fri, Sep 3, 2010 at 10:23 AM, Gregor Dorfbauer
<gregor.dorfba...@lagentz.at> wrote:
> Hi!
>
> I'm working on an indexer that should process documents on hard-disk which
> are of arbitrary size and type. I use Apache Tika for plain text extraction
> which offers the feature to stream the parsers output through a reader.
>
> My problem is following:
> Is there a possibility to generate a document field that gets its data from
> an Reader-instance and where the plain text is also stored into the index
> (like the Store.YES field denotes)?
> If I can't stream the data, memory usage is exceeding the limits of my
> machine.
>
>
> Thanks for your help,
> Gregor
>

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Index Field feeded from Reader that also stores cleartext

Reply via email to