Hi,
can anyone suggest me how can i use lucene for text classification.
--
Thanks and Regards
Vignesh Srinivasan
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@
Hi,
Anyone Used the Naive Bayesian Classifier?
It will be really helpful if some one Can post how to use the
classifiers in Lucene ..
--
Thanks and Regards
Vignesh Srinivasan
9739135640
-
To unsubscribe, e-mail: java-user-uns
side.
I thought Lucene Naive Bayesian Would be useful...
Please Suggest me Whether classifying emails will be done using this
Lucene Naive Bayesian or any other Lucene Classifiers..
Thanks and Regards
Vignesh Srinivasan
On Mon, Jan 14, 2013 at 7:23 PM, VIGNESH S wrote:
> Hi,
>
> An
Hi,
I recently started using Lucene. There are lot of articles and news
about undestanding Lucene at the API levels .
Can anyone help me understanding in depth what happens internally
inside a lucene index how it stores and details about the which
Datastructure lucene uses for inverted index.
Apache Tika:-You can Use to Extract text from PDF,word Documents.
It internally uses Apache POI for Extraction of text from office documents..
It uses PDFBOX for Extraction of text from PDF Documents..
On Sat, Jan 26, 2013 at 4:24 AM, saisantoshi wrote:
> I want to index the document content(
Hi All,
I am having a basic doubt..
I am trying to update a lucene document field with a new value..
The below is my code.. It is not giving any errors and also it is not
updating the document with field.
Document d = searcher.doc(docId);
writer1 = new IndexWriter(csDirectory, new
IndexWriterC
;
> Separately, that's the right way to call .updateDocument, but you must
> ensure FILE_NAME_FIELD was indexed for the first document, with the
> value "new1.docx".
>
> Can you include how you indexed the original document?
>
> Mike McCandless
>
> htt
Hi,
For Basics on Lucene How to Create Lucene Index and some basic Stuffs
Look in to Lucene in Action Book.
On Tue, Feb 5, 2013 at 6:28 PM, Álvaro Vargas Quezada wrote:
> Hello,
> I want to implement a central index, and I heard about Lucene, so I would
> like to ask your help to install it an
Hi,
I have two questions
1.How to Get the enumeration of Terms Ending with a given word
I saw we can get enumerations of word starting at a given word by
Indexreader.terms(term())) method
2.Actually I am doing an multiphrase query.In that I do do a suffix
query on first word..How can i do please
Hi Rafaela,
Look at Lucene Transform.It might help to encrypt lucene documents.
https://code.google.com/p/lucenetransform/
On Wed, Jun 26, 2013 at 2:36 PM, Rafaela Voiculescu <
rafaela.voicule...@gmail.com> wrote:
> Hello,
>
> Thank you all for your help and the suggestions. They are very usef
Hi,
Is it mandatory to use "Store.YES" when using Highlighting Feature.
is it Possible to use Highlighting Feature without using "Store.Yes" while
indexing because it almost doubles index size.
Please Kindly Help.
--
Thanks and Regards
Vignesh Srinivasan
9739135640
Hi,
I think using CompressingStoredFieldsFormat Feature introduced in Lucene
4.1 may help reduce index size.
Any other comments and suggestions are welcome in this topic..
Thanks and Regards
Vignesh Srinivasan
9739135640
On Thu, Jul 4, 2013 at 6:38 PM, VIGNESH S wrote:
> Hi,
>
&
compression techniques i can apply so
that it reduces disk space and also not increase the search time.
Thanks and Regards
Vignesh srinivasan
On Fri, Jul 5, 2013 at 7:03 PM, Roberto Ragusa wrote:
> On 07/05/2013 01:27 PM, VIGNESH S wrote:
> > Hi,
> >
&g
Hi,
I am Trying to migrate to Lucene 4.3.1
I just want to do basic indexing.I added the Lucene Core Jar and iam
getting
Getting Exception
07-01 15:11:13.763: E/AndroidRuntime(17123): Caused by:
java.lang.IllegalArgumentException: A SPI class of type
org.apache.lucene.codecs.Codec with name '
Hi,
I tried indexing in Desktop..It works fine.
The above error loading error comes only in android..
Any comments..
On Sat, Jul 13, 2013 at 12:39 PM, VIGNESH S wrote:
> Hi,
>
> I am Trying to migrate to Lucene 4.3.1
>
> I just want to do basic indexing.I added the Lucene C
, Jul 13, 2013 at 1:57 PM, Thomas Matthijs wrote:
> On Sat, Jul 13, 2013 at 10:25 AM, VIGNESH S
> wrote:
>
> > Hi,
> >
> > I tried indexing in Desktop..It works fine.
> > The above error loading error comes only in android..
> > Any comments..
>
>
> Don
ava (and
> doesnt implement/follow the spec)
>
> On Sat, Jul 13, 2013 at 4:31 AM, VIGNESH S
> wrote:
>
> > Hi,
> >
> > I did not striped META-INF/services and it contains the files.
> >
> > Even when i combined with other jars,i manually appended those entri
;s an
> unsupported configuration.
>
> Uwe
>
> -
> Uwe Schindler
> H.-H.-Meier-Allee 63, D-28213 Bremen
> http://www.thetaphi.de
> eMail: u...@thetaphi.de
>
> > -Original Message-
> > From: VIGNESH S [mailto:vigneshkln...@gmail.com]
>
Hi,
I am trying to upgrade our older index from Lucene 3.6 to Lucene 4.2
I need to understand the changes in Indexing Structure..
can any one please post some articles and links through which i can
understand indexing changes and search changes.
--
Thanks and Regards
Vignesh Srinivasan
9739135
Hi,
I am creating index like this in\\using Lucene 4.3.1
I am using 3 fields like
FieldType offsetsType = new FieldType(TextField.TYPE_STORED);
offsetsType.setIndexed(true);
offsetsType.setStored(true);
offsetsType.setStoreTermVectors(true);
offsetsType.setTokenized(true);
offsetsType.setStoreTe
ss.com> wrote:
> MultiFields.getFields returns null if the reader has no postings. Are
> you sure you actually indexed any content?
>
> Mike McCandless
>
> http://blog.mikemccandless.com
>
>
> On Tue, Jul 16, 2013 at 3:06 AM, VIGNESH S
> wrote:
> > Hi,
> >
> > I am cr
Hi Mike,
If i use StringField.Type_STORED instead of TextField.Type_Stored it is
coming in the contents of MultiReader..
On Tue, Jul 16, 2013 at 7:16 PM, VIGNESH S wrote:
> Hi Mike,
>
> Thanks for your Reply.
>
> Contents are Getting Indexed..In MultiReader the other two fiel
term vectors) are inherited from
> TextField.TYPE_STORED.
>
> Mike McCandless
>
> http://blog.mikemccandless.com
>
>
> On Tue, Jul 16, 2013 at 9:56 AM, VIGNESH S
> wrote:
> > Hi Mike,
> >
> > If i use StringField.Type_STORED instead of TextField.Type_Stor
Hi,
I need to do highlight the first sentence which matches the search keyword
in a document using PostingsHighlighter.
How can i do this
Any Help or suggestions welcome
--
Thanks and Regards
Vignesh Srinivasan
t;
luc...@mikemccandless.com> wrote:
> On Wed, Jul 17, 2013 at 1:52 AM, VIGNESH S
> wrote:
> > Hi Mike,
> >
> > The Problem I mentioned is I used 3 Fields subject title, Content.
> >
> > I indexed Subject and Title like this..
> >
> > doc.add(new StringField(su
Hi Mike,
I tried the TestPostingsHighlighter.java.The contents I gave my own
content..
In that,If iam searching "Android",it is always returning the First
Sentence as highlighted text whether the sentence contains Searched keyword
or not..
On Wed, Jul 17, 2013 at 3:48 PM, VIGNES
which happens when the document did not have any matched
> terms from the query. Are you sure your content matches Android? Can
> you post a full test case showing the issue?
>
> Mike McCandless
>
> http://blog.mikemccandless.com
>
>
> On Wed, Jul 17, 2013 at 10:12 AM, VIG
sage" (first N
> sentences), which happens when the document did not have any matched
> terms from the query. Are you sure your content matches Android? Can
> you post a full test case showing the issue?
>
> Mike McCandless
>
> http://blog.mikemccandless.com
>
>
> On
gt; Yes, PH supports any analyzer.
>
> Mike
>
> Mike McCandless
>
> http://blog.mikemccandless.com
>
>
> On Thu, Jul 18, 2013 at 2:57 AM, VIGNESH S
> wrote:
> > Hi Mike,
> >
> > I am getting the Search Hits.
> >
> > Will PostingsHighlighter
Hi,
which is the best way to delete a lucene document in a large index?
--
Thanks and Regards
Vignesh Srinivasan
9739135640
5:13 PM, Michael McCandless <
luc...@mikemccandless.com> wrote:
> Use IndexWriter.deleteDocument.
>
> Mike McCandless
>
> http://blog.mikemccandless.com
>
>
> On Tue, Aug 6, 2013 at 7:38 AM, VIGNESH S wrote:
> > Hi,
> >
> > which is the best
Hi
I have a large Index with a open IndexWriter.some one unknowlingly deleted
some files in the index and corrupted the index.
Once i get IndexCorruptedException,I will delete the entire Index Folder
and close my indexwriter also and Restart indexing process again.
In that time,I am Still Getti
Hi,
I tried FastVectorHighlighter to highlight the first sentence in which the
match occured.
I tried both ScoreOrderFragmentsBuilder and SimpleFragmentBuilder.Both of
them are not returning first sentence where the search query matches.
Please Kindly help me..
--
Thanks and Regards
Vignesh Sr
Hi,
In the Example of Multiphrase Query it is mentioned
"To use this class, to search for the phrase "Microsoft app*" first use
add(Term) on the term "Microsoft", then find all terms that have "app" as
prefix using IndexReader.terms(Term), and use MultiPhraseQuery.add(Term[]
terms) to add them to
null) {
> l.add(br.utf8ToString());
> }
> return l;
> }
>
> --
> Ian.
>
> On Wed, Sep 25, 2013 at 3:04 PM, VIGNESH S
> wrote:
> > Hi,
> >
> > In the Example of Multiphrase Query it is mentioned
> >
> > "To use this class, to search for the p
esults.
What may be the reason..
On Fri, Sep 27, 2013 at 11:49 AM, VIGNESH S wrote:
> Hi Lan,
>
> Thanks for your Reply.
>
> I am doing similar to this only..In MultiPhraseQuery object actual phrase
> is going proper but it is not returning any hits..
>
> In Lucene 3.6,I
ComplexPhraseQueryParser
> b) SurroundQueryParser: oal.queryparser.surround.parser.QueryParser
> c) experimental:
> http://issues.apache.org/jira/browse/LUCENE-5205
>
>
> -Original Message-
> From: VIGNESH S [mailto:vigneshkln...@gmail.com]
> Sent: Friday, September
more obscure, and only related to
> the multi phrase stuff, I suggest you build a tiny but complete
> RAMDirectory based program or test case that shows the problem and
> post it here.
>
>
> --
> Ian.
>
>
>
> On Mon, Sep 30, 2013 at 6:46 AM, VIGNESH S
> wrote:
ne of the core lucene analyzers
> instead? Have you used Luke to verify that the index holds what you
> think it does?
>
>
> --
> Ian.
>
>
> On Mon, Sep 30, 2013 at 3:21 PM, VIGNESH S
> wrote:
> > Hi,
> >
> > It is not the problem with case..Beca
",".
On Thu, Oct 3, 2013 at 11:23 AM, VIGNESH S wrote:
> Hi Ian,
>
> In Lucene Is there any Default Analyzer we can use which will ignore only
> Spaces.
> All other numbers,punctuation,dates everything it should preserve.
>
> I created my analyzer wi
t a new
> thread if you need help with that. But from your previous email it
> sounds like you could use WhitespaceTokenizer chained with
> LowerCaseFilter.
>
>
> --
> Ian.
>
>
> On Thu, Oct 3, 2013 at 7:16 AM, VIGNESH S wrote:
> > Hi,
> >
> > In my A
Hi,
I am Trying to do Multiphrase Query in Lucene 4.3. It is working Perfect
for all scenarios except the below scenario.
When I try to Search for a phrase which is preceded by any punctuation,it
is not working..
TextContent: Dremel is a scalable, interactive ad-hoc query system for
analysis
of
Hi,
sorry.. thats my typo..
Its not failing because of that
On Thu, Oct 3, 2013 at 8:17 PM, Ian Lea wrote:
> Are you sure it's not failing because "adhoc" != "ad-hoc"?
>
>
> --
> Ian.
>
>
> On Thu, Oct 3, 2013 at 3:07 PM, VIGNESH S wrote:
&g
gt;}
>while (trm.next() != null);
>
>if (!termsWithPrefix.isEmpty()) {
> MultiPhraseQuery mpquery = new MultiPhraseQuery();
> mpquery.add(termsWithPrefix.toArray(new Term[0]));
> bq.add(mpquery, BooleanClause.Occur.MUST);
>}
> }
>
> IndexSearcher sear
Hi,
How to implement synonym Search for All languages..
As far as i know,Wordnet has only English Support..Is there any other we
can use to get support for all languages.
--
Thanks and Regards
Vignesh Srinivasan
.
Please kindly help.
On Mon, Oct 7, 2013 at 8:06 PM, Koji Sekiguchi wrote:
> (13/10/07 18:33), VIGNESH S wrote:
>
>> Hi,
>>
>> How to implement synonym Search for All languages..
>>
>> As far as i know,Wordnet has only English Support..Is there any other
from the index.
> Wikipedia was a concrete example for the description I used.
> Please see the figure in the article for the system architecture.
>
> koji
>
>
>
> (13/10/09 20:46), VIGNESH S wrote:
>
>> Hi Koji,
>>
>> Thanks for your reply and guidance.
>
Hi,
If some one removes some of the segments in my Lucene Index from file
system,How to find out which documents are deleted.
--
Thanks and Regards
Vignesh Srinivasan
9739135640
Hi,
I have indexed the below text file "filename.txt" using the test code
G1.java..
When I search for "check for old" trm.seekceil() method gives "checking"
and "checks" and ignores "check" which is there in text document..
It is working for most cases except a few
Please kindly help me..
--
ll test case?
>
> Mike McCandless
>
> http://blog.mikemccandless.com
>
>
> On Wed, Oct 30, 2013 at 10:59 AM, VIGNESH S
> wrote:
> > Hi,
> >
> > I have indexed the below text file "filename.txt" using the test code
> > G1.java..
> >
&g
Hi Mike,
I got the problem.The term is not indexed properly..
On Thu, Oct 31, 2013 at 7:19 AM, VIGNESH S wrote:
> Hi Mike,
>
> please find tha attached test case G1.java..
>
>
> On Wed, Oct 30, 2013 at 8:41 PM, Michael McCandless <
> luc...@mikemccandless.com> wrote
In that "Check" is not getting indexed properly since it has the symbol
"•"..How can i index it properly..
On Thu, Oct 31, 2013 at 9:58 AM, VIGNESH S wrote:
> Hi Mike,
> I got the problem.The term is not indexed properly..
>
>
> On Thu, Oct 31, 2
:12 PM, Michael McCandless <
luc...@mikemccandless.com> wrote:
> Pick a better analyzer.
>
> Maybe StandardAnalyzer?
>
> Mike McCandless
>
> http://blog.mikemccandless.com
>
>
> On Thu, Oct 31, 2013 at 2:22 AM, VIGNESH S
> wrote:
> > Hi Mike,
> >
Hi,
whitespace analyzer would be ideal for your requirement.
On Sat, Nov 16, 2013 at 1:51 AM, wrote:
> Hi,
>
> I implemented my Lucene solution using StandardAnalyzer for both indexing
> and searching. While testing, I noticed that special characters such as
> hyphens, forward slash etc. are o
Hi,
I tried indexing for PostingsHighligher with
TextField."TYPE_NOT_STOREDTYPE_STORED" and used postings highlighter..iam
getting null pointer Exception.
But if i use TextField.TYPE_STORED it is working properly..
can,t i use postingshighlighter without storing ?.Please kindly Help.
Below is t
ou hit? We should improve that if we can...
>
> Mike McCandless
>
> http://blog.mikemccandless.com
>
>
> On Mon, Nov 25, 2013 at 6:43 AM, VIGNESH S
> wrote:
> > Hi,
> >
> > I tried indexing for PostingsHighligher with
> > TextField."TYPE_NO
Hi,
I try deleting the document from the Index like below.It is working in case
of Lucene 3.6.But document is not getting deleted for Lucene 4.3
Term term = new Term("path", value);
mWriter.deleteDocuments(term);
mWriter.commit();
Please kindly help..
--
Thanks and Regards
Vignesh Srinivasan
2 in this case, for 3.6 and 4.3, that demonstrates the problem.
>
>
> --
> Ian.
>
>
> On Fri, Nov 29, 2013 at 6:00 AM, VIGNESH S
> wrote:
> > Hi,
> >
> > I try deleting the document from the Index like below.It is working in
> case
> >
Hi ,
I tried Lucene 3.6 in both desktop and android and deleteDocuments is
working fine.
I tried Lucene Version 4.3 code in Desktop and android.In Desktop it is
deleting the index files without any problem.
In android,I changed the following files to avoid jar problem.
{Codec.java,DocValuesForma
we Schindler
> H.-H.-Meier-Allee 63, D-28213 Bremen
> http://www.thetaphi.de
> eMail: u...@thetaphi.de
>
>
> > -Original Message-
> > From: VIGNESH S [mailto:vigneshkln...@gmail.com]
> > Sent: Tuesday, December 17, 2013 1:55 PM
> > To: java-user@lucene.
60 matches
Mail list logo