from:"VIGNESH S"

Help needed Regarding classification of Text Data using Lucene..

2013-01-09 Thread VIGNESH S

Hi, can anyone suggest me how can i use lucene for text classification. -- Thanks and Regards Vignesh Srinivasan - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@

Reg Lucene Naive Bayesian classifier.

2013-01-14 Thread VIGNESH S

Hi, Anyone Used the Naive Bayesian Classifier? It will be really helpful if some one Can post how to use the classifiers in Lucene .. -- Thanks and Regards Vignesh Srinivasan 9739135640 - To unsubscribe, e-mail: java-user-uns

Re: Reg Lucene Naive Bayesian classifier.

2013-01-14 Thread VIGNESH S

side. I thought Lucene Naive Bayesian Would be useful... Please Suggest me Whether classifying emails will be done using this Lucene Naive Bayesian or any other Lucene Classifiers.. Thanks and Regards Vignesh Srinivasan On Mon, Jan 14, 2013 at 7:23 PM, VIGNESH S wrote: > Hi, > > An

Need help regarding understanding internals of Lucene Index.

2013-01-25 Thread VIGNESH S

Hi, I recently started using Lucene. There are lot of articles and news about undestanding Lucene at the API levels . Can anyone help me understanding in depth what happens internally inside a lucene index how it stores and details about the which Datastructure lucene uses for inverted index.

Re: Readers for extracting textual info from pd/doc/excel for indexing the actual content

2013-01-28 Thread VIGNESH S

Apache Tika:-You can Use to Extract text from PDF,word Documents. It internally uses Apache POI for Extraction of text from office documents.. It uses PDFBOX for Extraction of text from PDF Documents.. On Sat, Jan 26, 2013 at 4:24 AM, saisantoshi wrote: > I want to index the document content(

How to properly use updatedocument in lucene.

2013-01-31 Thread VIGNESH S

Hi All, I am having a basic doubt.. I am trying to update a lucene document field with a new value.. The below is my code.. It is not giving any errors and also it is not updating the document with field. Document d = searcher.doc(docId); writer1 = new IndexWriter(csDirectory, new IndexWriterC

Re: How to properly use updatedocument in lucene.

2013-01-31 Thread VIGNESH S

; > Separately, that's the right way to call .updateDocument, but you must > ensure FILE_NAME_FIELD was indexed for the first document, with the > value "new1.docx". > > Can you include how you indexed the original document? > > Mike McCandless > > htt

Re: How to implement Lucene

2013-02-05 Thread VIGNESH S

Hi, For Basics on Lucene How to Create Lucene Index and some basic Stuffs Look in to Lucene in Action Book. On Tue, Feb 5, 2013 at 6:28 PM, Álvaro Vargas Quezada wrote: > Hello, > I want to implement a central index, and I heard about Lucene, so I would > like to ask your help to install it an

Need Help:How to Get the enumeration of Terms Ending with a given word

2013-02-14 Thread VIGNESH S

Hi, I have two questions 1.How to Get the enumeration of Terms Ending with a given word I saw we can get enumerations of word starting at a given word by Indexreader.terms(term())) method 2.Actually I am doing an multiphrase query.In that I do do a suffix query on first word..How can i do please

Re: Securing stored data using Lucene

2013-07-03 Thread VIGNESH S

Hi Rafaela, Look at Lucene Transform.It might help to encrypt lucene documents. https://code.google.com/p/lucenetransform/ On Wed, Jun 26, 2013 at 2:36 PM, Rafaela Voiculescu < rafaela.voicule...@gmail.com> wrote: > Hello, > > Thank you all for your help and the suggestions. They are very usef

Regarding Lucene Highlighting feature.

2013-07-04 Thread VIGNESH S

Hi, Is it mandatory to use "Store.YES" when using Highlighting Feature. is it Possible to use Highlighting Feature without using "Store.Yes" while indexing because it almost doubles index size. Please Kindly Help. -- Thanks and Regards Vignesh Srinivasan 9739135640

Re: Regarding Lucene Highlighting feature.

2013-07-05 Thread VIGNESH S

Hi, I think using CompressingStoredFieldsFormat Feature introduced in Lucene 4.1 may help reduce index size. Any other comments and suggestions are welcome in this topic.. Thanks and Regards Vignesh Srinivasan 9739135640 On Thu, Jul 4, 2013 at 6:38 PM, VIGNESH S wrote: > Hi, > &

Re: Regarding Lucene Highlighting feature.

2013-07-10 Thread VIGNESH S

compression techniques i can apply so that it reduces disk space and also not increase the search time. Thanks and Regards Vignesh srinivasan On Fri, Jul 5, 2013 at 7:03 PM, Roberto Ragusa wrote: > On 07/05/2013 01:27 PM, VIGNESH S wrote: > > Hi, > > &g

A SPI class of type org.apache.lucene.codecs.Codec with name 'Lucene42' does not exist

2013-07-13 Thread VIGNESH S

Hi, I am Trying to migrate to Lucene 4.3.1 I just want to do basic indexing.I added the Lucene Core Jar and iam getting Getting Exception 07-01 15:11:13.763: E/AndroidRuntime(17123): Caused by: java.lang.IllegalArgumentException: A SPI class of type org.apache.lucene.codecs.Codec with name '

Re: A SPI class of type org.apache.lucene.codecs.Codec with name 'Lucene42' does not exist

2013-07-13 Thread VIGNESH S

Hi, I tried indexing in Desktop..It works fine. The above error loading error comes only in android.. Any comments.. On Sat, Jul 13, 2013 at 12:39 PM, VIGNESH S wrote: > Hi, > > I am Trying to migrate to Lucene 4.3.1 > > I just want to do basic indexing.I added the Lucene C

Re: A SPI class of type org.apache.lucene.codecs.Codec with name 'Lucene42' does not exist

2013-07-13 Thread VIGNESH S

, Jul 13, 2013 at 1:57 PM, Thomas Matthijs wrote: > On Sat, Jul 13, 2013 at 10:25 AM, VIGNESH S > wrote: > > > Hi, > > > > I tried indexing in Desktop..It works fine. > > The above error loading error comes only in android.. > > Any comments.. > > > Don

Re: A SPI class of type org.apache.lucene.codecs.Codec with name 'Lucene42' does not exist

2013-07-13 Thread VIGNESH S

ava (and > doesnt implement/follow the spec) > > On Sat, Jul 13, 2013 at 4:31 AM, VIGNESH S > wrote: > > > Hi, > > > > I did not striped META-INF/services and it contains the files. > > > > Even when i combined with other jars,i manually appended those entri

Re: A SPI class of type org.apache.lucene.codecs.Codec with name 'Lucene42' does not exist

2013-07-13 Thread VIGNESH S

;s an > unsupported configuration. > > Uwe > > - > Uwe Schindler > H.-H.-Meier-Allee 63, D-28213 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > > -Original Message- > > From: VIGNESH S [mailto:vigneshkln...@gmail.com] >

Features added after Lucene 4

2013-07-15 Thread VIGNESH S

Hi, I am trying to upgrade our older index from Lucene 3.6 to Lucene 4.2 I need to understand the changes in Indexing Structure.. can any one please post some articles and links through which i can understand indexing changes and search changes. -- Thanks and Regards Vignesh Srinivasan 9739135

MultiFields.getReader() returns null

2013-07-16 Thread VIGNESH S

Hi, I am creating index like this in\\using Lucene 4.3.1 I am using 3 fields like FieldType offsetsType = new FieldType(TextField.TYPE_STORED); offsetsType.setIndexed(true); offsetsType.setStored(true); offsetsType.setStoreTermVectors(true); offsetsType.setTokenized(true); offsetsType.setStoreTe

Re: MultiFields.getReader() returns null

2013-07-16 Thread VIGNESH S

ss.com> wrote: > MultiFields.getFields returns null if the reader has no postings. Are > you sure you actually indexed any content? > > Mike McCandless > > http://blog.mikemccandless.com > > > On Tue, Jul 16, 2013 at 3:06 AM, VIGNESH S > wrote: > > Hi, > > > > I am cr

Re: MultiFields.getReader() returns null

2013-07-16 Thread VIGNESH S

Hi Mike, If i use StringField.Type_STORED instead of TextField.Type_Stored it is coming in the contents of MultiReader.. On Tue, Jul 16, 2013 at 7:16 PM, VIGNESH S wrote: > Hi Mike, > > Thanks for your Reply. > > Contents are Getting Indexed..In MultiReader the other two fiel

Re: MultiFields.getReader() returns null

2013-07-16 Thread VIGNESH S

term vectors) are inherited from > TextField.TYPE_STORED. > > Mike McCandless > > http://blog.mikemccandless.com > > > On Tue, Jul 16, 2013 at 9:56 AM, VIGNESH S > wrote: > > Hi Mike, > > > > If i use StringField.Type_STORED instead of TextField.Type_Stor

PostingsHighlighter to highlight the first Match ion the document

2013-07-17 Thread VIGNESH S

Hi, I need to do highlight the first sentence which matches the search keyword in a document using PostingsHighlighter. How can i do this Any Help or suggestions welcome -- Thanks and Regards Vignesh Srinivasan

Re: MultiFields.getReader() returns null

2013-07-17 Thread VIGNESH S

t; luc...@mikemccandless.com> wrote: > On Wed, Jul 17, 2013 at 1:52 AM, VIGNESH S > wrote: > > Hi Mike, > > > > The Problem I mentioned is I used 3 Fields subject title, Content. > > > > I indexed Subject and Title like this.. > > > > doc.add(new StringField(su

Re: PostingsHighlighter to highlight the first Match ion the document

2013-07-17 Thread VIGNESH S

Hi Mike, I tried the TestPostingsHighlighter.java.The contents I gave my own content.. In that,If iam searching "Android",it is always returning the First Sentence as highlighted text whether the sentence contains Searched keyword or not.. On Wed, Jul 17, 2013 at 3:48 PM, VIGNES

Re: PostingsHighlighter to highlight the first Match ion the document

2013-07-17 Thread VIGNESH S

which happens when the document did not have any matched > terms from the query. Are you sure your content matches Android? Can > you post a full test case showing the issue? > > Mike McCandless > > http://blog.mikemccandless.com > > > On Wed, Jul 17, 2013 at 10:12 AM, VIG

Re: PostingsHighlighter to highlight the first Match ion the document

2013-07-17 Thread VIGNESH S

sage" (first N > sentences), which happens when the document did not have any matched > terms from the query. Are you sure your content matches Android? Can > you post a full test case showing the issue? > > Mike McCandless > > http://blog.mikemccandless.com > > > On

Re: PostingsHighlighter to highlight the first Match ion the document

2013-07-22 Thread VIGNESH S

gt; Yes, PH supports any analyzer. > > Mike > > Mike McCandless > > http://blog.mikemccandless.com > > > On Thu, Jul 18, 2013 at 2:57 AM, VIGNESH S > wrote: > > Hi Mike, > > > > I am getting the Search Hits. > > > > Will PostingsHighlighter

Best Way to delete a Lucene Index

2013-08-06 Thread VIGNESH S

Hi, which is the best way to delete a lucene document in a large index? -- Thanks and Regards Vignesh Srinivasan 9739135640

Re: Best Way to delete a Lucene Index

2013-08-06 Thread VIGNESH S

5:13 PM, Michael McCandless < luc...@mikemccandless.com> wrote: > Use IndexWriter.deleteDocument. > > Mike McCandless > > http://blog.mikemccandless.com > > > On Tue, Aug 6, 2013 at 7:38 AM, VIGNESH S wrote: > > Hi, > > > > which is the best

Getting LockObtainedFailedException while deleting corrupted index and indexing on a newly created index

2013-08-06 Thread VIGNESH S

Hi I have a large Index with a open IndexWriter.some one unknowlingly deleted some files in the index and corrupted the index. Once i get IndexCorruptedException,I will delete the entire Index Folder and close my indexwriter also and Restart indexing process again. In that time,I am Still Getti

Reg FastVectorHighlighter returning first sentence of match

2013-08-27 Thread VIGNESH S

Hi, I tried FastVectorHighlighter to highlight the first sentence in which the match occured. I tried both ScoreOrderFragmentsBuilder and SimpleFragmentBuilder.Both of them are not returning first sentence where the search query matches. Please Kindly help me.. -- Thanks and Regards Vignesh Sr

Multiphrase Query in Lucene 4.3

2013-09-25 Thread VIGNESH S

Hi, In the Example of Multiphrase Query it is mentioned "To use this class, to search for the phrase "Microsoft app*" first use add(Term) on the term "Microsoft", then find all terms that have "app" as prefix using IndexReader.terms(Term), and use MultiPhraseQuery.add(Term[] terms) to add them to

Re: Multiphrase Query in Lucene 4.3

2013-09-26 Thread VIGNESH S

null) { > l.add(br.utf8ToString()); > } > return l; > } > > -- > Ian. > > On Wed, Sep 25, 2013 at 3:04 PM, VIGNESH S > wrote: > > Hi, > > > > In the Example of Multiphrase Query it is mentioned > > > > "To use this class, to search for the p

Re: Multiphrase Query in Lucene 4.3

2013-09-27 Thread VIGNESH S

esults. What may be the reason.. On Fri, Sep 27, 2013 at 11:49 AM, VIGNESH S wrote: > Hi Lan, > > Thanks for your Reply. > > I am doing similar to this only..In MultiPhraseQuery object actual phrase > is going proper but it is not returning any hits.. > > In Lucene 3.6,I

Re: Multiphrase Query in Lucene 4.3

2013-09-29 Thread VIGNESH S

ComplexPhraseQueryParser > b) SurroundQueryParser: oal.queryparser.surround.parser.QueryParser > c) experimental: > http://issues.apache.org/jira/browse/LUCENE-5205 > > > -Original Message- > From: VIGNESH S [mailto:vigneshkln...@gmail.com] > Sent: Friday, September

Re: Multiphrase Query in Lucene 4.3

2013-09-30 Thread VIGNESH S

more obscure, and only related to > the multi phrase stuff, I suggest you build a tiny but complete > RAMDirectory based program or test case that shows the problem and > post it here. > > > -- > Ian. > > > > On Mon, Sep 30, 2013 at 6:46 AM, VIGNESH S > wrote:

Re: Multiphrase Query in Lucene 4.3

2013-10-02 Thread VIGNESH S

ne of the core lucene analyzers > instead? Have you used Luke to verify that the index holds what you > think it does? > > > -- > Ian. > > > On Mon, Sep 30, 2013 at 3:21 PM, VIGNESH S > wrote: > > Hi, > > > > It is not the problem with case..Beca

Re: Multiphrase Query in Lucene 4.3

2013-10-02 Thread VIGNESH S

",". On Thu, Oct 3, 2013 at 11:23 AM, VIGNESH S wrote: > Hi Ian, > > In Lucene Is there any Default Analyzer we can use which will ignore only > Spaces. > All other numbers,punctuation,dates everything it should preserve. > > I created my analyzer wi

Re: Multiphrase Query in Lucene 4.3

2013-10-03 Thread VIGNESH S

t a new > thread if you need help with that. But from your previous email it > sounds like you could use WhitespaceTokenizer chained with > LowerCaseFilter. > > > -- > Ian. > > > On Thu, Oct 3, 2013 at 7:16 AM, VIGNESH S wrote: > > Hi, > > > > In my A

Problem with MultiPhrase Query in Lucene 4.3

2013-10-03 Thread VIGNESH S

Hi, I am Trying to do Multiphrase Query in Lucene 4.3. It is working Perfect for all scenarios except the below scenario. When I try to Search for a phrase which is preceded by any punctuation,it is not working.. TextContent: Dremel is a scalable, interactive ad-hoc query system for analysis of

Re: Problem with MultiPhrase Query in Lucene 4.3

2013-10-03 Thread VIGNESH S

Hi, sorry.. thats my typo.. Its not failing because of that On Thu, Oct 3, 2013 at 8:17 PM, Ian Lea wrote: > Are you sure it's not failing because "adhoc" != "ad-hoc"? > > > -- > Ian. > > > On Thu, Oct 3, 2013 at 3:07 PM, VIGNESH S wrote: &g

Re: Problem with MultiPhrase Query in Lucene 4.3

2013-10-03 Thread VIGNESH S

gt;} >while (trm.next() != null); > >if (!termsWithPrefix.isEmpty()) { > MultiPhraseQuery mpquery = new MultiPhraseQuery(); > mpquery.add(termsWithPrefix.toArray(new Term[0])); > bq.add(mpquery, BooleanClause.Occur.MUST); >} > } > > IndexSearcher sear

Synonym Search in Lucene..

2013-10-07 Thread VIGNESH S

Hi, How to implement synonym Search for All languages.. As far as i know,Wordnet has only English Support..Is there any other we can use to get support for all languages. -- Thanks and Regards Vignesh Srinivasan

Re: Synonym Search in Lucene..

2013-10-09 Thread VIGNESH S

. Please kindly help. On Mon, Oct 7, 2013 at 8:06 PM, Koji Sekiguchi wrote: > (13/10/07 18:33), VIGNESH S wrote: > >> Hi, >> >> How to implement synonym Search for All languages.. >> >> As far as i know,Wordnet has only English Support..Is there any other

Re: Synonym Search in Lucene..

2013-10-09 Thread VIGNESH S

from the index. > Wikipedia was a concrete example for the description I used. > Please see the figure in the article for the system architecture. > > koji > > > > (13/10/09 20:46), VIGNESH S wrote: > >> Hi Koji, >> >> Thanks for your reply and guidance. >

Finding which document is deleted when someone Manually delete the Index

2013-10-11 Thread VIGNESH S

Hi, If some one removes some of the segments in my Lucene Index from file system,How to find out which documents are deleted. -- Thanks and Regards Vignesh Srinivasan 9739135640

trm.seekCeil() not giving proper value when used in MP Query for some words

2013-10-30 Thread VIGNESH S

Hi, I have indexed the below text file "filename.txt" using the test code G1.java.. When I search for "check for old" trm.seekceil() method gives "checking" and "checks" and ignores "check" which is there in text document.. It is working for most cases except a few Please kindly help me.. --

Re: trm.seekCeil() not giving proper value when used in MP Query for some words

2013-10-30 Thread VIGNESH S

ll test case? > > Mike McCandless > > http://blog.mikemccandless.com > > > On Wed, Oct 30, 2013 at 10:59 AM, VIGNESH S > wrote: > > Hi, > > > > I have indexed the below text file "filename.txt" using the test code > > G1.java.. > > &g

Re: trm.seekCeil() not giving proper value when used in MP Query for some words

2013-10-30 Thread VIGNESH S

Hi Mike, I got the problem.The term is not indexed properly.. On Thu, Oct 31, 2013 at 7:19 AM, VIGNESH S wrote: > Hi Mike, > > please find tha attached test case G1.java.. > > > On Wed, Oct 30, 2013 at 8:41 PM, Michael McCandless < > luc...@mikemccandless.com> wrote

Re: trm.seekCeil() not giving proper value when used in MP Query for some words

2013-10-30 Thread VIGNESH S

In that "Check" is not getting indexed properly since it has the symbol "•"..How can i index it properly.. On Thu, Oct 31, 2013 at 9:58 AM, VIGNESH S wrote: > Hi Mike, > I got the problem.The term is not indexed properly.. > > > On Thu, Oct 31, 2

Re: trm.seekCeil() not giving proper value when used in MP Query for some words

2013-10-31 Thread VIGNESH S

:12 PM, Michael McCandless < luc...@mikemccandless.com> wrote: > Pick a better analyzer. > > Maybe StandardAnalyzer? > > Mike McCandless > > http://blog.mikemccandless.com > > > On Thu, Oct 31, 2013 at 2:22 AM, VIGNESH S > wrote: > > Hi Mike, > >

Re: WhitespaceAnalyzer vs StandardAnalyzer

2013-11-17 Thread VIGNESH S

Hi, whitespace analyzer would be ideal for your requirement. On Sat, Nov 16, 2013 at 1:51 AM, wrote: > Hi, > > I implemented my Lucene solution using StandardAnalyzer for both indexing > and searching. While testing, I noticed that special characters such as > hyphens, forward slash etc. are o

Help in Lucene Postings Highlighter..

2013-11-25 Thread VIGNESH S

Hi, I tried indexing for PostingsHighligher with TextField."TYPE_NOT_STOREDTYPE_STORED" and used postings highlighter..iam getting null pointer Exception. But if i use TextField.TYPE_STORED it is working properly.. can,t i use postingshighlighter without storing ?.Please kindly Help. Below is t

Re: Help in Lucene Postings Highlighter..

2013-11-25 Thread VIGNESH S

ou hit? We should improve that if we can... > > Mike McCandless > > http://blog.mikemccandless.com > > > On Mon, Nov 25, 2013 at 6:43 AM, VIGNESH S > wrote: > > Hi, > > > > I tried indexing for PostingsHighligher with > > TextField."TYPE_NO

Deletion of Index not happening in Lucene 4.3

2013-11-28 Thread VIGNESH S

Hi, I try deleting the document from the Index like below.It is working in case of Lucene 3.6.But document is not getting deleted for Lucene 4.3 Term term = new Term("path", value); mWriter.deleteDocuments(term); mWriter.commit(); Please kindly help.. -- Thanks and Regards Vignesh Srinivasan

Re: Deletion of Index not happening in Lucene 4.3

2013-11-29 Thread VIGNESH S

2 in this case, for 3.6 and 4.3, that demonstrates the problem. > > > -- > Ian. > > > On Fri, Nov 29, 2013 at 6:00 AM, VIGNESH S > wrote: > > Hi, > > > > I try deleting the document from the Index like below.It is working in > case > >

Lucene deleteDocuments(Term)not working in android working in desktop..

2013-12-17 Thread VIGNESH S

Hi , I tried Lucene 3.6 in both desktop and android and deleteDocuments is working fine. I tried Lucene Version 4.3 code in Desktop and android.In Desktop it is deleting the index files without any problem. In android,I changed the following files to avoid jar problem. {Codec.java,DocValuesForma

Re: Lucene deleteDocuments(Term)not working in android working in desktop..

2013-12-17 Thread VIGNESH S

we Schindler > H.-H.-Meier-Allee 63, D-28213 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > > > -Original Message- > > From: VIGNESH S [mailto:vigneshkln...@gmail.com] > > Sent: Tuesday, December 17, 2013 1:55 PM > > To: java-user@lucene.

60 matches

Mail list logo