Did you double check the <lib> element in your solrconfig.xml which points to the Tika jar you're using?
2011/3/9 Deepak Singh <deep...@praumtech.com> > > downloaded apache-solr-3.1 still it giving TIKA Exception > > > On Wed, Mar 9, 2011 at 5:11 PM, Deepak Singh <deep...@praumtech.com>wrote: > >> oh, thanks for the better solution. >> >> >> On Wed, Mar 9, 2011 at 4:36 PM, Uwe Schindler <u...@thetaphi.de> wrote: >> >>> Hi, >>> >>> >>> >>> These are all bugs in Apache TIKA not Solr, some of them are already >>> fixed in later TIKA versions (so you may try the soon-to-be-released Solr >>> 3.1 version which contains a newer TIKA bundled). >>> >>> >>> >>> Uwe >>> >>> >>> >>> ----- >>> >>> Uwe Schindler >>> >>> H.-H.-Meier-Allee 63, D-28213 Bremen >>> >>> http://www.thetaphi.de >>> >>> eMail: u...@thetaphi.de >>> >>> >>> >>> *From:* Deepak Singh [mailto:deep...@praumtech.com] >>> *Sent:* Wednesday, March 09, 2011 12:03 PM >>> *To:* dev@lucene.apache.org >>> *Subject:* Re: Solr Exception >>> >>> >>> >>> >>> *HTTP ERROR :500 (INTERNAL SERVER ERROR)* >>> >>> *For DOC files:* >>> org.apache.tika.exception. >>> >>> TikaException : >>> -Unexpected RuntimeException from >>> org.apache.tika.parser.microsoft.OfficeParser@1248f2 >>> Caused by: org.apache.poi.hpsf.IllegalPropertySetDataException: The >>> property set claims to have a size of 16 bytes. However, it exceeds 16 >>> bytes. >>> >>> -TIKA-198: Illegal IOException from >>> org.apache.tika.parser.microsoft.OfficeParser@1248f2 >>> Caused by: java.io.IOException: block[ 0 ] already removed - does your >>> POIFS have circular or duplicate block references? >>> >>> >>> *For PDF files:* >>> org.apache.tika.exception.TikaException : >>> -Unexpected RuntimeException from >>> org.apache.tika.parser.Pdfparser@1b4cd65 >>> Caused by: java.lang.ClassCastException: org.pdfbox.cos.COSArray cannot >>> be cast to org.pdfbox.cos.COSDictionar >>> Caused by: java.lang.NullPointerException >>> >>> >>> >>> -Unable to extract PDF content >>> >>> *HTTP ERROR:400 (BAD REQUEST)* >>> -This error come when some fields are missing >>> ERROR:unknown field 'language' (Ex:content_status, description,version) >>> >>> >>> >>> On Wed, Mar 9, 2011 at 4:19 PM, Gora Mohanty <g...@mimirtech.com> wrote: >>> >>> Hi, >>> >>> This is probably better directed to the user list. Also, please provide >>> details of the exceptions from your log files. >>> >>> Regards, >>> Gora >>> >>> >>> >> >> >