RE: Tika 1.2 PDF parse error - org.apache.pdfbox.cos.COSString cannot be cast to org.apache.pdfbox.cos.COSDictionary

2013-02-12 Thread Phani Kumar Samudrala
Sorry, Just realized, seems I posted to wrong mailing list. Please ignore this. -Original Message- From: Phani Kumar Samudrala [mailto:phanikuma...@arisglobal.co.in] Sent: Tuesday, February 12, 2013 3:53 PM To: dev@tika.apache.org Subject: Tika 1.2 PDF parse error

Tika 1.2 PDF parse error - org.apache.pdfbox.cos.COSString cannot be cast to org.apache.pdfbox.cos.COSDictionary

2013-02-12 Thread Phani Kumar Samudrala
I am using Tika 1.2 JAVA API to extract text from a PDF, I am getting the following exception. I am getting this error for some PDF documents only and for some PDFs it is working fine. I couldn't figure it out a reason for this. When I tried using Tika 1.1 it works fine. Please let me if any of