Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++
>
>
>
>
--
---
Hong-Thai NGUYEN
Tel.: 06 27 04 86 22
+1 for me
Build on Windows, tested with an internal corpus. There's no regression. Even
more, we earned some more ppt documents converted comparing with 1.9
Great job David and others !
Thank
Hong-Thai
-Message d'origine-
De : Tyler Palsulich [mailto:tpalsul...@gmail.com]
Envoyé : mar
hief Architect
> > Instrument Software and Science Data Systems Section (398)
> > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> > Office: 168-519, Mailstop: 168-527
> > Email: chris.a.mattm...@nasa.gov
> > WWW: http://sunset.usc.edu/~mattmann/
> > ++
> > Adjunct Associate Professor, Computer Science Department
> > University of Southern California, Los Angeles, CA 90089 USA
> > ++
> >
> >
> >
>
--
---
Hong-Thai NGUYEN
Tel.: 06 27 04 86 22
ycastle.org/latest_releases.html).
>
> --
> Regards,
> Konstantin Gribov
>
> ср, 29 апр. 2015 г. в 16:43, Hong-Thai Nguyen >:
>
> > Hi forks,
> >
> > I'm +1 for announcement of ending support JDK1.6 on next 1.9.
> >
> > FYI, we are havin
Hi forks,
I'm +1 for announcement of ending support JDK1.6 on next 1.9.
FYI, we are having still some legacy dependencies dedicated only on JDK 1.5
(*jdk15*):
$ mvn dependency:tree
[INFO] Scanning for projects...
[INFO]
[INFO]
Hi,
+1 for me.
Great work, Tyler !
Hong-Thai
-Message d'origine-
De : Tyler Palsulich [mailto:tpalsul...@apache.org]
Envoyé : lundi 13 avril 2015 19:56
À : dev@tika.apache.org; u...@tika.apache.org
Objet : [VOTE] Apache Tika 1.8 Release Candidate #2
Hi Folks,
A candidate for the Tika
[
https://issues.apache.org/jira/browse/TIKA-1600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14492084#comment-14492084
]
Hong-Thai Nguyen commented on TIKA-1600:
The root exception is an NPE when par
Not yet, I'm investigating more on TIKA-1600 today.
Hong-Thai
-Message d'origine-
De : Allison, Timothy B. [mailto:talli...@mitre.org]
Envoyé : lundi 13 avril 2015 01:07
À : dev@tika.apache.org
Objet : RE: [VOTE] Release Apache Tika 1.8 Candidate #1
I don't think we've solved TIKA-1600,
[
https://issues.apache.org/jira/browse/TIKA-1581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14386900#comment-14386900
]
Hong-Thai Nguyen commented on TIKA-1581:
And great thank to [~kkrugler] with
+1 for 1.8
Hong-Thai
> On 28 Mar 2015, at 16:01, Tyler Palsulich wrote:
>
> Hi Folks,
>
> Now that TIKA-1581 (JHighlight licensing issues) is resolved, we need to
> release a new version of Tika. I'll volunteer to be the release manager
> again.
>
> Should we release this as 1.8 or 1.7.1?
>
[
https://issues.apache.org/jira/browse/TIKA-1581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen resolved TIKA-1581.
Resolution: Fixed
> jhighlight license conce
[
https://issues.apache.org/jira/browse/TIKA-1581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen updated TIKA-1581:
---
Fix Version/s: 1.8
> jhighlight license conce
[
https://issues.apache.org/jira/browse/TIKA-1581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14383827#comment-14383827
]
Hong-Thai Nguyen commented on TIKA-1581:
On r1669583, I switched to la
[
https://issues.apache.org/jira/browse/TIKA-1581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14371432#comment-14371432
]
Hong-Thai Nguyen edited comment on TIKA-1581 at 3/20/15 3:3
[
https://issues.apache.org/jira/browse/TIKA-1581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14371432#comment-14371432
]
Hong-Thai Nguyen edited comment on TIKA-1581 at 3/20/15 3:1
[
https://issues.apache.org/jira/browse/TIKA-1581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14371432#comment-14371432
]
Hong-Thai Nguyen commented on TIKA-1581:
I've contacted also 'g
I've checked again some regression tests. Seem fine for me too. So +1
Great job Tyler !
On Fri, Jan 9, 2015 at 11:02 PM, Tyler Palsulich
wrote:
> Hi All,
>
> A candidate for the Tika 1.7 release is available at:
> https://dist.apache.org/repos/dist/dev/tika/
>
> The release candidate is a z
Seems fine for me: +1
No big regression on our corpus test of 23K docs:
15-01-07 18:19:27 INFO (DocumentConversionErrorPlugin.java : 116)
[pool-3-thread-1] Summary of document conversion errors:
- pdf (4)
* (2) org.apache.tika.exception.TikaException: TIKA-198: Illegal
IOException from org.apach
[
https://issues.apache.org/jira/browse/TIKA-1505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14264786#comment-14264786
]
Hong-Thai Nguyen commented on TIKA-1505:
Can you provide also problem files
[
https://issues.apache.org/jira/browse/TIKA-672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen resolved TIKA-672.
---
Resolution: Fixed
Check no more System.err/System.out inside CHM parser
> Proper er
[
https://issues.apache.org/jira/browse/TIKA-672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen updated TIKA-672:
--
Fix Version/s: 1.7
> Proper error handling in the CHM par
[
https://issues.apache.org/jira/browse/TIKA-1448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen updated TIKA-1448:
---
Fix Version/s: 1.7
> CHM parser : defect in file extract
[
https://issues.apache.org/jira/browse/TIKA-1446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen updated TIKA-1446:
---
Fix Version/s: 1.7
> CHM parser : wrong decompression of aligned blo
[
https://issues.apache.org/jira/browse/TIKA-1430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen updated TIKA-1430:
---
Fix Version/s: 1.7
> CHM parser gets faulty text (fix fo
[
https://issues.apache.org/jira/browse/TIKA-1430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen resolved TIKA-1430.
Resolution: Fixed
> CHM parser gets faulty text (fix fo
[
https://issues.apache.org/jira/browse/TIKA-1447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen updated TIKA-1447:
---
Fix Version/s: 1.7
> CHM parser: wrong directory l
[
https://issues.apache.org/jira/browse/TIKA-1448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen resolved TIKA-1448.
Resolution: Fixed
> CHM parser : defect in file extract
[
https://issues.apache.org/jira/browse/TIKA-1447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen resolved TIKA-1447.
Resolution: Fixed
> CHM parser: wrong directory l
[
https://issues.apache.org/jira/browse/TIKA-1446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen resolved TIKA-1446.
Resolution: Fixed
> CHM parser : wrong decompression of aligned blo
Hi,
I've pushed a minor fix to pass this test on Windows.
Thanks,
On Mon, Nov 17, 2014 at 4:28 PM, Mattmann, Chris A (3980) <
chris.a.mattm...@jpl.nasa.gov> wrote:
> +1, agreed, Dave would be nice to have one as a default.
>
> ++
>
Yes, that's exactly I'm doing. If we move to Git, we'll avoid all SVN stuff.
Anyway, this concerns commiters only.
On Mon, Nov 17, 2014 at 12:08 PM, Nick Burch wrote:
> On Mon, 17 Nov 2014, Hong-Thai Nguyen wrote:
>
>> I didn't realize that we could commit/pu
I didn't realize that we could commit/push directly into git repo. Could we
?
Cheers
On Mon, Nov 17, 2014 at 11:46 AM, Nick Burch wrote:
> On Mon, 17 Nov 2014, Hong-Thai Nguyen wrote:
>
>> Git is implemented everywhere and profit many new features. Should we
>> abandon S
[
https://issues.apache.org/jira/browse/TIKA-1447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14214535#comment-14214535
]
Hong-Thai Nguyen commented on TIKA-1447:
[~binhawking], The work on TIKA-
Hi all,
Git is implemented everywhere and profit many new features. Should we
abandon SVN repo and move to Git forever to facility apply fixes and
contribution ?
Thanks,
--
Hong-Thai
[
https://issues.apache.org/jira/browse/TIKA-1446?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14208079#comment-14208079
]
Hong-Thai Nguyen edited comment on TIKA-1446 at 11/12/14 2:3
[
https://issues.apache.org/jira/browse/TIKA-1446?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14208079#comment-14208079
]
Hong-Thai Nguyen commented on TIKA-1446:
Hi [~binhawking], I've merge
[
https://issues.apache.org/jira/browse/TIKA-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14196343#comment-14196343
]
Hong-Thai Nguyen commented on TIKA-1463:
Thank [~lfcnassif], without
[
https://issues.apache.org/jira/browse/TIKA-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen closed TIKA-1463.
--
Resolution: Fixed
> TesseractOCRParser does not work in Wind
[
https://issues.apache.org/jira/browse/TIKA-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen updated TIKA-1463:
---
Description:
STR:
* Case 1:
** Setting tesseractPath to a common installation path of
[
https://issues.apache.org/jira/browse/TIKA-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen updated TIKA-1463:
---
Summary: TesseractOCRParser does not work in Windows (was:
TesseractOCRParser does work in
[
https://issues.apache.org/jira/browse/TIKA-1463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14194694#comment-14194694
]
Hong-Thai Nguyen commented on TIKA-1463:
Fixed in r1636382
> TesseractOC
Hong-Thai Nguyen created TIKA-1463:
--
Summary: TesseractOCRParser does work in Windows
Key: TIKA-1463
URL: https://issues.apache.org/jira/browse/TIKA-1463
Project: Tika
Issue Type: Bug
[
https://issues.apache.org/jira/browse/TIKA-1446?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14181530#comment-14181530
]
Hong-Thai Nguyen commented on TIKA-1446:
Thank alot [~binhawking], I've q
Hi Chris,
Yes, I made a mistake on this commit by missing a renaming file and broke
build, the next commit corrected:
Revision: 161
Author: thaichat04
Date: mardi 21 octobre 2014 11:47:54
Message:
TIKA-1422 - Fixing build & minor refactory of naming test class
Modified :
/tika/trunk/tika-
[
https://issues.apache.org/jira/browse/TIKA-1422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14178186#comment-14178186
]
Hong-Thai Nguyen edited comment on TIKA-1422 at 10/21/14 9:4
[
https://issues.apache.org/jira/browse/TIKA-1422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14178186#comment-14178186
]
Hong-Thai Nguyen commented on TIKA-1422:
Applied latest fix on r1633325 with
Hi Andrzej,
We are impatient for 1.7 release too.
I'm having compiling problem of TIKA-1422 on me. If anyone can build
successfully on Windows, I have no objection to release 1.7
Thanks,
On Thu, Oct 16, 2014 at 10:51 AM, Andrzej Białecki wrote:
> Hi,
>
> Any news on the 1.7 release? or at leas
[
https://issues.apache.org/jira/browse/TIKA-1422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14173537#comment-14173537
]
Hong-Thai Nguyen commented on TIKA-1422:
I'm not using
[
https://issues.apache.org/jira/browse/TIKA-1176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14169146#comment-14169146
]
Hong-Thai Nguyen commented on TIKA-1176:
Hi [~mdgeek], thank for your offe
[
https://issues.apache.org/jira/browse/TIKA-1422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14169130#comment-14169130
]
Hong-Thai Nguyen commented on TIKA-1422:
Strange, I'm unable to build cau
[
https://issues.apache.org/jira/browse/TIKA-1446?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14169098#comment-14169098
]
Hong-Thai Nguyen commented on TIKA-1446:
Thank [~binhawking], Any change you
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14169090#comment-14169090
]
Hong-Thai Nguyen commented on TIKA-1445:
Interesting question !
For me, pars
[
https://issues.apache.org/jira/browse/TIKA-1428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14147880#comment-14147880
]
Hong-Thai Nguyen commented on TIKA-1428:
Thanks [~theoettheo], any chance to
odp, .ods documents
> From: Hong-Thai Nguyen
> Sent: September 11, 2014 1:40:08pm PDT
> To: dev@tika.apache.org
> Subject: Re: NPE on all *.odt, odp, .ods documents
>
> I was wrong when saying that All OpenDocument are failed, some files
> passed, but alot of them failed
[
https://issues.apache.org/jira/browse/TIKA-1412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14143043#comment-14143043
]
Hong-Thai Nguyen commented on TIKA-1412:
Add a test at r1626706
>
[
https://issues.apache.org/jira/browse/TIKA-1421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen updated TIKA-1421:
---
Priority: Blocker (was: Major)
> Tika-Parsers tests fail on CentOS6 if tesseract is
[
https://issues.apache.org/jira/browse/TIKA-1421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14143041#comment-14143041
]
Hong-Thai Nguyen commented on TIKA-1421:
Not only CentOS, this test failed als
- pptx (10) - doc (6) - ppt (14) - xls (9) - dwg (4) - odp (2) -
pps (2)
On Thu, Sep 11, 2014 at 8:55 PM, Ken Krugler
wrote:
>
> > From: Hong-Thai Nguyen
> > Sent: September 11, 2014 5:21:41am PDT
> > To: dev@tika.apache.org
> > Subject: NPE on all *.odt, odp, .ods d
pache.org"
> Subject: RE: NPE on all *.odt, odp, .ods documents
>
> >Probably want to add TIKA-1411.
> >
> >Nick and all, anything else?
> >
> >-Original Message-
> >From: Hong-Thai Nguyen [mailto:thaicha...@gmail.com]
> >Sent: Thursday, S
Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++
>
>
>
>
>
>
> -Original Message-
> From: Hong-Thai Nguyen
> Reply-To: "dev@tika.apache.org&quo
Hi all,
I've tested the conversion Tika 1.6 with our corpus, all OpenOffice
document types are failed with NPE. Fix has been done on
https://issues.apache.org/jira/browse/TIKA-1412, but available from 1.7.
That's a fatal error for me.
Should we release a 1.6.1 with the fix of TIKA-1412 ?
Tack tr
[
https://issues.apache.org/jira/browse/TIKA-1413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen resolved TIKA-1413.
Resolution: Fixed
> OOXML thumbnail name added to b
[
https://issues.apache.org/jira/browse/TIKA-1413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14126949#comment-14126949
]
Hong-Thai Nguyen commented on TIKA-1413:
I agree. Fixed in r1623819 and _id
27;s not
> even referenced in the pom.xml and isn't done yet?
>
> How about we fix it in 1.7 but give this one a pass?
>
>
> Cheers,
> Chris
>
> -Original Message-
> From: Hong-Thai Nguyen
> Reply-To: "dev@tika.apache.org"
> Date: Monday,
-1 for me because tika-dotnet/pom.xml refer to parent pom with a snapshot
version.
org.apache.tika
tika-parent
1.6-SNAPSHOT
../tika-parent/pom.xml
On Mon, Sep 1, 2014 at 7:16 AM, Mattmann, Chris A (3980) <
chris.a.mattm...@jpl.nasa.gov> wrote:
> Hi Folks,
>
> A candidate for
Nice idea.
We could do more than samples. We can generate parser, detecter or translator
maven archetype. A kind o templete so that user can have quickly project to
develop new one.
Regards,
Hong-Thai
> On 07 Aug 2014, at 18:56, Tyler Palsulich wrote:
>
> Hi All,
>
> I think we should add
[
https://issues.apache.org/jira/browse/TIKA-1373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14077885#comment-14077885
]
Hong-Thai Nguyen commented on TIKA-1373:
Normally it's on next off
[
https://issues.apache.org/jira/browse/TIKA-1373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen resolved TIKA-1373.
Resolution: Fixed
> AutoDetectParser extracts no text when SourceCodeParser is selec
[
https://issues.apache.org/jira/browse/TIKA-1373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14073042#comment-14073042
]
Hong-Thai Nguyen commented on TIKA-1373:
HtmlParser skips tags generate
[
https://issues.apache.org/jira/browse/TIKA-1373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071643#comment-14071643
]
Hong-Thai Nguyen edited comment on TIKA-1373 at 7/23/14 1:4
[
https://issues.apache.org/jira/browse/TIKA-1373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071713#comment-14071713
]
Hong-Thai Nguyen commented on TIKA-1373:
Yes, I saw the trouble when implemen
[
https://issues.apache.org/jira/browse/TIKA-1373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071643#comment-14071643
]
Hong-Thai Nguyen commented on TIKA-1373:
Can you format your description
[
https://issues.apache.org/jira/browse/TIKA-1095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen updated TIKA-1095:
---
Labels: pdfbox (was: patch)
> Only gibberish extracted from this
[
https://issues.apache.org/jira/browse/TIKA-1095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen updated TIKA-1095:
---
Component/s: (was: general)
parser
> Only gibberish extracted from t
[
https://issues.apache.org/jira/browse/TIKA-1095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14061867#comment-14061867
]
Hong-Thai Nguyen commented on TIKA-1095:
Event with latest Tika can't con
[
https://issues.apache.org/jira/browse/TIKA-1332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14044706#comment-14044706
]
Hong-Thai Nguyen commented on TIKA-1332:
What you are describing is somet
Hi all,
Sorry about last wrong mail.
I'm unable to build latest snapshot on my Windows. Any idea ?
Thanks
Tests in error:
initializationError(org.apache.tika.bundle.BundleIT): Problem starting
test co
ntainer.
Tests run: 1, Failures: 0, Errors: 1, Skipped: 0
--
--
Hong-Thai
[
https://issues.apache.org/jira/browse/TIKA-1350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14040519#comment-14040519
]
Hong-Thai Nguyen commented on TIKA-1350:
Richard Johnson (author of java-ps
[
https://issues.apache.org/jira/browse/TIKA-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14017473#comment-14017473
]
Hong-Thai Nguyen commented on TIKA-1320:
OCR is a solution: TIKA-93. Unfortuna
[
https://issues.apache.org/jira/browse/TIKA-1308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14008704#comment-14008704
]
Hong-Thai Nguyen commented on TIKA-1308:
A virtual FileSystem may be a solu
And for >=Java7, we need a profile to active building 'tika-java7' module.
Hong-Thai
-Message d'origine-
De : Nick Burch [mailto:apa...@gagravarr.org]
Envoyé : mercredi 14 mai 2014 18:30
À : dev@tika.apache.org
Objet : Re: [DISCUSS] Nightly Jenkins Builds for Trunk
On Wed, 14 May 2014,
[
https://issues.apache.org/jira/browse/TIKA-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen resolved TIKA-1290.
Resolution: Fixed
r1592780
> Upgrade to PDFBOX 1.
[
https://issues.apache.org/jira/browse/TIKA-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen updated TIKA-1290:
---
Labels: trivial (was: )
> Upgrade to PDFBOX 1.
Hong-Thai Nguyen created TIKA-1290:
--
Summary: Upgrade to PDFBOX 1.8.5
Key: TIKA-1290
URL: https://issues.apache.org/jira/browse/TIKA-1290
Project: Tika
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/TIKA-1287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13987521#comment-13987521
]
Hong-Thai Nguyen commented on TIKA-1287:
Technically, not difficult to upload
[
https://issues.apache.org/jira/browse/TIKA-1283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13983434#comment-13983434
]
Hong-Thai Nguyen commented on TIKA-1283:
+1 for me to create a thumbnail fiel
[
https://issues.apache.org/jira/browse/TIKA-1279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen resolved TIKA-1279.
Resolution: Fixed
Thank [~rgauss] for this good catch. I fixed with more tests in r1589742
[
https://issues.apache.org/jira/browse/TIKA-1276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen resolved TIKA-1276.
Resolution: Fixed
Thank [~rwesten], added your patch at r1589717
> Missing embed
[
https://issues.apache.org/jira/browse/TIKA-1276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen updated TIKA-1276:
---
Fix Version/s: 1.6
> Missing embedded dependencies in tika-bun
[
https://issues.apache.org/jira/browse/TIKA-1279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen resolved TIKA-1279.
Resolution: Fixed
Fixed at r1589687
> Missing return lines at output of SourceCodePar
[
https://issues.apache.org/jira/browse/TIKA-1224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13979614#comment-13979614
]
Hong-Thai Nguyen commented on TIKA-1224:
Thank [~ben.12] for feedback.
For
Hong-Thai Nguyen created TIKA-1279:
--
Summary: Missing return lines at output of SourceCodeParser
Key: TIKA-1279
URL: https://issues.apache.org/jira/browse/TIKA-1279
Project: Tika
Issue Type
Hi Tika members,
Thank for this great initiative. I guess that there's some use cases possible
when creating such service:
1. Tika exploitation
We may create a free accessible Tika Server to parse documents coming from
public requests, a kind of demo or free-try document parser to check Tika
fe
[
https://issues.apache.org/jira/browse/TIKA-623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen updated TIKA-623:
--
Assignee: (was: Hong-Thai Nguyen)
> Add support for Outlook
[
https://issues.apache.org/jira/browse/TIKA-623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen resolved TIKA-623.
---
Resolution: Fixed
Improvement: extract each mail as attachment document. Recursion down to
Hi Tika men,
I have 500 error when committing to tika SVN. Do you have same problem ?
POST request on '/repos/asf/!svn/me' failed: 500 Internal Server Error
Thanks,
Hong-Thai
Yes, but from 1.6: https://issues.apache.org/jira/browse/TIKA-623
I'm finishing return mails as extracted documents as demand, but we'll have
this format in 1.6.
Hong-Thai
-Message d'origine-
De : Michael McCandless [mailto:luc...@mikemccandless.com]
Envoyé : mardi 1 avril 2014 13:42
À
[
https://issues.apache.org/jira/browse/TIKA-1244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen resolved TIKA-1244.
Resolution: Fixed
Fix Version/s: 1.6
Commited on r1583305, thanks [~lfcnassif]
I
[
https://issues.apache.org/jira/browse/TIKA-1244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong-Thai Nguyen reassigned TIKA-1244:
--
Assignee: Hong-Thai Nguyen
> Better parsing of Mbox fi
1 - 100 of 187 matches
Mail list logo