Hey Ken,

This is fine. I wanted to get going with our Julia/MIT-LL Text.jl based
detector and turning LanguageIdentifier into an interface. Me and
Trevor (CC’ed) are working on it, but not sure where we’re at and
shouldn’t be a blocker to moving forward.

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





-----Original Message-----
From: Ken Krugler <kkrugler_li...@transpac.com>
Reply-To: "dev@tika.apache.org" <dev@tika.apache.org>
Date: Thursday, February 4, 2016 at 12:23 PM
To: "tika-...@lucene.apache.org" <tika-...@lucene.apache.org>
Subject: Tika 2.0 and language detection

>Hi all,
>
>Over at https://issues.apache.org/jira/browse/TIKA-1723, Tim & I have
>been discussing whether to focus these pending changes on the 2.0 branch,
>and leave 1.x as-is.
>
>As part of that, we could do a cut-and-run in 2.0, and not spend the time
>to port the current (Tika 1.x) language detector code.
>
>I'm in favor of that approach, as I think leveraging the new detector
>project(s) gives us faster & more accurate results over more languages.
>
>But we're posting to the more general audience here, to gather input on
>things that we might not be considering.
>
>Thanks,
>
>-- Ken
>
>
>
>--------------------------
>Ken Krugler
>+1 530-210-6378
>http://www.scaleunlimited.com
>custom big data solutions & training
>Hadoop, Cascading, Cassandra & Solr
>
>
>
>
>

Reply via email to