Package: wnpp
Severity: wishlist
Owner: Markus Koschany <[email protected]>
X-Debbugs-Cc: [email protected],
[email protected],[email protected]
* Package name : liblanguage-detector-java
Version : 0.6
Upstream Author : Nakatani Shuyo, Francois ROLAND, Fabian Kessler,
Nicole Torres, Robert Theis
* URL : https://github.com/optimaize/language-detector
* License : Apache-2.0
Programming Lang: Java
Description : Language Detection Library for Java
This software uses language profiles which were created based on
common text for each language. N-grams, a contiguous sequence of n
items from a given sample of text, were then extracted from that text
and stored in the profiles. When trying to figure out in what
language a certain text is written, the program goes through the same
process: It creates the same kind of n-grams of the input text. Then
it compares the relative frequency of them, and finds the language
that matches best. Currently 71 languages are supported.