On Tue, Jan 25, 2011 at 8:01 PM, Gary Gregory <ggreg...@seagullsoftware.com> wrote: > Hi All: > > I just found a data set that I would like to integrate with [codec] to test > the language package: > > http://sourceforge.net/projects/familynamephon/ > > The test data file contains 837K German names (37MB) in a text file and > encodings in Cham (?) phonetics, Cologne phonetics, Metaphone, and Soundex. > > I have no idea how long it would take to run a test for our language encoders > on this but I imagine making it an optional unit test. How do you do THAT in > Maven? >
One way would be to have a separate profile which configures the surefire-plugin to only include that test and have the *normal* suefire-plugin config exclude it - lang has this for the RandomUtilsFreqTest in th2 2.x branch: http://svn.apache.org/repos/asf/commons/proper/lang/tags/LANG_2_6/pom.xml Niall > The data is covered (I think, I do not read German) by this license: > http://www.opendatacommons.org/licenses/odbl/1.0/ > > Thoughts? > Gary Gregory > Senior Software Engineer > Rocket Software > 3340 Peachtree Road, Suite 820 * Atlanta, GA 30326 * USA > Tel: +1.404.760.1560 > Email: ggreg...@seagullsoftware.com<mailto:ggreg...@seagullsoftware.com> > Web: seagull.rocketsoftware.com<http://www.seagull.rocketsoftware.com/> > > > --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org For additional commands, e-mail: dev-h...@commons.apache.org