+1 for an early release without the name parser. We can implement the algorithms from the book in 1.1.
Benedikt Gary Gregory <garydgreg...@gmail.com> schrieb am Di. 29. Nov. 2016 um 00:34: > +1 to removing the human name code for 1.0. Put it in a branch IMO. > > Gary > > On Mon, Nov 28, 2016 at 3:12 PM, Bruno P. Kinoshita < > brunodepau...@yahoo.com.br.invalid> wrote: > > > Hi Rob, > > > > First of all, kudos for the great work moving things from [lang] into > > [text]. > > > > I got a copy of the Lothaire book last weekend, but haven't had a chance > > to read it yet. > > > > There was also some discussion around the name-parser, and since we > > couldn't reach a consensus, > > I think we could either try to have another discussion thread, or stash > it > > somewhere so that > > it doesn't block a release. > > > > > > I also would like to implement more edit distance and string > similarities, > > as well as > > look into the duration unit parser, probably adapting code from > > github.com/jchampemont/gunip > > > > > > But I'd vote for (4). First moving the human name parser elsewhere, > > reviewing the edit distances, > > and checking if there's anything else we could put into this initial > > release from [lang]. > > > > Once it has been released, we will be able to add things from Lothaire > > book, > > more edit distances, maybe bring back the name parser, as well as any > > enhancement > > bug fixing. > > > > Bruno > > > > >________________________________ > > > From: Rob Tompkins <chtom...@gmail.com> > > >To: Commons Developers List <dev@commons.apache.org> > > >Sent: Tuesday, 29 November 2016 11:45 AM > > >Subject: [text] Next steps. > > > > > > > > >Hello, > > > > > >I'm a tad curious what folks (along with Gary, Benedikt, and Bruno) > think > > >the next steps are for text in the hopeful thought that we are > eventually > > >heading towards a 1.0 release. Some thoughts that come to mind are: > > > > > >(1) Go over lang with fine tooth comb and see what we think should move, > > >(2) Go through the Lothaire "Applied Combinatorics on Words" book ( > > >http://lipn.univ-paris13.fr/~duchamp/Books&more/Lothaire/( > > Encyclopedia_of_Mathematics_and_its_Applications_)M._Lothaire-Applied_ > > Combinatorics_On_Words-Cambridge_University_Press(2005).pdf) > > >and minimally implement some of the standard algorithms. > > >(3) Implement, from the Lothaire book, some of the more complex stuff: > > >heavier pattern matching, and/or natural language processing, > > >and/or > > >(4) Go straight for a release. > > > > > >I'm less for (4) because I think there's probably some smaller bits of > > code > > >in lang that probably come over. I like the idea of (2) before heading > out > > >the door. Regarding (3), I would have to do considerable reading to make > > >considerable headway here, which I'm not opposed to doing it would just > > >merely prolong getting to a 1.0 release if we predicated the release > upon > > >my getting that done. > > > > > >So, what do you guys think? > > > > > >Cheers, > > >-Rob > > > > > > > > > > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org > > For additional commands, e-mail: dev-h...@commons.apache.org > > > > > > > -- > E-Mail: garydgreg...@gmail.com | ggreg...@apache.org > Java Persistence with Hibernate, Second Edition > < > https://www.amazon.com/gp/product/1617290459/ref=as_li_tl?ie=UTF8&camp=1789&creative=9325&creativeASIN=1617290459&linkCode=as2&tag=garygregory-20&linkId=cadb800f39946ec62ea2b1af9fe6a2b8 > > > > <http://// > ir-na.amazon-adsystem.com/e/ir?t=garygregory-20&l=am2&o=1&a=1617290459> > JUnit in Action, Second Edition > < > https://www.amazon.com/gp/product/1935182021/ref=as_li_tl?ie=UTF8&camp=1789&creative=9325&creativeASIN=1935182021&linkCode=as2&tag=garygregory-20&linkId=31ecd1f6b6d1eaf8886ac902a24de418%22 > > > > <http://// > ir-na.amazon-adsystem.com/e/ir?t=garygregory-20&l=am2&o=1&a=1935182021> > Spring Batch in Action > < > https://www.amazon.com/gp/product/1935182951/ref=as_li_tl?ie=UTF8&camp=1789&creative=9325&creativeASIN=1935182951&linkCode=%7B%7BlinkCode%7D%7D&tag=garygregory-20&linkId=%7B%7Blink_id%7D%7D%22%3ESpring+Batch+in+Action > > > <http://// > ir-na.amazon-adsystem.com/e/ir?t=garygregory-20&l=am2&o=1&a=1935182951> > Blog: http://garygregory.wordpress.com > Home: http://garygregory.com/ > Tweet! http://twitter.com/GaryGregory >