commons-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Benedikt Ritter <brit...@apache.org>
Subject Re: [text] Next steps.
Date Tue, 29 Nov 2016 22:02:14 GMT
+1 for an early release without the name parser. We can implement the
algorithms from the book in 1.1.

Benedikt

Gary Gregory <garydgregory@gmail.com> schrieb am Di. 29. Nov. 2016 um 00:34:

> +1 to removing the human name code for 1.0. Put it in a branch IMO.
>
> Gary
>
> On Mon, Nov 28, 2016 at 3:12 PM, Bruno P. Kinoshita <
> brunodepaulak@yahoo.com.br.invalid> wrote:
>
> > Hi Rob,
> >
> > First of all, kudos for the great work moving things from [lang] into
> > [text].
> >
> > I got a copy of the Lothaire book last weekend, but haven't had a chance
> > to read it yet.
> >
> > There was also some discussion around the name-parser, and since we
> > couldn't reach a consensus,
> > I think we could either try to have another discussion thread, or stash
> it
> > somewhere so that
> > it doesn't block a release.
> >
> >
> > I also would like to implement more edit distance and string
> similarities,
> > as well as
> > look into the duration unit parser, probably adapting code from
> > github.com/jchampemont/gunip
> >
> >
> > But I'd vote for (4). First moving the human name parser elsewhere,
> > reviewing the edit distances,
> > and checking if there's anything else we could put into this initial
> > release from [lang].
> >
> > Once it has been released, we will be able to add things from Lothaire
> > book,
> > more edit distances, maybe bring back the name parser, as well as any
> > enhancement
> > bug fixing.
> >
> > Bruno
> >
> > >________________________________
> > > From: Rob Tompkins <chtompki@gmail.com>
> > >To: Commons Developers List <dev@commons.apache.org>
> > >Sent: Tuesday, 29 November 2016 11:45 AM
> > >Subject: [text] Next steps.
> > >
> > >
> > >Hello,
> > >
> > >I'm a tad curious what folks (along with Gary, Benedikt, and Bruno)
> think
> > >the next steps are for text in the hopeful thought that we are
> eventually
> > >heading towards a 1.0 release. Some thoughts that come to mind are:
> > >
> > >(1) Go over lang with fine tooth comb and see what we think should move,
> > >(2) Go through the Lothaire "Applied Combinatorics on Words" book (
> > >http://lipn.univ-paris13.fr/~duchamp/Books&more/Lothaire/(
> > Encyclopedia_of_Mathematics_and_its_Applications_)M._Lothaire-Applied_
> > Combinatorics_On_Words-Cambridge_University_Press(2005).pdf)
> > >and minimally implement some of the standard algorithms.
> > >(3) Implement, from the Lothaire book, some of the more complex stuff:
> > >heavier pattern matching, and/or natural language processing,
> > >and/or
> > >(4) Go straight for a release.
> > >
> > >I'm less for (4) because I think there's probably some smaller bits of
> > code
> > >in lang that probably come over. I like the idea of (2) before heading
> out
> > >the door. Regarding (3), I would have to do considerable reading to make
> > >considerable headway here, which I'm not opposed to doing it would just
> > >merely prolong getting to a 1.0 release if we predicated the release
> upon
> > >my getting that done.
> > >
> > >So, what do you guys think?
> > >
> > >Cheers,
> > >-Rob
> > >
> > >
> > >
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: dev-unsubscribe@commons.apache.org
> > For additional commands, e-mail: dev-help@commons.apache.org
> >
> >
>
>
> --
> E-Mail: garydgregory@gmail.com | ggregory@apache.org
> Java Persistence with Hibernate, Second Edition
> <
> https://www.amazon.com/gp/product/1617290459/ref=as_li_tl?ie=UTF8&camp=1789&creative=9325&creativeASIN=1617290459&linkCode=as2&tag=garygregory-20&linkId=cadb800f39946ec62ea2b1af9fe6a2b8
> >
>
> <http:////
> ir-na.amazon-adsystem.com/e/ir?t=garygregory-20&l=am2&o=1&a=1617290459>
> JUnit in Action, Second Edition
> <
> https://www.amazon.com/gp/product/1935182021/ref=as_li_tl?ie=UTF8&camp=1789&creative=9325&creativeASIN=1935182021&linkCode=as2&tag=garygregory-20&linkId=31ecd1f6b6d1eaf8886ac902a24de418%22
> >
>
> <http:////
> ir-na.amazon-adsystem.com/e/ir?t=garygregory-20&l=am2&o=1&a=1935182021>
> Spring Batch in Action
> <
> https://www.amazon.com/gp/product/1935182951/ref=as_li_tl?ie=UTF8&camp=1789&creative=9325&creativeASIN=1935182951&linkCode=%7B%7BlinkCode%7D%7D&tag=garygregory-20&linkId=%7B%7Blink_id%7D%7D%22%3ESpring+Batch+in+Action
> >
> <http:////
> ir-na.amazon-adsystem.com/e/ir?t=garygregory-20&l=am2&o=1&a=1935182951>
> Blog: http://garygregory.wordpress.com
> Home: http://garygregory.com/
> Tweet! http://twitter.com/GaryGregory
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message