commons-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Claude Warren <cla...@xenei.com>
Subject Re: New Sub-project Proposal.
Date Wed, 11 Sep 2019 15:11:15 GMT
As another note, we have had discussions here at ApacheCon about developing
a method to exchange bloom filter hashing algorithms to make it easier for
systems to publish interfaces where bloom filters are passed as the search
parameters.

Also, bloom filters are good for looking for "and"ed values.  So if I
create bloom filters for cars based on model, make, color and use those in
the bucket filter then I can determine if there are any red cars, or red
cars made by Ford in the bucket.

Claude

On Wed, Sep 11, 2019 at 3:49 PM sebb <sebbaz@gmail.com> wrote:

> On Wed, 11 Sep 2019 at 12:36, Gary Gregory <garydgregory@gmail.com> wrote:
> >
> > I would like to know more. I am curious since looking up whether an
> element
> > is in a set is done via a hash code. How do you do better than that?
>
> Wikipedia has a good explanation:
>
> https://en.wikipedia.org/wiki/Bloom_filter
>
> Basically instead of a hash you create a bit mask and set/test that.
>
> This can give false positives, but not false negatives.
>
> > Gary
> >
> > On Tue, Sep 10, 2019, 16:51 Bruno P. Kinoshita <kinow@apache.org> wrote:
> >
> > >  +1 Collections sounds like a good place for a bloom filter.
> > >
> > > Bruno
> > >
> > >     On Wednesday, 11 September 2019, 8:00:45 am NZST, Jochen Wiedmann <
> > > jochen.wiedmann@gmail.com> wrote:
> > >
> > >  Hi, Claude,
> > >
> > > having read, what a bloom filter is, a subproject sounds unnecessary
> > > to me. I'd recommend, that you contribute your code to Commons
> > > Collections, which seems to me to be a logical target.
> > >
> > > Jochen
> > >
> > > On Tue, Sep 10, 2019 at 8:45 PM Claude Warren <claude@xenei.com>
> wrote:
> > > >
> > > > Having spoken with several people at ApacheCon, I would like to see a
> > > > bloomfilter sub project.  I have code that is already under Apache
> > > License
> > > > that I am willing to contribute as the basis The goal of the
> sub-project
> > > > would be to produce a reference implementation that could be used by
> > > other
> > > > projects that desire to have use bloom filters and bloom filter based
> > > > collections.
> > > >
> > > > Is there any objection to doing this?  Other than asking here, what
> is
> > > the
> > > > proper path to get a sub-project created,  What does the Commons PMC
> > > > require?
> > > >
> > > > Any assistance and comments would be apprecieated.
> > > > Claude
> > > >
> > > > --
> > > > I like: Like Like - The likeliest place on the web
> > > > <http://like-like.xenei.com>
> > > > LinkedIn: http://www.linkedin.com/in/claudewarren
> > >
> > > ---------------------------------------------------------------------
> > > To unsubscribe, e-mail: dev-unsubscribe@commons.apache.org
> > > For additional commands, e-mail: dev-help@commons.apache.org
> > >
> > >
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscribe@commons.apache.org
> For additional commands, e-mail: dev-help@commons.apache.org
>
>

-- 
I like: Like Like - The likeliest place on the web
<http://like-like.xenei.com>
LinkedIn: http://www.linkedin.com/in/claudewarren

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message