lucene-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From David Smiley <david.w.smi...@gmail.com>
Subject Re: Apache Solr and Tika used to index Panama Papers
Date Wed, 06 Apr 2016 12:57:48 GMT
😀 awesome
On Wed, Apr 6, 2016 at 4:45 AM Uwe Schindler <uschindler@apache.org> wrote:

> Hi all,
>
> I just wanted to repost the following by Chris Mattman on the TIKA list:
>
> If you have been following the news you’ve seen the Panama papers and how
> the world’s rich and elite have been storing all their money offshore to
> hide it. Two of the ASF’s key technologies were used in uncovering that
> story and showing the world what was going on: Apache Tika and Apache Solr.
>
> Solr was used for making the Terabytes of Panama Papers available to
> journalists. The preprocessing of the documents for indexing was done with
> Tika (maybe through the contrib/extraction module).
>
> Here is the article by Forbes about that:
>
> http://www.forbes.com/sites/thomasbrewster/2016/04/05/panama-papers-amazon-encryption-epic-leak
>
> Uwe
>
> -----
> Uwe Schindler
> uschindler@apache.org
> ASF Member, Apache Lucene PMC / Committer
> Bremen, Germany
> http://lucene.apache.org/
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
> For additional commands, e-mail: dev-help@lucene.apache.org
>
> --
Lucene/Solr Search Committer, Consultant, Developer, Author, Speaker
LinkedIn: http://linkedin.com/in/davidwsmiley | Book:
http://www.solrenterprisesearchserver.com

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message