tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mattmann, Chris A (3010)" <chris.a.mattm...@jpl.nasa.gov>
Subject Re: Query Regarding Apache Tika Language Ditector
Date Wed, 08 Mar 2017 05:01:54 GMT
Resending this to dev@tika.apache.org<mailto:dev@tika.apache.org> rather than dev-owner.

Chris Mattmann, Ph.D.
Principal Data Scientist, Engineering Administrative Office (3010)
Manager, NSF & Open Source Projects Formulation and Development Offices (8212)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 180-503E, Mailstop: 180-503
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/

From: Supriti Dan <supritid2010@gmail.com>
Date: Tuesday, March 7, 2017 at 5:21 PM
To: "dev-owner@tika.apache.org" <dev-owner@tika.apache.org>
Subject: Query Regarding Apache Tika Language Ditector

Hi Team,

I want to use Apache Tika for language detection purpose, could you please suggest me how
many different language are detected well by Apache Tika. From the source (https://www.tutorialspoint.com/tika/tika_language_detection.htm)
I found that Apache Tika support 18 languages but I believe the latest version support more
then that.

Thanking you in advance.

Supriti Dan
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message