manifoldcf-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Furkan KAMACI <>
Subject Re: Language Detection for the data
Date Wed, 21 Nov 2018 09:05:20 GMT
Hi Nikita,

First of all, OpenNLP is a transformation connector at ManifoldCF and
should be enabled by default. It extracts named entities (people, locations
and organizations) from document.

You should download trained models to run OpenNLP connector. You can check
here for such purpose:

Check here for a detailed explanation:

Feel free to ask any questions when you try to integrate it. Also, you
should explain the points if you cannot success to run it.

Kind Regards,

On Wed, Nov 21, 2018 at 11:54 AM Karl Wright <> wrote:

> Hi Nikita,
> Can you be more specific when you say "OpenNLP is not working"?  All that
> this connector does is integrate OpenNLP as a ManifoldCF transformer.  It
> uses a specific directory to deliver the models that OpenNLP uses to match
> and extract content from documents.  Thus, you can provide any models you
> want that are compatible with the OpenNLP version we're including.
> Can you describe the steps you are taking and what you are seeing?
> On Wed, Nov 21, 2018 at 12:44 AM Nikita Ahuja <>
> wrote:
>> Hi,
>> I have query related to detect the language of the records/data which is
>> going to be ingest in the Output Connector.
>> OpenNLP connector is not working for the detection as per the user
>> documentation, but this is not working appropriately. Please suggest is NLP
>> has to be used if yes, then how it should be used or is there any other
>> solution for this?
>> --
>> Thanks and Regards,
>> Nikita
>> Email:
>> United Sources Service Pvt. Ltd.
>> a "Smartshore" Company
>> Mobile: +91 99 888 57720

View raw message