lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alexandre Rafalovitch (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SOLR-12789) UIMA enhancements to allow for dynamic AE detection
Date Fri, 21 Sep 2018 15:13:00 GMT

    [ https://issues.apache.org/jira/browse/SOLR-12789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16623756#comment-16623756
] 

Alexandre Rafalovitch commented on SOLR-12789:
----------------------------------------------

Hi Aaron,

It is great to hear that there would be a healthy discussion about this. Please feel free
to share the outcome of this on the developer list and it may spark further developers discussion
too. Nothing is set in stone, given enough evidence to the contrary.

Still, just to re-summarize, the issue we were facing was that all the shipped examples were
dead (Alchemy API...) and over multiple issues we could not figure out a way to get to the
new local maximum of latest version and useful examples (UIMA has a bit of a learning curve).
Nor were we able to find anybody helping us to push the discussion forward within either development
community (Jira discussions) or the user community (Solr Users mailing list).

Additionally, we are trying to slim Solr down in general and have done several things towards
that, including removing Javadoc from the distribution. If you were more closely connected
to the community, you would see multiple of these drives all pointing in the same general
direction. So, having a dead weight we could not figure what to do with over several years
was very much "not cool" on all those users downloading Solr and trying to navigate their
way through very full-featured product. 

And then, of course, there is a fact that we now incorporate Apache OpenNLP as well. So, there
are trade-offs to keep in mind.

 

> UIMA enhancements to allow for dynamic AE detection
> ---------------------------------------------------
>
>                 Key: SOLR-12789
>                 URL: https://issues.apache.org/jira/browse/SOLR-12789
>             Project: Solr
>          Issue Type: Improvement
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: contrib - UIMA
>    Affects Versions: 6.0
>            Reporter: Aaron LaBella
>            Priority: Major
>              Labels: ready-to-commit
>         Attachments: SOLR-12789-1.patch, SOLR-12789-2.patch, SOLR-12789-3.patch
>
>
> I've been sitting on this patch for over 2 years (and likewise it's been running IN production
for the same) ... finally got around to contributing it back to the community.  This change
prepares the UIMAUpdateRequestProcessor to allow subclasses to have additional control over
how the analysis engine is selected.  In my case, I wrote a sub-class that allows for *dynamic*
detection of the UIMA analysis engine based on the document fields.  ie: a field in the document
can be used to select different UIMA configurations and rules.
>  
> Can someone please commit this as soon as possible.  I don't necessarily need it to
be back-ported, having in 7.4.1 would suffice.
> Thanks!



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message