lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Aaron LaBella (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SOLR-12789) UIMA enhancements to allow for dynamic AE detection
Date Mon, 24 Sep 2018 12:52:00 GMT

    [ https://issues.apache.org/jira/browse/SOLR-12789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16625774#comment-16625774
] 

Aaron LaBella commented on SOLR-12789:
--------------------------------------

Hi Alexandre, thank for you the additional detail and background.  While I understand the
goal here, I don't agree with how the end result was achieved.  I think the real "issue"
here was that the examples and documentation are stale.  Likewise, UIMA core can (and should)
be upgraded to the latest 2.10.2, and the additional unnecessary dependencies should absolutely
be removed from the dist.  I'm attaching a simple patch (*SOLR-12789-4.patch*) that does
just this.  I would like to propose that we re-instate the contrib/uima project and apply
my patch instead.  I think this is a fair compromise since 6 Java classes doesn't quite compromise
as "dead weight", especially if those 6 classes provide direct end-user value.  While I would
certain agree, UIMA has a steep learning curve, there are folks out there that are using it,
and removing it entirely from the Solr dist is likely to do a disservice to those folks who
are in-fact doing text analytics using it.

 

All that being said, I think the only thing that really remains is a clean-up of the documentation
and examples.  I'm happy to do that over the next couple weeks if we agree to this strategy.

 

Thanks so much.

 

> UIMA enhancements to allow for dynamic AE detection
> ---------------------------------------------------
>
>                 Key: SOLR-12789
>                 URL: https://issues.apache.org/jira/browse/SOLR-12789
>             Project: Solr
>          Issue Type: Improvement
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: contrib - UIMA
>    Affects Versions: 6.0
>            Reporter: Aaron LaBella
>            Priority: Major
>              Labels: ready-to-commit
>         Attachments: SOLR-12789-1.patch, SOLR-12789-2.patch, SOLR-12789-3.patch, SOLR-12789-4.patch
>
>
> I've been sitting on this patch for over 2 years (and likewise it's been running IN production
for the same) ... finally got around to contributing it back to the community.  This change
prepares the UIMAUpdateRequestProcessor to allow subclasses to have additional control over
how the analysis engine is selected.  In my case, I wrote a sub-class that allows for *dynamic*
detection of the UIMA analysis engine based on the document fields.  ie: a field in the document
can be used to select different UIMA configurations and rules.
>  
> Can someone please commit this as soon as possible.  I don't necessarily need it to
be back-ported, having in 7.4.1 would suffice.
> Thanks!



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message