uima-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Thilo Goetz <twgo...@gmx.de>
Subject Re: [jira] Updated: (UIMA-1299) Contribution of Lucene CAS Indexer
Date Wed, 04 Mar 2009 10:16:08 GMT
In order to move this along, I'll call for a vote.
No use prevaricating about the bush ;-)


Rico Landefeld (JIRA) wrote:
>      [ https://issues.apache.org/jira/browse/UIMA-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
> Rico Landefeld updated UIMA-1299:
> ---------------------------------
>     Attachment: lucene-indexer.tar.gz
>> Contribution of Lucene CAS Indexer
>> ----------------------------------
>>                 Key: UIMA-1299
>>                 URL: https://issues.apache.org/jira/browse/UIMA-1299
>>             Project: UIMA
>>          Issue Type: New Feature
>>          Components: Sandbox
>>            Reporter: Rico Landefeld
>>         Attachments: lucene-indexer.tar.gz
>> Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index.
It is based on a XML-based "mapping configuration  file" in which the user can determine which
UIMA annotations should be put into which Lucene field, and how this field is set up (e.g.
indexed and/or stored). In addition, some basic functionality for (ontolgical) hypernym indexing
is provided.
>> Additionally, Lucas is able to perform offset-based token stream alignment and merging
of UIMA annotations (via token position increment) in the same Lucene field (e.g. "documenttext"
or "title")

View raw message