uima-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rico Landefeld (JIRA)" <uima-...@incubator.apache.org>
Subject [jira] Created: (UIMA-1299) Contribution of Lucene CAS Indexer
Date Mon, 02 Mar 2009 11:58:12 GMT
Contribution of Lucene CAS Indexer

                 Key: UIMA-1299
                 URL: https://issues.apache.org/jira/browse/UIMA-1299
             Project: UIMA
          Issue Type: New Feature
          Components: Sandbox
            Reporter: Rico Landefeld

Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based
on a XML-based "mapping configuration  file" in which the user can determine which UIMA annotations
should be put into which Lucene field, and how this field is set up (e.g. indexed and/or stored).
In addition, some basic functionality for (ontolgical) hypernym indexing is provided.

Additionally, Lucas is able to perform offset-based token stream alignment and merging of
UIMA annotations (via token position increment) in the same Lucene field (e.g. "documenttext"
or "title")

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message