uima-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rico Landefeld (JIRA)" <uima-...@incubator.apache.org>
Subject [jira] Created: (UIMA-1299) Contribution of Lucene CAS Indexer
Date Mon, 02 Mar 2009 11:58:12 GMT
Contribution of Lucene CAS Indexer
----------------------------------

                 Key: UIMA-1299
                 URL: https://issues.apache.org/jira/browse/UIMA-1299
             Project: UIMA
          Issue Type: New Feature
          Components: Sandbox
            Reporter: Rico Landefeld


Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based
on a XML-based "mapping configuration  file" in which the user can determine which UIMA annotations
should be put into which Lucene field, and how this field is set up (e.g. indexed and/or stored).
In addition, some basic functionality for (ontolgical) hypernym indexing is provided.

Additionally, Lucas is able to perform offset-based token stream alignment and merging of
UIMA annotations (via token position increment) in the same Lucene field (e.g. "documenttext"
or "title")


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message