uima-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Peter Klügl (JIRA) <...@uima.apache.org>
Subject [jira] [Commented] (UIMA-3530) UIMA Rute - allow WORDLIST and WORDTABLE files to include not just plain text to be matched but also regular expressions
Date Wed, 08 Jan 2014 14:03:53 GMT

    [ https://issues.apache.org/jira/browse/UIMA-3530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13865452#comment-13865452
] 

Peter Klügl commented on UIMA-3530:
-----------------------------------

That is not as simple as it seems in the currect implementation, because the dictionaries
are internally compiled into a trie, a tree structure of characters. I will try to think of
something and all suggestions are welcome. In the meantime, you could use the simple regexp
rules: http://uima.apache.org/d/ruta-current/tools.ruta.book.html#ugr.tools.ruta.language.regexprule

> UIMA Rute - allow WORDLIST and WORDTABLE files to include not just plain text to be matched
but also regular expressions 
> -------------------------------------------------------------------------------------------------------------------------
>
>                 Key: UIMA-3530
>                 URL: https://issues.apache.org/jira/browse/UIMA-3530
>             Project: UIMA
>          Issue Type: Wish
>          Components: ruta
>            Reporter: Dimitris Vassos
>            Priority: Minor
>
> It would greatly speed up and simplify the implementation of dictionary lookups using
WORDLIST and WORDTABLE, if instead of just plain text entries in the file we could enter regular
expressions.
> Especially for inflectional languages such as Greek or Russian, this feature is invaluable.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Mime
View raw message