tajo-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hyunsik Choi (JIRA)" <j...@apache.org>
Subject [jira] [Created] (TAJO-230) Support unicode identifiers
Date Fri, 04 Oct 2013 01:15:42 GMT
Hyunsik Choi created TAJO-230:

             Summary: Support unicode identifiers
                 Key: TAJO-230
                 URL: https://issues.apache.org/jira/browse/TAJO-230
             Project: Tajo
          Issue Type: New Feature
          Components: parser
            Reporter: Hyunsik Choi

The current parse only recognizes a combination of alphabet characters and underscore as an
identifier used for function names, column names, and table names. This is because of the
following antlr lexer rules:
  : Nonreserved_keywords
  | Regular_Identifier

  : ('a'..'z'|'A'..'Z'|'_') ('a'..'z'|'A'..'Z'|Digit|'_')*

In some CJK country, their characters can be used as identifiers. We need to support unicode

This message was sent by Atlassian JIRA

View raw message