Hyunsik Choi created TAJO-230:
---------------------------------
Summary: Support unicode identifiers
Key: TAJO-230
URL: https://issues.apache.org/jira/browse/TAJO-230
Project: Tajo
Issue Type: New Feature
Components: parser
Reporter: Hyunsik Choi
The current parse only recognizes a combination of alphabet characters and underscore as an
identifier used for function names, column names, and table names. This is because of the
following antlr lexer rules:
{code}
Identifier
: Nonreserved_keywords
| Regular_Identifier
;
Regular_Identifier
: ('a'..'z'|'A'..'Z'|'_') ('a'..'z'|'A'..'Z'|Digit|'_')*
;
{code}
In some CJK country, their characters can be used as identifiers. We need to support unicode
identifier.
--
This message was sent by Atlassian JIRA
(v6.1#6144)
|