jakarta-oro-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Daniel F. Savarese" <...@savarese.org>
Subject Re: Doubt about ORO
Date Thu, 10 Jan 2002 15:13:33 GMT

In message <295A9D64E5DC2D469405DE8037DDAB694FC14D@delmmsx01.sapient.com>, Chan
dramouli Kharidehal writes:
>How do i detect Unicode  Characters using the ORO package 
>For example usign \w i can detect all the ASCII characters 
>How about the characters beyond the ASCII Unicode Characters 

As I said before:

>\d matches based on Character.isDigit() and \w matches based on
>Character.isLetterOrDigit() or '_'.  So, you see, it's all Unicode based
>on Java's interpretation of how to classify characters with some attempt
>to remain true to Perl (e.g., the inclusion of '_' in \w).  Even though
>the source is somewhat inscrutable, these types of questions can be
>answered by looking at the source.

If you skimmed it the first time, please reread the first sentence of
my original reply which explicitly answers your question.  Since
Character.isLetterOrDigit(), or any other Character.isFoo()
method, is not restricted to ASCII, neither is \w, \d, or any of the
other character set specifiers.  Also pay special attention to the last
sentence of my original reply :)

daniel



--
To unsubscribe, e-mail:   <mailto:oro-user-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:oro-user-help@jakarta.apache.org>


Mime
View raw message