spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Xiangrui Meng (JIRA)" <j...@apache.org>
Subject [jira] [Created] (SPARK-15064) Locale support in StopWordsRemover
Date Mon, 02 May 2016 16:03:13 GMT
Xiangrui Meng created SPARK-15064:
-------------------------------------

             Summary: Locale support in StopWordsRemover
                 Key: SPARK-15064
                 URL: https://issues.apache.org/jira/browse/SPARK-15064
             Project: Spark
          Issue Type: New Feature
          Components: ML
    Affects Versions: 2.0.0
            Reporter: Xiangrui Meng


We support case insensitive filtering (default) in StopWordsRemover. However, case insensitive
matching depends on the locale and region, which cannot be explicitly set in StopWordsRemover.
We should consider adding this support in MLlib.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message