spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Joseph K. Bradley (JIRA)" <>
Subject [jira] [Created] (SPARK-2796) DecisionTree bug with ordered categorical features
Date Fri, 01 Aug 2014 21:51:40 GMT
Joseph K. Bradley created SPARK-2796:

             Summary: DecisionTree bug with ordered categorical features
                 Key: SPARK-2796
             Project: Spark
          Issue Type: Bug
          Components: MLlib
    Affects Versions: 1.0.0
            Reporter: Joseph K. Bradley

In DecisionTree, the method sequentialBinSearchForOrderedCategoricalFeatureInClassification()
indexed bins from 0 to (math.pow(2, featureCategories.toInt - 1) - 1).  This upper bound is
the bound for unordered categorical features, not ordered ones.  The upper bound should be
the arity (i.e., max value) of the feature.

This message was sent by Atlassian JIRA

View raw message