mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Dunning <ted.dunn...@gmail.com>
Subject Re: Should I be using OnlineLogisticRegression?
Date Fri, 07 Sep 2012 00:14:24 GMT
Try transforming them as well, likely with a log if they are positive and
have heavily skewed values.

Can you suck the data into R and paste in the results of summary(x)?
(assuming you put the data into the variable x).  This should look
something like:

> summary(x)
>        v1                 v2                  v3
>  Min.   :-3.41939   Min.   :0.0002538   Min.   :1.188
>  1st Qu.:-0.66695   1st Qu.:0.3122501   1st Qu.:3.321
>  Median :-0.07277   Median :0.6830144   Median :3.972
>  Mean   :-0.05619   Mean   :1.0286261   Mean   :4.010
>  3rd Qu.: 0.56784   3rd Qu.:1.4619058   3rd Qu.:4.712
>  Max.   : 2.74271   Max.   :7.7754864   Max.   :7.252
> >


On Thu, Sep 6, 2012 at 4:58 PM, Diederik van Liere <
Diederik.vanLiere@rotman.utoronto.ca> wrote:

>
> > - My (6) predictor variables are all numeric; some of the variables range
> > from 0...5, others range from 0...1,000,000.
> Have you tried rescaling your predictor variables so they have the same
> range?
>
> Diederik

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message