mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Pat Ferrel <>
Subject Re: Sequence and vectors from CSV file
Date Sat, 24 May 2014 17:38:55 GMT
Depends, what you are doing with the data? Also what are the value types. Are you interested
in all of them for analysis?

One thing to note is that if there are unique IDs in your data, they may need to be converted
into ordinal Ints for Mahout. So you need to map your ID to ordinal Ints for input then when
you get the data out you may need to do the reverse map to get your IDs back.

On May 22, 2014, at 10:47 PM, Chhaya Vishwakarma <>


I have a CSV file with following columns name.age,salary,experience

When I convert it to a sequence file what exactly happens to the data ?
How does sequence file will look like?

And onc sequence file is converted to vectors how does it look like
I want to understand what happens when we create sequence and vectors from input data

Chhaya Vishwakarma

The contents of this e-mail and any attachment(s) may contain confidential or privileged information
for the intended recipient(s). Unintended recipients are prohibited from taking action on
the basis of information in this e-mail and using or disseminating the information, and must
notify the sender and delete it from their system. L&T Infotech will not accept responsibility
or liability for the accuracy or completeness of, or the presence of any virus or disabling
code in this e-mail"

View raw message