hadoop-mapreduce-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "bc Wong (JIRA)" <j...@apache.org>
Subject [jira] [Created] (MAPREDUCE-5777) Support utf-8 text with BOM (byte order marker)
Date Tue, 04 Mar 2014 20:06:23 GMT
bc Wong created MAPREDUCE-5777:

             Summary: Support utf-8 text with BOM (byte order marker)
                 Key: MAPREDUCE-5777
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5777
             Project: Hadoop Map/Reduce
          Issue Type: Improvement
    Affects Versions: 2.2.0, 0.22.0
            Reporter: bc Wong

UTF-8 text may have a BOM. TextInputFormat, KeyValueTextInputFormat and friends should recognize
the BOM and not treat it as actual data.

This message was sent by Atlassian JIRA

View raw message