commons-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yonik Seeley (JIRA)" <>
Subject [jira] Commented: (SANDBOX-206) backslash before quote character gives an error
Date Fri, 04 Jan 2008 20:11:34 GMT


Yonik Seeley commented on SANDBOX-206:

I've done some more investigation of what common CSV producers use:
Excel does not do backslash escaping.
MySQL does backslash escaping to escape separators and or to escape encapsulators if they
are used.  The escape is used to escape itself.

This is the behavior I think we should mimic (no backslash escaping, but escape consistently
if enabled)  I haven't been able to find any basis for the current behavior... so if you understand
why the current code purposely does what it does, speak up!

> backslash before quote character gives an error
> -----------------------------------------------
>                 Key: SANDBOX-206
>                 URL:
>             Project: Commons Sandbox
>          Issue Type: Bug
>          Components: CSV
>         Environment: Windows, SOLR 1.2
>            Reporter: Michael Lackhoff
> A CSV-field with the contents "This is text with a \""quoted"" string" gives the error
> "invalid  char between encapsualted token end delimiter". If the backslash is not immediately
before the double quote, everything is fine.
> The same error occurs when the backslash is the last character in the field (directly
before the delimiter), like:
> "This is a text with a backslash \".
> Here the reason might be that the backslash also works as an escape character like in
> "This is a field with a \"quoted\" text" (no error, just the quotes in the resulting

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message