Ruslan Dautkhanov created ZEPPELIN-1803: -------------------------------------------
Summary: CSV export doesn't conform to RFC-4180: exported csv is broken in some cases Key: ZEPPELIN-1803 URL: https://issues.apache.org/jira/browse/ZEPPELIN-1803 Project: Zeppelin Issue Type: Bug Components: Core, front-end, zeppelin-server Affects Versions: 0.7.0 Reporter: Ruslan Dautkhanov CSV export doesn't conform to RFC-4180: exported csv is broken in some cases RFC-4180: {quote} If double-quotes are used to enclose fields, then a double-quote appearing inside a field must be escaped by preceding it with another double quote. {quote} It makes CSV with double quotes (") exported from Zeppelin not importable by any tools, including Excel. It looks like CSV export has other issues too, like in some cases exported column value was a negative number instead of a character field. It could be a new bug or related again to the fact that Zeppelin CSV exported doesn't conform to RFC-4180 standard. Some related quotes from RFC-4180: {noformat} 5. Each field may or may not be enclosed in double quotes (however some programs, such as Microsoft Excel, do not use double quotes at all). If fields are not enclosed with double quotes, then double quotes may not appear inside the fields. For example: "aaa","bbb","ccc" CRLF zzz,yyy,xxx 6. Fields containing line breaks (CRLF), double quotes, and commas should be enclosed in double-quotes. For example: "aaa","b CRLF bb","ccc" CRLF zzz,yyy,xxx 7. If double-quotes are used to enclose fields, then a double-quote appearing inside a field must be escaped by preceding it with another double quote. For example: "aaa","b""bb","ccc" {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)