[ 
https://issues.apache.org/jira/browse/SPARK-5685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15657502#comment-15657502
 ] 

Aditya commented on SPARK-5685:
-------------------------------

If this issue is still open, I can work on it

> Show warning when users open text files compressed with non-splittable 
> algorithms like gzip
> -------------------------------------------------------------------------------------------
>
>                 Key: SPARK-5685
>                 URL: https://issues.apache.org/jira/browse/SPARK-5685
>             Project: Spark
>          Issue Type: Improvement
>          Components: Spark Core
>            Reporter: Nicholas Chammas
>            Priority: Minor
>
> This is a usability or user-friendliness issue.
> It's extremely common for people to load a text file compressed with gzip, 
> process it, and then wonder why only 1 core in their cluster is doing any 
> work.
> Some examples:
> * http://stackoverflow.com/q/28127119/877069
> * http://stackoverflow.com/q/27531816/877069
> I'm not sure how this problem can be generalized, but at the very least it 
> would be helpful if Spark displayed some kind of warning in the common case 
> when someone opens a gzipped file with {{sc.textFile}}.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to