[jira] [Commented] (SPARK-21338) AggregatedDialect doesn't override isCascadingTruncateTable() method

2017-07-23 Thread Ostap Gonchar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16097994#comment-16097994 ] Ostap Gonchar commented on SPARK-21338: --- Can anyone check this issue? > Aggregated

[jira] [Assigned] (SPARK-21516) overriding afterEach() in DatasetCacheSuite must call super.afterEach()

2017-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21516: Assignee: (was: Apache Spark) > overriding afterEach() in DatasetCacheSuite must call

[jira] [Assigned] (SPARK-21516) overriding afterEach() in DatasetCacheSuite must call super.afterEach()

2017-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21516: Assignee: Apache Spark > overriding afterEach() in DatasetCacheSuite must call super.after

[jira] [Commented] (SPARK-21516) overriding afterEach() in DatasetCacheSuite must call super.afterEach()

2017-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16097981#comment-16097981 ] Apache Spark commented on SPARK-21516: -- User 'kiszk' has created a pull request for

[jira] [Updated] (SPARK-21443) Very long planning duration for queries with lots of operations

2017-07-23 Thread Eyal Zituny (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eyal Zituny updated SPARK-21443: Priority: Minor (was: Major) > Very long planning duration for queries with lots of operations > -

[jira] [Commented] (SPARK-21443) Very long planning duration for queries with lots of operations

2017-07-23 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16097952#comment-16097952 ] Takeshi Yamamuro commented on SPARK-21443: -- [~eyalzit] Could you update the titl

[jira] [Created] (SPARK-21516) overriding afterEach() in DatasetCacheSuite must call super.afterEach()

2017-07-23 Thread Kazuaki Ishizaki (JIRA)
Kazuaki Ishizaki created SPARK-21516: Summary: overriding afterEach() in DatasetCacheSuite must call super.afterEach() Key: SPARK-21516 URL: https://issues.apache.org/jira/browse/SPARK-21516 Proje

[jira] [Comment Edited] (SPARK-21512) DatasetCacheSuite needs to execute unpersistent after executing peristent

2017-07-23 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16097565#comment-16097565 ] Kazuaki Ishizaki edited comment on SPARK-21512 at 7/24/17 4:53 AM:

[jira] [Commented] (SPARK-21508) Documentation on 'Spark Streaming Custom Receivers' has error in example code

2017-07-23 Thread Remis Haroon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16097937#comment-16097937 ] Remis Haroon commented on SPARK-21508: -- Thankyou [~srowen] , I will raise the PR. I

[jira] [Created] (SPARK-21515) Spark ML Random Forest

2017-07-23 Thread KovvuriSriRamaReddy (JIRA)
KovvuriSriRamaReddy created SPARK-21515: --- Summary: Spark ML Random Forest Key: SPARK-21515 URL: https://issues.apache.org/jira/browse/SPARK-21515 Project: Spark Issue Type: Question

[jira] [Commented] (SPARK-21513) SQL to_json should support all column types

2017-07-23 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16097931#comment-16097931 ] Liang-Chi Hsieh commented on SPARK-21513: - Thanks a lot. Note that as I have not

[jira] [Commented] (SPARK-21513) SQL to_json should support all column types

2017-07-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16097926#comment-16097926 ] Hyukjin Kwon commented on SPARK-21513: -- Okay. I added it back. Please cc me if anyon

[jira] [Updated] (SPARK-21513) SQL to_json should support all column types

2017-07-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-21513: - Labels: Starter (was: ) > SQL to_json should support all column types >

[jira] [Commented] (SPARK-21513) SQL to_json should support all column types

2017-07-23 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16097922#comment-16097922 ] Liang-Chi Hsieh commented on SPARK-21513: - If for scala part only, it seems a sta

[jira] [Commented] (SPARK-21491) Performance enhancement: eliminate creation of intermediate collections

2017-07-23 Thread Iurii Antykhovych (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21491?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16097903#comment-16097903 ] Iurii Antykhovych commented on SPARK-21491: --- Performed some micro-benchmarking

[jira] [Resolved] (SPARK-19490) Hive partition columns are case-sensitive

2017-07-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-19490. -- Resolution: Duplicate I am resolving this per https://github.com/apache/spark/pull/16832#issue

[jira] [Resolved] (SPARK-19490) Hive partition columns are case-sensitive

2017-07-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-19490. -- Resolution: Cannot Reproduce > Hive partition columns are case-sensitive >

[jira] [Reopened] (SPARK-19490) Hive partition columns are case-sensitive

2017-07-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reopened SPARK-19490: -- > Hive partition columns are case-sensitive > - > >

[jira] [Resolved] (SPARK-21269) MetadataFetchFailedException: Missing an output location for shuffle 0

2017-07-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21269. -- Resolution: Cannot Reproduce I am resolving this per https://github.com/apache/spark/pull/1849

[jira] [Commented] (SPARK-21495) DIGEST-MD5: Out of order sequencing of messages from server

2017-07-23 Thread Xin Yu Pan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16097885#comment-16097885 ] Xin Yu Pan commented on SPARK-21495: Hi Sean, Any suggestion or guidance regard to a

[jira] [Resolved] (SPARK-21454) Decimal up cast to higher scale fails while reading parquet to Dataset

2017-07-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21454. -- Resolution: Invalid I am resolving this assuming the reporter has no argument for ^ or is inact

[jira] [Resolved] (SPARK-21487) WebUI-Executors Page results in "Request is a replay (34) attack"

2017-07-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21487. -- Resolution: Invalid I am resolving this assuming the reporter is inactive or has no argument fo

[jira] [Commented] (SPARK-21493) Add more metrics to External Shuffle Service

2017-07-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16097865#comment-16097865 ] Hyukjin Kwon commented on SPARK-21493: -- gentle ping [~raajay] > Add more metrics to

[jira] [Updated] (SPARK-21513) SQL to_json should support all column types

2017-07-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-21513: - Labels: (was: Starter) > SQL to_json should support all column types >

[jira] [Commented] (SPARK-21513) SQL to_json should support all column types

2017-07-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16097853#comment-16097853 ] Hyukjin Kwon commented on SPARK-21513: -- It won't be a starter BTW (assuming we shoul

[jira] [Commented] (SPARK-21491) Performance enhancement: eliminate creation of intermediate collections

2017-07-23 Thread Iurii Antykhovych (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21491?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16097815#comment-16097815 ] Iurii Antykhovych commented on SPARK-21491: --- Sure, I'll try to write a performa

[jira] [Updated] (SPARK-21513) SQL to_json should support all column types

2017-07-23 Thread Aaron Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aaron Davidson updated SPARK-21513: --- Labels: Starter (was: ) > SQL to_json should support all column types >

[jira] [Created] (SPARK-21514) Hive has updated with new support for S3 and InsertIntoHiveTable.scala should update also

2017-07-23 Thread Javier Ros (JIRA)
Javier Ros created SPARK-21514: -- Summary: Hive has updated with new support for S3 and InsertIntoHiveTable.scala should update also Key: SPARK-21514 URL: https://issues.apache.org/jira/browse/SPARK-21514

[jira] [Updated] (SPARK-21513) SQL to_json should support all column types

2017-07-23 Thread Aaron Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aaron Davidson updated SPARK-21513: --- Description: The built-in SQL UDF "to_json" currently supports serializing StructType column

[jira] [Created] (SPARK-21513) SQL to_json should support all column types

2017-07-23 Thread Aaron Davidson (JIRA)
Aaron Davidson created SPARK-21513: -- Summary: SQL to_json should support all column types Key: SPARK-21513 URL: https://issues.apache.org/jira/browse/SPARK-21513 Project: Spark Issue Type: I

[jira] [Resolved] (SPARK-21512) DatasetCacheSuite needs to execute unpersistent after executing peristent

2017-07-23 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-21512. - Resolution: Fixed Assignee: Kazuaki Ishizaki Fix Version/s: 2.3.0 > DatasetCacheSuite nee

[jira] [Resolved] (SPARK-20871) Only log Janino code in debug mode

2017-07-23 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-20871. - Resolution: Fixed Fix Version/s: 2.3.0 > Only log Janino code in debug mode >

[jira] [Assigned] (SPARK-20871) Only log Janino code in debug mode

2017-07-23 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li reassigned SPARK-20871: --- Assignee: PJ Fanning > Only log Janino code in debug mode > -- > >

[jira] [Commented] (SPARK-12008) Spark hive security authorization doesn't work as Apache hive's

2017-07-23 Thread Eugene Ilchenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16097722#comment-16097722 ] Eugene Ilchenko commented on SPARK-12008: - [~pin_zhang] While I don't have answer

[jira] [Resolved] (SPARK-20904) Task failures during shutdown cause problems with preempted executors

2017-07-23 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-20904. - Resolution: Fixed Fix Version/s: 2.3.0 2.2.1 Issue resolved by pull req

[jira] [Assigned] (SPARK-20904) Task failures during shutdown cause problems with preempted executors

2017-07-23 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-20904: --- Assignee: Marcelo Vanzin > Task failures during shutdown cause problems with preempted execu

[jira] [Updated] (SPARK-21507) Exception when using spark.jars.packages

2017-07-23 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-21507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-21507: --- Description: When more than one process is using packages option it's possible to create fol

[jira] [Updated] (SPARK-21507) Exception when using spark.jars.packages

2017-07-23 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-21507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-21507: --- Description: When more than one process is using packages option it's possible to create exc

[jira] [Updated] (SPARK-20712) [SPARK 2.1 REGRESSION][SQL] Spark can't read Hive table when column type has length greater than 4000 bytes

2017-07-23 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-20712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-20712: --- Summary: [SPARK 2.1 REGRESSION][SQL] Spark can't read Hive table when column type has length

[jira] [Commented] (SPARK-21491) Performance enhancement: eliminate creation of intermediate collections

2017-07-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21491?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16097580#comment-16097580 ] Sean Owen commented on SPARK-21491: --- [~sereneant] I agree with the above. Do we have an

[jira] [Updated] (SPARK-21506) The description of "spark.executor.cores" may be not correct

2017-07-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-21506: -- Affects Version/s: (was: 2.3.0) 2.2.0 Priority: Trivial (was: M

[jira] [Assigned] (SPARK-21512) DatasetCacheSuite needs to execute unpersistent after executing peristent

2017-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21512: Assignee: Apache Spark > DatasetCacheSuite needs to execute unpersistent after executing p

[jira] [Assigned] (SPARK-21512) DatasetCacheSuite needs to execute unpersistent after executing peristent

2017-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21512: Assignee: (was: Apache Spark) > DatasetCacheSuite needs to execute unpersistent after

[jira] [Commented] (SPARK-21512) DatasetCacheSuite needs to execute unpersistent after executing peristent

2017-07-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16097571#comment-16097571 ] Apache Spark commented on SPARK-21512: -- User 'kiszk' has created a pull request for

[jira] [Commented] (SPARK-21512) DatasetCacheSuite needs to execute unpersistent after executing peristent

2017-07-23 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16097565#comment-16097565 ] Kazuaki Ishizaki commented on SPARK-21512: -- When {DatasetCacheSuite} is executed

[jira] [Updated] (SPARK-21512) DatasetCacheSuite needs to execute unpersistent after executing peristent

2017-07-23 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kazuaki Ishizaki updated SPARK-21512: - Summary: DatasetCacheSuite needs to execute unpersistent after executing peristent (was:

[jira] [Updated] (SPARK-21512) DatasetCacheSuites need to execute unpersistent after executing peristent

2017-07-23 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kazuaki Ishizaki updated SPARK-21512: - Summary: DatasetCacheSuites need to execute unpersistent after executing peristent (was:

[jira] [Resolved] (SPARK-21511) Unable to load Pipeline or PipelineModel

2017-07-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-21511. --- Resolution: Invalid This should start as a question on the mailing list. It looks like an error in y