[ https://issues.apache.org/jira/browse/SPARK-16211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15350294#comment-15350294 ]
Dongjoon Hyun commented on SPARK-16211: --------------------------------------- Great! :) > DataFrame filter is buggy when used with "and" > ---------------------------------------------- > > Key: SPARK-16211 > URL: https://issues.apache.org/jira/browse/SPARK-16211 > Project: Spark > Issue Type: Bug > Components: Spark Shell, SQL > Affects Versions: 1.5.0 > Environment: CDH 5.5.0/YARN > Reporter: Renat Bekbolatov > > df was a result of several joins with some upstream tables having column > names renamed. > {code} > scala> df.filter(col("ad_market_id") === 4 && col("event_date") === > "2016-05-30").show > +----------+------------+ > |event_date|ad_market_id| > +----------+------------+ > +----------+------------+ > scala> df.filter("ad_market_id = 4 and event_date = '2016-05-30'").show > +----------+------------+ > |event_date|ad_market_id| > +----------+------------+ > +----------+------------+ > scala> df.filter("ad_market_id = 4").coalesce(20).filter("event_date = > '2016-05-30'").show > +----------+------------+ > |event_date|ad_market_id| > +----------+------------+ > |2016-05-30| 4| > +----------+------------+ > scala> sc.version > res40: String = 1.5.0 > scala> df > res41: org.apache.spark.sql.DataFrame = [event_date: string, ad_market_id: > int] > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org