I see, thanks Reynold. Reynold Xin <r...@databricks.com> 于2018年7月20日周五 上午8:46写道:
> Looking at the list of pull requests it looks like this is the ticket: > https://issues.apache.org/jira/browse/SPARK-24867 > > > > On Thu, Jul 19, 2018 at 5:25 PM Reynold Xin <r...@databricks.com> wrote: > >> I don't think my ticket should block this release. It's a big general >> refactoring. >> >> Xiao do you have a ticket for the bug you found? >> >> >> On Thu, Jul 19, 2018 at 5:24 PM Saisai Shao <sai.sai.s...@gmail.com> >> wrote: >> >>> Hi Xiao, >>> >>> Are you referring to this JIRA ( >>> https://issues.apache.org/jira/browse/SPARK-24865)? >>> >>> Xiao Li <gatorsm...@gmail.com> 于2018年7月20日周五 上午2:41写道: >>> >>>> dfWithUDF.cache() >>>> dfWithUDF.write.saveAsTable("t") >>>> dfWithUDF.write.saveAsTable("t1") >>>> >>>> >>>> Cached data is not being used. It causes a big performance regression. >>>> >>>> >>>> >>>> >>>> 2018-07-19 11:32 GMT-07:00 Sean Owen <sro...@gmail.com>: >>>> >>>>> What regression are you referring to here? A -1 vote really needs a >>>>> rationale. >>>>> >>>>> On Thu, Jul 19, 2018 at 1:27 PM Xiao Li <gatorsm...@gmail.com> wrote: >>>>> >>>>>> I would first vote -1. >>>>>> >>>>>> I might find another regression caused by the analysis barrier. Will >>>>>> keep you posted. >>>>>> >>>>>> >>>>