timsaucer opened a new pull request, #1167: URL: https://github.com/apache/datafusion-python/pull/1167
# Which issue does this PR close? None # Rationale for this change By design in a Jupyter notebook `display()` calls both `__repr__` and `_repr_html_`. This currently causes `collect()` on DataFrames to occur twice, which can lead to double the execution time during evaluation. This PR causes collect to only happen once. # What changes are included in this PR? If we are in a jupyter notebook, `__repr__` will return an empty string. Only `_repr_html_` will collect and display a formatted table. Users can still use `df.show()` if they want the table view. # Are there any user-facing changes? None. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org