timsaucer opened a new pull request, #1167:
URL: https://github.com/apache/datafusion-python/pull/1167

   # Which issue does this PR close?
   
   None
   
    # Rationale for this change
   
   By design in a Jupyter notebook `display()` calls both `__repr__` and 
`_repr_html_`. This currently causes `collect()` on DataFrames to occur twice, 
which can lead to double the execution time during evaluation. This PR causes 
collect to only happen once.
   
   # What changes are included in this PR?
   
   If we are in a jupyter notebook, `__repr__` will return an empty string. 
Only `_repr_html_` will collect and display a formatted table. Users can still 
use `df.show()` if they want the table view.
   
   # Are there any user-facing changes?
   
   None.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to