timsaucer opened a new pull request, #711:
URL: https://github.com/apache/datafusion-python/pull/711

   # Which issue does this PR close?
   
   Closes #696
   
    # Rationale for this change
   
   This PR sets up a work flow to generate TPH-C 1Gb data set in CI, runs the 
22 examples, and compares their results to the
   known answer file. By adding this PR we improve the robustness of our test 
suite.
   
   # What changes are included in this PR?
   
   This PR adds the following changes:
   
   - Adds CI workflow to generate TPH-C data set and answer file. The results 
are cached from run to run.
   - Adds pytest file to run all 22 examples and compare them to the answer file
   - Exposes substring operation that was missing. Sometime between 36.0.0 and 
38.0.0 substr was performing the operation instead.
   
   # Are there any user-facing changes?
   
   `substring` function exposed in python.
   
   # Additional context
   
   This PR replaces https://github.com/apache/datafusion-python/pull/710 which 
contains a lot of intermediate testing steps. This MR should be cleaner to 
review.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to