Yuchen Liu created SPARK-51362:
----------------------------------

             Summary: change toJSON to use NextIterator API to reduce latency
                 Key: SPARK-51362
                 URL: https://issues.apache.org/jira/browse/SPARK-51362
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 4.0.0
            Reporter: Yuchen Liu


The current toJSON operation uses the Iterator API where iter.hasNext is called 
after iter.next, which means the return of current row depends on the next row 
to arrive. If we change it to use the NextIterator API, iter.next will be 
called after iter.hasNext, so the current row will return immediately. This 
eliminates the dependency between adjacent rows, which reduces record-level 
latency.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to