Re: [PR] [SPARK-51362][SQL] Change toJSON to use NextIterator API to eliminate adjacent record dependency [spark]

via GitHub Mon, 03 Mar 2025 19:15:11 -0800


HeartSaVioR commented on PR #50124:
URL: https://github.com/apache/spark/pull/50124#issuecomment-2696069749


   If you read through my explanation, you would know it's not about the 
latency slowdown of "every single" row, but maybe a single row per 1000-2000 
rows (or even higher, depending on the Kafka poll size). So the microbenchmark 
you asked only makes sense if you pick "Pmax" rather than "P99" or so. This 
isn't the "overall" performance improvement. This is a fix for an edge case.
   
   I'd also argue that the fix itself is good with the purpose of better code. 
That's why I asked which one you are not sure.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Re: [PR] [SPARK-51362][SQL] Change toJSON to use NextIterator API to eliminate adjacent record dependency [spark]

Reply via email to