sunriseXDM opened a new issue, #30222:
URL: https://github.com/apache/doris/issues/30222

   环境:
   spark2.3
   doris2.0.3
   
   connector:spark-doris-connector-2.3_2.11-1.2.0
   
   报错内容:
   24/01/19 14:52:20 INFO BackendClient: Success connect to Doris 
BE{host='192.168.1.11', port=9060}.
   24/01/19 14:52:20 INFO BackendClient: Success connect to Doris 
BE{host='192.168.1.11', port=9060}.
   24/01/19 14:52:20 ERROR RowBatch: Schema size '1' is not equal to arrow 
field size '2'.
   24/01/19 14:52:20 ERROR RowBatch: Schema size '1' is not equal to arrow 
field size '2'.
   24/01/19 14:52:20 ERROR RowBatch: Read Doris Data failed because: 
   org.apache.doris.spark.exception.DorisException: Load Doris data failed, 
schema size of fetch data is wrong.
        at 
org.apache.doris.spark.serialization.RowBatch.<init>(RowBatch.java:99)
        at 
org.apache.doris.spark.rdd.ScalaValueReader.hasNext(ScalaValueReader.scala:210)
        at 
org.apache.doris.spark.rdd.AbstractDorisRDDIterator.hasNext(AbstractDorisRDDIterator.scala:56)
        at scala.collection.Iterator$$anon$11.hasNext(Iterator.scala:409)
        at 
org.apache.spark.sql.catalyst.expressions.GeneratedClass$GeneratedIteratorForCodegenStage10.agg_doAggregateWithKeys_0$(Unknown
 Source)
        at 
org.apache.spark.sql.catalyst.expressions.GeneratedClass$GeneratedIteratorForCodegenStage10.processNext(Unknown
 Source)
        at 
org.apache.spark.sql.execution.BufferedRowIterator.hasNext(BufferedRowIterator.java:43)
        at 
org.apache.spark.sql.execution.WholeStageCodegenExec$$anonfun$10$$anon$1.hasNext(WholeStageCodegenExec.scala:614)
        at scala.collection.Iterator$$anon$11.hasNext(Iterator.scala:409)
        at 
org.apache.spark.shuffle.sort.BypassMergeSortShuffleWriter.write(BypassMergeSortShuffleWriter.java:125)
        at 
org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:96)
        at 
org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:53)
        at org.apache.spark.scheduler.Task.run(Task.scala:109)
        at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:345)
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
        at java.lang.Thread.run(Thread.java:748)
   
   
   
   排查发现,在github上查到doris-flink-connector 
1.5.1修复了这个问题。链接:https://github.com/apache/doris-flink-connector/pull/261
   请问spark-doris-connector也有这个问题吗?而生产在用spark2导入doris2.0.2没有这个问题
   
   _Originally posted by @sunriseXDM in 
https://github.com/apache/doris/discussions/30134_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org

Reply via email to