Light-Towers opened a new issue, #8427:
URL: https://github.com/apache/seatunnel/issues/8427

   ### Search before asking
   
   - [X] I had searched in the 
[issues](https://github.com/apache/seatunnel/issues?q=is%3Aissue+label%3A%22bug%22)
 and found no similar issues.
   
   
   ### What happened
   
   Hive source connector 出现文件读取异常,但文件确实存在(见截图)。业务表将近有4亿条数据,共200个文件。请问为什么会出现这个问题?
   
![image](https://github.com/user-attachments/assets/e410858b-572e-44cd-a145-610090ff69c2)
   
   
   ### SeaTunnel Version
   
   2.3.8
   
   ### SeaTunnel Config
   
   ```conf
   env {
     execution.parallelism = 4
     job.mode = "BATCH"
     read_limit.rows_per_second = 50000
   }
   
   source {
    Hive {
      table_name = "dw.app_enterprise_info"
      metastore_uri = "thrift://master02:9083"
      hive_site_path = "/etc/hive/conf/hive-site.xml"
      hdfs_site_path = "/etc/hadoop/conf/hdfs-site.xml"
      read_partitions = ["dt=2024-12-31"]
      compress_codec = "orc"
    }
   }
   
   sink {
     Console { }
   }
   ```
   
   
   ### Running Command
   
   ```shell
   /opt/seatunnel/apache-seatunnel-2.3.8/bin/seatunnel.sh --config ./hive2mongo 
-m local
   ```
   
   
   ### Error Exception
   
   ```log
   2025-01-02 16:47:04,888 INFO  [.a.s.c.s.c.s.ConsoleSinkWriter] 
[st-multi-table-sink-writer-1] - subtaskIndex=0  rowIndex=1806171:  
SeaTunnelRow#tableId=dw.app_enterprise_info SeaTunnelRow#kind=INSERT : 
d783b7bc34975a, 银川市建材销售部, , , , , , xxx, 一般项目:, 银川市商住楼105, 银川市商住楼105, 
2022-02-24, 926xxxxNW4Y, 001, 注销, [106.30129, 38.494615], 640000, 宁夏回族自治区, 
640100, 银川市, , , , , , , , , , , , , , , , , , , [1], 18100000066, , , 
[{"category":"状态","tagName":"续"}], 6, 2022-02-24, 9999-12-31, , , 人民币, , 
MAxxxxx4, 2024-04-29, 1, 0, 2024-07-18, 2024-12-31
   2025-01-02 16:47:04,890 INFO  [c.h.i.s.t.TcpServerConnection ] 
[hz.main.IO.thread-in-1] - [localhost]:5801 [seatunnel-930048] [5.1] 
Connection[id=1, /127.0.0.1:5801->/127.0.0.1:55002, qualifier=null, 
endpoint=[127.0.0.1]:55002, remoteUuid=412e8da3-b9de-4db3-b558-bf654a01e6af, 
alive=false, connectionType=JVM, planeIndex=-1] closed. Reason: Connection 
closed by the other side
   2025-01-02 16:47:04,890 INFO  [.c.i.c.ClientConnectionManager] [main] - 
hz.client_1 [seatunnel-930048] [5.1] Removed connection to endpoint: 
[localhost]:5801:a3575d69-8d06-49d7-b52e-c16135fdeb48, connection: 
ClientConnection{alive=false, connectionId=1, 
channel=NioChannel{/127.0.0.1:55002->localhost/127.0.0.1:5801}, 
remoteAddress=[localhost]:5801, lastReadTime=2025-01-02 16:47:04.622, 
lastWriteTime=2025-01-02 16:47:01.020, closedTime=2025-01-02 16:47:04.886, 
connected server version=5.1}
   2025-01-02 16:47:04,890 INFO  [c.h.c.LifecycleService        ] [main] - 
hz.client_1 [seatunnel-930048] [5.1] HazelcastClient 5.1 (20220228 - 21f20e7) 
is CLIENT_DISCONNECTED
   2025-01-02 16:47:04,892 INFO  [c.h.c.i.ClientEndpointManager ] 
[hz.main.event-3] - [localhost]:5801 [seatunnel-930048] [5.1] Destroying 
ClientEndpoint{connection=Connection[id=1, /127.0.0.1:5801->/127.0.0.1:55002, 
qualifier=null, endpoint=[127.0.0.1]:55002, 
remoteUuid=412e8da3-b9de-4db3-b558-bf654a01e6af, alive=false, 
connectionType=JVM, planeIndex=-1], 
clientUuid=412e8da3-b9de-4db3-b558-bf654a01e6af, clientName=hz.client_1, 
authenticated=true, clientVersion=5.1, creationTime=1735805435983, latest 
clientAttributes=lastStatisticsCollectionTime=1735807621018,enterprise=false,clientType=JVM,clientVersion=5.1,clusterConnectionTimestamp=1735805435971,clientAddress=127.0.0.1,clientName=hz.client_1,credentials.principal=null,os.committedVirtualMemorySize=24853250048,os.freePhysicalMemorySize=113494515712,os.freeSwapSpaceSize=0,os.maxFileDescriptorCount=100000,os.openFileDescriptorCount=69,os.processCpuTime=1177350000000,os.systemLoadAverage=0.64,os.totalPhysicalMemorySize=270185119744,os
 
.totalSwapSpaceSize=0,runtime.availableProcessors=88,runtime.freeMemory=186469144,runtime.maxMemory=513802240,runtime.totalMemory=513802240,runtime.uptime=2188449,runtime.usedMemory=327335416,
 labels=[]}
   2025-01-02 16:47:04,893 INFO  [c.h.c.LifecycleService        ] [main] - 
hz.client_1 [seatunnel-930048] [5.1] HazelcastClient 5.1 (20220228 - 21f20e7) 
is SHUTDOWN
   2025-01-02 16:47:04,894 INFO  [s.c.s.s.c.ClientExecuteCommand] [main] - 
Closed SeaTunnel client......
   2025-01-02 16:47:04,894 INFO  [c.h.c.LifecycleService        ] [main] - 
[localhost]:5801 [seatunnel-930048] [5.1] [localhost]:5801 is SHUTTING_DOWN
   2025-01-02 16:47:04,898 INFO  [c.h.i.p.i.MigrationManager    ] 
[hz.main.cached.thread-16] - [localhost]:5801 [seatunnel-930048] [5.1] Shutdown 
request of Member [localhost]:5801 - a3575d69-8d06-49d7-b52e-c16135fdeb48 this 
master is handled
   2025-01-02 16:47:04,904 INFO  [c.h.i.i.Node                  ] [main] - 
[localhost]:5801 [seatunnel-930048] [5.1] Shutting down connection manager...
   2025-01-02 16:47:04,906 INFO  [c.h.i.i.Node                  ] [main] - 
[localhost]:5801 [seatunnel-930048] [5.1] Shutting down node engine...
   2025-01-02 16:47:04,919 INFO  [.c.c.DefaultClassLoaderService] [main] - 
close classloader service
   2025-01-02 16:47:04,919 INFO  [o.a.s.c.u.RetryUtils          ] 
[event-forwarder-0] - Failed to execute due to java.lang.NullPointerException: 
Target cannot be null!
        at 
com.hazelcast.internal.util.Preconditions.checkNotNull(Preconditions.java:59)
        at 
com.hazelcast.spi.impl.operationservice.impl.OperationServiceImpl.createInvocationBuilder(OperationServiceImpl.java:300)
        at 
org.apache.seatunnel.engine.server.utils.NodeEngineUtil.sendOperationToMasterNode(NodeEngineUtil.java:37)
        at 
org.apache.seatunnel.engine.server.EventService.lambda$initEventForwardService$1(EventService.java:72)
        at 
org.apache.seatunnel.common.utils.RetryUtils.retryWithException(RetryUtils.java:48)
        at 
org.apache.seatunnel.engine.server.EventService.lambda$initEventForwardService$2(EventService.java:70)
        at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
        at java.util.concurrent.FutureTask.run(FutureTask.java:266)
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
        at java.lang.Thread.run(Thread.java:748)
   . Retrying attempt (1/2) after backoff of 0 ms
   2025-01-02 16:47:04,920 INFO  [o.a.s.c.u.RetryUtils          ] 
[event-forwarder-0] - Failed to execute due to java.lang.NullPointerException: 
Target cannot be null!
        at 
com.hazelcast.internal.util.Preconditions.checkNotNull(Preconditions.java:59)
        at 
com.hazelcast.spi.impl.operationservice.impl.OperationServiceImpl.createInvocationBuilder(OperationServiceImpl.java:300)
        at 
org.apache.seatunnel.engine.server.utils.NodeEngineUtil.sendOperationToMasterNode(NodeEngineUtil.java:37)
        at 
org.apache.seatunnel.engine.server.EventService.lambda$initEventForwardService$1(EventService.java:72)
        at 
org.apache.seatunnel.common.utils.RetryUtils.retryWithException(RetryUtils.java:48)
        at 
org.apache.seatunnel.engine.server.EventService.lambda$initEventForwardService$2(EventService.java:70)
        at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
        at java.util.concurrent.FutureTask.run(FutureTask.java:266)
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
        at java.lang.Thread.run(Thread.java:748)
   . Retrying attempt (2/2) after backoff of 0 ms
   2025-01-02 16:47:04,920 WARN  [o.a.s.e.s.EventService        ] 
[event-forwarder-0] - Event forward failed, discard events 1
   java.lang.RuntimeException: Execute given execution failed after retry 2 
times
        at 
org.apache.seatunnel.common.utils.RetryUtils.retryWithException(RetryUtils.java:75)
 ~[seatunnel-starter.jar:2.3.8]
        at 
org.apache.seatunnel.engine.server.EventService.lambda$initEventForwardService$2(EventService.java:70)
 ~[seatunnel-starter.jar:2.3.8]
        at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) 
[?:1.8.0_181]
        at java.util.concurrent.FutureTask.run(FutureTask.java:266) 
[?:1.8.0_181]
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) 
[?:1.8.0_181]
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) 
[?:1.8.0_181]
        at java.lang.Thread.run(Thread.java:748) [?:1.8.0_181]
   Caused by: java.lang.NullPointerException: Target cannot be null!
        at 
com.hazelcast.internal.util.Preconditions.checkNotNull(Preconditions.java:59) 
~[seatunnel-starter.jar:2.3.8]
        at 
com.hazelcast.spi.impl.operationservice.impl.OperationServiceImpl.createInvocationBuilder(OperationServiceImpl.java:300)
 ~[seatunnel-starter.jar:2.3.8]
        at 
org.apache.seatunnel.engine.server.utils.NodeEngineUtil.sendOperationToMasterNode(NodeEngineUtil.java:37)
 ~[seatunnel-starter.jar:2.3.8]
        at 
org.apache.seatunnel.engine.server.EventService.lambda$initEventForwardService$1(EventService.java:72)
 ~[seatunnel-starter.jar:2.3.8]
        at 
org.apache.seatunnel.common.utils.RetryUtils.retryWithException(RetryUtils.java:48)
 ~[seatunnel-starter.jar:2.3.8]
        ... 6 more
   2025-01-02 16:47:04,931 INFO  [c.h.i.i.NodeExtension         ] [main] - 
[localhost]:5801 [seatunnel-930048] [5.1] Destroying node NodeExtension.
   2025-01-02 16:47:04,931 INFO  [c.h.i.i.Node                  ] [main] - 
[localhost]:5801 [seatunnel-930048] [5.1] Hazelcast Shutdown is completed in 34 
ms.
   2025-01-02 16:47:04,932 INFO  [c.h.c.LifecycleService        ] [main] - 
[localhost]:5801 [seatunnel-930048] [5.1] [localhost]:5801 is SHUTDOWN
   2025-01-02 16:47:04,932 INFO  [s.c.s.s.c.ClientExecuteCommand] [main] - 
Closed HazelcastInstance ......
   2025-01-02 16:47:04,932 INFO  [s.c.s.s.c.ClientExecuteCommand] [main] - 
Closed metrics executor service ......
   2025-01-02 16:47:04,932 ERROR [o.a.s.c.s.SeaTunnel           ] [main] - 
   
   
===============================================================================
   
   
   2025-01-02 16:47:04,932 ERROR [o.a.s.c.s.SeaTunnel           ] [main] - 
Fatal Error, 
   
   2025-01-02 16:47:04,932 ERROR [o.a.s.c.s.SeaTunnel           ] [main] - 
Please submit bug report in https://github.com/apache/seatunnel/issues
   
   2025-01-02 16:47:04,932 ERROR [o.a.s.c.s.SeaTunnel           ] [main] - 
Reason:SeaTunnel job executed failed 
   
   2025-01-02 16:47:04,933 ERROR [o.a.s.c.s.SeaTunnel           ] [main] - 
Exception 
StackTrace:org.apache.seatunnel.core.starter.exception.CommandExecuteException: 
SeaTunnel job executed failed
        at 
org.apache.seatunnel.core.starter.seatunnel.command.ClientExecuteCommand.execute(ClientExecuteCommand.java:213)
        at org.apache.seatunnel.core.starter.SeaTunnel.run(SeaTunnel.java:40)
        at 
org.apache.seatunnel.core.starter.seatunnel.SeaTunnelClient.main(SeaTunnelClient.java:34)
   Caused by: 
org.apache.seatunnel.engine.common.exception.SeaTunnelEngineException: 
org.apache.seatunnel.connectors.seatunnel.file.exception.FileConnectorException:
 ErrorCode:[FILE-08], ErrorDescription:[File read failed] - Read data from this 
file 
[dw.app_enterprise_info_hdfs://nameservice1/user/hive/warehouse/app/app_enterprise_info/dt=2024-12-31/part-00009-7f77f045-c85d-4e50-9f98-51d2874ecce2.c000]
 failed
        at 
org.apache.seatunnel.connectors.seatunnel.hive.source.reader.MultipleTableHiveSourceReader.pollNext(MultipleTableHiveSourceReader.java:87)
        at 
org.apache.seatunnel.engine.server.task.flow.SourceFlowLifeCycle.collect(SourceFlowLifeCycle.java:159)
        at 
org.apache.seatunnel.engine.server.task.SourceSeaTunnelTask.collect(SourceSeaTunnelTask.java:127)
        at 
org.apache.seatunnel.engine.server.task.SeaTunnelTask.stateProcess(SeaTunnelTask.java:168)
        at 
org.apache.seatunnel.engine.server.task.SourceSeaTunnelTask.call(SourceSeaTunnelTask.java:132)
        at 
org.apache.seatunnel.engine.server.TaskExecutionService$BlockingWorker.run(TaskExecutionService.java:693)
        at 
org.apache.seatunnel.engine.server.TaskExecutionService$NamedTaskWrapper.run(TaskExecutionService.java:1018)
        at org.apache.seatunnel.api.tracing.MDCRunnable.run(MDCRunnable.java:39)
        at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
        at java.util.concurrent.FutureTask.run(FutureTask.java:266)
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
        at java.lang.Thread.run(Thread.java:748)
   Caused by: java.lang.NullPointerException
   
        at 
org.apache.seatunnel.core.starter.seatunnel.command.ClientExecuteCommand.execute(ClientExecuteCommand.java:205)
        ... 2 more
    
   2025-01-02 16:47:04,934 ERROR [o.a.s.c.s.SeaTunnel           ] [main] - 
   
===============================================================================
   
   
   
   Exception in thread "main" 
org.apache.seatunnel.core.starter.exception.CommandExecuteException: SeaTunnel 
job executed failed
        at 
org.apache.seatunnel.core.starter.seatunnel.command.ClientExecuteCommand.execute(ClientExecuteCommand.java:213)
        at org.apache.seatunnel.core.starter.SeaTunnel.run(SeaTunnel.java:40)
        at 
org.apache.seatunnel.core.starter.seatunnel.SeaTunnelClient.main(SeaTunnelClient.java:34)
   Caused by: 
org.apache.seatunnel.engine.common.exception.SeaTunnelEngineException: 
org.apache.seatunnel.connectors.seatunnel.file.exception.FileConnectorException:
 ErrorCode:[FILE-08], ErrorDescription:[File read failed] - Read data from this 
file 
[dw.app_enterprise_info_hdfs://nameservice1/user/hive/warehouse/app/app_enterprise_info/dt=2024-12-31/part-00009-7f77f045-c85d-4e50-9f98-51d2874ecce2.c000]
 failed
        at 
org.apache.seatunnel.connectors.seatunnel.hive.source.reader.MultipleTableHiveSourceReader.pollNext(MultipleTableHiveSourceReader.java:87)
        at 
org.apache.seatunnel.engine.server.task.flow.SourceFlowLifeCycle.collect(SourceFlowLifeCycle.java:159)
        at 
org.apache.seatunnel.engine.server.task.SourceSeaTunnelTask.collect(SourceSeaTunnelTask.java:127)
        at 
org.apache.seatunnel.engine.server.task.SeaTunnelTask.stateProcess(SeaTunnelTask.java:168)
        at 
org.apache.seatunnel.engine.server.task.SourceSeaTunnelTask.call(SourceSeaTunnelTask.java:132)
        at 
org.apache.seatunnel.engine.server.TaskExecutionService$BlockingWorker.run(TaskExecutionService.java:693)
        at 
org.apache.seatunnel.engine.server.TaskExecutionService$NamedTaskWrapper.run(TaskExecutionService.java:1018)
        at org.apache.seatunnel.api.tracing.MDCRunnable.run(MDCRunnable.java:39)
        at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
        at java.util.concurrent.FutureTask.run(FutureTask.java:266)
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
        at java.lang.Thread.run(Thread.java:748)
   Caused by: java.lang.NullPointerException
   
        at 
org.apache.seatunnel.core.starter.seatunnel.command.ClientExecuteCommand.execute(ClientExecuteCommand.java:205)
        ... 2 more
   2025-01-02 16:47:04,935 INFO  [s.c.s.s.c.ClientExecuteCommand] 
[ForkJoinPool.commonPool-worker-100] - run shutdown hook because get close 
signal
   ```
   
   
   ### Zeta or Flink or Spark Version
   
   Zeta
   
   ### Java or Scala Version
   
   java version "1.8.0_181"
   
   ### Screenshots
   
   _No response_
   
   ### Are you willing to submit PR?
   
   - [ ] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of 
Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@seatunnel.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to