In the issue, it states that it should be sufficient to append the hbase-protocol.jar file to the Hadoop classpath. Flink respects the Hadoop classpath and will append it to its own classpath upon launching a cluster.
To do that, you need to modify the classpath with one of the commands below. Note that this has to be performed on all cluster nodes. export HADOOP_CLASSPATH="${HADOOP_CLASSPATH}:/path/to/hbase-protocol.jar" export HADOOP_CLASSPATH="${HADOOP_CLASSPATH}:$(hbase mapredcp)" export HADOOP_CLASSPATH="${HADOOP_CLASSPATH}:$(hbase classpath)" Alternatively, you can build a fat jar from your project with the missing dependency. Flink will then automatically distribute the jar file upon job submission. Just add this Maven dependency to your fat-jar pom: <dependency> <groupId>org.apache.hbase</groupId> <artifactId>hbase-protocol</artifactId> <version>1.1.2</version> </dependency> Let me know if any of the two approaches work for you. After all, this is a workaround because of an HBase optimzation.. Cheers, Max On Wed, Sep 23, 2015 at 11:16 AM, Aljoscha Krettek <aljos...@apache.org> wrote: > It might me that this is causing the problem: > https://issues.apache.org/jira/browse/HBASE-10304 > > In your log I see the same exception. Anyone has any idea what we could do > about this? > > > On Tue, 22 Sep 2015 at 22:40 Lydia Ickler <ickle...@googlemail.com> wrote: > >> Hi, >> >> I am trying to get the HBaseReadExample to run. I have filled a table >> with the HBaseWriteExample and purposely split it over 3 regions. >> Now when I try to read from it the first split seems to be scanned (170 >> rows) fine and after that the Connections of Zookeeper and RCP are suddenly >> closed down. >> >> Does anyone has an idea why this is happening? >> >> Best regards, >> Lydia >> >> >> 22:28:10,178 DEBUG org.apache.flink.runtime.operators.DataSourceTask >> - Opening input split Locatable Split (2) at [grips5:60020]: >> DataSource (at createInput(ExecutionEnvironment.java:502) >> (org.apache.flink.HBaseReadExample$1)) (1/1) >> 22:28:10,178 INFO org.apache.flink.addons.hbase.TableInputFormat >> - opening split [2|[grips5:60020]|aaaaaaaa|-] >> 22:28:10,189 DEBUG org.apache.zookeeper.ClientCnxn >> - Reading reply sessionid:0x24ff6a96ecd000a, packet:: >> clientPath:null serverPath:null finished:false header:: 3,4 replyHeader:: >> 3,51539607639,0 request:: '/hbase/meta-region-server,F response:: >> #ffffffff0001a726567696f6e7365727665723a363030$ >> 22:28:10,202 DEBUG org.apache.zookeeper.ClientCnxn >> - Reading reply sessionid:0x24ff6a96ecd000a, packet:: >> clientPath:null serverPath:null finished:false header:: 4,4 replyHeader:: >> 4,51539607639,0 request:: '/hbase/meta-region-server,F response:: >> #ffffffff0001a726567696f6e7365727665723a363030$ >> 22:28:10,211 DEBUG LocalActorRefProvider(akka://flink) >> - resolve of path sequence [/temp/$b] failed >> 22:28:10,233 DEBUG org.apache.hadoop.hbase.util.ByteStringer >> - Failed to classload HBaseZeroCopyByteString: >> java.lang.IllegalAccessError: class >> com.google.protobuf.HBaseZeroCopyByteString cannot access its superclass >> com.google.protobuf.LiteralByteString >> 22:28:10,358 DEBUG org.apache.hadoop.ipc.RpcClient >> - Use SIMPLE authentication for service ClientService, sasl=false >> 22:28:10,370 DEBUG org.apache.hadoop.ipc.RpcClient >> - Connecting to grips1/130.73.20.14:60020 >> 22:28:10,380 DEBUG org.apache.hadoop.ipc.RpcClient >> - IPC Client (2145423150) connection to grips1/130.73.20.14:60020 >> from hduser: starting, connections 1 >> 22:28:10,394 DEBUG org.apache.hadoop.ipc.RpcClient >> - IPC Client (2145423150) connection to grips1/130.73.20.14:60020 >> from hduser: got response header call_id: 0, totalSize: 469 bytes >> 22:28:10,397 DEBUG org.apache.hadoop.ipc.RpcClient >> - IPC Client (2145423150) connection to grips1/130.73.20.14:60020 >> from hduser: wrote request header call_id: 0 method_name: "Get" >> request_param: true >> 22:28:10,413 DEBUG org.apache.zookeeper.ClientCnxn >> - Reading reply sessionid:0x24ff6a96ecd000a, packet:: >> clientPath:null serverPath:null finished:false header:: 5,4 replyHeader:: >> 5,51539607639,0 request:: '/hbase/meta-region-server,F response:: >> #ffffffff0001a726567696f6e7365727665723a363030$ >> 22:28:10,424 DEBUG org.apache.hadoop.ipc.RpcClient >> - IPC Client (2145423150) connection to grips1/130.73.20.14:60020 >> from hduser: wrote request header call_id: 1 method_name: "Scan" >> request_param: true priority: 100 >> 22:28:10,426 DEBUG org.apache.hadoop.ipc.RpcClient >> - IPC Client (2145423150) connection to grips1/130.73.20.14:60020 >> from hduser: got response header call_id: 1 cell_block_meta { length: 480 >> }, totalSize: 497 bytes >> 22:28:10,432 DEBUG org.apache.hadoop.hbase.client.ClientSmallScanner >> - Finished with small scan at {ENCODED => 1588230740, NAME => >> 'hbase:meta,,1', STARTKEY => '', ENDKEY => ''} >> 22:28:10,434 DEBUG org.apache.hadoop.ipc.RpcClient >> - Use SIMPLE authentication for service ClientService, sasl=false >> 22:28:10,434 DEBUG org.apache.hadoop.ipc.RpcClient >> - Connecting to grips5/130.73.20.16:60020 >> 22:28:10,435 DEBUG org.apache.hadoop.ipc.RpcClient >> - IPC Client (2145423150) connection to grips5/130.73.20.16:60020 >> from hduser: wrote request header call_id: 2 method_name: "Scan" >> request_param: true >> 22:28:10,436 DEBUG org.apache.hadoop.ipc.RpcClient >> - IPC Client (2145423150) connection to grips5/130.73.20.16:60020 >> from hduser: starting, connections 2 >> 22:28:10,437 DEBUG org.apache.hadoop.ipc.RpcClient >> - IPC Client (2145423150) connection to grips5/130.73.20.16:60020 >> from hduser: got response header call_id: 2, totalSize: 12 bytes >> 22:28:10,438 DEBUG org.apache.flink.runtime.operators.DataSourceTask >> - Starting to read input from split Locatable Split (2) at >> [grips5:60020]: DataSource (at createInput(ExecutionEnvironment.java:502) >> (org.apache.flink.HBaseReadExample$1)) (1/1) >> 22:28:10,438 DEBUG org.apache.hadoop.ipc.RpcClient >> - IPC Client (2145423150) connection to grips5/130.73.20.16:60020 >> from hduser: wrote request header call_id: 3 method_name: "Scan" >> request_param: true >> 22:28:10,457 DEBUG org.apache.hadoop.ipc.RpcClient >> - IPC Client (2145423150) connection to grips5/130.73.20.16:60020 >> from hduser: got response header call_id: 3 cell_block_meta { length: 4679 >> }, totalSize: 4899 bytes >> 22:28:10,476 DEBUG org.apache.hadoop.ipc.RpcClient >> - IPC Client (2145423150) connection to grips5/130.73.20.16:60020 >> from hduser: wrote request header call_id: 4 method_name: "Scan" >> request_param: true >> 22:28:10,480 DEBUG org.apache.hadoop.ipc.RpcClient >> - IPC Client (2145423150) connection to grips5/130.73.20.16:60020 >> from hduser: got response header call_id: 4 cell_block_meta { length: 3306 >> }, totalSize: 3466 bytes >> 22:28:10,482 DEBUG org.apache.hadoop.ipc.RpcClient >> - IPC Client (2145423150) connection to grips5/130.73.20.16:60020 >> from hduser: got response header call_id: 5, totalSize: 8 bytes >> 22:28:10,482 DEBUG org.apache.hadoop.ipc.RpcClient >> - IPC Client (2145423150) connection to grips5/130.73.20.16:60020 >> from hduser: wrote request header call_id: 5 method_name: "Scan" >> request_param: true >> 22:28:10,487 DEBUG org.apache.flink.runtime.operators.DataSourceTask >> - Closing input split Locatable Split (2) at [grips5:60020]: >> DataSource (at createInput(ExecutionEnvironment.java:502) >> (org.apache.flink.HBaseReadExample$1)) (1/1) >> 22:28:10,489 INFO >> org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation >> - Closing zookeeper sessionid=0x24ff6a96ecd000a >> 22:28:10,489 DEBUG org.apache.zookeeper.ZooKeeper >> - Closing session: 0x24ff6a96ecd000a >> 22:28:10,489 DEBUG org.apache.zookeeper.ClientCnxn >> - Closing client for session: 0x24ff6a96ecd000a >> 22:28:10,499 DEBUG org.apache.zookeeper.ClientCnxn >> - Reading reply sessionid:0x24ff6a96ecd000a, packet:: >> clientPath:null serverPath:null finished:false header:: 6,-11 >> replyHeader:: 6,51539607640,0 request:: null response:: null >> 22:28:10,499 DEBUG org.apache.zookeeper.ClientCnxn >> - Disconnecting client for session: 0x24ff6a96ecd000a >> 22:28:10,499 INFO org.apache.zookeeper.ClientCnxn >> - EventThread shut down >> 22:28:10,499 INFO org.apache.zookeeper.ZooKeeper >> - Session: 0x24ff6a96ecd000a closed >> 22:28:10,499 DEBUG org.apache.hadoop.ipc.RpcClient >> - Stopping rpc client >> 22:28:10,501 DEBUG org.apache.hadoop.ipc.RpcClient >> - IPC Client (2145423150) connection to grips1/130.73.20.14:60020 >> from hduser: closed >> 22:28:10,502 DEBUG org.apache.hadoop.ipc.RpcClient >> - IPC Client (2145423150) connection to grips1/130.73.20.14:60020 >> from hduser: stopped, connections 0 >> 22:28:10,502 DEBUG org.apache.hadoop.ipc.RpcClient >> - IPC Client (2145423150) connection to grips5/130.73.20.16:60020 >> from hduser: closed >> 22:28:10,502 DEBUG org.apache.hadoop.ipc.RpcClient >> - IPC Client (2145423150) connection to grips5/130.73.20.16:60020 >> from hduser: stopped, connections 0 >> 22:28:10,502 INFO org.apache.flink.addons.hbase.TableInputFormat >> - Closing split (scanned 170 rows) >> 22:28:10,508 DEBUG org.apache.flink.runtime.operators.DataSourceTask >> - Opening input split Locatable Split (1) at [grips4:60020]: >> DataSource (at createInput(ExecutionEnvironment.java:502) >> (org.apache.flink.HBaseReadExample$1)) (1/1) >> 22:28:10,509 INFO org.apache.flink.addons.hbase.TableInputFormat >> - opening split [1|[grips4:60020]|55555555|aaaaaaaa] >> 22:28:11,380 DEBUG org.apache.flink.runtime.taskmanager.TaskManager >> - Received message SendHeartbeat at akka://flink/user/taskmanager >> from Actor[akka://flink/deadLetters]. >> 22:28:11,380 DEBUG org.apache.flink.runtime.taskmanager.TaskManager >> - Sending heartbeat to JobManager >> >>