Make sure that each supervisor node can resolve names of the others via /etc/hosts or dns and can reach each worker ports on all hosts.

Can each node reach nimbus and zookeeper?


On 09/18/2015 09:51 PM, researcher cs wrote:
can someone have an experience on this ?

On Wed, Sep 16, 2015 at 7:00 AM, researcher cs <[email protected] <mailto:[email protected]>> wrote:

    Thanks for your advice and helping i found this in log file of workers
    [ERROR] Async loop died! java.lang.RuntimeException:
    org.apache.thrift7.transport.TTransportException:
    java.net.ConnectException: Connection refused at
    
backtype.storm.drpc.DRPCInvocationsClient.<init>(DRPCInvocationsClient.java:23)
    at backtype.storm.drpc.DRPCSpout.open(DRPCSpout.java:69) at
    
storm.trident.spout.RichSpoutBatchTriggerer.open(RichSpoutBatchTriggerer.java:41)
    at
    backtype.storm.daemon.executor$fn__3985$fn__3997.invoke(executor.clj:460)
    at backtype.storm.util$async_loop$fn__465.invoke(util.clj:375) at
    clojure.lang.AFn.run(AFn.java:24) at java.lang.Thread.run(Unknown
    Source) Caused by:
    org.apache.thrift7.transport.TTransportException:
    java.net.ConnectException: Connection refused

    log file of supervisor

    supervisor [INFO] ff6460a5-aafb-44a4-a49c-2de945ffd572 still
    hasn't started
    2015-09-15 02:00:54 supervisor [ERROR] Error when processing event
    org.apache.zookeeper.KeeperException$ConnectionLossException:
    KeeperErrorCode = ConnectionLoss
    at
    com.netflix.curator.ConnectionState.getZooKeeper(ConnectionState.java:72)
    at
    
com.netflix.curator.CuratorZookeeperClient.getZooKeeper(CuratorZookeeperClient.java:74)
    at
    
com.netflix.curator.framework.imps.CuratorFrameworkImpl.getZooKeeper(CuratorFrameworkImpl.java:353)
    at
    
com.netflix.curator.framework.imps.ExistsBuilderImpl$2.call(ExistsBuilderImpl.java:149)
    at
    
com.netflix.curator.framework.imps.ExistsBuilderImpl$2.call(ExistsBuilderImpl.java:138)
    at com.netflix.curator.RetryLoop.callWithRetry(RetryLoop.java:85)
    at
    
com.netflix.curator.framework.imps.ExistsBuilderImpl.pathInForeground(ExistsBuilderImpl.java:134)
    at
    
com.netflix.curator.framework.imps.ExistsBuilderImpl.forPath(ExistsBuilderImpl.java:125)
    at
    
com.netflix.curator.framework.imps.ExistsBuilderImpl.forPath(ExistsBuilderImpl.java:34)
    at
    backtype.storm.zookeeper$exists_node_QMARK_.invoke(zookeeper.clj:78)
    at backtype.storm.zookeeper$mkdirs.invoke(zookeeper.clj:88)
    at
    
backtype.storm.cluster$mk_distributed_cluster_state$reify__1996.set_ephemeral_node(cluster.clj:54)
    at
    
backtype.storm.cluster$mk_storm_cluster_state$reify__2415.supervisor_heartbeat_BANG_(cluster.clj:300)
    at sun.reflect.GeneratedMethodAccessor13.invoke(Unknown Source)
    at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source)
    at java.lang.reflect.Method.invoke(Unknown Source)
    at clojure.lang.Reflector.invokeMatchingMethod(Reflector.java:93)
    at clojure.lang.Reflector.invokeInstanceMethod(Reflector.java:28)
    at
    
backtype.storm.daemon.supervisor$fn__4757$exec_fn__1228__auto____4758$heartbeat_fn__4760.invoke(supervisor.clj:345)
    at
    backtype.storm.timer$schedule_recurring$this__1776.invoke(timer.clj:69)
    at
    backtype.storm.timer$mk_timer$fn__1759$fn__1760.invoke(timer.clj:33)
    at backtype.storm.timer$mk_timer$fn__1759.invoke(timer.clj:26)
    at clojure.lang.AFn.run(AFn.java:24)
    at java.lang.Thread.run(Unknown Source)
    2015-09-15 02:00:54 supervisor [INFO]
    ff6460a5-aafb-44a4-a49c-2de945ffd572 still hasn't started
    2015-09-15 02:00:55 ClientCnxn [INFO] Client session timed out,
    have not heard from server in 20020ms for sessionid
    0x14fce3996380015, closing socket connection and attempting reconnect
    2015-09-15 02:00:58 ClientCnxn [INFO] Opening socket connection to
    server localhost/127.0.0.1:2181 <http://127.0.0.1:2181>
    2015-09-15 02:00:58 ClientCnxn [INFO] Socket connection
    established to localhost/127.0.0.1:2181 <http://127.0.0.1:2181>,
    initiating session
    2015-09-15 02:00:59 supervisor [INFO]
    ff6460a5-aafb-44a4-a49c-2de945ffd572 still hasn't started
    2015-09-15 02:01:01 supervisor [INFO]
    ff6460a5-aafb-44a4-a49c-2de945ffd572 still hasn't started
    2015-09-15 02:00:59 util [INFO] Halting process: ("Error when
    processing an event")



    On Tue, Sep 15, 2015 at 8:14 PM, Javier Gonzalez
    <[email protected] <mailto:[email protected]>> wrote:

        They will be in $STORM_HOME/logs on each node.

        I would suggest reading the storm documentation thoroughly so
        you can familiarize yourself with the concepts and
        terminology. That way it will be easier for you to ask for
        help and for us to help you.

        Regards,
        Javier

        On Sep 15, 2015 2:25 AM, "researcher cs"
        <[email protected] <mailto:[email protected]>>
        wrote:

            Sorry for my question , i'm beginner , how can i check my
            supervisors and worker logs

            On Tue, Sep 15, 2015 at 8:11 AM, Javier Gonzalez
            <[email protected] <mailto:[email protected]>> wrote:

                Check your supervisor and worker logs.

                On Mon, Sep 14, 2015 at 8:14 PM, researcher cs
                <[email protected]
                <mailto:[email protected]>> wrote:

                    I'm new in storm and trying to submit a topology
                    and found this
                    in supervisor


                    ​

                    anyone can help ?




-- Javier González Nicolini





Reply via email to