Apache Storm Worker Process dies -
Apache Storm Worker Process dies -
i have installed storm-0.9.2 in 5-node cluster. have simple topology 1 spout , varying number of bolts (4, 9, 22, 31). each configuration have configured (#bolts + 1) workers. 4 bolts, have 5 workers, 22 bolts 23 workers, etc. have observed failed worker processes in worker log files corresponding endofstream exception in zookeeper.out log file. when clean test run number of tuples processed each bolt evenly distributed on each worker. on non-clean test run, workers failed effort reconnect, since number of tuples finite there no more tuples process. possible causes worker process die?
excerpt zookeeper.out log file: 2014-10-27 17:40:33,198 [myid:] - warn [nioservercxn.factory:0.0.0.0/0.0.0.0:2181:nioservercnxn@357] - caught end of stream exception endofstreamexception: unable read additional info client sessionid 0x1495431347c001e, client has closed socket @ org.apache.zookeeper.server.nioservercnxn.doio(nioservercnxn.java:228) @ org.apache.zookeeper.server.nioservercnxnfactory.run(nioservercnxnfactory.java:208) @ java.lang.thread.run(thread.java:744) 2014-10-27 17:40:33,201 [myid:] - info [nioservercxn.factory:0.0.0.0/0.0.0.0:2181:nioservercnxn@1007] - closed socket connection client /192.168.0.1:45693 had sessionid 0x1495431347c001e
cluster environment: storm 0.9.2 zookeeper 3.4.6 ubuntu 13.10
thanks!
to me, looks problem zookeeper. there couple of ideas:
your zookeeper timeout configuration small. your zookeeper instance doesn't has plenty children (slaves) handle workload.for diagnosing, start increasing default time out zookeeper instance. if not working, seek expand zookeeper cluster.
you can consolidate zookeeper documentation. please, allow know if solves problem.
apache-storm apache-zookeeper
Comments
Post a Comment