Bob, Currently, the metadata request needs to do at least one ZK read per partition. So the more topics/partitions you have, the longer the request takes. So, you need to increase the request timeout. Try something like 60 * 1000 ms.
Thanks, Jun On Thu, Mar 21, 2013 at 12:46 PM, Bob Jervis <bjer...@gmail.com> wrote: > We are seeing horrible problems. We cannot move data through our 0.8 > borker because we are getting socket timeout exceptions and I cannot figure > out what settings should be. The fetch metadata stuff is throwing these > exceptions and no matter how I tweak the timeouts, I still get horrible > timeouts and no progress on moving data. > > On test environments where there are only 12 topics there are no problems. > > When the number of topics goes to ~75, then we can't move anything because > the fetch metadata requests time out. > > What can we do to fix this????????? > > I am desperate. >