Thanks for the pointer Tim.

I will try to run jemalloc profiling and post back on the Github issue.

- Dheeraj

On Mon, Dec 13, 2021 at 11:18 PM Tim Wojtulewicz <t...@corelight.com> wrote:

> We also have another report of the same in
> https://github.com/zeek/zeek/issues/1856. Is it possible for you to
> rebuild with jemalloc support and run the jemalloc profiling plugin on your
> logger node? That should give more information about what’s causing the
> bloat. We can use that issue to discuss more in depth what’s going on with
> it, if that’s easier than email.
>
> Tim
>
> On Dec 12, 2021, at 11:15 PM, Dheeraj Gupta <dheeraj.gup...@gmail.com>
> wrote:
>
> Hi,
>
> We have a Zeek node that sees high volumes on working days. Due to our
> internal network configuration a lot of connections for our internal DNS
> servers are generated by certain endpoints (because our DNS does not
> resolve any external domains and certain applications keep repeating the
> DNS requests at astronomical rates). The node is a 16 core, 128GB VM and we
> use ASCII logger.
>
> We have observed that under high loads (~40k writes/s), the logger process
> starts lagging behind and its memory usage goes up. Once the machine is
> using >60% of its memory, Zeek starts dropping packets and a general drop
> in performance is observed. Only solution is to restart the zeek process.
>
> My understanding is that logger is buffering the unwritten lines in memory
> and so memory usage is going up.
>
> To work around this, I split the output files so that all connections to
> the DNS server and all DNS requests to high velocity domains are logged to
> separate files (conn-noise.log and dns-noise.log). These two files consume
> nearly 80% of the disk usage under the current directory (E.g. in 30
> minutes the current directory use is 4.9G out of which these two files use
> 4.0G). Doing this, I hoped that any lags would be limited to these two
> files and I will lose less data on a restart. Also by using separate
> threads for heavily written files, I may be able to get better performance.
> The idea has worked partially as lags for other files are generally low now
> although we do need to restart zeek if memory usage goes beyond 55%.
>
> The problem is that I have observed that logger memory usage does not
> decrease on its own when the loads reduce (e.g. at night). E.g. If Zeek was
> using 40G memory on Friday evening and dns-noise was showing a lag of 1800
> seconds, the memory usage on Monday morning is still 40G although the lag
> is only around 1 second. Has anyone experienced anything similar? I am
> running Zeek-4.1.1.
>
> Thanks,
> Dheeraj
>
> --
> zeek mailing list -- z...@lists.zeek.org
> To unsubscribe send an email to zeek-le...@lists.zeek.org
>
>
>
_______________________________________________
zeek-dev mailing list -- zeek-dev@lists.zeek.org
To unsubscribe send an email to zeek-dev-le...@lists.zeek.org

Reply via email to