Emanuele,
Here is ntopng.conf
-G=/var/run/ntopng.pid
-i=enp2s0
-m=10.12.17.0/24 <http://10.12.17.0/24>
-S=local
I do see unidirectional flows in flows_stats.lua for DNS.
Incidentally, I do also see alerts w/ non-zero replies
(though most alerts are 0):
Host pihole has sent 211 DNS requests but received 7 DNS
replies
I tried 2 different 30 minute PCAP files. In both cases,
right at the 10 minute mark, I got alerts. How can I get
these PCAP files to you?
Thanks,
Aaron
On Tue, May 12, 2020 at 4:13 AM Emanuele Faranda
<[email protected] <mailto:[email protected]>> wrote:
Hi Aaron,
Please see below.
On 5/11/20 9:29 PM, Aaron Scamehorn wrote:
Hi Emanuele,
Thank you again for the detailed responses.
From the interfaces page, I see these stats:
Total Traffic 91.6 GB [103,062,265 Pkts] Dropped
Packets 0 Pkts
I don't see any dropped packets on the NIC either:
ethtool -S enp2s0
NIC statistics:
tx_packets: 0
rx_packets: 106581943
tx_errors: 0
rx_errors: 0
rx_missed: 0
align_errors: 0
tx_single_collisions: 0
tx_multi_collisions: 0
unicast: 105432876
broadcast: 350738
multicast: 1149060
tx_aborted: 0
tx_underrun: 0
As of right now, 2 of the hosts we are discussing are
still in alert, at the original Date/Time of 07:25:01,
and Duration is now "3 Days, 08:06:59".
Given that my replies vs requests ratio is still
configured at 50%, this means that, at every 5 minute
interval for the last 3 Days, 8 hours, said host is
receiving < 50% DNS replies, correct? I find this
difficult to believe, and cannot find ANY missing
packets in my pcap file.
I have captured a 30 minute pcap file captured with
this command:
tcpdump -i enp2s0 -G 1800 -w /tmp/enp2s0.%FT%T.pcap
host edgemax and port 53
This file contains DNS traffic to/from edgemax only.
I can count responses like this:
tshark -t a -r enp2s0.2020-05-11T13:00:02.pcap | grep
-c "Standard query response"
349
And queries like this:
tshark -t a -r enp2s0.2020-05-11T13:00:02.pcap | grep
-c "Standard query 0x"
349
In other words, no missing DNS responses in the 30
minutes spanning 13:00:02 to 13:29:51.
I would think that the alert should "clear" because
the threshold is not exceeded within that 30 minute
pcap file.
In any case, at 13:23, I manually click on the
"Release" button for that alert. 2 minutes later, at
13:25:00, I receive this alert:
Host edgemax has received 62 DNS requests but sent 0
DNS replies [5 Minutes ratio: 0%]
As stated previously, no missing DNS responses in the
30 minutes spanning 13:00:02 to 13:29:51. Why does
ntopng think 62 replies are missing?
Please report your ntopng.conf. If you look at the
active ntopng DNS flows, can you identify
unidirectional flows? You can also try to run ntopng on
the PCAP file (--original-speed -i file.pcap). If you
can reproduce using the PCAP file, please send it to me
privately so that I can troubleshoot the problem.
I exported 10 minutes of PCAP from if_stats.lua.
Using the filter "(ip.dst_host == "10.12.17.1" or
ip.src_host == "10.12.17.1") and dns" I am not able to
find any missing DNS responses in wireshark.
Interestingly, If I specify a BPF Filter ("port 53"),
the downloaded PCAP file seems to only have 1 side
(ie. edgemax is only a source, never a dest. Without
a BPF Filter, the download is fine.
This is probably a bug, please open an issue at
https://github.com/ntop/ntopng .
Regards,
Emanuele
On Mon, May 11, 2020 at 8:59 AM Emanuele Faranda
<[email protected] <mailto:[email protected]>> wrote:
Hi Aaron,
Please see below:
On 5/8/20 10:27 PM, Aaron Scamehorn wrote:
Thank you for your response. In the screenshot
below, can you please explain the significance of
the "Date/Time" and the "Duration" columns? What
do they mean in this context?
Date/Time: the time when the alert was triggered.
Ntopng performs periodic checks in order to
trigger alerts. In this particular case, the check
on the requests/reply ratio is performed every 5
minutes. So this means that problem started
between 07:20 and 07:25 .
Duration: the total time in which the problem was
active. Again, the check is performed every 5
minutes for this alert so 5 minutes is the
granularity.
Do I understand correctly that all 3 hosts
triggered the alert at 07:25:01 (OR 07:30:01)
this morning? And that all three alerts are
active for the past 07:28:53 hours? Does this
mean that there have been no new additional DNS
Reply/Request issues have been detected?
As explained above, the problem started between
07:20 and 07:25 . For 07:28:53 hours the problem
was active on all the three hosts (the
requests/reply ratio threshold was exceeded for
07:28:53 hours).
I notice in "Past Alerts" tab, that there are
many Reply/Request Alerts for the same host with
very short durations (screen shot #2). When/how
does an alert move from the "Engaged" to "Past" tab?
In this case, the engaged alert becomes "past"
alert when, after the check performed every 5
minutes, the requests/reply ratio threshold is not
exceed anymore. This can happen as soon as the
next check is performed (5 minutes).
So in the 2nd screenshot, fire-TV had an alert at
06:20:00 for 05:00 minutes where 18 requests
received 0 replies. Then another alert at
06:50:00 for 05:00 minutes. Were the 18 replies
from the first alert ultimately received? And
they were received 5 minutes the alert occurred?
The check is performed on the DNS packet counters.
A DNS request cannot take 5 minutes to be replied.
The fact that the alert was closed after 5/10
minutes could be related to one of these events:
- The host went idle
- The host did not send enough DNS requests
- The new DNS requests made by the host were
successfully replied.
Context here is that 99% of the traffic is
Internet traffic. Almost all of the pihole
traffic is to forwarders. BTW, the way pihole
works (by default) is it replies 0.0.0.0 for
blocked hosts. It should respond to every query.
I tried the live_pcap_download.html
<https://www.ntop.org/guides/ntopng/advanced_features/live_pcap_download.html>
lua, but couldn't figure out the bpf_filter:
curl --cookie "user=admin; password=xxxxx"
"http://10.12.17.25:3000/lua/live_traffic.lua?ifid=0&duration=600&bpf_filter=\
<http://10.12.17.25:3000/lua/live_traffic.lua?ifid=0&duration=600&bpf_filter=%5C>"port
53\""
I also tried the download pcap on the
if_stats.lua page. The downloaded pcap file
seems to only contain incoming data (see wireshark)?
This is consistent with the above alerts, please
ensure that ntopng is not dropping packets as this
would explain this behavior.
If I just do a tshark on the same interface that
ntopng is listening on, I see all of the expected
DNS query & replies. I am not able to correlate
the alerts to any missing packets.
See response above.
Regards,
Emanuele
On Fri, May 8, 2020 at 2:53 AM Emanuele Faranda
<[email protected] <mailto:[email protected]>> wrote:
Hi Aaron,
The alerts that you are reporting basically
tell you that such hosts receive DNS requests
but do not send a reply. In order to
troubleshoot possible problems you should
augment such information with the knowledge
of your network.
The first question to answer is, are that
hosts expected to accept DNS requests? If
not, are the requests generated from the
internet or from the LAN? In the first case a
firewall to block such DNS requests may be a
good idea . In the latter case some hosts in
the LAN may be misconfigured. In case of the
pihole hosts, I expect pihole to block some
DNS requests for advertisement sites so this
could be a normal behaviour. The following
ntopng features may also help you:
https://www.ntop.org/guides/ntopng/advanced_features/live_pcap_download.html
https://www.ntop.org/guides/ntopng/using_with_other_tools/n2disk.html
https://www.ntop.org/guides/ntopng/historical_flows.html
Regards,
Emanuele
On 5/7/20 5:57 PM, Aaron Scamehorn wrote:
Hello,
I'm trying to understand how/why I am
getting the "Replies / Requests Ratio"
warnings for DNS.
I am suspect of these alerts, and would like
to know how/why they are being generated. I
am suspect for for the following reasons:
1) If it really is as bad as indicated, I
should notice problems. 2) the "events'
occur immediately after I clear the alerts,
and tend to persist for hours.
In any case, I cleared the alerts last
night, and this is what they look like:
06/05/2020 22:15:00 12:31:28 Warning
Replies / Requests Ratio Host
edgemax.example.net
<http://xps-630i.scamlan.net:3000/lua/host_details.lua?ifid=2&host=10.12.17.1@1&page=historical&epoch_begin=1588864588&epoch_end=1588868188>
has received 54 DNS requests but sent 0 DNS
replies [5 Minutes ratio: 0%]
06/05/2020 22:15:00 12:31:28 Warning
Replies / Requests Ratio Host
pihole.example.net
<http://xps-630i.scamlan.net:3000/lua/host_details.lua?ifid=2&host=10.12.17.3@1&page=historical&epoch_begin=1588864588&epoch_end=1588868188>
has sent 93 DNS requests but received 3 DNS
replies [5 Minutes ratio: 3.2%]
06/05/2020 22:15:00 12:31:28 Warning
Replies / Requests Ratio Host
pihole-2.example.net
<http://xps-630i.scamlan.net:3000/lua/host_details.lua?ifid=2&host=10.12.17.4@1&page=historical&epoch_begin=1588864588&epoch_end=1588868188>
has sent 97 DNS requests but received 1 DNS
reply [5 Minutes ratio: 1.0%]
_______________________________________________
Ntop mailing list
[email protected]
<mailto:[email protected]>
http://listgateway.unipi.it/mailman/listinfo/ntop
_______________________________________________
Ntop mailing list
[email protected]
<mailto:[email protected]>
http://listgateway.unipi.it/mailman/listinfo/ntop
_______________________________________________
Ntop mailing list
[email protected] <mailto:[email protected]>
http://listgateway.unipi.it/mailman/listinfo/ntop
_______________________________________________
Ntop mailing list
[email protected]
<mailto:[email protected]>
http://listgateway.unipi.it/mailman/listinfo/ntop
_______________________________________________
Ntop mailing list
[email protected] <mailto:[email protected]>
http://listgateway.unipi.it/mailman/listinfo/ntop
_______________________________________________
Ntop mailing list
[email protected]
<mailto:[email protected]>
http://listgateway.unipi.it/mailman/listinfo/ntop
_______________________________________________
Ntop mailing list
[email protected] <mailto:[email protected]>
http://listgateway.unipi.it/mailman/listinfo/ntop