Denis,

I don't sure that I understand what is expected behavior should be.
There are local and aggregated cluster wide metrics. I don't know
which one used by Visor because I never used it :)

Also it would be great to know what version of Apache Ignite used in
described case. I remember some bug with metrics aggregation during
discovery metrics message round trip.

On Wed, Mar 11, 2020 at 12:05 AM Denis Magda <dma...@apache.org> wrote:
>
> @Nikolay Izhikov <nizhi...@apache.org>, @Andrey Gura <ag...@apache.org>,
> could you folks check out this thread?
>
> I have a feeling that what Dominik is describing was talked out before and
> rather some sort of a limitation than an issue with the current
> implementation.
>
> -
> Denis
>
>
> On Tue, Mar 3, 2020 at 11:41 PM Dominik Przybysz <alien11...@gmail.com>
> wrote:
>
> > Hi,
> > I am trying to use partitioned cache on server nodes to which I connect
> > with client node. Statistics of cache in the cluster are updated, but only
> > for hits metric - misses metric is always 0.
> >
> > To reproduce this problem I created cluster of two nodes:
> >
> > Server node 1 adds 100 random test cases and prints cache statistics
> > continuously:
> >
> > public class IgniteClusterNode1 {
> >     public static void main(String[] args) throws InterruptedException {
> >         IgniteConfiguration igniteConfiguration = new
> > IgniteConfiguration();
> >
> >         CacheConfiguration cacheConfiguration = new CacheConfiguration();
> >         cacheConfiguration.setName("test");
> >         cacheConfiguration.setCacheMode(CacheMode.PARTITIONED);
> >         cacheConfiguration.setAtomicityMode(CacheAtomicityMode.ATOMIC);
> >         cacheConfiguration.setStatisticsEnabled(true);
> >         igniteConfiguration.setCacheConfiguration(cacheConfiguration);
> >
> >         TcpCommunicationSpi communicationSpi = new TcpCommunicationSpi();
> >         communicationSpi.setLocalPort(47500);
> >         igniteConfiguration.setCommunicationSpi(communicationSpi);
> >
> >         TcpDiscoverySpi discoverySpi = new TcpDiscoverySpi();
> >         discoverySpi.setLocalPort(47100);
> >         discoverySpi.setLocalPortRange(100);
> >         TcpDiscoveryVmIpFinder ipFinder = new TcpDiscoveryVmIpFinder();
> >         ipFinder.setAddresses(Arrays.asList("127.0.0.1:47100..47200",
> > "127.0.0.1:48100..48200"));
> >         igniteConfiguration.setDiscoverySpi(discoverySpi);
> >
> >         try (Ignite ignite = Ignition.start(igniteConfiguration)) {
> >             try (IgniteCache<String, String> cache =
> > ignite.getOrCreateCache("test")) {
> >                 new Random().ints(1000).map(i -> Math.abs(i %
> > 1000)).distinct().limit(100).forEach(i -> {
> >                             String key = "data_" + i;
> >                             String value = UUID.randomUUID().toString();
> >                             cache.put(key, value);
> >                         }
> >                 );
> >             }
> >             while (true) {
> >                 System.out.println(ignite.cache("test").metrics());
> >                 Thread.sleep(5000);
> >             }
> >         }
> >     }
> > }
> >
> > Server node 2 only prints cache statistics continuously:
> >
> > public class IgniteClusterNode2 {
> >     public static void main(String[] args) throws InterruptedException {
> >         IgniteConfiguration igniteConfiguration = new
> > IgniteConfiguration();
> >
> >         CacheConfiguration cacheConfiguration = new CacheConfiguration();
> >         cacheConfiguration.setName("test");
> >         cacheConfiguration.setCacheMode(CacheMode.PARTITIONED);
> >         cacheConfiguration.setStatisticsEnabled(true);
> >         igniteConfiguration.setCacheConfiguration(cacheConfiguration);
> >
> >         TcpCommunicationSpi communicationSpi = new TcpCommunicationSpi();
> >         communicationSpi.setLocalPort(48500);
> >         igniteConfiguration.setCommunicationSpi(communicationSpi);
> >
> >         TcpDiscoverySpi discoverySpi = new TcpDiscoverySpi();
> >         discoverySpi.setLocalPort(48100);
> >         discoverySpi.setLocalPortRange(100);
> >         TcpDiscoveryVmIpFinder ipFinder = new TcpDiscoveryVmIpFinder();
> >         ipFinder.setAddresses(Arrays.asList("127.0.0.1:47100..47200",
> > "127.0.0.1:48100..48200"));
> >         igniteConfiguration.setDiscoverySpi(discoverySpi);
> >
> >         try (Ignite ignite = Ignition.start(igniteConfiguration)) {
> >             while (true) {
> >                 System.out.println(ignite.cache("test").metrics());
> >                 Thread.sleep(5000);
> >             }
> >         }
> >     }
> > }
> >
> > Next I start a client node which continuously read data from the cluster:
> >
> > public class CacheClusterReader {
> >     public static void main(String[] args) throws InterruptedException {
> >         IgniteConfiguration cfg = new IgniteConfiguration();
> >         cfg.setClientMode(true);
> >
> >         TcpDiscoverySpi spi = new TcpDiscoverySpi();
> >         TcpDiscoveryVmIpFinder tcMp = new TcpDiscoveryVmIpFinder();
> >         tcMp.setAddresses(Arrays.asList("127.0.0.1:47100..47200",
> > "127.0.0.1:48100..48200"));
> >         spi.setIpFinder(tcMp);
> >         cfg.setDiscoverySpi(spi);
> >
> >         CacheConfiguration<String, String> cacheConfig = new
> > CacheConfiguration<>("test");
> >         cacheConfig.setStatisticsEnabled(true);
> >         cacheConfig.setCacheMode(CacheMode.PARTITIONED);
> >         cfg.setCacheConfiguration(cacheConfig);
> >
> >         try (Ignite ignite = Ignition.start(cfg)) {
> >             System.out.println(ignite.cacheNames());
> >
> >             while (true) {
> >                 try (IgniteCache<String, String> cache =
> > ignite.getOrCreateCache(cacheConfig)) {
> >                     cache.enableStatistics(true);
> >
> >                     new Random().ints(1000).map(i -> Math.abs(i %
> > 2000)).distinct().limit(100).forEach(i -> {
> >                                 String key = "data_" + i;
> >                                 String value = cache.get(key);
> >                                 System.out.println("Found " + key + " with
> > value " + value);
> >                             }
> >                     );
> >
> >                     System.out.println(cache.metrics());
> >                     System.out.println(cache.localMetrics());
> >                 }
> >                 Thread.sleep(3000);
> >             }
> >         }
> >     }
> > }
> >
> > On server and client nodes cluster statistics are synchronized, but misses
> > counter is always equal to 0, e. g.
> >
> > CacheMetricsSnapshot [reads=93, puts=100, entryProcessorPuts=0,
> > entryProcessorReadOnlyInvocations=0,
> > entryProcessorAverageInvocationTime=0.0, entryProcessorInvocations=0,
> > entryProcessorRemovals=0, entryProcessorMisses=0, entryProcessorHits=0,
> > entryProcessorMissPercentage=0.0, entryProcessorHitPercentage=0.0,
> > entryProcessorMaxInvocationTime=0.0, entryProcessorMinInvocationTime=0.0, 
> > *hits=93,
> > misses=0*, txCommits=0, txRollbacks=0, evicts=0, removes=0,
> > putAvgTimeNanos=259.4794, getAvgTimeNanos=0.0, rmvAvgTimeNanos=0.0,
> > commitAvgTimeNanos=0.0, rollbackAvgTimeNanos=0.0, cacheName=test,
> > offHeapGets=0, offHeapPuts=0, offHeapRemoves=0, offHeapEvicts=0,
> > offHeapHits=0, offHeapMisses=0, offHeapEntriesCnt=100, heapEntriesCnt=0,
> > offHeapPrimaryEntriesCnt=100, offHeapBackupEntriesCnt=0,
> > offHeapAllocatedSize=0, size=56, cacheSize=56, keySize=56, isEmpty=false,
> > dhtEvictQueueCurrSize=-1, txThreadMapSize=0, txXidMapSize=0,
> > txCommitQueueSize=0, txPrepareQueueSize=0, txStartVerCountsSize=0,
> > txCommittedVersionsSize=0, txRolledbackVersionsSize=0,
> > txDhtThreadMapSize=0, txDhtXidMapSize=-1, txDhtCommitQueueSize=0,
> > txDhtPrepareQueueSize=0, txDhtStartVerCountsSize=0,
> > txDhtCommittedVersionsSize=-1, txDhtRolledbackVersionsSize=-1,
> > isWriteBehindEnabled=false, writeBehindFlushSize=-1,
> > writeBehindFlushThreadCnt=-1, writeBehindFlushFreq=-1,
> > writeBehindStoreBatchSize=-1, writeBehindTotalCriticalOverflowCnt=-1,
> > writeBehindCriticalOverflowCnt=-1, writeBehindErrorRetryCnt=-1,
> > writeBehindBufSize=-1, totalPartitionsCnt=1024, rebalancingPartitionsCnt=0,
> > rebalancedKeys=44, estimatedRebalancingKeys=44, keysToRebalanceLeft=0,
> > rebalancingKeysRate=0, rebalancingBytesRate=0, rebalanceStartTime=0,
> > rebalanceFinishTime=0, rebalanceClearingPartitionsLeft=0,
> > keyType=java.lang.Object, valType=java.lang.Object, isStoreByVal=true,
> > isStatisticsEnabled=true, isManagementEnabled=false, isReadThrough=false,
> > isWriteThrough=false, isValidForReading=true, isValidForWriting=true]
> >
> > In comparision, I also have also a thin client which connects to server
> > and read some random data once:
> >
> > public class CacheClusterThinClientReader {
> >     public static void main(String[] args) throws Exception {
> >         ClientConfiguration clientConfiguration = new ClientConfiguration()
> >                 .setAddresses("127.0.0.1:10800");
> >
> >         try (IgniteClient ignite =
> > Ignition.startClient(clientConfiguration)) {
> >
> >             ClientCache<String, String> cache =
> > ignite.getOrCreateCache("test");
> >
> >             Set<String> misses = new HashSet<>();
> >             new Random().ints(1000).map(i -> Math.abs(i %
> > 2000)).distinct().limit(100).forEach(i -> {
> >                         String key = "data_" + i;
> >                         String value = cache.get(key);
> >                         System.out.println("Found " + key + " with value "
> > + value);
> >                         if (value == null) {
> >                             misses.add(key);
> >                         }
> >                     }
> >             );
> >             System.out.println("Misses count: " + misses.size());
> >         }
> >     }
> > }
> >
> > and finally prints (in my try):
> >
> > Misses count: 97
> >
> > On the server nodes misses were counted, but only for one server node:
> >
> > CacheMetricsSnapshot [reads=51, puts=100, entryProcessorPuts=0,
> > entryProcessorReadOnlyInvocations=0,
> > entryProcessorAverageInvocationTime=0.0, entryProcessorInvocations=0,
> > entryProcessorRemovals=0, entryProcessorMisses=0, entryProcessorHits=0,
> > entryProcessorMissPercentage=0.0, entryProcessorHitPercentage=0.0,
> > entryProcessorMaxInvocationTime=0.0, entryProcessorMinInvocationTime=0.0, 
> > *hits=3,
> > misses=48*, txCommits=0, txRollbacks=0, evicts=0, removes=0,
> > putAvgTimeNanos=276.13132, getAvgTimeNanos=850.90735, rmvAvgTimeNanos=0.0,
> > commitAvgTimeNanos=0.0, rollbackAvgTimeNanos=0.0, cacheName=test,
> > offHeapGets=0, offHeapPuts=0, offHeapRemoves=0, offHeapEvicts=0,
> > offHeapHits=0, offHeapMisses=0, offHeapEntriesCnt=100, heapEntriesCnt=0,
> > offHeapPrimaryEntriesCnt=100, offHeapBackupEntriesCnt=0,
> > offHeapAllocatedSize=0, size=53, cacheSize=53, keySize=53, isEmpty=false,
> > dhtEvictQueueCurrSize=-1, txThreadMapSize=0, txXidMapSize=0,
> > txCommitQueueSize=0, txPrepareQueueSize=0, txStartVerCountsSize=0,
> > txCommittedVersionsSize=0, txRolledbackVersionsSize=0,
> > txDhtThreadMapSize=0, txDhtXidMapSize=-1, txDhtCommitQueueSize=0,
> > txDhtPrepareQueueSize=0, txDhtStartVerCountsSize=0,
> > txDhtCommittedVersionsSize=-1, txDhtRolledbackVersionsSize=-1,
> > isWriteBehindEnabled=false, writeBehindFlushSize=-1,
> > writeBehindFlushThreadCnt=-1, writeBehindFlushFreq=-1,
> > writeBehindStoreBatchSize=-1, writeBehindTotalCriticalOverflowCnt=-1,
> > writeBehindCriticalOverflowCnt=-1, writeBehindErrorRetryCnt=-1,
> > writeBehindBufSize=-1, totalPartitionsCnt=1024, rebalancingPartitionsCnt=0,
> > rebalancedKeys=47, estimatedRebalancingKeys=47, keysToRebalanceLeft=0,
> > rebalancingKeysRate=0, rebalancingBytesRate=0, rebalanceStartTime=0,
> > rebalanceFinishTime=0, rebalanceClearingPartitionsLeft=0,
> > keyType=java.lang.Object, valType=java.lang.Object, isStoreByVal=true,
> > isStatisticsEnabled=true, isManagementEnabled=false, isReadThrough=false,
> > isWriteThrough=false, isValidForReading=true, isValidForWriting=true]
> >
> > Visor confirms my observation:
> >
> > Nodes for: test(@c0)
> >
> > +==================================================================================================================+
> > |      Node ID8(@), IP       | CPUs | Heap Used | CPU Load |   Up Time
> >  | Size (Primary / Backup)  | Hi/Mi/Rd/Wr |
> >
> > +==================================================================================================================+
> > | D09713CD(@n1), 10.10.10.14 | 8    | 2.81 %    | 0.53 %   | 00:15:00.291
> > | Total: 47 (47 / 0)       | Hi: 0       |
> > |                            |      |           |          |
> >  |   Heap: 0 (0 / <n/a>)    | Mi: 0       |
> > |                            |      |           |          |
> >  |   Off-Heap: 47 (47 / 0)  | Rd: 0       |
> > |                            |      |           |          |
> >  |   Off-Heap Memory: <n/a> | Wr: 0       |
> >
> > +----------------------------+------+-----------+----------+--------------+--------------------------+-------------+
> > | B8CCBCAB(@n0), 10.10.10.14 | 8    | 1.15 %    | 0.50 %   | 00:15:04.122
> > | Total: 53 (53 / 0)       | Hi: 3       |
> > |                            |      |           |          |
> >  |   Heap: 0 (0 / <n/a>)    | Mi: 48      |
> > |                            |      |           |          |
> >  |   Off-Heap: 53 (53 / 0)  | Rd: 51      |
> > |                            |      |           |          |
> >  |   Off-Heap Memory: <n/a> | Wr: 100     |
> >
> > +------------------------------------------------------------------------------------------------------------------+
> >
> > I have seen the same behaviour with rest api as with the thin client, but
> > I don't have prepared test case for this.
> >
> > Is there a good way to correctly configure collecting statistics on server
> > nodes?
> > Should I create an issue for this problem?
> >
> > --
> > Pozdrawiam / Regards,
> > Dominik Przybysz
> >

Reply via email to