[ https://issues.apache.org/jira/browse/SOLR-15859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17633390#comment-17633390 ]
Ben Manes commented on SOLR-15859: ---------------------------------- I believe many of Solr's usages switched to {{computeIfAbsent}} to avoid cache stampedes, but you only instrumented {{put}} for inserting into the {{hitCounter}} map. You are also never removing from the counters, so this could become a memory leak as evicted keys are retained. It isn't clear if you want the historic hit count of all keys or only the live working set. If you only want the live set, then I would use Map computations for writes so that the {{hitCounter}} map is always kept consistent with the cache (use {{Caffeine.evictionListener}} for removal in this case). Otherwise I would wrap the cached value with additional metadata to avoid the extra map. In either case, the global synchronized lock should not be used as that will destroy the performance by synchronizing all operations. > Add handler to dump filter cache > -------------------------------- > > Key: SOLR-15859 > URL: https://issues.apache.org/jira/browse/SOLR-15859 > Project: Solr > Issue Type: Improvement > Reporter: Andy Lester > Assignee: Shawn Heisey > Priority: Major > Labels: FQ, cache, filtercache, metrics > Attachments: cacheinfo.patch > > > It would be very helpful to be able to inspect the contents of the > filterCache. > I'd like to be able to query something like > {{/admin/caches?type=filter&nentries=1000&sort=numHits+DESC}} > nentries would be allowed to be -1 to get everything. > It would be nice to see these data items for each entry. I don't know which > are available, but I'm thinking blue sky here: > * cache key, exactly as stored > * Timestamp when the entry was inserted > * Whether the insertion of the entry evicted another entry, and if so which > one > * Timestamp of when this entry was last hit > * Number of hits on this entry forever > * Number of hits on this entry over some time period > * Number of documents matched by the filter > * Number of bytes of memory used by the filter > These are the sorts of questions I'd like to be able answer: > * "I just did a query that I expect will have added a cache entry. Did it?" > * "Are my queries hitting existing cache entries?" > * "How big should I set my filterCache size? Should I limit it by number of > entries or RAM usage?" > * "Which of my FQs are getting used the most? These are the ones I want in > my firstSearcher queries." (I currently determine this by processing my old > solr logs) > * "Which filters give me the most bang for the buck in terms of RAM usage?" > * "I have filter X and filter Y, but would it be beneficial if I made a > filter X AND Y?" > * "Which FQs are used more at certain times of the day? (Assuming I take > regular snapshots throughout the day)" > I imagine a response might look like: > {{{}} > {{ "responseHeader": {}} > {{ "status": 0,}} > {{ "QTime": 961}} > {{ },}} > {{ "response": {}} > {{ "numFound": 12104,}} > {{ "filterCacheKeys": {}} > {{ [}} > {{ "language:eng": {}} > {{ "inserted": "2021-12-04T07:34:16Z",}} > {{ "lastHit": "2021-12-04T18:17:43Z",}} > {{ "numHits": 15065,}} > {{ "numHitsInPastHour": 2319,}} > {{ "evictedKey": "agelevel:4 shippable:Y",}} > {{ "numRecordsMatchedByFilter": 24328753,}} > {{ "bytesUsed": 3041094}} > {{ }}} > {{ ],}} > {{ [}} > {{ "is_set:N": {}} > {{ ...}} > {{ }}} > {{ ],}} > {{ [}} > {{ "language:spa": {}} > {{ ...}} > {{ }}} > {{ ]}} > {{ }}} > {{}}} -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@solr.apache.org For additional commands, e-mail: issues-h...@solr.apache.org