Re: Sequenced Collections

Stuart Marks Wed, 21 Sep 2022 17:39:15 -0700

Hi, yes, this is the right place to discuss Sequenced Collections. I'm glad you findit promising.

Note that Sequenced Collections actually has very little implementation in it, asidefrom various reversed views of things. The actual data is still stored in existingconcrete collections such as ArrayList, ArrayDeque, LinkedHashMap, and TreeMap.

I think Sequenced Collections has the right set of abstractions in it as it stands,and I don't want to expand its scope by talking about additional concepts like sizelimits or eviction policy.

However, those things are quite reasonable to discuss independently of the currentSequenced Collections proposal. Having a maximum size on a collection seemsindependent of sequencing. An eviction policy *might* be based on the sequence, butit might not; consider the various eviction policies available for a cache librarysuch as Caffeine [1].


[1] https://github.com/ben-manes/caffeine/wiki

However, I'm somewhat skeptical of trying to build things like eviction policiesdirectly into collections. It's tempting to add a simple thing like a size and justthrow away things in some well-defined order whenever the size is exceeded. Theproblem is that if this policy doesn't do *exactly* what you want to do, then you'reout of luck.

The current (pre Sequenced Collections) LinkedHashMap is a good example of this.It's suitable for a least-recently-inserted expiration policy; there's a methodremoveEldestEntry() that programs can use to implement a simple policy, size-basedor otherwise. (Unfortunately they have to subclass-and-override, but whatever.) Theproblem is that it allows removal of only one element -- the eldest (first) element.

If you want to change the policy of insertion order of an LHM, you have only onealternative: access order. Enabling this has some weird side effects though. Forexample, get() now rearranges the order of entries in the map, and is thus astructural modification -- which means that it spoils any iterators over the map'sviews.

These are both fairly common cases, which is probably why they were added. Butthey're not very flexible, and if you want to do something slightly different,you're on your own -- and it's pretty hard to implement your own policy, because LHMlacks a bunch of essential operations.

Where the Sequenced Collections proposal helps is that instead of adding morepolicies, it adds the missing primitive operations. You can add/get/remove at eitherend, and you can reposition mappings to either end. If you have some differentrecent-usage policy or some unusual cache eviction policy that I've never heard of,you can use the primitives to implement it yourself. That's much better than tryingto bake a few more specific cases into LinkedHashMap or other collections.

Is there a discussion about making the SynchronizedCollection family of classes public?

No. Synchronizing on every collection operation is the wrong level of abstraction.Typical collection usage involves too much external iteration and too muchcheck-then-act logic. Callers would have to wrap those in synchronized blocks, andin general they don't know when that's necessary. Certain transaction-styleoperations (like Map::computeIfAbsent) can be made to work, but those are all prettylow level.


s'marks



On 9/21/22 9:32 AM, Ernie Rael wrote:

 > I don't see why you think a general collection...
I thought the Subject would be sufficient to indicate that I was not talking aboutcollections in general. I should have been more precise with my words; guess I wasjust excited by a bi-directional ordered set.
The MRU _example_ is useful; the current collections handle it poorly and SequencedCollections is ideal. Caches with an eviction policy are common; I suspect cacheswill be a common use for SequencedSet family. Note there are fixed sized Collectionsand SequencedCollection borrows heavily from that family. Perhaps this issue shouldbe considered in the context of adding an **Eviction Policy** to appropriatecollections.
MRU is a Collection; for example, I pass an MRU to a persistence mechanism thattakes a collection. Saying "all methods offered by `Collection` should [not] even bepart of an `MRU` interface" is innacurate, especially when considered in the contextof a SequencedCollection.
-ernie
PS - Loosely related is extending a Collection and providing a synchronized version.Is there a discussion about making the SynchronizedCollection family of classes public?
On 9/21/22 4:22 AM, John Hendrikx wrote:
I don't see why you think a general collection, that is in 99.9% of the cases notused to implement an MRU, should burden every call to #add with a check to see ifit isn't exceeding its maximum size or to see if a maximum size has been set.
This is much better done by composition, as I don't think all methods offered by`Collection` should even be part of an `MRU` interface.
--John

On 20/09/2022 21:08, Ernie Rael wrote:
(There may be a better place to send this, let me know where)
Suggesting an option to limit the size of the collection, e.g "setMaxSize(int)",default of zero means no limit.
I put together "interface MRU<E> extends Collection<E>" some months ago, it hastwo implementations based on LinkedList and LinkedHashSet. The code can be seenathttps://sourceforge.net/p/jvi/raelity-lib/ci/default/tree/lib/src/main/java/com/raelity/lib/
A SequencedCollection, as outlined in the JEP draft 2020/09/01, would be almostperfect to implement MRU; I've run into most of the problems/issues discussed inthe JEP draft.
The MRU is a cache, as I use it; it typically has a max size for the collection.Handling this natively in the collection would be ideal; if an add operationwould overflow, remove the item at the other end. Note that addAll() is used wheninitializing from backing store.
FYI, I use a "Supplier<Integer>" to the constructor to provide maxSize, but aproperty makes much more sense. I'll make that change in MRU for sanity, and getrid of the trim() method. setMaxSize can do the trim.
-ernie

Re: Sequenced Collections

Reply via email to