Re: Time Routed Alias

Matt Kuiper Tue, 24 Aug 2021 07:17:25 -0700

Thanks Gus!  I appreciate this information.  It is very helpful.  From my
POC, I can see that TRAs are very powerful and very helpful.  I am excited
to build out a more full implementation within our use case which is right
in line with the boundaries you set for standard TRA use.


Matt

On Sat, Aug 21, 2021 at 1:16 PM Gus Heck <gus.h...@gmail.com> wrote:

> Hi Matt,
>
> TRA's were put into use almost immediately by at least one organization as
> soon as Dave and I implemented them, and the CRA and DRA's followed because
> the same organization wanted to further subdivide their data. They have
> been around for a while, and I'm currently helping a client move to use
> them, and past clients have adopted them. Always hard to know exactly how
> much any feature is used, but I have heard other mentions of folks using
> them, and not heard a failure story yet, so I think so long as your use
> case is a good fit for them (non-trivial amounts of data, never re-index
> the same doc with a different routing date, typically data flows in over
> time, optionally old data needs periodic removal, etc) then they are good.
> Of course every particular case is individual and there's always the chance
> that YOU are the lucky one who discovers something subtle, or find a scale
> at which things break down, but you aren't the first to use it, that's for
> sure :). They definitely were meant to make it easier to handle large
> amounts of temporal data.
>
> Also, it's open source so if something needs tweaking the process for that
> is open and well defined :) As with everything technical, test often and
> test well.
>
> -Gus
>
> On Fri, Aug 20, 2021 at 10:07 AM Matt Kuiper <kuipe...@gmail.com> wrote:
>
> > Sending this question out again to learn about how well Time Routed
> Aliases
> > have worked out for others.
> >
> > Would like to know if a number of others have used this approach
> > successfully as our team is planning for the use of TRAs in a very large
> > SolrCloud deployment.
> >
> > Thanks,
> > Matt
> >
> > On Fri, Aug 13, 2021, 2:56 PM Matt Kuiper <kuipe...@gmail.com> wrote:
> >
> > > Thanks David, this test link is helpful.
> > >
> > > @David @Gus - From your viewpoint do you see TRAs as an accepted/proven
> > > technique within SolrCloud?  My small POC works great.  Would like to
> > hear
> > > if others are using TRA in production deployments successfully at
> scale.
> > >
> > > Thanks,
> > > Matt
> > >
> > > On Wed, Aug 11, 2021 at 8:10 PM David Smiley <dsmi...@apache.org>
> wrote:
> > >
> > >> I hope you have success with TRAs!
> > >>
> > >> You can delete some number of collections from the rear of the chain,
> > but
> > >> you must first update the TRA to exclude these collections.  This is
> > >> tested:
> > >>
> > >>
> >
> https://github.com/apache/solr/blob/f6c4f8a755603c3049e48eaf9511041252f2dbad/solr/core/src/test/org/apache/solr/update/processor/TimeRoutedAliasUpdateProcessorTest.java#L184
> > >> It'd be nice if it would remove itself from the alias.
> > >>
> > >> ~ David Smiley
> > >> Apache Lucene/Solr Search Developer
> > >> http://www.linkedin.com/in/davidwsmiley
> > >>
> > >>
> > >> On Tue, Aug 10, 2021 at 9:26 PM Matt Kuiper <kuipe...@gmail.com>
> wrote:
> > >>
> > >> > I found some helpful information while testing TRAs:
> > >> >
> > >> > For our use-case I am hesitant to set up an autoDeleteAge (unless it
> > >> can be
> > >> > modified - still need to test).  So I wondered about a little more
> > >> manual
> > >> > delete management approach.
> > >> >
> > >> > I confirmed that I cannot simply delete a collection that is
> > registered
> > >> as
> > >> > part of a TRA.  The delete collection api call will fail with a
> > message
> > >> > that the collection is a part of the alias.
> > >> >
> > >> > I did learn that I could use the same create TRA api call I used to
> > >> create
> > >> > the TRA, but modify the router.start to date more recent than one or
> > >> more
> > >> > of the older collections associated with the TRA. Then when I
> queried
> > >> the
> > >> > TRA, I only received documents from the collections after the new
> > >> > router.start date. Also, I was now able to successfully delete the
> > older
> > >> > collections with a standard collection delete command.
> > >> >
> > >> > I think this satisfies my initial use-case requirements to be able
> to
> > >> > modify an existing TRA and delete older collections.
> > >> >
> > >> > Matt
> > >> >
> > >> > On Mon, Aug 9, 2021 at 11:27 AM Matt Kuiper <kuipe...@gmail.com>
> > wrote:
> > >> >
> > >> > > Hi Gus, Jan,
> > >> > >
> > >> > > I am considering implementing TRA for a large-scale Solr
> deployment.
> > >> > Your
> > >> > > Q&A is helpful!
> > >> > >
> > >> > > I am curious if you have experience/ideas regarding modifying the
> TR
> > >> > Alias
> > >> > > when one desires to manually delete old collections or modify the
> > >> > > router.autoDeleteAge to shorten or extend the delete age.  Here's
> a
> > >> few
> > >> > > specific questions?
> > >> > >
> > >> > > 1) Can you manually delete an old collection (via collection api)
> > and
> > >> > then
> > >> > > edit the start date (to a more recent date) of the TRA so that it
> no
> > >> > longer
> > >> > > sees/processes the deleted collection?
> > >> > > 2) Is the only way to manage the deletion of collections within a
> > TRA
> > >> > > using the automatic deletion configuration? The
> router.autoDeleteAge
> > >> > > parameter.
> > >> > > 3) If you can only manage deletes using the router.autoDeleteAge
> > >> > > parameter, are you able to update this parameter to either:
> > >> > >
> > >> > >    - Set the delete age earlier so that older collections are
> > >> triggered
> > >> > >    for automatic deletion sooner?
> > >> > >    - Set the delete age to a larger value to extend the life of a
> > >> > >    collection?  Say you originally  would like the collections to
> > stay
> > >> > around
> > >> > >    for 5 years, but then change your mind to 7 years.
> > >> > >
> > >> > > I will likely do some experimentation, but am interested to learn
> if
> > >> you
> > >> > > have covered these use-cases with TRA.
> > >> > >
> > >> > > Thanks,
> > >> > > Matt
> > >> > >
> > >> > >
> > >> > > On Fri, Aug 6, 2021 at 8:08 AM Gus Heck <gus.h...@gmail.com>
> wrote:
> > >> > >
> > >> > >> Hi Jan,
> > >> > >>
> > >> > >> The key thing to remember about TRA's (or any Routed Alias) is
> that
> > >> it
> > >> > >> only
> > >> > >> actively does two things:
> > >> > >> 1) Routes document updates to the correct collection by
> inspecting
> > >> the
> > >> > >> routed field in the document
> > >> > >> 2) Detects when a new collection is required and creates it.
> > >> > >>
> > >> > >> If you don't send it data *nothing* happens. The collections are
> > not
> > >> > >> created until data requires them (with an async create possible
> > when
> > >> it
> > >> > >> sees an update that has a timestamp "near" the next interval, see
> > >> docs
> > >> > for
> > >> > >> router.preemptiveCreateMath )
> > >> > >>
> > >> > >> A) Dave's half of our talk at 2018 activate talks about it:
> > >> > >> https://youtu.be/RB1-7Y5NQeI?t=839
> > >> > >> B) Time Routed Aliases are a means by which to automate creation
> of
> > >> > >> collections and route documents to the created collections.
> Sizing,
> > >> and
> > >> > >> performance of the individual collections is not otherwise
> special,
> > >> and
> > >> > >> you
> > >> > >> can interact with the collections individually after they are
> > >> created,
> > >> > >> with
> > >> > >> the obvious caveats that you probably don't want to be doing
> things
> > >> that
> > >> > >> get them out of sync schema wise unless your client programs know
> > >> how to
> > >> > >> handle documents of both types etc. A less obvious consequence of
> > the
> > >> > >> routing is that your data must not ever republish the same
> document
> > >> > with a
> > >> > >> different route key (date for TRA), since that can lead to
> > duplicate
> > >> > id's
> > >> > >> across collections. The "normal" use case is event data, things
> > that
> > >> > >> happened and are done, and are correctly recorded (or at least
> > their
> > >> > time
> > >> > >> is correctly recorded) the first time
> > >> > >> C) Configure the higher number of replicas, remove old ones
> > manually
> > >> if
> > >> > >> not
> > >> > >> needed. At query time it's "just an alias". Managing collections
> > >> based
> > >> > on
> > >> > >> recency could be automated here, before autoscaling was
> deprecated
> > I
> > >> was
> > >> > >> thinking that adding a couple of hooks into autoscaling such that
> > it
> > >> > could
> > >> > >> react to collection creation by a TRA specifically would get us
> to
> > a
> > >> > place
> > >> > >> much like Elastic's Hot/Warm architecture. I haven't kept track
> of
> > >> > what's
> > >> > >> being done to replace auto scaling however. I think Atri was
> > >> interested
> > >> > in
> > >> > >> that at one point as well.
> > >> > >> D) TRA's create collections under the hood with a CREATE command
> > just
> > >> > like
> > >> > >> you would manually (based on the config in the TRA). Anything in
> > Solr
> > >> > that
> > >> > >> would influence that placement should apply.
> > >> > >> E) See D above, for fill rate, Utilizing new nodes over time
> should
> > >> be
> > >> > as
> > >> > >> simple as adding new nodes and waiting for new collections to be
> > >> > created.
> > >> > >> One could also manually move replicas as with any other
> collection,
> > >> > >> (aside:
> > >> > >> be sure to refer to a current version of MOVEREPLICA docs, prior
> to
> > >> > >> something like 8.6 they were incomplete and even wrong in a few
> > >> places).
> > >> > >> F) If you are talking about router.autoDeleteAge here, old
> > collection
> > >> > >> removal is a regular DELETE (just automatically issued), Not sure
> > >> what
> > >> > you
> > >> > >> mean by rotation interval.
> > >> > >> G) They are just collections with special names that can be
> parsed
> > >> > during
> > >> > >> update to select a destination for the incoming document.
> > >> > >> H) They are just collections, and there's nothing to prevent you
> > from
> > >> > >> upgrading the schema, and new collections will begin using that,
> > >> > >> individual
> > >> > >> collections would need to be reloaded, non-safe schema changes
> (in
> > >> the
> > >> > >> usual sense) require a re-index as usual. In a cloud environment
> > >> where
> > >> > you
> > >> > >> can temporarily add machines or disk this is not so bad aside
> from
> > >> the
> > >> > >> time
> > >> > >> to re-index of course. If you are on-prem then plan to have a
> > >> > significant
> > >> > >> level of spare disk to handle this case without running yourself
> > into
> > >> > the
> > >> > >> danger zone for segment merging.
> > >> > >> H.2) TRA is just an alias with fancy collection creation (and
> > >> naming).
> > >> > >> Once
> > >> > >> they collections exist, it's just an alias. All the action (at
> this
> > >> > point)
> > >> > >> happens at update. So long as the collection is listed in the TRA
> > in
> > >> > >> zookeeper in aliases.json ***in the correct, (chronological,
> desc)
> > >> > >> order***
> > >> > >> and the naming of the collection can be parsed by the TRA code
> you
> > >> > should
> > >> > >> be fine. Incoming updates iterate down the list of collections
> > >> during an
> > >> > >> update, and stop at the first one where the collection name
> matches
> > >> the
> > >> > >> date in the routing field for the document for a normal TRA the
> > vast
> > >> > >> majority of updates hit one of the most recent two or three
> > >> collections.
> > >> > >> Frequent updates to old data in a TRA with very many time slices
> > (sub
> > >> > >> collections) might suffer some since this is a simple linear
> > >> iteration,
> > >> > >> optimizing that was deferred until it seemed important to
> someone's
> > >> less
> > >> > >> normal use case :).
> > >> > >>
> > >> > >>
> > >> > >>
> > >> > >> Otherwise it's just an alias of collections with funky looking
> > names
> > >> > >> (unless someone added something when I wasn't looking ;) ).
> > >> > >>
> > >> > >> -Gus
> > >> > >>
> > >> > >> On Fri, Aug 6, 2021 at 4:13 AM Jan Høydahl <
> jan....@cominvent.com>
> > >> > wrote:
> > >> > >>
> > >> > >> > Hi,
> > >> > >> >
> > >> > >> > I have never used TRA, but a client of mine is considering it.
> A
> > >> few
> > >> > >> > questions.
> > >> > >> >
> > >> > >> > A) Do you have links to talks (slides/video) on the feature? Or
> > >> blog
> > >> > >> posts
> > >> > >> > going into more detail than the RefGuide?
> > >> > >> > B) For ingestion performance, sharding may make sense. But only
> > for
> > >> > the
> > >> > >> > current collection. Have anyone tried merging "static" shards?
> > >> > >> > C) Is there a trick to have more relicas on recent collections
> > than
> > >> > old
> > >> > >> > ones?
> > >> > >> > D) Is there a way to manage what nodes that get selected for
> new
> > >> > >> > collections, or you need to rely on replica placement policies?
> > >> > >> > E) How do you guys ensure you get a good fill-rate on the
> nodes,
> > >> and
> > >> > >> what
> > >> > >> > procedure do you use when adding more nodes in the cluster?
> > >> > >> >     * I.e. do you simply add a few new nodes and let Solr
> > >> > automatically
> > >> > >> > place new collections onto those?
> > >> > >> > F) How many sub-collections/cores do you plan for on a single
> > node?
> > >> > >> >     * You could try to configure the "rotation interval" such
> > that
> > >> a
> > >> > >> node
> > >> > >> > gets filled by a single core, but that seems hard to predict
> > >> > >> >     * Having a too rapid "rotation interval" will leave behind
> > too
> > >> > many
> > >> > >> > cores per node, causing inefficiencies?
> > >> > >> >     * Have you found a strategy to balance this? I'd likely try
> > to
> > >> > plan
> > >> > >> > for 10 cores per node, and monitor fill-rate such that I
> > (manually)
> > >> > add
> > >> > >> > more HW once a threshold is reached.
> > >> > >> > G) Have anyone tried backup of a TRA? Does it even work, or do
> > you
> > >> > need
> > >> > >> to
> > >> > >> > run the command for each single collection?
> > >> > >> > H) A typical requirement is to migrate all data from one
> cluster
> > >> to a
> > >> > >> new
> > >> > >> > cluster on a newer version or with a new schema. Have you tried
> > >> doing
> > >> > >> that
> > >> > >> > with a TRA?
> > >> > >> >     * Would you need to migrate each sub collection at a time?
> > >> > >> >     * Will TRA on the new cluster accept that someone
> "external"
> > >> adds
> > >> > >> > collections, and how it is initialized/bootstrapped to fill the
> > >> > internal
> > >> > >> > collection registry?
> > >> > >> >
> > >> > >> > That's what I could think of before trying the feature. I'm
> sure
> > >> there
> > >> > >> > would be other questions after some trial and error :)
> > >> > >> >
> > >> > >> > Jan
> > >> > >>
> > >> > >>
> > >> > >>
> > >> > >> --
> > >> > >> http://www.needhamsoftware.com (work)
> > >> > >> http://www.the111shift.com (play)
> > >> > >>
> > >> > >
> > >> >
> > >>
> > >
> >
>
>
> --
> http://www.needhamsoftware.com (work)
> http://www.the111shift.com (play)
>

Re: Time Routed Alias

Reply via email to