Re: Exceeding the DataStorageConfiguration#getMaxWalArchiveSize due to historical rebalance

2021-06-21 Thread ткаленко кирилл
" :) >>  In any case, I think we're on the same page. >> >>>   On 11 May 2021, at 13:18, Andrey Gura wrote: >>> >>>   Stan >>> >>>>   If archive size is less than min or more than max then the system >>>> functi

Re: Exceeding the DataStorageConfiguration#getMaxWalArchiveSize due to historical rebalance

2021-06-17 Thread ткаленко кирилл
Created the first task by this discussion IGNITE-14923. 13.05.2021, 18:37, "Stanislav Lukyanov" : > What I mean by degradation when archive size < min is that, for example, > historical rebalance is available for a smaller timespan than expected by the > system design. &g

Re: Exceeding the DataStorageConfiguration#getMaxWalArchiveSize due to historical rebalance

2021-05-13 Thread Stanislav Lukyanov
What I mean by degradation when archive size < min is that, for example, historical rebalance is available for a smaller timespan than expected by the system design. It may not be an issue of course, especially for a new cluster. If "degradation" is the wrong word we can call i

Re: Exceeding the DataStorageConfiguration#getMaxWalArchiveSize due to historical rebalance

2021-05-11 Thread Andrey Gura
Stan > If archive size is less than min or more than max then the system > functionality can degrade (e.g. historical rebalance may not work as > expected). Why does the condition "archive size is less than min" lead to system degradation? Actually, the described case is a n

Re: Exceeding the DataStorageConfiguration#getMaxWalArchiveSize due to historical rebalance

2021-05-09 Thread Stanislav Lukyanov
e=2GB. Say, under normal load on stable topology 2 hours of WAL use 1 GB of space. Now, say we're doing historical rebalance and reserve the WAL archive. The WAL archive starts growing and soon it occupies 2 GB. Now what? We're supposed to give up WAL reservations and start agressively r

Re: Exceeding the DataStorageConfiguration#getMaxWalArchiveSize due to historical rebalance

2021-05-07 Thread ткаленко кирилл
to name the property geMintWalArchiveSize. I think that this is >> exactly what it is - the minimal size of the archive that we want to have. >>  The archive size at all times should be between min and max. >>  If archive size is less than min or more than max then the system >

Re: Exceeding the DataStorageConfiguration#getMaxWalArchiveSize due to historical rebalance

2021-05-06 Thread Stanislav Lukyanov
WalArchiveSize. I think that this is > exactly what it is - the minimal size of the archive that we want to have. > The archive size at all times should be between min and max. > If archive size is less than min or more than max then the system > functionality can degrade (e.g. histori

Re: Exceeding the DataStorageConfiguration#getMaxWalArchiveSize due to historical rebalance

2021-05-06 Thread Stanislav Lukyanov
grade (e.g. historical rebalance may not work as expected). I think these rules are intuitively understood from the "min" and "max" names. Ilya's suggestion about throttling is great although I'd do this in a different ticket. Thanks, Stan > On 5 May 2021, at 19:25,

Re: Exceeding the DataStorageConfiguration#getMaxWalArchiveSize due to historical rebalance

2021-05-05 Thread Maxim Muzafarov
> > >> Hello everybody! > >> > >> At the moment, if there are partitions for the rebalance for which the > >> historical rebalance will be used, then we reserve segments in the WAL > >> archive (we do not allow cleaning the WAL archive) until the

Re: Exceeding the DataStorageConfiguration#getMaxWalArchiveSize due to historical rebalance

2021-05-04 Thread ткаленко кирилл
ing? > > So we will be throttling for both checkpoint page buffer and WAL limit. > > Regards, > -- > Ilya Kasnacheev > > вт, 4 мая 2021 г. в 11:29, ткаленко кирилл : > >>  Hello everybody! >> >>  At the moment, if there are partitions for the rebalance f

Re: Exceeding the DataStorageConfiguration#getMaxWalArchiveSize due to historical rebalance

2021-05-04 Thread Ilya Kasnacheev
if there are partitions for the rebalance for which the > historical rebalance will be used, then we reserve segments in the WAL > archive (we do not allow cleaning the WAL archive) until the rebalance for > all cache groups is over. > > If a cluster is under load during the rebalan

Exceeding the DataStorageConfiguration#getMaxWalArchiveSize due to historical rebalance

2021-05-04 Thread ткаленко кирилл
Hello everybody! At the moment, if there are partitions for the rebalance for which the historical rebalance will be used, then we reserve segments in the WAL archive (we do not allow cleaning the WAL archive) until the rebalance for all cache groups is over. If a cluster is under load during

[jira] [Created] (IGNITE-14524) Historical rebalance doesn't work if cache has configured rebalanceDelay

2021-04-12 Thread Dmitry Lazurkin (Jira)
Dmitry Lazurkin created IGNITE-14524: Summary: Historical rebalance doesn't work if cache has configured rebalanceDelay Key: IGNITE-14524 URL: https://issues.apache.org/jira/browse/IGNITE-

[jira] [Created] (IGNITE-14138) Historical rebalance kills cluster

2021-02-08 Thread Vladislav Pyatkov (Jira)
Vladislav Pyatkov created IGNITE-14138: -- Summary: Historical rebalance kills cluster Key: IGNITE-14138 URL: https://issues.apache.org/jira/browse/IGNITE-14138 Project: Ignite Issue Type

Re: Choosing historical rebalance heuristics

2020-07-17 Thread Alexei Scherbakov
so > > 1) Exclude partitions by threshold (IGNITE_PDS_WAL_REBALANCE_THRESHOLD - > > reduce it to 500) > > 2) Select only that partition for historical rebalance where difference > > between counters less that partition size. > > Agreed, let's go this way. >

Re: Choosing historical rebalance heuristics

2020-07-16 Thread Ivan Rakov
> > I think we can modify the heuristic so > 1) Exclude partitions by threshold (IGNITE_PDS_WAL_REBALANCE_THRESHOLD - > reduce it to 500) > 2) Select only that partition for historical rebalance where difference > between counters less that partition size. Agreed, let's go

Re: Choosing historical rebalance heuristics

2020-07-16 Thread Vladislav Pyatkov
I completely forget about another promise to favor of using historical rebalance where it is possible. When cluster decided to use a full balance, demander nodes should clear not empty partitions. This can to consume a long time, in some cases that may be compared with a time of rebalance. It also

Re: Choosing historical rebalance heuristics

2020-07-15 Thread Vladislav Pyatkov
e for generic case, this should be fixed in bound of protocol. I think we can modify the heuristic so 1) Exclude partitions by threshold (IGNITE_PDS_WAL_REBALANCE_THRESHOLD - reduce it to 500) 2) Select only that partition for historical rebalance where difference between counters less that part

Re: Choosing historical rebalance heuristics

2020-07-15 Thread Ivan Rakov
itely makes the situation better, but due to possible corner cases we should keep the fallback lever to restrict or limit historical rebalance as before. Probably, it would be handy to keep IGNITE_PDS_WAL_REBALANCE_THRESHOLD property with a low default value (1000, 500 or even 0) and apply your heuristic

Choosing historical rebalance heuristics

2020-07-14 Thread Vladislav Pyatkov
Hi guys, I want to implement a more honest heuristic for historical rebalance. Before, a cluster makes a choice between the historical rebalance or not it only from a partition size. This threshold more known by a name of property IGNITE_PDS_WAL_REBALANCE_THRESHOLD. It might prevent a historical

[jira] [Created] (IGNITE-13254) Historical rebalance iterator may skip checkpoint if it not contains updates

2020-07-14 Thread Vladislav Pyatkov (Jira)
Vladislav Pyatkov created IGNITE-13254: -- Summary: Historical rebalance iterator may skip checkpoint if it not contains updates Key: IGNITE-13254 URL: https://issues.apache.org/jira/browse/IGNITE-13254

[jira] [Created] (IGNITE-13253) Advanced heuristics for historical rebalance

2020-07-14 Thread Vladislav Pyatkov (Jira)
Vladislav Pyatkov created IGNITE-13253: -- Summary: Advanced heuristics for historical rebalance Key: IGNITE-13253 URL: https://issues.apache.org/jira/browse/IGNITE-13253 Project: Ignite

[jira] [Created] (IGNITE-13168) Retrigger historical rebalance if it was cancelled in case WAL history is still available

2020-06-19 Thread Vladislav Pyatkov (Jira)
Vladislav Pyatkov created IGNITE-13168: -- Summary: Retrigger historical rebalance if it was cancelled in case WAL history is still available Key: IGNITE-13168 URL: https://issues.apache.org/jira/browse/IGNITE

[jira] [Created] (IGNITE-12935) Disadvantages in log of historical rebalance

2020-04-24 Thread Vladislav Pyatkov (Jira)
Vladislav Pyatkov created IGNITE-12935: -- Summary: Disadvantages in log of historical rebalance Key: IGNITE-12935 URL: https://issues.apache.org/jira/browse/IGNITE-12935 Project: Ignite

[jira] [Created] (IGNITE-12495) Make historical rebalance wokring per-partition level

2019-12-25 Thread Maxim Muzafarov (Jira)
Maxim Muzafarov created IGNITE-12495: Summary: Make historical rebalance wokring per-partition level Key: IGNITE-12495 URL: https://issues.apache.org/jira/browse/IGNITE-12495 Project: Ignite

[jira] [Created] (IGNITE-12429) Rework bytes-based WAL archive size management logic to make historical rebalance more predictable

2019-12-09 Thread Ivan Rakov (Jira)
Ivan Rakov created IGNITE-12429: --- Summary: Rework bytes-based WAL archive size management logic to make historical rebalance more predictable Key: IGNITE-12429 URL: https://issues.apache.org/jira/browse/IGNITE

[jira] [Created] (IGNITE-12117) Historical rebalance should not be processed in striped way

2019-08-28 Thread Anton Vinogradov (Jira)
Anton Vinogradov created IGNITE-12117: - Summary: Historical rebalance should not be processed in striped way Key: IGNITE-12117 URL: https://issues.apache.org/jira/browse/IGNITE-12117 Project

[jira] [Created] (IGNITE-11607) Historical rebalance is not possible from partition which was recently rebalanced itself

2019-03-22 Thread Alexei Scherbakov (JIRA)
Alexei Scherbakov created IGNITE-11607: -- Summary: Historical rebalance is not possible from partition which was recently rebalanced itself Key: IGNITE-11607 URL: https://issues.apache.org/jira/browse/IGNITE

Re: Historical rebalance

2018-12-03 Thread Vladimir Ozerov
t;> perhaps, we are focused too much on update counters. This feature was > >> designed for the continuous queries and it may not be suited well for > >> the historical rebalance. What if we would track updates on > >> per-transaction basis instead of per-update basis? Let&

Re: Historical rebalance

2018-12-03 Thread Roman Kondakov
n Kondakov wrote: Igor, Vladimir, Ivan, perhaps, we are focused too much on update counters. This feature was designed for the continuous queries and it may not be suited well for the historical rebalance. What if we would track updates on per-transaction basis instead of per-update basis? Let

Re: Historical rebalance

2018-12-03 Thread Vladimir Ozerov
t; Igor, Vladimir, Ivan, > > perhaps, we are focused too much on update counters. This feature was > designed for the continuous queries and it may not be suited well for > the historical rebalance. What if we would track updates on > per-transaction basis instead of per-update bas

Re: Historical rebalance

2018-12-03 Thread Roman Kondakov
Igor, Vladimir, Ivan, perhaps, we are focused too much on update counters. This feature was designed for the continuous queries and it may not be suited well for the historical rebalance. What if we would track updates on per-transaction basis instead of per-update basis? Let's conside

Re: Historical rebalance

2018-12-02 Thread Павлухин Иван
consistency is possible then we might be able to fix historical rebalance for other cache modes as well. Main point here is that having only one common mechanism is much better. Of course if it is feasible. пт, 30 нояб. 2018 г. в 02:02, Seliverstov Igor : > > Vladimir, > > Look at my

Re: Historical rebalance

2018-11-29 Thread Seliverstov Igor
e: cp1 [current=uc1, backpointer=uc1] > > > > Example 2: one active transaction: > > --- > > | | > > uc1op1uc2op2op3uc3cp1 > >

Re: Historical rebalance

2018-11-29 Thread Vladimir Ozerov
2: one active transaction: --- | | uc1op1uc2op2op3uc3cp1 ^ | -- state: tx1 -> op3 -> uc2 cp1 [current=uc3, backpointer=uc2] 7) Historical rebalance: 7.1) Demander finds latest checkpoint, get it's

Re: Historical rebalance

2018-11-29 Thread Vladimir Ozerov
Example 1: no active transactions. > uc1cp1 > ^ | > > > state: cp1 [current=uc1, backpointer=uc1] > > Example 2: one active transaction: > --- > | | > uc1op1----uc2op2op3uc3cp1 >^ | >

Re: Historical rebalance

2018-11-29 Thread Seliverstov Igor
; > > > > > > > > > > > > > > вт, 27 нояб. 2018 г. в 11:19, Vladimir Ozerov < > voze...@gridgain.com>: > > > > > > > > > >> Igor, > > > > >> > > > > >> Could you please elabor

Re: Historical rebalance

2018-11-28 Thread Павлухин Иван
> >> are > > > >> going to save at checkpoint time? From what I understand this should > > > >> be: > > > >> 1) List of active transactions with WAL pointers of their first writes > > > >> 2) List of prepared transactions with their up

Re: Historical rebalance

2018-11-28 Thread Павлухин Иван
are > > >> going to save at checkpoint time? From what I understand this should be: > > >> 1) List of active transactions with WAL pointers of their first writes > > >> 2) List of prepared transactions with their update counters > > >> 3) Partition counte

Re: Historical rebalance

2018-11-28 Thread Павлухин Иван
t; before which there are no prepared transactions. > >> > >> And the we send to supplier node a message: "Give me all updates starting > >> from that LWM plus data for that transactions which were active when I > >> failed". > >> > >>

Re: Historical rebalance

2018-11-27 Thread Seliverstov Igor
repared transactions. >> >> And the we send to supplier node a message: "Give me all updates starting >> from that LWM plus data for that transactions which were active when I >> failed". >> >> Am I right? >> >> On Fri, Nov 23, 2018 at 1

Re: Historical rebalance

2018-11-27 Thread Seliverstov Igor
message: "Give me all updates starting > from that LWM plus data for that transactions which were active when I > failed". > > Am I right? > > On Fri, Nov 23, 2018 at 11:22 AM Seliverstov Igor > wrote: > > > Hi Igniters, > > > > Currently I’m wo

Re: Historical rebalance

2018-11-27 Thread Vladimir Ozerov
ll updates starting > from that LWM plus data for that transactions which were active when I > failed". > > Am I right? > > On Fri, Nov 23, 2018 at 11:22 AM Seliverstov Igor > wrote: > >> Hi Igniters, >> >> Currently I’m working on possible approaches how

Re: Historical rebalance

2018-11-27 Thread Vladimir Ozerov
v 23, 2018 at 11:22 AM Seliverstov Igor wrote: > Hi Igniters, > > Currently I’m working on possible approaches how to implement historical > rebalance (delta rebalance using WAL iterator) over MVCC caches. > > The main difficulty is that MVCC writes changes on tx active phase whil

Re: Historical rebalance

2018-11-26 Thread Павлухин Иван
node to which we are applying WAL records to? I mean the state before applying WAL records. Do we apply all records simply one by one or filter out some of them? пт, 23 нояб. 2018 г. в 11:22, Seliverstov Igor : > > Hi Igniters, > > Currently I’m working on possible approaches how to i

Historical rebalance

2018-11-23 Thread Seliverstov Igor
Hi Igniters, Currently I’m working on possible approaches how to implement historical rebalance (delta rebalance using WAL iterator) over MVCC caches. The main difficulty is that MVCC writes changes on tx active phase while partition update version, aka update counter, is being applied on tx

[jira] [Created] (IGNITE-10117) Node is mistakenly excluded from history suppliers preventing historical rebalance.

2018-11-01 Thread Alexei Scherbakov (JIRA)
Alexei Scherbakov created IGNITE-10117: -- Summary: Node is mistakenly excluded from history suppliers preventing historical rebalance. Key: IGNITE-10117 URL: https://issues.apache.org/jira/browse/IGNITE-10117

[jira] [Created] (IGNITE-9827) Assertion error due historical rebalance

2018-10-09 Thread ARomantsov (JIRA)
ARomantsov created IGNITE-9827: -- Summary: Assertion error due historical rebalance Key: IGNITE-9827 URL: https://issues.apache.org/jira/browse/IGNITE-9827 Project: Ignite Issue Type: Bug

[jira] [Created] (IGNITE-8946) AssertionError can occur during reservation of WAL history for historical rebalance

2018-07-05 Thread Ivan Rakov (JIRA)
Ivan Rakov created IGNITE-8946: -- Summary: AssertionError can occur during reservation of WAL history for historical rebalance Key: IGNITE-8946 URL: https://issues.apache.org/jira/browse/IGNITE-8946

[GitHub] ignite pull request #3791: IGNITE-8116 Historical rebalance fixes

2018-05-18 Thread Jokser
Github user Jokser closed the pull request at: https://github.com/apache/ignite/pull/3791 ---

[jira] [Created] (IGNITE-8390) WAL historical rebalance is not able to process cache.remove() updates

2018-04-25 Thread Pavel Kovalenko (JIRA)
Pavel Kovalenko created IGNITE-8390: --- Summary: WAL historical rebalance is not able to process cache.remove() updates Key: IGNITE-8390 URL: https://issues.apache.org/jira/browse/IGNITE-8390 Project

[GitHub] ignite pull request #3791: IGNITE-8116 Historical rebalance fixes

2018-04-10 Thread Jokser
GitHub user Jokser opened a pull request: https://github.com/apache/ignite/pull/3791 IGNITE-8116 Historical rebalance fixes You can merge this pull request into a Git repository by running: $ git pull https://github.com/gridgain/apache-ignite ignite-8116-8122 Alternatively