Oh I see now, using currentRow will give the correlation per ID within the
group based on its ordering and using unbounded both will result in the
overall correlation value for the whole group?
El lun, 28 feb 2022 a las 16:33, Sean Owen () escribió:
> The results make sense then. You want a corre
The results make sense then. You want a correlation per group right?
because it's over the sums by ID within the group. Then currentRow is
wrong; needs to be unbounded preceding and following.
On Mon, Feb 28, 2022 at 9:22 AM Edgar H wrote:
> The window is defined as you said yes, unboundedPrece
The window is defined as you said yes, unboundedPreceding and currentRow
ordering by orderCountSum.
val initialSetWindow = Window
.partitionBy("group")
.orderBy("orderCountSum")
.rowsBetween(Window.unboundedPreceding, Window.currentRow)
I'm trying to obtain the correlation for each of the m
How are you defining the window? It looks like it's something like "rows
unbounded proceeding, current" or the reverse, as the correlation varies
across the elements of the group as if it's computing them on 1, then 2,
then 3 elements. Don't you want the correlation across the group? otherwise
this
My bad completely, missed the example by a mile sorry for that, let me
change a couple of things.
- Got to add "id" to the initial grouping and also add more elements to the
initial set;
val sampleSet = Seq(
("group1", "id1", 1, 1, 6),
("group1", "id1", 4, 4, 6),
("group1", "id2", 2, 2, 5),
You're computing correlations of two series of values, but each series has
one value, a sum. Correlation is not defined in this case (both variances
are undefined). This is sample correlation, note.
On Mon, Feb 28, 2022 at 7:06 AM Edgar H wrote:
> Morning all, been struggling with this for a whi