Hi Ping,
FYI, we just merged Feynman's PR:
https://github.com/apache/spark/pull/6997 that adds sequential pattern
support. Please check out master branch and help test. Thanks!
Best,
Xiangrui
On Wed, Jun 24, 2015 at 2:16 PM, Feynman Liang wrote:
> There is a JIRA for this which I just submitted
This is on the wish list for Spark 1.5. Assuming that the items from
the same transaction are distinct. We can still follow FP-Growth's
steps:
1. find frequent items
2. filter transactions and keep only frequent items
3. do NOT order by frequency
4. use suffix to partition the transactions (whethe
Hi,
I have a use case where I'd like to mine frequent sequential patterns
(consider the clickpath scenario). Transaction A -> B doesn't equal
Transaction B->A..
>From what I understand about FP-growth in general and the MLlib
implementation of it, the orders are not preserved. Anyone can provide