Interesting future idea. My current goal is to simply find outliers by 
value. Eg: if my electricity bill is twice what it normally is, to flag 
that. 

But as you observed, fraud detection might be possible by simply grabbing 
all data that an institution provides, stuffing it into metadata, and 
exposing it to the learning algorithm.

On Wednesday, January 24, 2024 at 4:55:29 PM UTC-8 timothy.n....@gmail.com 
wrote:

> Maybe take a page from fraud detection 
> <https://towardsdatascience.com/anomaly-fraud-detection-a-quick-overview-28641ec49ec1>
>  
> and try to annotate each transaction with the hour of the day, day of the 
> week, day of the month, and accumulated transaction volume/value when the 
> transaction occurred. Even throw in a biweekly time period if you're 
> feeling fancy? The best thing would be to include the location if you have 
> it, but I don't think you do.
>
>
> Sincerely,
> Timothy Jesionowski
>
>
> On Wed, Jan 24, 2024 at 4:34 PM Red S <redst...@gmail.com> wrote:
>
>> Very interesting, thank you all. I wonder if a single user's journal 
>> would suffice for a learning dataset in this case. For me, expenses across 
>> categories of interest are those have been stable for years. Plus, I’m 
>> willing to deal with false positives (but preferably not false negatives).
>>
>> There is a kind of machine learning problem called outlier detection. I 
>> think sciki-learn library is a good starting point
>>
>> Excellent, thank you for the helpful pointers! A quick search brought up 
>> these, which I’ve noted down to look into when I have time(TM):
>> https://scikit-learn.org/stable/modules/outlier_detection.html
>>
>> https://scikit-learn.org/stable/auto_examples/neighbors/plot_lof_outlier_detection.html
>> ​
>>
>>
>> On Wednesday, January 24, 2024 at 10:09:54 AM UTC-8 erical...@gmail.com 
>> wrote:
>>
>> This would probably be more useful if users can provide their own 
>> examples of abnormal and normal expenses.  In that case, the model itself 
>> is probably not very difficult; I imagine a variety of off the shelf 
>> toolkits would work.  To me, the harder part seems like making the workflow 
>> smooth and robust -- deciding how users would flag outliers, run the 
>> classifier, correct misclassifications, cause retraining to happen, etc.
>>
>> On Wed, Jan 24, 2024 at 10:05 AM Yichu Zhou <flyaw...@gmail.com> wrote:
>>
>> There is a kind of machine learning problem called outlier detection. I 
>> think sciki-learn library is a good starting point if we want to use ML 
>> techniques. But in our case, I feel the definition of “abnormal” varies on 
>> different personal situations. It might be tricky to formulate the problem 
>> properly. 
>>
>> On Tue, Jan 23, 2024 at 21:24 Red S <redst...@gmail.com> wrote:
>>
>> Definitely! That's what I had in mind. Would you or others on this list 
>> have experience in how to frame the problem from a deep learning 
>> classification problem, what tools/libraries to use, and such? Pointers 
>> appreciated.
>>
>> On Tuesday, January 23, 2024 at 8:08:31 AM UTC-8 char...@gmail.com wrote:
>>
>> Sounds like a good opportunity for deep learning classification problem.
>>
>> On Friday, January 19, 2024 at 11:45:35 AM UTC+1 Red S wrote:
>>
>> I'm curious, has anyone setup Beancount scripts or reports to flag 
>> expenses that might need further attention? The situation that made me 
>> think about this is a quarterly bill that doubled multiple times after 
>> years of being stable, which is an obvious red flag.
>>
>> Unlike in the past, virtually of my Beancount interactions are highly 
>> automated, which combined with the fact that time is at a premium these 
>> days, causes me to miss details like this.
>>
>> In this particular case, a rule to flag expenses that deviate from their 
>> norm over a certain time period (monthly, annually) might be simple to 
>> write, but I was wondering for a more general, perhaps fancier solution 
>> that would learn to distinguish what's normal and call attention to what's 
>> not, as rules based solutions tend to be incomplete and require constant 
>> fiddling.
>>
>> -- 
>>
> You received this message because you are subscribed to the Google Groups 
>> "Beancount" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to beancount+...@googlegroups.com.
>>
> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/beancount/d1cffa4f-4c32-484c-b4f8-c6d4cbf748cen%40googlegroups.com
>>  
>> <https://groups.google.com/d/msgid/beancount/d1cffa4f-4c32-484c-b4f8-c6d4cbf748cen%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"Beancount" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to beancount+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/beancount/3dcea2d6-1090-490d-a65a-3a0919ce24bdn%40googlegroups.com.

Reply via email to