Re: ChatGPT > Spamassassin? :)

Bill Cole Tue, 25 Jun 2024 08:22:33 -0700

On 2024-06-24 at 17:18:11 UTC-0400 (Mon, 24 Jun 2024 17:18:11 -0400)
Mark London <m...@psfc.mit.edu>
is rumored to have said:

I received a spam email with the text below, that wasn't caught bySpamassasin (at least mine). The text actually looks like somethingthat was generated using ChatGPT. In any event, I put the textthrough ChatGPT, and asked if it looked like spam. At the bottom ofthis email , is it's analysis. I've not been fully reading thisgroup. Has there been any work to allow Spamassassin to use AI?


"Artificial intelligence" does not exist. It is a misnomer.

Large language models like ChatGPT have a provenance problem. There's noway to know why exactly the model "says" anything. In a singleparagraph, ChatGPT is capable of making completely and directlyinconsistent assertions. The only way to explain that is that despiteappearances, a request to answer the ham/spasm question generates textwith no semantic connection to the original, but which seems like anexplanation.

SpamAssassin's code and rules all come from ASF committers, and thescores are determined by examining the scan results from contributorsand optimizing them to a threshold of 5.0. Every scan of a messageresults in a list of hits against documented rules. The results can beanalyzed and understood.

We know that ChatGPT and other LLMs that are publicly available havebeen trained on data to which they had no license. There is no way toremove any particular ingested data. There's no way to know where anyparticular LLM will have problems and no way to fix those problems. Thisall puts them outside of the boundaries we have as an ASF project.However, we do have a plugin architecture, so it is possible for 3rdparties to create a plugin for LLM integration.



--
Bill Cole
b...@scconsult.com or billc...@apache.org

(AKA @grumpybozo@toad.social and many *@billmail.scconsult.comaddresses)

Not Currently Available For Hire

Re: ChatGPT > Spamassassin? :)

Reply via email to