On Mon, May 12, 2025 at 11:33 AM Matt Mahoney <mattmahone...@gmail.com>
wrote:

>
> Training and prediction costs about the same.
>

In the limit of a large number of predictions per model-revision that is
false for the obvious reason that predictions are conditional
decompressions and the conditions (ie: "prompts") overlap hence the
predictions overlap -- so tabling/memoization/caching not only works, but
is an inevitable cost savings.

------------------------------------------
Artificial General Intelligence List: AGI
Permalink: 
https://agi.topicbox.com/groups/agi/Tdc5c19d0f38aacd6-M9ed2badb276cdd06da046713
Delivery options: https://agi.topicbox.com/groups/agi/subscription

Reply via email to