On Mon, May 12, 2025 at 11:33 AM Matt Mahoney <mattmahone...@gmail.com> wrote:
> > Training and prediction costs about the same. > In the limit of a large number of predictions per model-revision that is false for the obvious reason that predictions are conditional decompressions and the conditions (ie: "prompts") overlap hence the predictions overlap -- so tabling/memoization/caching not only works, but is an inevitable cost savings. ------------------------------------------ Artificial General Intelligence List: AGI Permalink: https://agi.topicbox.com/groups/agi/Tdc5c19d0f38aacd6-M9ed2badb276cdd06da046713 Delivery options: https://agi.topicbox.com/groups/agi/subscription