[nexa] Evidence For (and Against) Emergent World Models in LLM

Daniela Tafani Fri, 14 Feb 2025 00:54:25 -0800

Melanie Mitchell, LLMs and World Models, Part 2
Evidence For (and Against) Emergent World Models in LLM
Feb 13, 2025


[...]
Conclusion
The claims of emergent abstract world models in LLMs are not yet supported by 
strong evidence. There is some evidence of such world models arising in 
transformers trained on narrow domains (Othello, chess, mazes, etc.) but also 
evidence that their abilities arise not from human-like internal models but 
from large “bags of heuristics”. Moreover, the notion of “world model” itself 
is not rigorously defined; when considering whether an agent has a particular 
kind of world model, we should ask what kinds of questions such a model should 
be able to answer, how easy or hard it should be for the agent to get answers 
from the model, and to what extent we would expect that the model would allow 
the agent to adapt to novel situations.

<https://aiguide.substack.com/p/llms-and-world-models-part-2>

Qui la prima parte
<https://aiguide.substack.com/p/llms-and-world-models-part-1>

[nexa] Evidence For (and Against) Emergent World Models in LLM

Reply via email to