UCLA揭穿AI读心术幻觉：用极简无脑模型击败GPT-2，破除大模型类脑神话 | DAST Papers

对应论文

Spurious alignment between large language models and brains can emerge from non-robust methods and overlooked confounds

视频简介

这篇 Nature Communications 论文直接质疑 LLM-brain alignment 研究中的方法学稳健性。作者跨多个模型、方法和三个常用神经数据集分析 neural predictivity，发现 shuffled train-test splits 曾导致有影响力但虚假的结论。他们还显示 LLM activation extraction 选择会偏向特定模型类别，而 position signal 和 word rate 等混杂变量可与训练好的 LLM 竞争，甚至解释 untrained LLM 的神经预测性。它值得收录，因为它为 NeuroAI 和认知神经科学中的模型-大脑相似性分析提供了重要方法学边界，防止把 confound 当成智能机制。

外部视频链接

论文链接

论文详情页