LLMは専門家を超えるか

この研究では、大規模言語モデル（LLMs）が人間の専門家を上回って神経科学の実験結果を予測できるかどうかを評価している。研究の一環として、BrainBenchという神経科学の結果を予測するためのベンチマークを作成し、LLMsが専門家よりも優れた予測を行うことを発見した。特に、神経科学の文献に基づいて調整されたBrainGPTというLLMがさらに優れた性能を示している。LLMsは高い信頼度を示すときに正確な予測をする傾向があり、将来的には人間の科学者が発見を行う際にLLMsが支援する可能性が示唆されている。

Large language models surpass human experts in predicting neuroscience results – Nature Human Behaviour

Large language models (LLMs) can synthesize vast amounts of information. Luo et al. show that LLMs—especially BrainGPT, an LLM the authors tuned on the neuroscience literature—outperform experts in predicting neuroscience results and could assist scientists in making future discoveries.