您的位置: 首页 > 全球经管学术 > 顶刊追踪 > 顶尖期刊 > 综合性期刊 > Science > 2024 > 6682期

Grounded language acquisition through the eyes and ears of a single child

成果类型：

Article

署名作者：

Vong, Wai Keen; Wang, Wentao; Orhan, A. Emin; Lake, Brenden M.

署名单位：

New York University; New York University

刊物名称：

SCIENCE

ISSN/ISSBN：

0036-9620

DOI：

10.1126/science.adi1374

发表日期：

2024-02-02

页码：

504-511

关键词：

model words

摘要：

Starting around 6 to 9 months of age, children begin acquiring their first words, linking spoken words to their visual counterparts. How much of this knowledge is learnable from sensory input with relatively generic learning mechanisms, and how much requires stronger inductive biases? Using longitudinal head-mounted camera recordings from one child aged 6 to 25 months, we trained a relatively generic neural network on 61 hours of correlated visual-linguistic data streams, learning feature-based representations and cross-modal associations. Our model acquires many word-referent mappings present in the child's everyday experience, enables zero-shot generalization to new visual referents, and aligns its visual and linguistic conceptual systems. These results show how critical aspects of grounded word meaning are learnable through joint representation and associative learning from one child's input.