Data-driven models and computational tools for neurolinguistics: a language technology perspective

Year
2020
Volume 21
Issue 1
Pages
15-52
Authors
Ekaterina Artemova, Amir Bakarov, Aleksey Artemov, Evgeny Burnaev, Maxim Sharaev
Abstract
In this paper, our focus is the connection and influence of language technologies on the research in neurolinguistics. We present a review of brain imaging-based neurolinguistic studies with a focus on the natural language representations, such as word embeddings and pre-trained language models.  Mutual enrichment of neurolinguistics and language technologies leads to development of brain-aware natural language representations. The importance of this research area is emphasized by medical applications.

Keywords: neurolinguistics, neuroimaging data, EEG, fMRI, natural language representations, word embeddings, distributional semantics models, word2vec, GloVe, BERT, brain-aware embeddings