The role of corpus size and syntax in deriving lexico-semantic representations for a wide range of concepts

De Deyne, S.; Verheyen, Steven; Storms, Gert

doi:10.1080/17470218.2014.994098

De Deyne, S., S. Verheyen (Steven) and G. Storms (Gert)

2015

The role of corpus size and syntax in deriving lexico-semantic representations for a wide range of concepts

The Quarterly Journal of Experimental Psychology , Volume 68 - Issue 8 p. 1643- 1664

One of the most significant recent advances in the study of semantic processing is the advent of models based on text and other corpora. In this study, we address what impact both the quantitative and qualitative properties of corpora have on mental representations derived from them. More precisely, we evaluate models with different linguistic and mental constraints on their ability to predict semantic relatedness between items from a vast range of domains and categories. We find that a model based on syntactic dependency relations captures significantly less of the variability for all kinds of words, regardless of the semantic relation between them or their abstractness. The largest difference was found for concrete nouns, which are commonly used to assess semantic processing. For both models we find that limited amounts of data suffice to obtain reliable predictions. Together, these findings suggest new constraints for the construction of mental models from corpora, both in terms of the corpus size and in terms of the linguistic properties that contribute to mental representations.

Additional Metadata
Keywords	semantic memory, text corpora, syntactic dependency, word associations, similarity.
Persistent URL	doi.org/10.1080/17470218.2014.994098, hdl.handle.net/1765/125386
Journal	The Quarterly Journal of Experimental Psychology
Organisation	Department of Psychology
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	De Deyne, S., Verheyen, S., & Storms, G. (2015). The role of corpus size and syntax in deriving lexico-semantic representations for a wide range of concepts. The Quarterly Journal of Experimental Psychology, 68(8), 1643–1664. doi:10.1080/17470218.2014.994098

View at Publisher

The role of corpus size and syntax in deriving lexico-semantic representations for a wide range of concepts

Publication

Publication

About

The role of corpus size and syntax in deriving lexico-semantic representations for a wide range of concepts

Publication

Publication

Workflow

Workflow

Add Content