2011-10-19
Word sense disambiguation for automatic taxonomy construction from text-based Web corpora
Publication
Publication
In this paper, we propose the Automatic Taxonomy Construction from Text (ATCT) framework for building taxonomies from text-based Web corpora. The framework is composed of multiple processing steps. Firstly, domain terms are extracted using a filtering method. Subsequently, Word Sense Disambiguation (WSD) is optionally applied in order to determine the senses of these terms. Then, by means of a subsumption technique, the resulting concepts are arranged in a hierarchy. We construct taxonomies with and without WSD and we investigate the effect of WSD on the quality of concept type-of relations using an evaluation framework that uses a golden taxonomy. We find that WSD improves the quality of the built taxonomy in terms of the taxonomic F-Measure.
Additional Metadata | |
---|---|
doi.org/10.1007/978-3-642-24434-6_18, hdl.handle.net/1765/53694 | |
Organisation | Erasmus School of Economics |
de Knijff, J., Meijer, K., Frasincar, F., & Hogenboom, F. (2011). Word sense disambiguation for automatic taxonomy construction from text-based Web corpora. doi:10.1007/978-3-642-24434-6_18 |