2012-12-01
Mapping product taxonomies in e-commerce
Publication
Publication
In this paper we propose SCHEMA, an algorithm that automatically maps heterogeneous product taxonomiesin the domain of e-commerce. SCHEMA employs a custom word sense disambiguation technique,based on the Lesk algorithm, in combination with the semantic lexicon WordNet. For findingcandidate target categories and determining the path-similarity we propose a semantic category matchingalgorithm that takes into account the disambiguation process of a category. The mapping quality score iscalculated using the Damerau-Levenshtein distance and a node-dissimilarity penalty. The performance ofSCHEMA was tested on three real-life datasets and compared to PROMPT and the algorithm proposedby Park & Kim. The comparison shows that SCHEMA improves considerably recall and F1-score, whilemaintaining similar precision.
Additional Metadata | |
---|---|
hdl.handle.net/1765/82835 | |
24th Benelux Conference on Artificial Intelligence, BNAIC 2012 | |
Organisation | Department of Econometrics |
Aanen, S., Nederstigt, L., Vandic, D., & Frasincar, F. (2012). Mapping product taxonomies in e-commerce. Presented at the 24th Benelux Conference on Artificial Intelligence, BNAIC 2012. Retrieved from http://hdl.handle.net/1765/82835 |