Over the last few years, we have experienced a steady growth in e-commerce. This growth introduces many problems for services that want to aggregate product information and offerings. One of the problems that aggregation services face is the matching of product categories from different Web shops. This paper proposes an algorithm to perform this task automatically, making it possible to aggregate product information from multiple Web sites, in order to deploy it for search, comparison, or recommender systems applications. The algorithm uses word sense disambiguation techniques to address varying denominations between different taxonomies. Path similarity is assessed between source and candidate target categories, based on lexical relatedness and structural information. The main focus of the proposed solution is to improve the disambiguation procedure in comparison to an existing state-of-the-art approach, while coping with product taxonomy-specific characteristics, like composite categories, and re-examining lexical similarity and similarity aggregation in this context. The performance evaluation based on data from three real-world Web shops demonstrates that the proposed algorithm improves the benchmarked approach by 62% on average F1-measure.

, , , , , , , ,
doi.org/10.1016/j.eswa.2014.09.032, hdl.handle.net/1765/88931
ERIM Top-Core Articles
Expert Systems with Applications
Erasmus University Rotterdam

Aanen, S.S, Vandic, D, & Frasincar, F. (2015). Automated product taxonomy mapping in an e-commerce environment. Expert Systems with Applications, 42(3), 1298–1313. doi:10.1016/j.eswa.2014.09.032