Obtaining a useful complete overview of Web-based product information has become difficult nowadays due to the ever-growing amount of information available on online shops. Findings from previous studies suggest that better search capabilities, such as the exploitation of annotated data, are needed to keep online shopping transparent for the user. Annotations can, for example, help present information from multiple sources in a uniform manner. In order to support the product data integration process, we propose an algorithm that can autonomously map heterogeneous product taxonomies from different online shops. The proposed approach uses word sense disambiguation techniques, approximate lexical matching, and a mechanism that deals with composite categories. Our algorithm’s performance compared favorably against two other state-of-the-art taxonomy mapping algorithms on three real-life datasets. The results show that the F1-measure for our algorithm is on average 60% higher than a state-of-the-art product taxonomy mapping algorithm.

Additional Metadata
Keywords Lexical matching, Product taxonomies, Schema mapping, Word sense disambiguation
Persistent URL hdl.handle.net/1765/92014
Journal Journal of Web Engineering
Citation
Nederstigt, L.J, Vandic, D, & Frasincar, F. (2016). A lexical approach for taxonomy mapping. Journal of Web Engineering, 15(1-2), 84–109. Retrieved from http://hdl.handle.net/1765/92014