Many of the existing cloud tagging systems are unable to cope with the syntactic and semantic tag variations during user search and browse activities. As a solution to this problem, in this paper, we propose the Semantic Tag Clustering Search, a framework able to cope with these needs. The framework consists of three parts: removing syntactic variations, creating semantic clusters, and utilizing the obtained clusters to improve search and exploration of tag spaces. For removing syntactic variations, we use the normalized Levenshtein distance, and the cosine similarity measure based on tag co-occurrences. For creating semantic clusters, we improve an existing non-hierarchical clustering technique. Using our framework, we are able to find more clusters and achieve a higher precision than the original method. The advantages of a cluster-based approach for searching and browsing through tag spaces have been exploited in, the implementation of our framework.

, , ,,
Erasmus School of Economics

Vandic, D., Van Dam, J. W., Hogenboom, F., & Frasincar, F. (2011). A semantic clustering-based approach for searching and browsing tag spaces. doi:10.1145/1982185.1982538