ABSTRACT
In content- and knowledge-based recommender systems often a measure of (dis)similarity between items is used. Frequently, this measure is based on the attributes of the items. However, which attributes are important for the users of the system remains an important question to answer. In this paper, we present an approach to determine attribute weights in a dissimilarity measure using clickstream data of an e-commerce website. Counted is how many times products are sold and based on this a Poisson regression model is estimated. Estimates of this model are then used to determine the attribute weights in the dissimilarity measure. We show an application of this approach on a product catalog of MP3 players provided by Compare Group, owner of the Dutch price comparison site http://www.vergelijk.nl, and show how the dissimilarity measure can be used to improve 2D product catalog visualizations.
- B. Arslan, F. Ricci, N. Mirzadeh, and A. Venturini. A dynamic approach to feature weighting. Manage. Inform. Syst., 6:999--1008, 2002.Google Scholar
- I. Borg and P. J. F. Groenen. Modern Multidimensional Scaling. Springer, New York, 2nd edition, 2005.Google Scholar
- L. K. Branting. Learning feature weights from customer return-set selections. Knowl. Inf. Syst., 6:188--202, 2004. Google ScholarDigital Library
- R. Burke. Knowledge based recommender systems. In Encyclopedia of Library and Information Science, volume 69, Supplement 32. Marcel Dekker, New York, 2000.Google Scholar
- P. Chaudhuri, W.-D. Lo, W.-Y. Loh, and C.-C. Yang. Generalized regression trees. Stat. Sin., 5:641--666, 1995.Google Scholar
- A. Ciampi. Generalized regression trees. Comput. Stat. Data Anal., 12:57--78, 1991. Google ScholarDigital Library
- L. Coyle and P. Cunningham. Improving recommendation rankings by learning personal feature weights. Lect. Notes Comput. Sci., 3155:560--572, 2004.Google ScholarCross Ref
- S. Debnath, N. Ganguly, and P. Mitra. Feature weighting in content based recommendation system using social network analysis. In Proceeding of the 17th International Conference on World Wide Web, pages 1041--1042, 2008. Google ScholarDigital Library
- J. C. Gower. A general coefficient of similarity and some of its properties. Biometrics, 27:857--874, 1971.Google ScholarCross Ref
- B. F. Green. The orthogonal approximation of an oblique structure in factor analysis. Psychometrika, 17:429--440, 1952.Google ScholarCross Ref
- J. Honaker, G. King, and M. Blackwell. Amelia II: A Program for Missing Data, 2008. R package version 1.1--27, http://gking.harvard.edu/amelia.Google Scholar
- J. G. Ibrahim, M.-H. Chen, S. R. Lipsitz, and A. H. Herring. Missing-data methods for generalized linear models: A comparative review. J. Am. Stat. Assoc., 100(469):332--346, 2005.Google ScholarCross Ref
- A. Jameson, R. Schäfer, J. Simons, and T. Weis. Adaptive provision of evaluation-oriented information: Tasks and techniques. In Proceedings of the 15th International Joint Conference on Artificial Intelligence, pages 1886--1893, 1995. Google ScholarDigital Library
- M. Kagie, M. Van Wezel, and P. J. F. Groenen. Online shopping using a two dimensional product map. Lect. Notes Comput. Sci., 4655:89--98, 2007. Google ScholarDigital Library
- M. Kagie, M. Van Wezel, and P. J. F. Groenen. A graphical shopping interface based on product attributes. Decis. Support Syst., 2008. http://dx.doi.org/j.dss.2008.06.011. Google ScholarDigital Library
- G. King, J. Honaker, A. Joseph, and K. Scheve. Analyzing incomplete political science data: An alternative algorithm for multiple imputation. Am. Polit. Sci. Rev., 95(1):49--69, 2001.Google ScholarCross Ref
- F. Lorenzi and F. Ricci. Case-based recommender systems: A unifying view. Lect. Notes Comput. Sci., 3169:89--113, 2005. Google ScholarDigital Library
- P. McCullagh and J. A. Nelder. Generalized Linear Models. Chapman & Hall, Boca Raton, 2nd edition, 1989.Google Scholar
- D. McSherry. A generalised approach to similarity-based retrieval in recommender systems. Artif. Intell. Rev., 18:309--341, 2002. Google ScholarDigital Library
- J. A. Nelder and R. W. M. Wedderburn. Generalized linear models. J. R. Stat. Soc. Ser. A-Stat. Soc., 135(3):370--384, 1972.Google ScholarCross Ref
- R Development Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, 2008. ISBN 3-900051-07-0, http://www.R-project.org.Google Scholar
- D. B. Rubin. Multiple Imputation for Nonresponse in Surveys. Wiley, New York, 1987.Google ScholarCross Ref
- J. L. Schafer and M. K. Olsen. Multiple imputation for multivariate missing-data problems: A data analyst's perspective. Multivariate Behav. Res., 33(4):545--571, 1998.Google ScholarCross Ref
- I. Schwab, W. Pohl, and I. Koychev. Learning to recommend from positive evidence. In Proceedings of the 5th International Conference on Intelligent User Interfaces, pages 241--246, 2000. Google ScholarDigital Library
- M. Verbeek. A Guide To Modern Econometrics. John Wiley & Sons, Chichester, UK, 2nd edition, 2004.Google Scholar
- D. Wettschereck and D. W. Aha. Weighting features. Lect. Notes Comput. Sci., 1010:347--358, 1995. Google ScholarDigital Library
Index Terms
- Choosing attribute weights for item dissimilarity using clikstream data with an application to a product catalog map
Recommendations
A novel similarity/dissimilarity measure for intuitionistic fuzzy sets and its application in pattern recognition
Among the most interesting measures in intuitionistic fuzzy sets (IFSs) theory, the similarity measure is an essential tool to compare and determine degree of similarity between IFSs. Although there exist many similarity measures for IFSs, most of them ...
Comparing Dissimilarity Measures: A Case of Banking Ratios
The aim of this paper is twofold. Firstly, to discuss a clustering of a given set of the European banks into groups based on their performance during 1999–2013. Secondly, to compare different dissimilarity measures and to determine which of them suits ...
A Collaborative Recommender Combining Item Rating Similarity and Item Attribute Similarity
ISBIM '08: Proceedings of the 2008 International Seminar on Business and Information Management - Volume 02Collaborative filtering (CF) is the most popular recommendation technique nowadays. Traditional CF approaches compute a similarity value between the target user and each other user by computing the relativity of their rating style, which is the set of ...
Comments