Preprocessing and mining web log data for web personalization
Miriam Baglioni,U. Ferrara,Andrea Romei,Salvatore Ruggieri,Franco Turini +4 more
- 23 Sep 2003
- pp 237-249
TL;DR: The web usage mining activities of an on-going project, called ClickWorld, that aims at extracting models of the navigational behaviour of a web site users by means of data and web mining techniques are described.
read more
Abstract: We describe the web usage mining activities of an on-going project, called ClickWorld, that aims at extracting models of the navigational behaviour of a web site users. The models are inferred from the access logs of a web server by means of data and web mining techniques. The extracted knowledge is deployed to the purpose of offering a personalized and proactive view of the web services to users. We first describe the preprocessing steps on access logs necessary to clean, select and prepare data for knowledge extraction. Then we show two sets of experiments: the first one tries to predict the sex of a user based on the visited web pages, and the second one tries to predict whether a user might be interested in visiting a section of the site.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Patent
Automated profiling of resource usage
Michael David Marr,Matthew D. Klein +1 more
- 20 Sep 2012
TL;DR: In this paper, operating profiles for consumers of computing resources may be automatically determined based on an analysis of actual resource usage measurements and other operating metrics and assignment decisions may be made based on the profiles, and computing resources can be reallocated or oversubscribed if the profiles indicate that the consumers are unlikely to fully utilize the resources reserved for them.
232
Data Mining the Web: Uncovering Patterns in Web Content, Structure, and Usage
Zdravko Markov,Daniel T. Larose +1 more
TL;DR: The author revealed how the A Priori Algorithm and the MDL-Based Model Evaluation techniques improved the efficiency of the Web Usage Mining process and also revealed new ideas about how to improve the quality of the search process.
181
Patent
Request routing using network computing components
Swaminathan Sivasubramanian,David R. Richardson,Christopher L. Scofield,Bradley E. Marshall +3 more
- 18 Jun 2009
TL;DR: In this article, a DNS server at a content delivery network service provider obtains a DNS query corresponding to a resource requested from a client computing device and associated with a first resource identifier.
166
Patent
Managing content delivery network service providers by a content broker
David R. Richardson,Bradley E. Marshall,Swaminathan Sivasubramanian,Tal Saraf,Imran S. Patel +4 more
- 25 Jan 2012
TL;DR: In this paper, a system, method, and computer readable medium for managing network storage provider and CDN service providers is provided, where a content broker component obtains client computing device requests for content provided by a content provider.
162
References
•Book
Data Mining: Concepts and Techniques
Jiawei Han,Micheline Kamber,Jian Pei +2 more
- 08 Sep 2000
TL;DR: This book presents dozens of algorithms and implementation examples, all in pseudo-code and suitable for use in real-world, large-scale data mining projects, and provides a comprehensive, practical look at the concepts and techniques you need to get the most out of real business data.
Instance-Based Learning Algorithms
TL;DR: This paper describes how storage requirements can be significantly reduced with, at most, minor sacrifices in learning rate and classification accuracy and extends the nearest neighbor algorithm, which has large storage requirements.
Data Mining: Concepts and Techniques
G. Thamaraiselvi,A. Kaliammal +1 more
TL;DR: This article explains What is data mining?
4.4K