A generalization-based approach to clustering of Web usage sessions

Yongjian Fu, Kanwalpreet Sandhu, Ming-Yi Shih

Research output: Chapter in Book/Report/Conference proceedingConference contribution

72 Citations (Scopus)

Abstract

The clustering of Web usage sessions based on the access patterns is studied. Access patterns of Web users are extracted from Web server log files, and then organized into sessions which represent episodes of interaction between the Web users and the Web server. Using attribute-oriented induction, the sessions are then generalized according to a page hierarchy which organizes pages based on their contents. These generalized sessions are finally clustered using a hierarchical clustering method. Our experiments on a large real data set show that the approach is efficient and practical for Web mining applications.

Original languageEnglish
Title of host publicationWeb Usage Analysis and User Profiling - International WEBKDD 1999 Workshop, Revised Papers
EditorsBrij Masand, Brij Masand, Myra Spiliopoulou
PublisherSpringer Verlag
Pages21-38
Number of pages18
ISBN (Electronic)9783540678182
Publication statusPublished - 2000 Jan 1
EventInternational Workshop on Web Usage Analysis and User Profiling, WEBKDD 1999 - San Diego, United States
Duration: 1999 Aug 151999 Aug 15

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume1836
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

OtherInternational Workshop on Web Usage Analysis and User Profiling, WEBKDD 1999
CountryUnited States
CitySan Diego
Period99-08-1599-08-15

Fingerprint

Servers
Web Server
Clustering
Web Mining
Hierarchical Clustering
Clustering Methods
Proof by induction
Attribute
Experiments
Interaction
Experiment
Generalization
Hierarchy

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Fu, Y., Sandhu, K., & Shih, M-Y. (2000). A generalization-based approach to clustering of Web usage sessions. In B. Masand, B. Masand, & M. Spiliopoulou (Eds.), Web Usage Analysis and User Profiling - International WEBKDD 1999 Workshop, Revised Papers (pp. 21-38). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 1836). Springer Verlag.
Fu, Yongjian ; Sandhu, Kanwalpreet ; Shih, Ming-Yi. / A generalization-based approach to clustering of Web usage sessions. Web Usage Analysis and User Profiling - International WEBKDD 1999 Workshop, Revised Papers. editor / Brij Masand ; Brij Masand ; Myra Spiliopoulou. Springer Verlag, 2000. pp. 21-38 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{490e959d46424c6baa02c6f495b27eee,
title = "A generalization-based approach to clustering of Web usage sessions",
abstract = "The clustering of Web usage sessions based on the access patterns is studied. Access patterns of Web users are extracted from Web server log files, and then organized into sessions which represent episodes of interaction between the Web users and the Web server. Using attribute-oriented induction, the sessions are then generalized according to a page hierarchy which organizes pages based on their contents. These generalized sessions are finally clustered using a hierarchical clustering method. Our experiments on a large real data set show that the approach is efficient and practical for Web mining applications.",
author = "Yongjian Fu and Kanwalpreet Sandhu and Ming-Yi Shih",
year = "2000",
month = "1",
day = "1",
language = "English",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
pages = "21--38",
editor = "Brij Masand and Brij Masand and Myra Spiliopoulou",
booktitle = "Web Usage Analysis and User Profiling - International WEBKDD 1999 Workshop, Revised Papers",
address = "Germany",

}

Fu, Y, Sandhu, K & Shih, M-Y 2000, A generalization-based approach to clustering of Web usage sessions. in B Masand, B Masand & M Spiliopoulou (eds), Web Usage Analysis and User Profiling - International WEBKDD 1999 Workshop, Revised Papers. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 1836, Springer Verlag, pp. 21-38, International Workshop on Web Usage Analysis and User Profiling, WEBKDD 1999, San Diego, United States, 99-08-15.

A generalization-based approach to clustering of Web usage sessions. / Fu, Yongjian; Sandhu, Kanwalpreet; Shih, Ming-Yi.

Web Usage Analysis and User Profiling - International WEBKDD 1999 Workshop, Revised Papers. ed. / Brij Masand; Brij Masand; Myra Spiliopoulou. Springer Verlag, 2000. p. 21-38 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 1836).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - A generalization-based approach to clustering of Web usage sessions

AU - Fu, Yongjian

AU - Sandhu, Kanwalpreet

AU - Shih, Ming-Yi

PY - 2000/1/1

Y1 - 2000/1/1

N2 - The clustering of Web usage sessions based on the access patterns is studied. Access patterns of Web users are extracted from Web server log files, and then organized into sessions which represent episodes of interaction between the Web users and the Web server. Using attribute-oriented induction, the sessions are then generalized according to a page hierarchy which organizes pages based on their contents. These generalized sessions are finally clustered using a hierarchical clustering method. Our experiments on a large real data set show that the approach is efficient and practical for Web mining applications.

AB - The clustering of Web usage sessions based on the access patterns is studied. Access patterns of Web users are extracted from Web server log files, and then organized into sessions which represent episodes of interaction between the Web users and the Web server. Using attribute-oriented induction, the sessions are then generalized according to a page hierarchy which organizes pages based on their contents. These generalized sessions are finally clustered using a hierarchical clustering method. Our experiments on a large real data set show that the approach is efficient and practical for Web mining applications.

UR - http://www.scopus.com/inward/record.url?scp=23044519893&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=23044519893&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:23044519893

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 21

EP - 38

BT - Web Usage Analysis and User Profiling - International WEBKDD 1999 Workshop, Revised Papers

A2 - Masand, Brij

A2 - Masand, Brij

A2 - Spiliopoulou, Myra

PB - Springer Verlag

ER -

Fu Y, Sandhu K, Shih M-Y. A generalization-based approach to clustering of Web usage sessions. In Masand B, Masand B, Spiliopoulou M, editors, Web Usage Analysis and User Profiling - International WEBKDD 1999 Workshop, Revised Papers. Springer Verlag. 2000. p. 21-38. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).