Computer Science and Information Systems
The international journal published by ComSIS Consortium 

Improving Categorical Data Clustering Algorithm by Weighting Uncommon Attribute Value Matches

 

UDC 004.421.2

Zengyou He1, Xiaofei Xu1, Shenchun Deng1


1 Department of Computer Science and Engineering,
Harbin Institute of Technology,
92 West Dazhi Street, P.O Box 315, China, 150001
zengyouhe@yahoo.com, {xiaofei,dsc} @hit.edu.cn

Abstract. This paper presents an improved Squeezer algorithm for categorical data clustering by giving greater weight to uncommon attribute value matches in similarity computations. Experimental results on real life datasets show that, the modified algorithm is superior to the original Squeezer algorithm and other clustering algorithm with respect to clustering accuracy.

Volume 03 , Issue 01 (June 2006) table of contents
Year of Publication: 2006
ISSN:
Publisher ComSIS Consortium
Full text available: Pdf
 
Home 
ComSIS Consortium
Aims and Scope 
Editorial Board
Editorial Council
Managing Board
Information for Contributors
Copyright Transfer Form
Current Issue
Archive
Forthcoming Articles
Subscription
Contact Info