Cluster validity functions for categorical data: a solution-space perspective
Data Mining and Knowledge Discovery, Volume 29, Issue 6, 2014.
Cluster analysisCluster validity functionGeneralizationEffectivenessNormalization
For categorical data, there are three widely-used internal validity functions: the $k$k-modes objective function, the category utility function and the information entropy function, which are defined based on within-cluster information only. Many clustering algorithms have been developed to use them as objective functions and find their...More
Full Text (Upload PDF)