"An Application of a New meta-Heuristic for Optimizing the Classification Accuracy When Analyzing Some Medical Datasets"

Expert Systems with Applications, Vol. 36, No. 5, pp. 9240-9249, July 2009.

Pham, H.N.A., and E. Triantaphyllou

Medical data mining has recently become one of the most popular topics in the data mining community. This is due to the societal importance of the field and also the particular computational challenges posed in this domain of data mining. However, current medical data mining approaches oftentimes use identical costs or just ignore them for the different cases of classification errors. Thus, their outcome may be unexpected. This paper applies a new meta-heuristic approach, called the Homogeneity-Based Algorithm (or HBA), for optimizing the classification accuracy when analyzing some medical datasets. The HBA first expresses the objective as an optimization problem in terms of the error rates and the associated penalty costs. These costs may be dramatically different in medical applications as the implications of having a false-positive and a false-negative case may be tremendously different. When the HBA is combined with traditional classification algorithms, it enhances their prediction accuracy. It does so by using the concept of homogenous sets. Five medical datasets, obtained from the machine learning data repository at the University of California, Irvine (UCI), USA, were tested. Some computational results indicate that the HBA, when it is combined with traditional methods, can significantly outperform current stand-alone data mining approaches.

Key Words:
Optimization, Medical Data Mining, HBA, Genetic Algorithms, Classification Errors.

Download this paper as a PDF file. (Size = 1,645 KB)

Visit Dr. Triantaphyllou's homepage.