Optimization Online


A Mixed-Integer Programming Approach to Multi-Class Data Classification Problem

Fadime Uney (funey***at***ku.edu.tr)
Metin Turkay (mturkay***at***ku.edu.tr)

Abstract: This paper presents a new data classification method based on mixed-integer programming. Traditional approaches that are based on partitioning the data sets into two groups perform poorly for multi-class data classification problems. The proposed approach is based on the use of hyper-boxes for defining boundaries of the classes that include all or some of the points in that set. A mixed-integer programming model is developed for representing existence of hyper-boxes and their boundaries. In addition, the relationships among the discrete decisions in the model are represented using propositional logic and then converted to their equivalent integer constraints using Boolean algebra. The proposed approach for multi-class data classification is illustrated on an example problem. The efficiency of the proposed method is tested on the well-known IRIS data set. The computational results on the illustrative example and the IRIS data set show that the proposed method is very accurate and efficient on multi-class data classification problems.

Keywords: Data Mining, Data Classification, Mixed-Integer Programming, Boolean Algebra

Category 1: Applications -- Science and Engineering (Data-Mining )

Category 2: Integer Programming ((Mixed) Integer Linear Programming )

Citation: College of Engineering, Koç University, Rumelifeneri Yolu, 34450, Sariyer, Istanbul, Turkey, November 2004

Download: [PDF]

Entry Submitted: 11/11/2004
Entry Accepted: 11/11/2004
Entry Last Modified: 11/11/2004

Modify/Update this entry

  Visitors Authors More about us Links
  Subscribe, Unsubscribe
Digest Archive
Search, Browse the Repository


Coordinator's Board
Classification Scheme
Give us feedback
Optimization Journals, Sites, Societies
Mathematical Programming Society