Optimization Online


Clustering Categories in Support Vector Machines

Emilio Carrizosa(ecarrizosa***at***us.es)
Amaya Nogales-Gómez(amayanogales***at***us.es)
Dolores Romero Morales(dolores.romero-morales***at***sbs.ox.ac.uk)

Abstract: Support Vector Machines (SVM) is the state-of-the-art in Supervised Classification. In this paper the Cluster Support Vector Machines (CLSVM) methodology is proposed with the aim to reduce the complexity of the SVM classifier in the presence of categorical features. The CLSVM methodology lets categories cluster around their peers and builds an SVM classifier using the clustered dataset. Four strategies for building the CLSVM classifier are presented based on solving: the original SVM formulation, a Quadratically Constrained Quadratic Programming formulation, and a Mixed Integer Quadratic Programming formulation as well as its continuous relaxation. The computational study illustrates the performance of the CLSVM classifier using two clusters. In the tested datasets our methodology achieves comparable accuracy to that of the SVM with original data but with a dramatic decrease in complexity.

Keywords: Support vector machines, Categorical features, Classifier complexity, Clustering, Quadratically constrained programming, 0-1 programming

Category 1: Applications -- OR and Management Sciences


Download: [PDF]

Entry Submitted: 06/23/2014
Entry Accepted: 06/23/2014
Entry Last Modified: 06/23/2014

Modify/Update this entry

  Visitors Authors More about us Links
  Subscribe, Unsubscribe
Digest Archive
Search, Browse the Repository


Coordinator's Board
Classification Scheme
Give us feedback
Optimization Journals, Sites, Societies
Mathematical Optimization Society