Optimization Online


Estimating L1-Norm Best-Fit Lines for Data

J.P. Brooks(jpbrooks***at***vcu.edu)
J.H. Dula(jdula***at***vcu.edu)

Abstract: The general formulation for finding the L1-norm best-fit subspace for a point set in $m$-dimensions is a nonlinear, nonconvex, nonsmooth optimization problem. In this paper we present a procedure to estimate the L1-norm best-fit one-dimensional subspace (a line through the origin) to data in $\Re^m$ based on an optimization criterion involving linear programming but which can be performed using simple ratios and sortings. The procedure has distinct advantages in that it does not depend on any initializations, is deterministic and replicable, and is scalable. The estimated line is sharp in that it satisfies a well-defined optimization criterion, and is often tight, meaning that it is sometimes a globally optimal solution. We show how the method can be extended to a procedure for an L1-norm principal component analysis by iteratively approximating higher-order best-fit subspaces. In a comprehensive computational study involving synthetic and real data, the procedure is shown to be more robust to outlier observations than competing approaches.

Keywords: analytics, L1-norm, line location, principal component analysis

Category 1: Applications -- Science and Engineering (Data-Mining )

Category 2: Applications -- OR and Management Sciences

Category 3: Applications -- Science and Engineering (Statistics )

Citation: Virginia Commonwealth University, June, 2017

Download: [PDF]

Entry Submitted: 06/07/2017
Entry Accepted: 06/07/2017
Entry Last Modified: 06/07/2017

Modify/Update this entry

  Visitors Authors More about us Links
  Subscribe, Unsubscribe
Digest Archive
Search, Browse the Repository


Coordinator's Board
Classification Scheme
Give us feedback
Optimization Journals, Sites, Societies
Mathematical Optimization Society