Hybrid MPI/OpenMP Parallel Linear Support Vector Machine Training

Kristian Woodsend, Jacek Gondzio

Research output: Contribution to journalArticlepeer-review

Abstract

Support vector machines are a powerful machine learning technology, but the training process involves a dense quadratic optimization problem and is computationally challenging. A parallel implementation of linear Support Vector Machine training has been developed, using a combination of MPI and OpenMP. Using an interior point method for the optimization and a reformulation that avoids the dense Hessian matrix, the structure of the augmented system matrix is exploited to partition data and computations amongst parallel processors efficiently. The new implementation has been applied to solve problems from the PASCAL Challenge on Large-scale Learning. We show that our approach is competitive, and is able to solve problems in the Challenge many times faster than other parallel approaches. We also demonstrate that the hybrid version performs more efficiently than the version using pure MPI.

Original languageEnglish
Pages (from-to)1937-1953
Number of pages17
JournalJournal of Machine Learning Research
Volume10
DOIs
Publication statusPublished - Aug 2009

Keywords

  • linear SVM training
  • hybrid parallelism
  • largescale learning
  • interior point method
  • INTERIOR-POINT SOLVER
  • QUADRATIC PROGRAMS
  • PERFORMANCE
  • BLAS

Fingerprint

Dive into the research topics of 'Hybrid MPI/OpenMP Parallel Linear Support Vector Machine Training'. Together they form a unique fingerprint.

Cite this