An iterative undersampling of extremely imbalanced data using CSVM

Jong Bum Lee, Jee Hyong Lee

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Semiconductor is a major component of electronic devices and is required very high reliability and productivity. If defective chip predict in advance, the product quality will be improved and productivity will increases by reduction of test cost. However, the performance of the classifiers about defective chips is very poor due to semiconductor data is extremely imbalance, as roughly 1:1000. In this paper, the iterative undersampling method using CSVM is employed to deal with the class imbalanced. The main idea is to select the informative majority class samples around the decision boundary determined by classify. Our experimental results are reported to demonstrate that our method outperforms the other sampling methods in regard with the accuracy of defective chip in highly imbalanced data.

Original languageEnglish
Title of host publicationSeventh International Conference on Machine Vision, ICMV 2014
EditorsBranislav Vuksanovic, Jianhong Zhou, Antanas Verikas, Petia Radeva
PublisherSPIE
ISBN (Electronic)9781628415605
DOIs
StatePublished - 2015
Event7th International Conference on Machine Vision, ICMV 2014 - Milan, Italy
Duration: 19 Nov 201421 Nov 2014

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume9445
ISSN (Print)0277-786X
ISSN (Electronic)1996-756X

Conference

Conference7th International Conference on Machine Vision, ICMV 2014
Country/TerritoryItaly
CityMilan
Period19/11/1421/11/14

Keywords

  • Cost-sensitive Support Vector Machin
  • Heuristic undersampling method
  • imbalanced data
  • semiconductor

Fingerprint

Dive into the research topics of 'An iterative undersampling of extremely imbalanced data using CSVM'. Together they form a unique fingerprint.

Cite this