中文核心期刊
CSCD来源期刊
中国科技核心期刊
RCCSE中国核心学术期刊

Journal of Chongqing Jiaotong University(Natural Science) ›› 2014, Vol. 33 ›› Issue (2): 154-158.DOI: 10.3969/j.issn.1674-0696.2014.02.35

Previous Articles    

Reduction and Improvement Strategy of Training Sample Set for SVM Based on Class-Center

Pang Shouyan1,2,Chen Song1,Wei Jianmeng1,Zhang Yuansheng1   

  1. 1. School of Information Science & Engineering,Chongqing Jiaotong University,Chongqing 400074,China; 2. Chongqing University of Education,Chongqing 400065,China
  • Received:2012-09-14 Revised:2012-12-03 Online:2014-04-15 Published:2015-01-22

基于类中心的SVM 训练样本集缩减改进策略

庞首颜1,2,陈松1,魏建猛1,张元胜1   

  1. 1. 重庆交通大学信息科学与工程学院,重庆 400074; 2. 重庆第二师范学院,重庆 400065
  • 作者简介:庞首颜( 1987—) ,女,广西玉林人,硕士研究生,主要从事图形图像处理方面的研究。E-mail: PSY1239@126.com。

Abstract: SVM has practical applications in many fields for its superior performance at present. However,it has become a bottleneck to use SVM due to the problems including slow learning speed,large save requirement,and low generalization performance. Then an improvement reduction strategy was proposed based on the class-center,which significantly cut the amount of training samples and improved the training speed by determining the boundary training sample set. In addition,for the question that in the nonlinear space the center of the feature space class can not be directly obtained by calculating,an alternative strategy was proposed. In the alternative strategy,the sample spots that could generate the smallest hypersphere in the feature space which approximately substituted the center of feature samples were found. It improves the learning speed without reducing the classification accuracy.

Key words: information technology, Support Vector Machine( SVM) , class-center, boundary sample, alternative strategy

摘要: 针对SVM 训练样本集规模较大引发的学习速度慢、存储需求量大、泛化能力降低等问题,通过改进的样本点 到类中心的方法来确定边界样本,从而大量缩减训练样本,提高训练速度。此外,针对非线性空间无法直接通过计 算得到特征空间类中心的问题,提出了一种通过在特征空间中,寻找能生成最小超球的样本点来近似代替特征样本 的替代策略,使得在保证分类精度的同时,提高了训练速度。

关键词: 信息技术, SVM, 类中心, 边界样本, 替代策略

CLC Number: