文档库 最新最全的文档下载
当前位置:文档库 › 结合切空间及特征空间校准的增量流形学习正则优化算法

结合切空间及特征空间校准的增量流形学习正则优化算法

ISSN1004‐9037,CODEN SCYCE4

Journal of Data Acquisition and Processing Vol.32,No.6,Nov.2017,pp.1141-1152DOI:10.16337/j.1004‐9037.2017.06.009

眗2017by Journal of Data Acquisition and Processing

http://sjcj.nuaa.edu.cn E‐mail:sjcj@nuaa.edu.cn Tel/Fax:+86‐025‐84892742

结合切空间及特征空间校准的增量流形学习正则优化算法

谈超1,2吉根林2赵斌2

(1.东南大学计算机科学与工程学院,南京,211189;2.南京师范大学计算机科学与技术学院,南京,210023)

摘要:高维流式大数据的产生与发展对传统机器学习和数据挖掘算法提出了诸多挑战。本文结合流式大数据流式到达的特性,首先建立自适应增量特征提取算法模型。然后,针对噪声环境,建立基于特征空间校准的增量流形学习算法模型,解决小样本问题。最后,构造流形学习的正则化优化框架,解决高维数据流特征提取过程中产生的降维误差问题,并得到最终的最优解。实验结果表明本文提出的算法框架符合流形学习算法的3个评价指标:稳定性、提高性以及学习曲线能迅速增加到一个相对稳定的水平;从而实现了高维数据流的高效学习。

关键词:高维流式大数据;自适应增量特征提取;特征空间校准;正则化优化

中图分类号:T P181文献标志码:A

Incremental Manifold Learning Regular Optimization Algorithm on Tangent Space and Feature Space Alignment

Tan Chao1,2,Ji Genlin2,Zhao Bin2

(1.School of Computer Science and Engineering,Southeast University,Nanjing,211189,China;2.School of Computer Science and Technology,Nanjing Normal University,Nanjing,210023,China)

Abstract:The emergence and development of high dimensional big data streams have presented a great challenge to the traditional machine learning and data mining algorithms.Based on the characteristics of data flow,first we construct an adaptive incremental feature extraction algorithm model.Then,accord‐ing to the environment with noise,we establish an incremental manifold learning algorithm model based on feature space alignment to solve the small size sample problem.Finally,the regularization optimiza‐tion framework of manifold learning is constructed to solve the problem of dimensionality reduction errors of high‐dimensional data flow in feature extraction process,and then the optimal solutions are obtained.Experimental results show that the proposed algorithm framework conforms to the three evaluation crite‐rions of manifold learning algorithm:Stability,enhancement,and the learning curve can rapidly increase to a relative stable level.Thus the efficient learning of high‐dimensional data streams can be realized.Key words:high dimensional big data streams;adaptive incremental feature extraction;feature space a‐lignment;regularization optimization

基金项目:国家自然科学基金(41471371,61702270)资助项目;江苏省高校自然科学基金(15KJB520022)资助项目;中国博士后科学基金(2017M621592)资助项目。

收稿日期:2017‐06‐05;修订日期:2017‐06‐30

万方数据

相关文档