个人信息
王东生
学位:理学博士学位

个人信息 Personal information

学历:南京大学 所在单位:理学院 电子邮箱:

Data cleaning and classification in the presence of label noise with class-specific autoencoder

点击次数: 所属单位:计算机科学与技术学院/人工智能学院/软件学院 发表刊物:Lect. Notes Comput. Sci. 摘要:We present a simple but effective method for data cleaning and classification in the presence of label noise. The key idea is to treat the data points with label noise as outliers of the class indicated by the corresponding noisy label. However, finding such dubious observations is challenging in general. We therefore propose to reduce their potential influence using feature learning method by class-specific autoencoder. Particularly, we learn for each class a feature space using all the samples labeled as that class, including those with noisy labels. Furthermore, in the case of high label noise, we propose a weighted class-specific autoencoder by considering the effect of each data point. To fully exploit the advantage of the learned feature space, we use a minimum reconstruction error based method for testing. Experiments on several datasets show that the proposed method achieves state-of-the-art performance on the related tasks with noisy labels. © Springer International Publishing AG, part of Springer Nature 2018. ISSN号:0302-9743 是否译文: 发表时间:2018-01-01 合写作者:Zhang, Weining,谭晓阳 通讯作者:Zhang, Weining,王东生