个人信息
王东生
学位:理学博士学位

个人信息 Personal information

学历:南京大学 所在单位:理学院 电子邮箱:

Robust Class-Specific Autoencoder for Data Cleaning and Classification in the Presence of Label Noise

点击次数: 所属单位:计算机科学与技术学院/人工智能学院/软件学院 发表刊物:NEURAL PROCESSING LETTERS 关键字:Class-specific autoencoder Label noise Classification Data cleaning Outliers 摘要:We present a simple but effective method for data cleaning and classification in the presence of label noise. The fundamental idea is to treat the data points with label noise as outliers of the class indicated by the corresponding noisy label. This essentially allows us to deal with the traditional supervised problem of classification with label noise as an unsupervised one, i.e., identifying outliers from each class. However, finding such dubious observations (outliers) from each class is challenging in general. We therefore propose to reduce their potential influence using class-specific feature learning by autoencoder. Particularly, we learn for each class a feature space using all the samples labeled as that class, including those with noisy (but unknown to us) labels. Furthermore, in order to solve the situation when the noise is relatively high, we propose a weighted class-specific autoencoder by considering the effect of each data point on the postulated model. To fully exploit the advantage of the learned class-specific feature space, we use a minimum reconstruction error based method for finding out the outliers (label noise) and solving the classification task. Experiments on several datasets show that the proposed method achieves state of the art performance on the task of data cleaning and classification with noisy labels. ISSN号:1370-4621 是否译文: 发表时间:2019-10-01 合写作者:Zhang, Weining,Wang, Dong,谭晓阳 通讯作者:谭晓阳,王东生