的个人主页 http://faculty.nuaa.edu.cn/huangsj/zh_CN/index.htm
点击次数:
所属单位:计算机科学与技术学院/人工智能学院/软件学院
发表刊物:Proc. ACM SIGKDD Int. Conf. Knowl. Discov. Data Min.
摘要:Feature missing is a serious problem in many applications, which may lead to low quality of training data and further significantly degrade the learning performance. While feature acquisition usually involves special devices or complex processes, it is expensive to acquire all feature values for the whole dataset. On the other hand, features may be correlated with each other, and some values may be recovered from the others. It is thus important to decide which features are most informative for recovering the other features as well as improving the learning performance. In this paper, we try to train an effective classification model with the least acquisition cost by jointly performing active feature querying and supervised matrix completion. When completing the feature matrix, a novel objective function is proposed to simultaneously minimize the reconstruction error on observed entries and the supervised loss on training data. When querying the feature value, the most uncertain entry is actively selected based on the variance of previous iterations. In addition, a bi-objective optimization method is presented for cost-aware active selection when features bear different acquisition costs. The effectiveness of the proposed approach is well validated by both theoretical analysis and experimental study. © 2018 Association for Computing Machinery.
是否译文:否
发表时间:2018-07-19
合写作者:Sugiyama, Masashi,Xu, Miao,Niu, Gang,Xie, Ming-Kun,陈松灿
通讯作者:黄圣君,Sugiyama, Masashi,Xu, Miao,黄圣君