The Power Of Localization For Efficiently Learning Linear Separators With Noise

CoRR(2014)

引用 190|浏览27
暂无评分
摘要
We introduce a new approach for designing computationally efficient and noise tolerant algorithms for learning linear separators. We consider the malicious noise model of Valiant [41, 32] and the adversarial label noise model of Kearns, Schapire, and Sellie [34]. For malicious noise, where the adversary can corrupt an I) of fraction both the label part and the feature part, we provide a polynomial-time algorithm for learning linear separators in Rd under the uniform distribution with nearly information-theoretically optimal noise tolerance of I) = Q(e), improving on the Q noise-tolerance of [31] and the Q lg(d/c) of [35]. For the adversarial label noise model, where the distribution over the feature vectors is unchanged, and the overall probability of a noisy label is constrained to be at most 7), we give a polynomial-time algorithm for learning linear separators in Rd under the uniform distribution that can also handle a noise rate of I) = Q N. This improves over the results of [31] which either required runtime super-exponential in 1/ (ours is polynomial in 1/0 or tolerated less noise. In the case that the distribution is isotropic log-concave, we present a polynomial-time algorithm for the malicious noise model that tolerates Q 1g2 (1/)) noise, and a polynomial-time algorithm for the adversarial label noise model that also handles Q log2 (1/c) noise. Both of these also improve on results from [35]. In particular, in the case of malicious noise, unlike previous results, our noise tolerance has no dependence on the dimension d of the space. Our algorithms are also efficient in the active learning setting, where learning algorithms only receive the classifications of examples when they ask for them. We show that, in this model, our algorithms achieve a label complexity whose dependence on the error parameter is polylogarithmic (and thus exponentially better than that of any passive algorithm). This provides the first polynomial time active learning algorithm for learning linear separators in the presence of malicious noise or adversarial label noise. Our algorithms and analysis combine several ingredients including aggressive localization, minimization of a progressively rescaled hinge loss, and a novel localized and soft outlier removal procedure. We use localization techniques (previously used for obtaining better sample complexity results) in order to obtain better noise tolerant polynomial-time algorithms.
更多
查看译文
关键词
Noise tolerant learning,Adversarial label noise,Malicious noise,Passive and active learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要