Logo image
Building text classifiers using positive and unlabeled examples
Conference proceeding

Building text classifiers using positive and unlabeled examples

B. Liu, Y. Dai, X. Li, W.S. Lee and P.S. Yu
3rd IEEE International Conference on Data Mining, pp.179-186
2003

Abstract

Biomedical engineering Computer science Iterative algorithms Labeling Niobium Performance evaluation Sun Support vector machine classification Support vector machines Text categorization
We study the problem of building text classifiers using positive and unlabeled examples. The key feature of this problem is that there is no negative example for learning. Recently, a few techniques for solving this problem were proposed in the literature. These techniques are based on the same idea, which builds a classifier in two steps. Each existing technique uses a different method for each step. We first introduce some new methods for the two steps, and perform a comprehensive evaluation of all possible combinations of methods of the two steps. We then propose a more principled approach to solving the problem based on a biased formulation of SVM, and show experimentally that it is more accurate than the existing techniques.

Metrics

1 Record Views

Details

Logo image