2006/031 - Reducing the annotation burden in text classification
- Anastasia Krithara,Cyril Goutte,Amini Massih,Jean-Michel Renders
Conference on Multidisciplinary Information Sciences and Technologies, InSciT2006, Mérida, Spain, October 25-28, 2006.
In this paper we describe a method which combines semi-supervised and active learning for the classification task. In particular, we propose a semi-supervised PLSA (Probabilistic latent semantic analysis) algorithm  combined with a certainty-based active learning method, in order to classify text documents.