High-reproducibility and high-accuracy method for automated topic classification
Physical Review X, pp. 0110072015.
Much of human knowledge sits in large databases of unstructured text. Leveraging this knowledge requires algorithms that extract and record metadata on unstructured text documents. Assigning topics to documents will enable intelligent search, statistical characterization, and meaningful classification. Latent Dirichlet allocation (LDA) ...More
PPT (Upload PPT)