An overview of Microsoft web N-gram corpus and applications
NAACL (Demos), pp. 45-48, 2010.
EI
Keywords:
Abstract:
This document describes the properties and some applications of the Microsoft Web N-gram corpus. The corpus is designed to have the following characteristics. First, in contrast to static data distribution of previous corpus releases, this N-gram corpus is made publicly available as an XML Web Service so that it can be updated as deemed n...More
Code:
Data:
Tags
Comments