N-Grams for Translation and Retrieval in CL-SDR
CLEF, pp. 658-663, 2003.
We report on a first attempt to perform cross-language spoken document retrieval. Without prior monolingual speech retrieval experience we applied the same general approach we use for bilingual retrieval that is typified by the use of overlapping character n-grams for tokenization and a statistical language model of retrieval. An inno...More
Full Text (Upload PDF)
PPT (Upload PPT)