I Preliminaries 1
1 Introduction 3
2 Mathematical Foundations 39
3 Linguistic Essentials 81
4 Corpus-Based Work 117
II Words 149
5 Collocations 151
6 Statistical Inference: n-gram Models over Sparse Data 191
7 Word Sense Disambiguation 229
8 Lexical Acquisition 265
III Grammar 315
9 Markov Models 317
10 Part-of-Speech Tagging 341
11 Probabilistic Context Free Grammars
12 Probabilistic Parsing 407
381
I v Applications and Techniques 461
13 Statistical Alignment and Machine Translation
14 Clustering 495
15 Topics in Information Retrieval 529
16 Text Categorization 575
1