Speech and Language Processing (3rd ed. draft)
Dan Jurafsky and James H. Martin

This is just an informal pre-release (dated Jan 5, 2024) of our 2024 release. The final draft of the 2024 release won't come til probably February, with more details.

Individual chapters are below; here is a single pdf of all the chapters in the Jan 5, 2024 pre-relase draft of the book so far!

As always, typos and comments very welcome: please email slp3edbugs@gmail.com

When will the whole book be finished? Don't ask.

 
Part I: Fundamental Algorithms
1:Introduction
2: Regular Expressions, Text Normalization, Edit Distance
3: N-gram Language Models
4: Naive Bayes, Text Classification, and Sentiment
5: Logistic Regression
6: Vector Semantics and Embeddings
7: Neural Networks and Neural Language Models
8: Sequence Labeling for Parts of Speech and Named Entities
9: RNNs and LSTMs
10: Transformers and Large Language Models
11: Fine-tuning and Masked Language Models
12: Prompting, In-Context Learning, and Instruct Tuning
 
Part II: NLP Applications
13: Machine Translation
14: Question Answering and Information Retrieval
15: Chatbots and Dialogue Systems
16: Automatic Speech Recognition and Text-to-Speech
 
Part III: Annotating Linguistic Structure
17: Context-Free Grammars and Constituency Parsing
18: Dependency Parsing
19: Information Extraction: Relations, Events, and Time
20: Semantic Role Labeling and Argument Structure
21: Lexicons for Sentiment, Affect, and Connotation
22: Coreference Resolution
23: Discourse Coherence
 
Appendix Chapters (will be just on the web)
A: Hidden Markov Models
B: Spelling Correction and the Noisy Channel
C: Statistical Constituency Parsing
D: Context-Free Grammars
E: Combinatory Categorial Grammar
F: Logical Representations of Sentence Meaning
G: Word Senses and WordNet
H: Phonetics