Linguistic Databases explains the increasing use of databases in linguistics. The enormous potential in linguistic data—billions of utterances and messages daily—has been difficult to exploit. Data must be archived and organized. Many linguists have had to concentrate on introspective data with its inevitable blinders toward frequency, variation, and naturalness. Applications of linguistics have been handicapped. This volume explores the potential advantages of database applications to linguistics.
Databases not only store large amounts of data, but also impose an organization in data, which facilitates access for researchers and applications developers. Linguistics Databases reports on database activities in phonetics, phonology, lexicography and syntax, comparative grammar, second-language acquisition, linguistic fieldwork and language pathology. The book presents the specialized problems of multi-media (especially audio) and multilingual texts, including those in exotic writing systems. Implemented solutions are discussed. The opportunities to use existing, minimally structured text repositories are presented.
is Professor of Computational Linguistics and Chair of Humanities Computing Groningen.
- 1 Introduction
- 2 TSNLP — Test Suites for Natural Language
- 3 From Annotated Corpora to Databases: the SgmlQL Language
- 4 An Markup of a Test Suite with SGML
- 5 An Open Systems Approach for an Acoustic-Phonetic Continuous Speech Database: The S_Tools Database-Management System (STDBMS)
- 6 The Reading Database of Syllable Structure
- 7 A Database Application for the Generation of Phonetic Atas Maps
- 8 Swiss French PolyPhone and PolyVar: Telephone Speech Databases to Model Inter- and Intra-speaker Variability
- 9 Investigating Arguemnt Structure: The Russian Nominalization Database
- 10 The Use of a Psycholinguistic Database in the Simplification of Text for Aphasic Readers
- 11 The Computer Learner Corpus: A Testbed for Electronic EFL Tools
- 12 Linking WordNet to a Corpus Querey System
- 13 Mulitilingual Data Processing in the CELLAR Environment
- Name Index
- Subject Index