Corpus Linguistics and Linguistically Annotated Corpora / Sandra Kubler and Heike Zinsmeister.

Kübler, Sandra [Browse]
  • London, UK ; New York, NY : Bloomsbury, [2015]
  • ©2015
viii, 312 pages ; 24 cm


Summary note
"Linguistically annotated corpora are becoming a central part of the corpus linguistics field. One of their main strengths is the level of searchability they offer, but with the annotation come problems of the initial complexity of queries and query tools. This book gives a full, pedagogic account of this burgeoning field. Beginning with an overview of corpus linguistics, its prerequisites and goals, the book then introduces linguistically annotated corpora. It explores the different levels of linguistic annotation, including morphological, parts of speech, syntactic, semantic and discourse-level, as well as advantages and challenges for such annotations. It covers the main annotated corpora for English, the Penn Treebank, the International Corpus of English, and OntoNotes, as well as a wide range of corpora for other languages. In its third part, search strategies required for different types of data are explored. All chapters are accompanied by exercises and by sections on further reading, together with an integral companion website that contains lists and guidance on contemporary annotated corpora and query tools"-- Provided by publisher.
Bibliographic references
Includes bibliographical references and index.
  • Machine generated contents note:
  • PrefacePart I Introduction 1. Corpus Linguistics2. Corpora and Linguistic Annotation Part II Linguistic Annotation 3. Linguistic Annotation on the Word Level 4. Syntactic Annotation 5. Semantic Annotation 6. Discourse Annotation Part III Using Linguistic Annotation in Corpus Linguistics7. Advantages and Limitations of Using Linguistically AnnotatedCorpora 8. Corpus Linguistics Using Linguistically Annotated CorporaPart IV Querying Linguistically Annotated Corpora 9. Concordances 10. Regular Expressions 11. Searching on the Word Level 12. Querying Syntactic Structures 13. Searching for Semantic and Discourse Phenomena Appendix A. Penn Treebank POS Tagset 343Appendix B. ICE POS Tagset 345BibliographyIndex.
  • 9781441164476 (hardback)
  • 1441164472
  • 9781441116758 (paperback)
  • 1441116753
Statement on language in description
Princeton University Library aims to describe library materials in a manner that is respectful to the individuals and communities who create, use, and are represented in the collections we manage. Read more...
Other views
Staff view

Supplementary Information