Text Analysis with R for Students of Literature

Matthew L. Jockers (Autor)

Buch | Softcover

XVI, 194 Seiten

2016 | Softcover reprint of the original 1st ed. 2014
Springer International Publishing (Verlag)
9783319349190 (ISBN)

Artikel merken

This practical introduction explores core R procedures and processes and offers a thorough understanding of the possibilities of computational text analysis at both micro and macro scales. Each chapter concludes with a set of practice exercises.

Text Analysis with R for Students of Literature is written with students and scholars of literature in mind but will be applicable to other humanists and social scientists wishing to extend their methodological tool kit to include quantitative and computational approaches to the study of text. Computation provides access to information in text that we simply cannot gather using traditional qualitative methods of close reading and human synthesis. Text Analysis with R for Students of Literature provides a practical introduction to computational text analysis using the open source programming language R. R is extremely popular throughout the sciences and because of its accessibility, R is now used increasingly in other research areas. Readers begin working with text right away and each chapter works through a new technique or process such that readers gain a broad exposure to core R procedures and a basic understanding of the possibilities of computational text analysis atboth the micro and macro scale. Each chapter builds on the previous as readers move from small scale "microanalysis" of single texts to large scale "macroanalysis" of text corpora, and each chapter concludes with a set of practice exercises that reinforce and expand upon the chapter lessons. The book's focus is on making the technical palatable and making the technical useful and immediately gratifying.

The author, Matthew L. Jockers, is Associate Professor of English and Director of the Nebraska Literary Lab at the University of Nebraska in Lincoln. Jockers's text mining research has been featured in the New York Times, Nature, the Chronicle of Higher Education, Wired, New Scientist, Smithsonian, NBC News and many others. Jockers blogs about his research at www.matthewjockers.net.

R Basics.- First Foray into Text Analysis with R.- Accessing and Comparing Word Frequency Data.- Token Distribution Analysis.- Correlation.- Measures of Lexical Variety.- Hapax Richness.- Do It KWIC.- Do It KWIC (Better).- Text Quality, Text Variety, and Parsing XML.- Clustering.- Classification.- Topic Modeling.- Appendix A: Variable Scope Example.- Appendix B: The LDA Buffet.- Appendix C: Code Repository.- Appendix D: R Resources.- Practice Exercise Solutions.- Index.

"The aim of this book is ... to give the Literature students just the most basic tools needed to do some relatively straightforward textual analysis. ... Even though this is primarily a book intended for literature students, I would actually strongly recommend it to anyone interested in text mining, text analysis and natural language processing. It is a very gentle and approachable introduction to the whole world of textual analysis." (Bojan Tunguz, tunguzreview.com, July, 2015)

"This is a well written book on the topic of Text Analysis. There is enough information to give you a good start using R. Followed by easy to understand details about text analysis. ... This is a good book to have if you are doing text analysis." (Mary Anne, Cats and Dogs with Data, maryannedata.com, August, 2014)

"A remarkably well-crafted book that will allow students to get a quick start and progress toward quite sophisticated text mining tasks. ... exercises provided at the end of each chapter, withsolutions at the end of the book, should serve well to help students solidify their knowledge and gain more confidence in their text mining skills. ... a great addition to the libraries of digital humanists and natural language enthusiasts who wish to expand their programming literacy ... ." (Denilson Barbosa, Computing Reviews, August, 2014)

Erscheinungsdatum	16.09.2016
Reihe/Serie	Quantitative Methods in the Humanities and Social Sciences
Zusatzinfo	XVI, 194 p. 40 illus., 10 illus. in color.
Verlagsort	Cham
Sprache	englisch
Maße	155 x 235 mm
Themenwelt	Geisteswissenschaften ► Sprach- / Literaturwissenschaft ► Sprachwissenschaft
	Mathematik / Informatik ► Mathematik ► Computerprogramme / Computeralgebra
	Mathematik / Informatik ► Mathematik ► Wahrscheinlichkeit / Kombinatorik
	Sozialwissenschaften ► Soziologie ► Empirische Sozialforschung
Schlagworte	Computational Linguistics • Computational Literary Studies • Corpus Linguistics and R • digital humanities • Linguistic Computing • Mathematical and statistical software • mathematics and statistics • Programming and Literature • R • Social research and statistics • Statistics and Computing/Statistics Programs • Statistics for Social Science, Behavorial Science, • text analysis • text classification • Text Clustering • Text Mining
ISBN-13	9783319349190 / 9783319349190
Zustand	Neuware