Nicht aus der Schweiz? Besuchen Sie lehmanns.de
Text Analysis with R for Students of Literature - Matthew L. Jockers

Text Analysis with R for Students of Literature

Buch | Softcover
XVI, 194 Seiten
2016 | 1. Softcover reprint of the original 1st ed. 2014
Springer International Publishing (Verlag)
978-3-319-34919-0 (ISBN)
CHF 89,85 inkl. MwSt
This practical introduction explores core R procedures and processes and offers a thorough understanding of the possibilities of computational text analysis at both micro and macro scales. Each chapter concludes with a set of practice exercises.
Text Analysis with R for Students of Literature is written with students and scholars of literature in mind but will be applicable to other humanists and social scientists wishing to extend their methodological tool kit to include quantitative and computational approaches to the study of text. Computation provides access to information in text that we simply cannot gather using traditional qualitative methods of close reading and human synthesis. Text Analysis with R for Students of Literature provides a practical introduction to computational text analysis using the open source programming language R. R is extremely popular throughout the sciences and because of its accessibility, R is now used increasingly in other research areas. Readers begin working with text right away and each chapter works through a new technique or process such that readers gain a broad exposure to core R procedures and a basic understanding of the possibilities of computational text analysis atboth the micro and macro scale. Each chapter builds on the previous as readers move from small scale "microanalysis" of single texts to large scale "macroanalysis" of text corpora, and each chapter concludes with a set of practice exercises that reinforce and expand upon the chapter lessons. The book's focus is on making the technical palatable and making the technical useful and immediately gratifying.

The author, Matthew L. Jockers, is Associate Professor of English and Director of the Nebraska Literary Lab at the University of Nebraska in Lincoln. Jockers's text mining research has been featured in the New York Times, Nature, the Chronicle of Higher Education, Wired, New Scientist, Smithsonian, NBC News and many others. Jockers blogs about his research at www.matthewjockers.net.

R Basics.- First Foray into Text Analysis with R.- Accessing and Comparing Word Frequency Data.- Token Distribution Analysis.- Correlation.- Measures of Lexical Variety.- Hapax Richness.- Do It KWIC.- Do It KWIC (Better).- Text Quality, Text Variety, and Parsing XML.- Clustering.- Classification.- Topic Modeling.- Appendix A: Variable Scope Example.- Appendix B: The LDA Buffet.- Appendix C: Code Repository.- Appendix D: R Resources.- Practice Exercise Solutions.- Index.

"The aim of this book is ... to give the Literature students just the most basic tools needed to do some relatively straightforward textual analysis. ... Even though this is primarily a book intended for literature students, I would actually strongly recommend it to anyone interested in text mining, text analysis and natural language processing. It is a very gentle and approachable introduction to the whole world of textual analysis." (Bojan Tunguz, tunguzreview.com, July, 2015)

"This is a well written book on the topic of Text Analysis. There is enough information to give you a good start using R. Followed by easy to understand details about text analysis. ... This is a good book to have if you are doing text analysis." (Mary Anne, Cats and Dogs with Data, maryannedata.com, August, 2014)

"A remarkably well-crafted book that will allow students to get a quick start and progress toward quite sophisticated text mining tasks. ... exercises provided at the end of each chapter, withsolutions at the end of the book, should serve well to help students solidify their knowledge and gain more confidence in their text mining skills. ... a great addition to the libraries of digital humanists and natural language enthusiasts who wish to expand their programming literacy ... ." (Denilson Barbosa, Computing Reviews, August, 2014)

Erscheinungsdatum
Reihe/Serie Quantitative Methods in the Humanities and Social Sciences
Zusatzinfo XVI, 194 p. 40 illus., 10 illus. in color.
Verlagsort Cham
Sprache englisch
Maße 155 x 235 mm
Themenwelt Geisteswissenschaften Sprach- / Literaturwissenschaft Sprachwissenschaft
Mathematik / Informatik Mathematik Computerprogramme / Computeralgebra
Mathematik / Informatik Mathematik Wahrscheinlichkeit / Kombinatorik
Sozialwissenschaften Soziologie Empirische Sozialforschung
Schlagworte Computational Linguistics • Computational Literary Studies • Corpus Linguistics and R • digital humanities • Linguistic Computing • Mathematical and statistical software • mathematics and statistics • Programming and Literature • R • Social research and statistics • Statistics and Computing/Statistics Programs • Statistics for Social Science, Behavorial Science, • text analysis • text classification • Text Clustering • Text Mining
ISBN-10 3-319-34919-8 / 3319349198
ISBN-13 978-3-319-34919-0 / 9783319349190
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich