Analyzing Textual Information
From Words to Meanings through Numbers
- Johannes Ledolter - The University of Iowa, USA
- Lea S. VanderVelde - The University of Iowa, USA
Volume:
188
Courses:
Big Data | Intermediate/Advanced Research Methods | Quantitative Methods | Research Methods & Statistics in Sociology | Research Methods in Mass Communication | Research Methods in Political Science | Research Methods in Political Science | Research Methods in Sociology | Statistics - General Interest
Big Data | Intermediate/Advanced Research Methods | Quantitative Methods | Research Methods & Statistics in Sociology | Research Methods in Mass Communication | Research Methods in Political Science | Research Methods in Political Science | Research Methods in Sociology | Statistics - General Interest
May 2021 | 192 pages | SAGE Publications, Inc
Researchers in the social sciences and beyond are dealing more and more with massive quantities of text data requiring analysis, from historical letters to the constant stream of content in social media. Traditional texts on statistical analysis have focused on numbers, but this book will provide a practical introduction to the quantitative analysis of textual data. Using up-to-date R methods, this book will take readers through the text analysis process, from text mining and pre-processing the text to final analysis. It includes two major case studies using historical and more contemporary text data to demonstrate the practical applications of these methods. Currently, there is no introductory how-to book on textual data analysis with R that is up-to-date and applicable across the social sciences. Code and a variety of additional resources to enrich the use of this book are available on an accompanying website at: https://www.biz.uiowa.edu/faculty/jledolter/analyzing-textual-information/. These resources include data files from the 39th Congress, and also the collection of tweets of President Trump, now no longer available to researchers via Twitter itself.
Series Editor’s Introduction
Preface
Acknowledgments
About the Authors
Chapter 1: Introduction
Chapter 2: A Description of the Studied Text Corpora and A Discussion of Our Modeling Strategy
Chapter 3: Preparing Text for Analysis: Text Cleaning and Formatting
Chapter 4: Word Distributions: Document-Term Matrices of Word Frequencies and the “Bag of Words” Representation
Chapter 5: Metavariables and Text Analysis Stratified on Metavariables
Chapter 6: Sentiment Analysis
Chapter 7: Clustering of Documents
Chapter 8: Classification of Documents
Chapter 9: Modeling Text Data: Topic Models
Chapter 10: n-Grams and Other Ways of Analyzing Adjacent Words
Chapter 11: Concluding Remarks
Appendix: Listing of Website Resources
Sample Materials & Chapters
Chapter 2: A Description of the Studied Text Corpora and A Discussion of Our Mod