This three-part series will take a deeper dive into text analysis and natural language processing tasks using R. In this first session, we will start by formatting and inputting source texts, creating a corpus, structuring metadata, processing and preparing text for analysis, and generating descriptive statistics about a corpus. We'll spend time considering conceptual models of text, focusing on a "bag-of-words" approach. Prerequisites: basic experience using R at least at the level of our Introduction to R workshop.
Instructor: Michele Claibourn