Introduction to Text Mining


  • Use unnest_tokens to tokenize the text in the format of a data frame
  • Use anti_join to exclude stop words from the text data set
  • Use count to calculate the word frequencies

Sentiment Analysis


  • Define sentiment analysis
  • Use sentiments dataset
  • Use general-purpose lexicons
  • Analyze sentiment in text

Using RMarkdown


  • Use .md files for episodes when you want static content
  • Use .Rmd files for episodes when you need to generate output
  • Run sandpaper::check_lesson() to identify any issues with your lesson
  • Run sandpaper::build_lesson() to preview your lesson locally