This meeting included an introductory tutorial by Mario covering tm and sentiment anaylsis. The slides are available here.

The tutorial was structured as follows:

  • Data Reading
  • Data Structures
  • Preprocessing Pipeline
  • removePunctuation, tolower, removeWords, stripWhitespace, stemDocument
  • Examples
  • Known Weaknesses and Outlook
  • Plans for SentimentAnalysis (tm.plugin.sentiment 2.0)

Further talks covering advanced text mining topics in R are planned for the next meeting(s), so stay tuned…