Here’s a list of a few tools I put together for my work (all written in R):

  • TATE
    • TATE: Text Analysis Tools for English is an R package where I wrapped a few functions for text quantification. It uses external norms of valence, arousal, dominance, concreteness, humor, extremity and emotionality. The functions take a string as input, execute lemmatization of the string and return a value.
  • Passive Voice
    • This code snippet takes a vector of strings and calculates the percentage of passive voice in the input text. It uses Stanford NLP tool and coreNLP for R.
  • cheatR
    • A mini R package to compare pdf and word documents and see how similar they are. Designed to catch homework cheaters and celebrate Pokemon. Co-written with Mattan S. Ben-Shachar. Also available as a Shiny App over here.