The Political Twitter Discourse Corpus (PTDC)The Political Twitter Discourse Corpus (PTDC)
The Political Twitter Discourse Corpus (PTDC) has been created for use as a discourse reference corpus tool in the focused analysis of political discourse occurring on Twitter - particularly for comparative keyword analyses. It is comprised of the most recent original tweets (i.e. not inclusive of retweets, up to a maximum of 3000 per user) of all current US state governors, members of congress and senators. At present, the PTDC consists of 205,303 individual tweets and 4,659,381 words. The corpus is in '.txt' file format, and can be accessed via email request to either Dr. Andrew Ross (andrew.ross[AT]scu.edu.au) or Dr. Damian Rivers (rivers[AT]fun.ac.jp).