Newer
Older
## Prerequisites
- Up to date java installation (openjdk-16)
- sbt installed on the system and in environment variables
- Up to date version of Python3 installed
- `$pip install praw` to install the python reddit api wrapper library
## Downloading reddit submissions
- generate api token at [https://www.reddit.com/prefs/apps](https://www.reddit.com/prefs/apps)
- insert token in download.py
- `$ python3 ./downloads.py -s -c askreddit iama all`
- The -s flag tells the scipt to fetch the submission body, while the -c flag tells it to download the top comments.
## Usage
- `$ sbt "run /path/to/text-samples/ askreddit iama all"`
Thanks to [https://www.wordfrequency.info/coca.asp](https://www.wordfrequency.info/coca.asp) for
providing the corpus data.