The urls used in the code and documentation should be checked for availability (As done for the readme in #87) Places to check: * URLs in the CorpusDownloader classes * URLs in the code-doc of CorpusReader classes