CMSW - www.scottishcorpus.ac.uk/cmsw/

This zip file contains files for each document.
All documents are identified by their document id.

The folder structure is as follows:

plaintext: Document as plain text (UTF-8)

file names are in this format:

cmsw-[documentid]-[yeargroup]-[genre]-[title]

where yeargroup is:
y1: 1700-1750
y2: 1750-1800
y3: 1800-1850
y4: 1850-1900
y5: 1900-1950

where genre is:
g1: Administrative prose
g2: Expository prose
g3: Personal writing
g4: Instructional prose
g5: Religious prose
g6: Verse/drama
g7: Imaginative prose
g8: Journalism
g9: Orthoepists
