This step involves preparing your text, or “corpus”, to be read by Voyant, and ensuring that it is the right material for what you are trying to do/ it fits the question you are trying to ask.
To find this key term and other key terms used in digital projects, visit the “My Data” page above and see the “Data Dictionary” entry- this has definitions and provides more clarity on these terms.
Voyant will help you do this, but it’s best to use a really careful refinement process. OpenRefine is my favorite choice for this step, which is explored on my “My Data” page of this site.
In my case, I found the data I needed and cleaned it myself. Download my data now, in the “My Data” page, to follow along on Voyant with step two.
