Entire transcriptions in string form can be split into words, (see example in make-test-dataset.rb)
If the size of the split isn't the same as the number of words either:
- There's a bug in the splitting
- An utterance was produced twice and transcribed
- The speaker skipped a word
- Error in transcription or in spacing to signify word boundaries