Contains indices of the sequence positions which immediately follow breaks (e.
Contains indices of the sequence positions which immediately follow breaks (e.g. removed stopwords)
Read from the BufferedReader, filling the document with words and Z assignments, but allow map function to alter the Z assignment, and skips word/z pairs for which the map function returns a value less than 0.
The abstract document variable required by LDA.