If there are more than this number of characters between the end of the previous token and the beginning of this one, force a sentence start.
If there are more than this number of characters between the end of the previous token and the beginning of this one, force a sentence start. If negative, don't break sentences according to this criteria at all.
Matches the Token.
Matches the Token.string of tokens that may extend a sentence, such as quotes, closing parentheses, and even additional periods.
Matches the Token.
Matches the Token.string of punctuation that always indicates the end of a sentence. It does not include possible additional tokens that may be appended to the sentence such as quotes and closing parentheses.
How the annotation of this DocumentAnnotator should be printed as extra information after a one-word-per-line (OWPL) format.
How the annotation of this DocumentAnnotator should be printed as extra information after a one-word-per-line (OWPL) format. If there is no document annotation, return the empty string. Used in Document.owplString.
If true every double newline causes a sentence break.
If true every double newline causes a sentence break.
If true, every newline causes a sentence break.
If true, every newline causes a sentence break.
Matches the Token.
Matches the Token.string of tokens that might possibility indicate the end of a sentence, such as an mdash. The sentence segmenter will only actually create a sentence end here if possibleSentenceStart is true for the following token.
Returns true for strings that probably start a sentence after a word that ends with a period.
Returns true for strings that probably start a sentence after a word that ends with a period.
Whitespace that should not be allowed between a closingRegex and closingContinuationRegex for a sentence continuation.
Whitespace that should not be allowed between a closingRegex and closingContinuationRegex for a sentence continuation. For example: He ran. "You shouldn't run!"
How the annotation of this DocumentAnnotator should be printed in one-word-per-line (OWPL) format.
How the annotation of this DocumentAnnotator should be printed in one-word-per-line (OWPL) format. If there is no per-token annotation, return null. Used in Document.owplString.