cc.factorie.app.nlp.segment

DeterministicSentenceSegmenter

object DeterministicSentenceSegmenter extends DeterministicSentenceSegmenter

Linear Supertypes
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. DeterministicSentenceSegmenter
  2. DeterministicSentenceSegmenter
  3. DocumentAnnotator
  4. AnyRef
  5. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Value Members

  1. final def !=(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  2. final def !=(arg0: Any): Boolean

    Definition Classes
    Any
  3. final def ##(): Int

    Definition Classes
    AnyRef → Any
  4. final def ==(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  5. final def ==(arg0: Any): Boolean

    Definition Classes
    Any
  6. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  7. val charOffsetBoundary: Int

    If there are more than this number of characters between the end of the previous token and the beginning of this one, force a sentence start.

    If there are more than this number of characters between the end of the previous token and the beginning of this one, force a sentence start. If negative, don't break sentences according to this criteria at all.

    Definition Classes
    DeterministicSentenceSegmenter
  8. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  9. val closingContinuationRegex: Regex

    Matches the Token.

    Matches the Token.string of tokens that may extend a sentence, such as quotes, closing parentheses, and even additional periods.

    Definition Classes
    DeterministicSentenceSegmenter
  10. val closingRegex: Regex

    Matches the Token.

    Matches the Token.string of punctuation that always indicates the end of a sentence. It does not include possible additional tokens that may be appended to the sentence such as quotes and closing parentheses.

    Definition Classes
    DeterministicSentenceSegmenter
  11. def documentAnnotationString(document: Document): String

    How the annotation of this DocumentAnnotator should be printed as extra information after a one-word-per-line (OWPL) format.

    How the annotation of this DocumentAnnotator should be printed as extra information after a one-word-per-line (OWPL) format. If there is no document annotation, return the empty string. Used in Document.owplString.

    Definition Classes
    DocumentAnnotator
  12. var doubleNewlineBoundary: Boolean

    If true every double newline causes a sentence break.

    If true every double newline causes a sentence break.

    Definition Classes
    DeterministicSentenceSegmenter
  13. val emoticonRegex: Regex

  14. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  15. def equals(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  16. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  17. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  18. def hashCode(): Int

    Definition Classes
    AnyRef → Any
  19. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  20. def main(args: Array[String]): Unit

  21. def mentionAnnotationString(mention: Mention): String

    Definition Classes
    DocumentAnnotator
  22. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  23. var newlineBoundary: Boolean

    If true, every newline causes a sentence break.

    If true, every newline causes a sentence break.

    Definition Classes
    DeterministicSentenceSegmenter
  24. final def notify(): Unit

    Definition Classes
    AnyRef
  25. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  26. def phraseAnnotationString(phrase: Phrase): String

    Definition Classes
    DocumentAnnotator
  27. val possibleClosingRegex: Regex

    Matches the Token.

    Matches the Token.string of tokens that might possibility indicate the end of a sentence, such as an mdash. The sentence segmenter will only actually create a sentence end here if possibleSentenceStart is true for the following token.

    Definition Classes
    DeterministicSentenceSegmenter
  28. def possibleSentenceStart(s: String): Boolean

    Returns true for strings that probably start a sentence after a word that ends with a period.

    Returns true for strings that probably start a sentence after a word that ends with a period.

    Definition Classes
    DeterministicSentenceSegmenter
  29. def postAttrs: Iterable[Class[_]]

  30. def prereqAttrs: Iterable[Class[_]]

  31. def process(document: Document): Document

  32. def processParallel(documents: Iterable[Document], nThreads: Int = ...): Iterable[Document]

    Definition Classes
    DocumentAnnotator
  33. def processSequential(documents: Iterable[Document]): Iterable[Document]

    Definition Classes
    DocumentAnnotator
  34. val spaceRegex: Regex

    Whitespace that should not be allowed between a closingRegex and closingContinuationRegex for a sentence continuation.

    Whitespace that should not be allowed between a closingRegex and closingContinuationRegex for a sentence continuation. For example: He ran. "You shouldn't run!"

    Definition Classes
    DeterministicSentenceSegmenter
  35. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  36. def toString(): String

    Definition Classes
    AnyRef → Any
  37. def tokenAnnotationString(token: Token): Null

    How the annotation of this DocumentAnnotator should be printed in one-word-per-line (OWPL) format.

    How the annotation of this DocumentAnnotator should be printed in one-word-per-line (OWPL) format. If there is no per-token annotation, return null. Used in Document.owplString.

    Definition Classes
    DeterministicSentenceSegmenterDocumentAnnotator
  38. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  39. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  40. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from DocumentAnnotator

Inherited from AnyRef

Inherited from Any

Ungrouped