cc.factorie.app.nlp.segment

MSRChainChineseWordSegmenter

class MSRChainChineseWordSegmenter extends ChainChineseWordSegmenter

A linear-chain CRF model for Chinese word segmentation with four companion objects, each pre-trained on a different corpus that corresponds to a different variety of written Mandarin.

Linear Supertypes
Known Subclasses
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. MSRChainChineseWordSegmenter
  2. ChainChineseWordSegmenter
  3. DocumentAnnotator
  4. AnyRef
  5. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Instance Constructors

  1. new MSRChainChineseWordSegmenter(url: URL)

Value Members

  1. final def !=(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  2. final def !=(arg0: Any): Boolean

    Definition Classes
    Any
  3. final def ##(): Int

    Definition Classes
    AnyRef → Any
  4. final def ==(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  5. final def ==(arg0: Any): Boolean

    Definition Classes
    Any
  6. object SegmentationFeaturesDomain extends CategoricalVectorDomain[String]

    Definition Classes
    ChainChineseWordSegmenter
  7. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  8. def characterToFeatures(i: Int, labeledCharacters: IndexedSeq[(String, String)]): Seq[String]

    Definition Classes
    ChainChineseWordSegmenter
  9. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  10. def deserialize(stream: InputStream): Unit

    Definition Classes
    ChainChineseWordSegmenter
  11. def deserialize(filePath: String): Unit

    Definition Classes
    ChainChineseWordSegmenter
  12. def documentAnnotationString(document: Document): String

    How the annotation of this DocumentAnnotator should be printed as extra information after a one-word-per-line (OWPL) format.

    How the annotation of this DocumentAnnotator should be printed as extra information after a one-word-per-line (OWPL) format. If there is no document annotation, return the empty string. Used in Document.owplString.

    Definition Classes
    DocumentAnnotator
  13. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  14. def equals(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  15. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  16. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  17. def getF1Score(filePath: String): Double

    Definition Classes
    ChainChineseWordSegmenter
  18. def getSegmentables(labeledCharacters: IndexedSeq[(String, String)]): IndexedSeq[Segmentable]

    Definition Classes
    ChainChineseWordSegmenter
  19. def getSegmentables(document: Document): IndexedSeq[Segmentable]

    Definition Classes
    ChainChineseWordSegmenter
  20. def getSegmentables(corpus: File): IndexedSeq[Segmentable]

    Definition Classes
    ChainChineseWordSegmenter
  21. def hashCode(): Int

    Definition Classes
    AnyRef → Any
  22. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  23. def mentionAnnotationString(mention: Mention): String

    Definition Classes
    DocumentAnnotator
  24. val model: ChainModel[SegmentationLabel, SegmentationFeatures, Character]

    Definition Classes
    ChainChineseWordSegmenter
  25. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  26. final def notify(): Unit

    Definition Classes
    AnyRef
  27. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  28. def phraseAnnotationString(phrase: Phrase): String

    Definition Classes
    DocumentAnnotator
  29. def postAttrs: Seq[Class[_ >: Sentence with Token <: Attr]]

  30. def prereqAttrs: Seq[Nothing]

  31. def process(document: Document): Document

  32. def processParallel(documents: Iterable[Document], nThreads: Int = ...): Iterable[Document]

    Definition Classes
    DocumentAnnotator
  33. def processSequential(documents: Iterable[Document]): Iterable[Document]

    Definition Classes
    DocumentAnnotator
  34. def segment(segmentables: IndexedSeq[Segmentable]): IndexedSeq[SegmentationLabel]

    Definition Classes
    ChainChineseWordSegmenter
  35. def segment(document: Document): IndexedSeq[SegmentationLabel]

    Definition Classes
    ChainChineseWordSegmenter
  36. def segment(filePath: String): IndexedSeq[SegmentationLabel]

    Definition Classes
    ChainChineseWordSegmenter
  37. def serialize(stream: OutputStream): Unit

    Definition Classes
    ChainChineseWordSegmenter
  38. def serialize(filePath: String): Unit

    Definition Classes
    ChainChineseWordSegmenter
  39. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  40. def toString(): String

    Definition Classes
    AnyRef → Any
  41. def tokenAnnotationString(token: Token): String

    How the annotation of this DocumentAnnotator should be printed in one-word-per-line (OWPL) format.

    How the annotation of this DocumentAnnotator should be printed in one-word-per-line (OWPL) format. If there is no per-token annotation, return null. Used in Document.owplString.

    Definition Classes
    ChainChineseWordSegmenterDocumentAnnotator
  42. def train(filePath: String): Unit

    Definition Classes
    ChainChineseWordSegmenter
  43. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  44. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  45. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from ChainChineseWordSegmenter

Inherited from DocumentAnnotator

Inherited from AnyRef

Inherited from Any

Ungrouped