cc.factorie.app.nlp.lexicon

Lexicon

trait Lexicon extends AnyRef

The general interface to a lexicon. Both WordLexicon and PhraseLexicon are subclasses.

Linear Supertypes
AnyRef, Any
Known Subclasses
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. Lexicon
  2. AnyRef
  3. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Abstract Value Members

  1. abstract def contains[T <: Observation[T]](query: Seq[T]): Boolean

  2. abstract def contains[T <: Observation[T]](query: T): Boolean

    Is this Token (or more generally Observation) a member of a phrase in the lexicon (including single-word phrases)? The query.

    Is this Token (or more generally Observation) a member of a phrase in the lexicon (including single-word phrases)? The query.string will be processed by the lemmatizer. For example if query.string is "New" and query.next.string is "York" and the two-word phrase "New York" is in the lexicon, then this method will return true. But if query.next.string is "shoes" (and "New shoes" is not in the lexicon) this method will return false.

  3. abstract def containsLemmatizedWord(word: String): Boolean

    Is this single word in the lexicon? The input String will not be processed by tokenizer, but will be processed by the lemmatizer.

  4. abstract def containsLemmatizedWords(words: Seq[String]): Boolean

    Is the pre-tokenized sequence of words in the lexicon? The input words are expected to already be processed by the lemmatizer.

  5. abstract def lemmatizer: Lemmatizer

    The string lemmatizer that simplifies lexicon entries and queries before searching for a match.

    The string lemmatizer that simplifies lexicon entries and queries before searching for a match. For example, a common lemmatizer is one that lowercases all strings.

  6. abstract def name: String

    An identifier for this lexicon, suitable for adding as a category to a FeatureVectorVariable[String].

  7. abstract def tokenizer: StringSegmenter

    The string segmenter that breaks a lexicon entries and queries into (potentially) multi-word phrases.

Concrete Value Members

  1. final def !=(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  2. final def !=(arg0: Any): Boolean

    Definition Classes
    Any
  3. final def ##(): Int

    Definition Classes
    AnyRef → Any
  4. final def ==(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  5. final def ==(arg0: Any): Boolean

    Definition Classes
    Any
  6. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  7. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  8. def contains(untokenizedString: String): Boolean

    Is the input String in the lexicon.

    Is the input String in the lexicon. The input is tokenized and lemmatized; if the tokenizer indicates that it is a multi-word phrase, it will be processed by containsWords, otherwise containsWord.

  9. def contains(span: TokenSpan): Boolean

  10. def containsWord(word: String): Boolean

    Is this single word in the lexicon? The input String will not be processed by tokenizer, but will be processed by the lemmatizer.

  11. def containsWords(words: Seq[String]): Boolean

    Is the pre-tokenized sequence of words in the lexicon? Each of the input words will be processed by the lemmatizer.

  12. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  13. def equals(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  14. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  15. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  16. def hashCode(): Int

    Definition Classes
    AnyRef → Any
  17. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  18. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  19. final def notify(): Unit

    Definition Classes
    AnyRef
  20. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  21. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  22. def toString(): String

    Definition Classes
    AnyRef → Any
  23. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  24. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  25. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from AnyRef

Inherited from Any

Ungrouped