The interface common to objects that create Documents from a data source, such as plain text files, labeled data from Ontonotes, etc.
The interface common to objects that create Documents from the files in a directory.
Create Documents from plain text files.
Load a Document from a single NYTimes article in the XML format released by NYTimes and described in Evan Sandhaus (2008), "The New York Times Annotated Corpus," Linguistic Data Consortium, Philadelphia.
Author: martin Date: 2/25/12