Package opennlp.tools.formats
package opennlp.tools.formats
Experimental package related to converting various corpora to OpenNLP Format.
- 
ClassDescriptionBase class for sample stream factories.Asample streamfor the training files of the BioNLP/NLPBA 2004 shared task.Factory producing OpenNLPChunkSampleStreams.Parser for the Dutch and Spanish ner training files of the CONLL 2002 shared task.Note: Do not use this class, internal use only!An import stream which can parse the CONLL03 data.Parses the data from the CONLL 06 shared task into POS Samples.Note: Do not use this class, internal use only!Note: Do not use this class, internal use only!Note: Do not use this class, internal use only!Base class for factories which need aDetokenizer.The directory sample stream allows for creating anObjectStream<File>from a directory listing of files.Factory producing OpenNLPDocumentSampleStreams.Parser for the Italian NER training files of the Evalita 2007 and 2009 NER shared tasks.Note: Do not use this class, internal use only!Factory producing OpenNLPlang detector sample streams.Stream factory for those streams which carry language.Factory producing OpenNLPLemmaSampleStreams.This class helps to read the US Census data from the files to build a StringList for each dictionary entry in the name-finder dictionary.Factory producing OpenNLPNameSampleDataStreams.Factory producing OpenNLPParseSampleStreams.Factory producing OpenNLPSentenceSampleStreams.Factory producing OpenNLPTokenSampleStreams.AnObjectStreamimplementation for the Twenty Newsgroups text corpus.Note: Do not use this class, internal use only!Note: Do not use this class, internal use only!