| Interface | Description |
|---|---|
| CharPredicate |
Predicate for testing a character.
|
| Encounter<Collector> | |
| Handler |
Handler that receives streaming events when processing a Lect syntax tree.
|
| HandlerFactory<Collector> |
Interface for creating instances of a
Handler. |
| LanguageEncounter |
Encounter used while using a
LanguageParser to extract natural
language details. |
| LanguageFactory |
Factory for creating instances of
LanguageParser. |
| LanguageParser |
Language parser used for parsing contents within paragraph-level content.
|
| Pipeline<Collector> |
Pipeline over a
TextSource that will run Handlers on the
text in the source. |
| PipelineAssembly<Collector,Self extends PipelineAssembly<Collector,Self>> |
Assembly interface for creating pipelines, used to support both
PipelineBuilder and PipelineRunner. |
| PipelineBuilder<Collector> |
Builder for instances of
Pipeline. |
| TextSource |
Abstraction over sources of text, used to support different formats such as
plain text and HTML.
|
| TextSourceEncounter |
Encounter given to
TextSources when they should parse their content. |
| Class | Description |
|---|---|
| ChunkedLanguageParser |
Abstract implementation of
LanguageParser that allows an
implementation to handle a single CharSequence without worrying
about mapping to the correct location or original text in the source. |
| ICULanguage |
LanguageParser that uses BreakIterator from ICU4J to split
text into sentences and tokens. |
| ParagraphLanguage |
LanguageFactory that just emits entire paragraphs as a Token. |
| PipelineRunner<Collector> | |
| TokenizingLanguage |
Implementation of
LanguageParser that uses Tokenizers to
split text into sentences and tokens. |
Copyright © 2018. All rights reserved.