Part of Speech (English)
Description
Takes in a stream of tokens and marks each token with a part-of-speech tag The list of used suffixes can be found here
Required input
A stream with a list property which contains the tokens.
Configuration
Simply assign the correct output of the previous stream to the part of speech detector input. To use this component you have to download or train an openNLP model: https://opennlp.apache.org/models.html
Output
Appends two list properties to the stream:
- String list: The tag for each token
- Double list: The confidence for each tag that it is indeed the given tag (between 0 and 1)
Example:
Input: (tokens: ["Hi", "Joe"])
Output: (tokens: ["Hi", "Joe"], tags: ["UH", "NNP"], confidence: [0.82, 0.87])