Package org.carrot2.text.preprocessing
Class LabelFilterProcessor
java.lang.Object
org.carrot2.attrs.AttrComposite
org.carrot2.text.preprocessing.LabelFilterProcessor
- All Implemented Interfaces:
AcceptingVisitor
Applies basic filtering to words and phrases to produce candidates for cluster labels. Filtering
is applied to
PreprocessingContext.AllWords
and PreprocessingContext.AllPhrases
, the results are saved to PreprocessingContext.AllLabels
. Currently, the following filters are applied:
This class saves the following results to the PreprocessingContext
:
This class requires that InputTokenizer
, CaseNormalizer
, StopListMarker
and PhraseExtractor
be invoked first.
-
Field Summary
FieldsModifier and TypeFieldDescriptionTruncated phrase filter for this processor.Genitive length label filter.Min length label filter.Numeric label filter for this processor.Query word label filter for this processor.Stop label filter.Stop word label filter for this processor.Fields inherited from class org.carrot2.attrs.AttrComposite
attributes
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionvoid
process
(PreprocessingContext context) Processes all filters declared as fields of this class.Methods inherited from class org.carrot2.attrs.AttrComposite
accept
-
Field Details
-
queryLabelFilter
Query word label filter for this processor. -
stopWordLabelFilter
Stop word label filter for this processor. -
stopLabelFilter
Stop label filter. -
numericLabelFilter
Numeric label filter for this processor. -
completeLabelFilter
Truncated phrase filter for this processor. -
minLengthLabelFilter
Min length label filter. -
genitiveLabelFilter
Genitive length label filter.
-
-
Constructor Details
-
LabelFilterProcessor
public LabelFilterProcessor()
-
-
Method Details
-
process
Processes all filters declared as fields of this class.
-