A token filter of type stop that removes stop words from token streams.
The following are settings that can be set for a stop token filter type:
Setting | Description |
---|---|
stopwords | A list of stop words to use. Defaults to english stop words. |
stopwords_path | A path (either relative to config location, or absolute) to a stopwords file configuration. |
enable_position_increments | Set to true if token positions should record the removed stop words, false otherwise. Defaults to true. |
ignore_case | Set to true to lower case all words first. Defaults to false. |
stopwords allow for custom language specific expansion of default stopwords. It follows the _lang_ notation and supports: arabic, armenian, basque, brazilian, bulgarian, catalan, danish, dutch, english, finnish, french, galician, german, greek, hindi, hungarian, indonesian, italian, norwegian, persian, portuguese, romanian, russian, spanish, swedish, turkish.