Implements a feature set based off of dictionary lookup.
Parameters: |
|
---|
int : A count of the number of dictionary words in the revision
int : A count of the number of non-dictionary words in the revision
int : A count of the number of dictionary words added
int : A count of the number of dictionary words removed
int : A count of the number of non-dictionary words added
int : A count of the number of non-dictionary words removed
int : The sum of word frequency deltas for dictionary words
int : The sum of word frequency delta increases for dictionary words
int : The sum of word frequency delta decreases for dictionary words
int : The sum of word frequency deltas for non-dictionary words
int : The sum of word frequency delta increases for non-dictionary words
int : The sum of word frequency delta decreases for non-dictionary words
float : The sum of word frequency proportional delta for dictionary words
float : The sum of word frequency proportional delta increases for dictionary words
float : The sum of word frequency proportional delta decreases for dictionary words
float : The sum of word frequency proportional delta for non-dictionary words
float : The sum of word frequency proportional delta increase for non-dictionary words
float : The sum of word frequency proportional delta decrease for non-dictionary words
Implements a feature set based off of a set of regexes applied to strings.
Parameters: |
|
---|
int : A count of the number of matches found in the text
int : The number of matches added in the edit
int : The number of matches removed in the edit
int : The sum of frequency delta for matched strings
int : The sum of frequency delta increases for matched strings
int : The sum of frequency delta decreases for matched strings
int : The sum of proportional frequency delta for matched strings
int : The sum of proportional frequency delta increases for matched strings
int : The sum of proportional frequency delta decreases for matched strings
Implements a feature set based off of filtering words for stopwords
Parameters: |
|
---|
int : A count of the number of stopwords in the content
int : A count of the number of non-stopwords in the content
int : A count of stopwords added
int : A count of stopwords removed
int : A count of non-stopwords added
int : A count of non-stopwords removed
int : The sum of word frequency deltas for stopwords
int : The sum of word frequency delta increases for stopwords
int : The sum of word frequency delta decreases for stopwords
int : The sum of word frequency deltas for non-stopwords
int : The sum of word frequency delta increases for non-stopwords
int : The sum of word frequency delta decreases for non-stopwords
float : The sum of proportional word frequency deltas for stopwords
float : The sum of proportional word frequency delta increases for stopwords
float : The sum of proportional word frequency delta decreases for stopwords
float : The sum of proportional word frequency deltas for non-stopwords
float : The sum of proportional word frequency delta increases for non-stopwords
float : The sum of proportional word frequency delta decreases for non-stopwords
Implements a feature set based off of stemmer applied to words.
Parameters: |
|
---|
int : A count of unique stemmed words.
int : A count of characters in stemmed words.
int : The sum of frequency deltas for stemmed words
int : The sum of frequency delta increases for stemmed words
int : The sum of frequency delta decreases for stemmed words
int : The sum of proportional frequency deltas for stemmed words
int : The sum of proportional frequency delta increases for stemmed words
int : The sum of proportional frequency delta decreases for stemmed words