This module implements a set of functions for extracting labeling events and predicting article quality scores.
Extracts a set of feature values from a text.
Parameters: |
|
---|---|
Returns: | A list of extracted feature values |
Extracts labeling events from mwxml.Dump.
Parameters: |
|
---|---|
Returns: | An iterator of dicts containing:
|
Extracts article text and metadata for labelings from an XML dump.
Parameters: |
|
---|---|
Returns: | An iterator of labelings augmented with ‘page_id’, ‘rev_id’ and ‘text’. Note that labelings of articles that can’t be looked up will not be included. |
Fetches article text and metadata for labelings from a MediaWiki API.
Parameters: |
|
---|---|
Returns: | An iterator of labelings augmented with ‘page_id’, ‘rev_id’ and ‘text’. Note that labelings of articles that aren’t found will not be included. |
Scores a chunck of Wikitext markup
Parameters: |
|
---|---|
Returns: | A dict of score information. |