This module implements a set of functions for extracting labeling events and predicting article quality scores.
Extracts a set of feature values from a text.
| Parameters: |
|
|---|---|
| Returns: | A list of extracted feature values |
Extracts labeling events from mwxml.Dump.
| Parameters: |
|
|---|---|
| Returns: | An iterator of dicts containing:
|
Extracts article text and metadata for labelings from an XML dump.
| Parameters: |
|
|---|---|
| Returns: | An iterator of labelings augmented with ‘page_id’, ‘rev_id’ and ‘text’. Note that labelings of articles that can’t be looked up will not be included. |
Fetches article text and metadata for labelings from a MediaWiki API.
| Parameters: |
|
|---|---|
| Returns: | An iterator of labelings augmented with ‘page_id’, ‘rev_id’ and ‘text’. Note that labelings of articles that aren’t found will not be included. |
Scores a chunck of Wikitext markup
| Parameters: |
|
|---|---|
| Returns: | A dict of score information. |