7.1.3. mclearn.classifier.learning_curve¶

mclearn.classifier.learning_curve(data, feature_cols, target_col, classifier, train_sizes, test_sizes=200000, random_state=None, balanced=True, normalise=True, degree=1, pickle_path=None)[source]¶

Compute the learning curve of a classiifer.

Parameters:	data (DataFrame) – The DataFrame containing all the data. feature_cols (array) – A list of column names in data that are used as features. target_col (str) – The column name of the target. classifier (Classifier object) – A classifier object that will be used to train and test the data. It should have the same interface as scikit-learn classifiers. train_sizes (array) – The list of the sample sizes that the classifier will be trained on. test_sizes (int or list of ints) – The sizes of the test set. random_state (int) – The value of the Random State (used for reproducibility). normalise (boolean) – Whether we should first normalise the data to zero mean and unit variance. degree (int) – If greater than 1, the data will first be polynomially transformed with the given degree. pickle_path (str) – The path where the values of the learning curve will be saved.
Returns:	lc_accuracy_test – The list of balanced accuracy scores for the given sample sizes.
Return type:	array

Table Of Contents

Search

7.1.3. mclearn.classifier.learning_curve¶