Use metrics to tune the performance of your machine learning model.
- Create ML
- Create MLUI
Evaluating and improving your model starts with looking at its performance across different data sets. Metrics from each dataset inform which changes have the most impact on your model’s accuracy.
No single metric can tell you everything about your model’s performance. To improve your model, you compare the metrics (
MLRegressor depending on your model type) among your training, validation, and testing data sets. For example, the accuracy discussed in the Creating an Image Classifier Model article is derived from the
classification metric for each data set.
You can also access these values programmatically after creating a model and loading your testing data:
In this case, you see output for several different metrics, including
confusion for classifiers and
root for regressors. Use these values from each data set to determine where your model needs to improve.
Improve Your Model’s Training Accuracy
If the training accuracy of your model is low, it’s an indication that your current model configuration can’t capture the complexity of your data.
Try adjusting the training parameters. When working with image data, double the maximum number of iterations in the
MLImage playground UI (the default value is 10).
For natural language data, try a different underlying algorithm (see
MLText). For more general tasks, use a different underlying model than the type determined by
MLClassifier (see Supporting Classifier Types) or
MLRegressor (see Supporting Regressor Types).
Improve Your Model’s Validation Accuracy
If your model’s accuracy on the validation set is low or fluctuates between low and high each time you train the model, you need more data. You can generate more input data from the examples you already collected, a technique known as data augmentation. For image data, you can combine operations like cropping, rotation, blurring, and exposure adjustment to make one image into many examples.
It’s also possible for you to have lots of data and validation accuracy that is still significantly lower than your training accuracy. In this case, your model is overfitting, meaning that it’s learning too many specific details about your training set that don’t generally apply to other examples. In this case, you need to reduce the number of training iterations to prevent the model from learning too much about your training data.
Improve Your Model’s Evaluation Accuracy
If your model’s accuracy on your testing data is lower than your training or validation accuracy, it usually indicates that there are meaningful differences between the kind of data you trained the model on and the testing data you’re providing for evaluation.
For example, suppose you train your
MLImage on many images of indoor cats, but then test only on images of outdoor cats. Because of the differences in lighting, exposure, and background, it’s unlikely that your testing data will yield good results. Differences between images that seem obvious to humans can be difficult for a model to resolve without sufficient training data.
To correct for this, provide more diverse data in your training set. In general, more examples lead to higher performance, but it’s also important to show your model examples that are as varied your testing data.