Comprehensive and Detailed Explanation From Exact AWS AI documents:
The F1 score is a standard evaluation metric that represents the harmonic mean of precision and recall.
In AWS ML evaluation guidance:
Precision measures correctness of positive predictions
Recall measures coverage of actual positive cases
F1 score balances both metrics into a single performance indicator
This makes the F1 score particularly useful when evaluating classification performance of foundation models.
Why the other options are incorrect:
Speed (B) is a latency metric.
Cost (C) measures operational efficiency.
Energy efficiency (D) is unrelated to predictive accuracy.
AWS AI document references:
Model Evaluation Metrics on AWS
Classification Performance Measurement
Amazon SageMaker Evaluation Best Practices