3.3. Metrics and scoring: quantifying the quality of predictions — scikit-learn 1.3.2 documentation
scikit learn - What's the difference between Sklearn F1 score 'micro' and 'weighted' for a multi class classification problem? - Data Science Stack Exchange
An Introduction to Inter-Annotator Agreement and Cohen's Kappa Statistic