fix `_multiclass_stat_scores_update` in classification #3078

ved1beta · 2025-04-30T16:25:35Z

What does this PR do?

This PR fixes an issue with the multiclass accuracy calculation when using top_k > 1 with average="micro". The bug caused incorrect accuracy calculations in scenarios where predictions were provided as logits/probabilities and the correct class needed to be identified among the top-k predictions.

The fix ensures that:

The one-hot encoding path is always used when top_k > 1, regardless of the averaging method
A special case is added in multiclass_accuracy to properly handle top-k with micro averaging
Top-k selection is consistently applied across all evaluation scenarios

The PR includes test cases that demonstrate the issue and verify the fix works correctly. These tests show the alignment between manual calculations of top-k accuracy and the results from the metric implementation.

Before submitting

Was this discussed/agreed via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure to update the docs?
Did you write any new necessary tests?

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

Did you have fun?

Make sure you had fun coding 🙃

📚 Documentation preview 📚: https://torchmetrics--3078.org.readthedocs.build/en/3078/

fixes #3068

for more information, see https://pre-commit.ci

rittik9

Hi @ved1beta thanks for opening this pr,
but I think these issues are still there even with the updated code

>>> logits
tensor([[[0.0000, 0.1000, 0.5000, 0.4000],
         [0.0000, 0.2000, 0.7000, 0.1000]],

        [[0.0000, 0.4000, 0.3000, 0.3000],
         [1.0000, 0.0000, 0.0000, 0.0000]]])
>>> code
tensor([[3, 2],
        [1, 0]])
>>> logits.shape
torch.Size([2, 2, 4])
>>> code.shape
torch.Size([2, 2])
>>> acc = Accuracy(task="multiclass", ignore_index=0, num_classes=4, multidim_average="global", average="micro", top_k=4)
>>> acc(logits.transpose(2, 1), code)
tensor(0.6667)
>>> acc = Accuracy(task="multiclass", ignore_index=0, num_classes=4, multidim_average="global", average="micro", top_k=3)
>>> acc(logits.transpose(2, 1), code)
tensor(0.6667)
>>> acc = Accuracy(task="multiclass", ignore_index=0, num_classes=4, multidim_average="global", average="micro", top_k=2)
>>> acc(logits.transpose(2, 1), code)
tensor(0.6667)
>>> acc = Accuracy(task="multiclass", ignore_index=0, num_classes=4, multidim_average="global", average="micro", top_k=1)
>>> acc(logits.transpose(2, 1), code)
tensor(0.6667)

can you pls add them as unittests and recheck your implementation...

fix _multiclass_stat_scores_update

f900b97

ved1beta requested review from SkafteNicki, Borda and justusschock as code owners April 30, 2025 16:25

github-actions bot added the topic: Classif label Apr 30, 2025

[pre-commit.ci] auto fixes from pre-commit.com hooks

44b9ed5

for more information, see https://pre-commit.ci

rittik9 reviewed May 8, 2025

View reviewed changes

Borda added the waiting on author label May 14, 2025

Borda changed the title ~~fix _multiclass_stat_scores_update~~ fix _multiclass_stat_scores_update in classification May 20, 2025

Borda marked this pull request as draft June 10, 2025 10:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix `_multiclass_stat_scores_update` in classification #3078

fix `_multiclass_stat_scores_update` in classification #3078

Uh oh!

ved1beta commented Apr 30, 2025 •

edited

Loading

Uh oh!

rittik9 left a comment

Uh oh!

Uh oh!

fix _multiclass_stat_scores_update in classification #3078

Are you sure you want to change the base?

fix _multiclass_stat_scores_update in classification #3078

Uh oh!

Conversation

ved1beta commented Apr 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Did you have fun?

Uh oh!

rittik9 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

fix `_multiclass_stat_scores_update` in classification #3078

fix `_multiclass_stat_scores_update` in classification #3078

ved1beta commented Apr 30, 2025 •

edited

Loading