Unable to extract confusion matrix as a metric from trainer #19835

lathashree01 · 2024-05-01T16:10:40Z

Bug description

Hi team,

During testing, I would like to extract Binaryconfusion matrix as a metric from the trainer.

I can see the value being calculated successfully but it is failing in trainer where the number of elements in result is greater than 1.

pytorch-lightning/src/lightning/fabric/utilities/apply_func.py

Line 123 in d194976

if value.numel() != 1:

def convert_tensors_to_scalars(data: Any) -> Any:
 """Recursively walk through a collection and convert single-item tensors to scalar values.
 Raises:
     ValueError:
         If tensors inside ``metrics`` contains multiple elements, hence preventing conversion to a scalar.

 """
 def to_item(value: Tensor) -> Union[int, float, bool]:
     if value.numel() != 1:
         raise ValueError(
             f"The metric `{value}` does not contain a single element, thus it cannot be converted to a scalar."
         )
     return value.item()
 return apply_to_collection(data, Tensor, to_item)

PS: I am using Anomalib library which is based on pytorch lightning.

How do I resolve this or is there another way to get this? Any help would be greatly appreciated.

Thanks

What version are you seeing the problem on?

v2.2

How to reproduce the bug

Extract BinaryConfusionMatrix as a metric during trainer.test

Error messages and logs

ValueError: The metric `tensor([[343, 497],
        [  0,   4]])` does not contain a single element, thus it cannot be converted to a scalar.

Environment

Current environment

#- Lightning Component (e.g. Trainer, LightningModule, LightningApp, LightningWork, LightningFlow):
#- PyTorch Lightning Version (e.g., 1.5.0):
#- Lightning App Version (e.g., 0.5.2):
#- PyTorch Version (e.g., 2.0):
#- Python version (e.g., 3.9):
#- OS (e.g., Linux):
#- CUDA/cuDNN version:
#- GPU models and configuration:
#- How you installed Lightning(`conda`, `pip`, source):
#- Running environment of LightningApp (e.g. local, cloud):

More info

No response

The text was updated successfully, but these errors were encountered:

ryan597 · 2024-05-03T15:19:19Z

As the error suggests, if logging a metric it must be a singe value. You could try logging the individual elements of the confusion matrix eg. the true positives, false positives, true negatives, and false negatives

self.log_dict({"bcm/tp": torch.rand(1),
               "bcm/fp": torch.rand(1),
               "bcm/tn": torch.rand(1),
               "bcm/fn": torch.rand(1),
               })

or you could log the confusion matrix as an image

logger = TensorBoardLogger()
logger.experiment.add_image("Confusion Matrix", torch.rand(1, 2, 2), global_step=self.global_step)

using the BinaryConfusionMatrix from torchmetrics

bcm = BinaryConfusionMatrix(normalize='all')
matrix = bcm(torch.randint(0, 2, (1, 50)), torch.randint(0, 2, (1, 50)))  # just making some random predictions and labels to compute BCM
logger.experiment.add_image("bcm", matrix[None], global_step=self.global_step)

or you could use other functions of your logger directly to save the tensor

lathashree01 added bug Something isn't working needs triage Waiting to be triaged by maintainers labels May 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unable to extract confusion matrix as a metric from trainer #19835

Unable to extract confusion matrix as a metric from trainer #19835

lathashree01 commented May 1, 2024 •

edited

ryan597 commented May 3, 2024 •

edited

Unable to extract confusion matrix as a metric from trainer #19835

Unable to extract confusion matrix as a metric from trainer #19835

Comments

lathashree01 commented May 1, 2024 • edited

Bug description

What version are you seeing the problem on?

How to reproduce the bug

Error messages and logs

Environment

More info

ryan597 commented May 3, 2024 • edited

lathashree01 commented May 1, 2024 •

edited

ryan597 commented May 3, 2024 •

edited