Support for 3D Conv-Net #466

kevinkevin556 · 2023-11-17T20:38:42Z

Hi all,

Thank you for developing such a nice repo. I've been using it in many of my projects for network explainability, and it has been incredibly convenient!

Recently, I've been working with medical datasets using 3D-UNet. However, I noticed that 3D convolution is not yet supported in this library, and there are also some issues like #351 requesting for the feature. Therefore, I made several changes on GradCAM and BaseCAM to extend the functionality of GradCAM to support 3D images.

Please let me know if you have any questions or suggestions regarding the changes I've implemented. I'm excited to contribute to this project and look forward to your feedback!

jacobgil · 2023-12-09T12:24:48Z

Hey, sorry for the late reply.
Thanks a lot for this functionality, this will be great to merge.

Is there a way to share an example use case for this: maybe some model and and input image example,
or an image example for the readme?

jacobgil · 2023-12-09T12:26:26Z

pytorch_grad_cam/base_cam.py

        weights = self.get_cam_weights(input_tensor,
                                       target_layer,
                                       targets,
                                       activations,
                                       grads)
-        weighted_activations = weights[:, :, None, None] * activations
+        w_shape = (slice(None), slice(None)) + (None,) * (len(activations.shape)-2)


This line is a bit less straight forward to understand.
Can you please explain what's going on here?
Do you think there is a way to rewrite it to be more clear ?

That line does exactly the same thing as

# 2D conv if len(activations.shape) == 4: weighted_activations = weights[:, :, None, None] * activations # 3D conv elif len(activations.shape) == 5: weighted_activations = weights[:, :, None, None, None] * activations

But I think you are right: it does lack some readability.
I will rewrite the code here.

kevinkevin556 · 2023-12-28T18:43:12Z

@jacobgil Thanks for your reply!

Is there a way to share an example use case for this: maybe some model and and input image example, or an image example for the readme?

I added an animation of gradcam-visualized CT scans in the readme.
Hope this can make it clearer.

Syax19 · 2024-01-17T00:55:48Z

@kevinkevin556 Thanks for providing the code for applying Grad-Cam on 3D CNN!

I have used your code to get the grad-cam outputs, my input 3D image tensor size is (1, 1, 24, 224, 224) representing (batch, channel, depth, height, width).
Then I got the grayscale_cam outputs size is (1, 24, 224, 224).
I'm curious to know if I take one of the outputs, for example, depth=11, the output will be "outputs[ 0, : ][ 11, : , : ] (depth, height, width)", will it corresponds to "input image[ : , 11 , : , : ] (channel, depth, height, width)" ?
Since I found that every depth of the output heatmap looked same.

Looking forward to your replying, thanks!

kevinkevin556 · 2024-01-26T23:00:26Z

I have used your code to get the grad-cam outputs, my input 3D image tensor size is (1, 1, 24, 224, 224) representing (batch, channel, depth, height, width). Then I got the grayscale_cam outputs size is (1, 24, 224, 224). I'm curious to know if I take one of the outputs, for example, depth=11, the output will be "outputs[ 0, : ][ 11, : , : ] (depth, height, width)", will it corresponds to "input image[ : , 11 , : , : ] (channel, depth, height, width)" ? Since I found that every depth of the output heatmap looked same.

@Syax19 Sorry for the late reply. I'm glad to hear that someone is using it 😄

Although I followed MONAI's convention to assign each dimension in the order of (height, width, depth), the output dimensions should still correspond with your input tensor, as there is no dimension swap when calculating Grad-CAM.

Therefore, the grayscale_cam of size (1, 24, 224, 224) represents dimensions (batch, depth, height, width) in your case.

MoH-assan · 2024-03-14T21:36:38Z

@jacobgil
Any update on this feature?

jacobgil · 2024-05-28T18:29:57Z

This is incredible functionality, thank you so much for contributing this, and sorry for being so late with my reply.
I really want to merge this.
The .gif file weights 24 mb which is a bit much, will look into resizing it.

jacobgil · 2024-05-28T18:56:37Z

@kevinkevin556 merged!! better late than never. Thank you so much for this contribution!

Modify grad-cam and base-cam to support 3d conv.

e76d53f

jacobgil reviewed Dec 9, 2023

View reviewed changes

Add image examples for 3D convolutions.

127387f

Modify get_cam_image to increase readbability.

c547065

Merge branch 'master' of https://github.com/jacobgil/pytorch-grad-cam

352b089

jacobgil changed the base branch from master to 3d May 28, 2024 18:41

jacobgil changed the base branch from 3d to master May 28, 2024 18:43

Merge branch 'master' into master

4133a5b

jacobgil merged commit 3f6b14d into jacobgil:master May 28, 2024
1 check failed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for 3D Conv-Net #466

Support for 3D Conv-Net #466

kevinkevin556 commented Nov 17, 2023

jacobgil commented Dec 9, 2023

jacobgil Dec 9, 2023

kevinkevin556 Dec 28, 2023

kevinkevin556 commented Dec 28, 2023

Syax19 commented Jan 17, 2024

kevinkevin556 commented Jan 26, 2024

MoH-assan commented Mar 14, 2024

jacobgil commented May 28, 2024

jacobgil commented May 28, 2024

Support for 3D Conv-Net #466

Support for 3D Conv-Net #466

Conversation

kevinkevin556 commented Nov 17, 2023

jacobgil commented Dec 9, 2023

jacobgil Dec 9, 2023

Choose a reason for hiding this comment

kevinkevin556 Dec 28, 2023

Choose a reason for hiding this comment

kevinkevin556 commented Dec 28, 2023

Syax19 commented Jan 17, 2024

kevinkevin556 commented Jan 26, 2024

MoH-assan commented Mar 14, 2024

jacobgil commented May 28, 2024

jacobgil commented May 28, 2024