importing `xformers.ops` implicitly initializes CUDA context #1030

function2-llx · 2024-04-20T13:43:11Z

Currently, importing xformers.ops will implicitly initializes CUDA context. This has an unpleasant effect that we cannot use the "fork" multi-processing method.

The line of code that initializes CUDA context is as follows:

xformers/xformers/__init__.py

Line 52 in f663712

if torch.cuda.get_device_capability("cuda") < (8, 0):

The text was updated successfully, but these errors were encountered:

danthe3rd · 2024-04-24T12:23:12Z

Hi,
Thanks for reporting this issue. Unfortunately it might be more effort than just this line, as we check for device capabilities in multiple places as well...
@fmassa @bottler any idea?

bottler · 2024-05-02T13:29:48Z

Fixing this would be good for cutting import times.

We need _is_triton_available to be only called when a public function is called, not at import time of public modules. I think we could do that.

See #1030 (I am not updating xformers/components here.) ghstack-source-id: 56136b9a2d20448cbd9546fa4182e10c0ae83d51 Pull Request resolved: fairinternal/xformers#1094 __original_commit__ = fairinternal/xformers@df89e3c

bottler · 2024-05-07T15:59:25Z

It's possible that the commit 737c2e6 just now will fix this.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

importing `xformers.ops` implicitly initializes CUDA context #1030

importing `xformers.ops` implicitly initializes CUDA context #1030

function2-llx commented Apr 20, 2024

danthe3rd commented Apr 24, 2024

bottler commented May 2, 2024

bottler commented May 7, 2024

importing xformers.ops implicitly initializes CUDA context #1030

importing xformers.ops implicitly initializes CUDA context #1030

Comments

function2-llx commented Apr 20, 2024

danthe3rd commented Apr 24, 2024

bottler commented May 2, 2024

bottler commented May 7, 2024

importing `xformers.ops` implicitly initializes CUDA context #1030

importing `xformers.ops` implicitly initializes CUDA context #1030