fp8 support #2304

vince62s · 2023-02-02T07:29:40Z

If someone is motivated, there could be some adaptation to support fp8 (on some hardware) using this new library:

https://github.com/NVIDIA/TransformerEngine

vince62s · 2023-02-02T07:46:27Z

Well, no hurry for RTX 4090, not ready yet.

Hi All,

First of all, I'm really sorry for the prolonged silence on this issue - I did not want to communicate anything before getting a full alignment internally.
As noted in the RTX 4090 announcement and Ada whitepaper, Ada has FP8 TensorCore hardware. However, the software support for them is not currently available - e.g. there is no support for it exposed in cuBLASLt currently. The reason for it is that both the FP8 TC instruction as well as other features used in the fast FP8 GEMM kernels are different between Hopper and Ada (meaning a different set of kernels required for both architectures) and the Hopper support was prioritized. Once the FP8 support lands in CUDA and its libraries (tentatively scheduled for CUDA 12.1 in Q2), Transformer Engine will also fully support Ada.

read from here: NVIDIA/TransformerEngine#15

oscarbg · 2023-03-01T18:18:54Z

@vince62s cuda 12.1 released.. can Ada support be worked on?

AaronZLT · 2023-03-07T05:24:42Z

@oscarbg didn't work.
cublasLtMatmul and cublasLtMatrixTransform still can't work for __nv_fp8_e4m3 and __nv_fp8_e4m3 on 4090 with newest cuda 12.1, date-2023.3.7.
Can anyone work it out or it is just cuda not support issue?

sbhavani · 2023-04-27T20:32:24Z

TransformerEngine v0.7 added FP8 support on Ada

vince62s · 2023-04-28T07:04:53Z

Great I'll give it a try when I get some time.

vince62s · 2023-05-19T07:17:59Z

I tried it but obviously it is not so easy to make it work in our scenario.
NVIDIA/TransformerEngine#230

vince62s added the type:performance label Feb 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fp8 support #2304

fp8 support #2304

vince62s commented Feb 2, 2023

vince62s commented Feb 2, 2023

oscarbg commented Mar 1, 2023

AaronZLT commented Mar 7, 2023 •

edited

sbhavani commented Apr 27, 2023

vince62s commented Apr 28, 2023

vince62s commented May 19, 2023

fp8 support #2304

fp8 support #2304

Comments

vince62s commented Feb 2, 2023

vince62s commented Feb 2, 2023

oscarbg commented Mar 1, 2023

AaronZLT commented Mar 7, 2023 • edited

sbhavani commented Apr 27, 2023

vince62s commented Apr 28, 2023

vince62s commented May 19, 2023

AaronZLT commented Mar 7, 2023 •

edited