-
Notifications
You must be signed in to change notification settings - Fork 816
Issues: NVIDIA/cutlass
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Tiled copy misaligned, how to solve it?
? - Needs Triage
question
Question
#1561
opened May 30, 2024 by
4grass
Warp Group MMA vs Warp MMA
? - Needs Triage
question
Question
#1560
opened May 30, 2024 by
OrenLeung
[QST] Best way to tell which methods are called
? - Needs Triage
question
Question
#1558
opened May 29, 2024 by
dscotthunter
[QST] why cute kernel transfers so much data between L2 and gmen than cublas kernel
? - Needs Triage
question
Question
#1556
opened May 29, 2024 by
irasin
[QST]How to implement different type between D0(D1) and D2 based on 45_dual_gemm example
? - Needs Triage
question
Question
#1555
opened May 29, 2024 by
Sunny-bot1
[QST] The best way to do D = func(A x B) x C.
? - Needs Triage
question
Question
#1551
opened May 27, 2024 by
amazingyyc
[QST] Hopper mixed precision gemm always worse than FP8
? - Needs Triage
question
Question
#1549
opened May 24, 2024 by
divchenko
[BUG] Cutlass Python API silently fails in (suspected) unsupported case
? - Needs Triage
bug
Something isn't working
#1547
opened May 23, 2024 by
LucasWilkinson
[QST] Row major for int8 matrix multiplications?
? - Needs Triage
question
Question
#1533
opened May 10, 2024 by
ken012git
[QST] Question
cutlass::Array
and cute::Tensor
--- using CUTLASS utility structs/classes with CUTE (such as NumericArrayConverter
)
? - Needs Triage
question
#1532
opened May 10, 2024 by
HanGuo97
[QST/BUG] Should shared memory usage be checked for multistage pipeline?
? - Needs Triage
question
Question
#1525
opened May 7, 2024 by
wzhcz8902
[BUG] Composition between Something isn't working
Tensor
and Layout
as shown in 03_tensor.md
does not compile
? - Needs Triage
bug
#1519
opened Apr 30, 2024 by
armbuster
[QST] use FastLinearCombinationClamp to convert half accumulator to int8_t output?
? - Needs Triage
inactive-30d
question
Question
#1516
opened Apr 30, 2024 by
ken012git
[BUG] Autovectorized copy fails with shape (_2, _3)
bug
Something isn't working
inactive-30d
#1499
opened Apr 23, 2024 by
YichengDWu
[QST] 2D Convolution for NCHW Row-Major images, kernels and output
? - Needs Triage
inactive-30d
question
Question
#1496
opened Apr 19, 2024 by
chart21
[QST] make error caused by glibc 2.32 in ubuntu 18.04 "//usr/local/lib/libpthread.so.0: undefined reference to"
? - Needs Triage
inactive-30d
question
Question
#1490
opened Apr 17, 2024 by
Zmy6
[QST] StreamK ReductionStrategy: "Atomic" or "Mixed"
inactive-30d
question
Question
#1488
opened Apr 16, 2024 by
HanGuo97
[BUG] Disordered header files: detail::is_prefetch used before declaration.
bug
Something isn't working
inactive-30d
#1484
opened Apr 15, 2024 by
rchardx
[FEA] FP8 grouped gemm kernel without TMA
feature request
New feature or request
inactive-30d
#1483
opened Apr 15, 2024 by
masahi
[QST] What is the easiest way to partition/slice a tensor in CUTE to get a subtensor?
inactive-30d
question
Question
#1476
opened Apr 12, 2024 by
srikantvv
Previous Next
ProTip!
What’s not been updated in a month: updated:<2024-04-29.