NVIDIA / cutlass Public

Notifications You must be signed in to change notification settings
Fork 816
Star 4.7k

Code
Issues 70
Pull requests 30
Discussions
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Issues: NVIDIA/cutlass

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

70 Open 808 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Tiled copy misaligned, how to solve it? ? - Needs Triage question

Question

#1561 opened May 30, 2024 by 4grass

Warp Group MMA vs Warp MMA ? - Needs Triage question

Question

#1560 opened May 30, 2024 by OrenLeung

[QST] Best way to tell which methods are called ? - Needs Triage question

Question

#1558 opened May 29, 2024 by dscotthunter

[QST] why cute kernel transfers so much data between L2 and gmen than cublas kernel ? - Needs Triage question

Question

#1556 opened May 29, 2024 by irasin

[QST]How to implement different type between D0(D1) and D2 based on 45_dual_gemm example ? - Needs Triage question

Question

#1555 opened May 29, 2024 by Sunny-bot1

[QST] The best way to do D = func(A x B) x C. ? - Needs Triage question

Question

#1551 opened May 27, 2024 by amazingyyc

[QST] epilogue in HGEMM ? - Needs Triage question

Question

#1550 opened May 27, 2024 by irasin

[QST] Hopper mixed precision gemm always worse than FP8 ? - Needs Triage question

Question

#1549 opened May 24, 2024 by divchenko

[BUG] Cutlass Python API silently fails in (suspected) unsupported case ? - Needs Triage bug

Something isn't working

#1547 opened May 23, 2024 by LucasWilkinson

[QST] Row major for int8 matrix multiplications? ? - Needs Triage question

Question

#1533 opened May 10, 2024 by ken012git

[QST] cutlass::Array and cute::Tensor --- using CUTLASS utility structs/classes with CUTE (such as NumericArrayConverter) ? - Needs Triage question

Question

#1532 opened May 10, 2024 by HanGuo97

[QST/BUG] Should shared memory usage be checked for multistage pipeline? ? - Needs Triage question

Question

#1525 opened May 7, 2024 by wzhcz8902

[BUG] Composition between Tensor and Layout as shown in 03_tensor.md does not compile ? - Needs Triage bug

Something isn't working

#1519 opened Apr 30, 2024 by armbuster

[QST] Epilogue Reduction ? - Needs Triage question

Question

#1518 opened Apr 30, 2024 by jeromeku

[QST] use FastLinearCombinationClamp to convert half accumulator to int8_t output? ? - Needs Triage inactive-30d question

Question

#1516 opened Apr 30, 2024 by ken012git

two files are included in each other inactive-30d

#1514 opened Apr 29, 2024 by wzhcz8902

typo in comment inactive-30d

#1513 opened Apr 29, 2024 by wzhcz8902

[BUG] Broken copy.hpp bug

Something isn't working

inactive-30d

#1508 opened Apr 28, 2024 by kroburg

[BUG] Autovectorized copy fails with shape (_2, _3) bug

Something isn't working

inactive-30d

#1499 opened Apr 23, 2024 by YichengDWu

[QST] 2D Convolution for NCHW Row-Major images, kernels and output ? - Needs Triage inactive-30d question

Question

#1496 opened Apr 19, 2024 by chart21

[QST] make error caused by glibc 2.32 in ubuntu 18.04 "//usr/local/lib/libpthread.so.0: undefined reference to" ? - Needs Triage inactive-30d question

Question

#1490 opened Apr 17, 2024 by Zmy6

[QST] StreamK ReductionStrategy: "Atomic" or "Mixed" inactive-30d question

Question

#1488 opened Apr 16, 2024 by HanGuo97

[BUG] Disordered header files: detail::is_prefetch used before declaration. bug

Something isn't working

inactive-30d

#1484 opened Apr 15, 2024 by rchardx

[FEA] FP8 grouped gemm kernel without TMA feature request

New feature or request

inactive-30d

#1483 opened Apr 15, 2024 by masahi

[QST] What is the easiest way to partition/slice a tensor in CUTE to get a subtensor? inactive-30d question

Question

#1476 opened Apr 12, 2024 by srikantvv

Previous 1 2 3 Next

Previous Next

ProTip! What’s not been updated in a month: updated:<2024-04-29.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly