Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Kernel] Add flash-attn back
#4907 opened May 19, 2024 by WoosukKwon Draft
[CI/Build] Make marlin kernel build conditional.
#4905 opened May 19, 2024 by esmeetu Loading…
[Kernel] Add marlin_24 unit tests
#4901 opened May 18, 2024 by alexm-neuralmagic Loading…
Update test_ignore_eos
#4898 opened May 18, 2024 by simon-mo Loading…
[Core] Fix scheduler considering "no LoRA" as "LoRA"
#4897 opened May 18, 2024 by Yard1 Loading…
[Misc] Load FP8 kv-cache scaling factors from checkpoints
#4893 opened May 17, 2024 by comaniac Loading…
1 task
[Core] Sharded State Loader download from HF
#4889 opened May 17, 2024 by aurickq Loading…
[Model] Add Phi-2 LoRA support
#4886 opened May 17, 2024 by Isotr0py Loading…
[Bugfix] Fix with verifying model max len
#4885 opened May 17, 2024 by dimaioksha Loading…
[Build/CI] Extending AMD Tests
#4875 opened May 17, 2024 by Alexei-V-Ivanov-AMD Loading…
[CI/Build] Add health check
#4868 opened May 16, 2024 by pseudotensor Loading…
[Doc] Add page for PoolingParams
#4800 opened May 14, 2024 by DarkLight1337 Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.