Issues: InternLM/lmdeploy
[Benchmark] benchmarks on different cuda architecture with mo...
#815
opened Dec 11, 2023 by
lvhan028
Open
6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Bug] got error when pip install. docker img works though, python ver3.11
#1633
opened May 21, 2024 by
fyxc
2 tasks done
使用KV cache(int8或int4)量化internvl-v1.5后,显存反而增加了
#1626
opened May 21, 2024 by
qingchunlizhi
1 of 2 tasks
[Feature] Layer Wise Calibration and Quantization of Models (To quantize model on Low VRAM GPU)
#1625
opened May 21, 2024 by
Tushar-ml
[Bug] Unrecognized configuration class when quantizing llava
#1601
opened May 16, 2024 by
zjysteven
2 tasks done
AsyncEngine 的 stream_infer 函数增加手动传入session_id,实现多次调用 stream_infer 时的并行推理[Feature]
#1590
opened May 14, 2024 by
NagatoYuki0943
[Bug] change h_input_length_buf_ before synchronization
#1584
opened May 11, 2024 by
mengmeexix
2 tasks done
[Feature] Support for LLaVA-NeXT Qwen1.5-110, Qwen1.5-72B, LLaMA3-8B
awaiting response
#1583
opened May 11, 2024 by
Iven2132
[Feature] 是否支持enc-dec类型模型中decoder的persistent batch
awaiting response
#1581
opened May 10, 2024 by
Oldpan
[Bug] 使用docker部署internlm/internlm-xcomposer-vl-7b和internlm/internlm-xcomposer2-vl-7b均报错
#1577
opened May 10, 2024 by
ye7love7
2 tasks done
Previous Next
ProTip!
Follow long discussions with comments:>50.