Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bug: [TensorRT-LLM Server not work] #2834

Closed
tmape opened this issue Apr 26, 2024 · 1 comment
Closed

bug: [TensorRT-LLM Server not work] #2834

tmape opened this issue Apr 26, 2024 · 1 comment
Labels
type: bug Something isn't working

Comments

@tmape
Copy link

tmape commented Apr 26, 2024

Describe the bug
I am attempting to connect Open Web-UI to access Jan's server for utilizing the TensorRT-LLM model (Mistral 7B Instructions v0.1 INT4). However, I am experiencing issues and it does not work as expected. I have tried switching to a standard gguf model (Meta-Llama-3-8B-Instruct.Q8_0), which functions correctly. When chatting with Jan, both the TensorRT-LLM and gguf models work as expected.

Expected behavior
A clear and concise description of what you expected to happen.

Screenshots
If applicable, add screenshots to help explain your issue.

Environment details

  • Operating System: [Windows 11]
  • Jan Version: [0.4.12]
  • Processor: [Intel Core i7]
  • RAM: [64GB]
  • Any additional relevant hardware specifics: [RTX 4060]

Logs

2024-04-26T09:19:08.690Z [SERVER]::{"reqId":"req-t","res":{},"req":{"method":"POST","url":"/v1/chat/completions","hostname":"192.168.0.101:1337"},"msg":"incoming request"}

2024-04-26T09:19:08.832Z [SERVER]::{"reqId":"req-t","res":{"statusCode":500},"req":{},"msg":"request completed","responseTime":133.75729999993928}

Additional context

  • Server Options: 0.0.0.0, 1337
  • API Prefix: /v1
  • Cross-Origin-Resource-Sharing (CORS): Disable
  • Verbose Server Logs: Enable
@tmape tmape added the type: bug Something isn't working label Apr 26, 2024
@louis-jan
Copy link
Contributor

Duplicated
#2373

@louis-jan louis-jan closed this as not planned Won't fix, can't repro, duplicate, stale May 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type: bug Something isn't working
Projects
Status: Done
Development

No branches or pull requests

2 participants