Skip to content

Releases: xorbitsai/inference

v0.11.3

31 May 09:28
69c09cd
Compare
Choose a tag to compare

What's new in 0.11.3 (2024-05-31)

These are the changes in inference v0.11.3.

New features

Enhancements

Bug fixes

  • BUG: fix launch model error when use torch 2.3.0 by @amumu96 in #1543
  • BUG: fix vl-model img path error by @amumu96 in #1559
  • BUG: Fix validation errors when define a custom baichuan-chat LLM model by @buptzyf in #1557

Documentation

  • DOC: update readme and fix description about model engine by @qinxuye in #1566

Others

New Contributors

Full Changelog: v0.11.2...v0.11.3

v0.11.2.post1

24 May 11:52
ac8f334
Compare
Choose a tag to compare

What's new in 0.11.2.post1 (2024-05-24)

These are the changes in inference v0.11.2.post1, a hotfix version of v0.11.2.

Bug fixes

  • BUG: fix launch model error when use torch 2.3.0 by @amumu96 in #1543

Full Changelog: v0.11.2...v0.11.2.post1

v0.11.2

24 May 09:10
77e79f8
Compare
Choose a tag to compare

What's new in 0.11.2 (2024-05-24)

These are the changes in inference v0.11.2.

New features

Enhancements

Bug fixes

  • BUG: Fix start worker failed due to None device name by @codingl2k1 in #1539
  • BUG: Fix gpu_idx allocate error when set replica > 1 by @amumu96 in #1528

Others

Full Changelog: v0.11.1...v0.11.2

v0.11.1

17 May 07:17
55a0200
Compare
Choose a tag to compare

What's new in 0.11.1 (2024-05-17)

These are the changes in inference v0.11.1.

New features

  • FEAT: support Yi-1.5 series by @qinxuye in #1489
  • FEAT: [UI] embedding and rerank support the specified GPU and CPU. by @yiboyasss in #1491

Enhancements

Bug fixes

Documentation

New Contributors

Full Changelog: v0.11.0...v0.11.1

v0.11.0

11 May 09:41
21be5ab
Compare
Choose a tag to compare

What's new in 0.11.0 (2024-05-11)

These are the changes in inference v0.11.0.

Break Changes

v0.11.0 introduced break change when launching model that model_engine should be specified, refer to Model Engine for more information

New features

Enhancements

Bug fixes

Tests

  • TST: Pin huggingface-hub to pass CI since it has some break changes by @ChengjieLi28 in #1427

Documentation

Others

  • BUG:Fix mertics is empty when call /v1/chat/completions by @amumu96 in #1406

New Contributors

Full Changelog: v0.10.3...v0.11.0

v0.10.3

24 Apr 02:57
2ba72b0
Compare
Choose a tag to compare

What's new in 0.10.3 (2024-04-24)

These are the changes in inference v0.10.3.

New features

Enhancements

Bug fixes

  • BUG: Fix Launching embedding or reranking models from commandline fails due to PEFT by @hainaweiben in #1343
  • BUG: Fix extra parameters issue when auto-recovering models by @ChengjieLi28 in #1348
  • BUG: Fix old rerank models use flag rerank issue by @codingl2k1 in #1350

Documentation

New Contributors

Full Changelog: v0.10.2.post1...v0.10.3

v0.10.2.post1

19 Apr 06:48
5001715
Compare
Choose a tag to compare

What's new in 0.10.2.post1 (2024-04-19)

These are the changes in inference v0.10.2.post1.

Bug fixes

Full Changelog: v0.10.2...v0.10.2.post1

v0.10.2

19 Apr 06:19
f19e85b
Compare
Choose a tag to compare

What's new in 0.10.2 (2024-04-19)

These are the changes in inference v0.10.2.

New features

Enhancements

Others

  • Revert "REF: support query for engine feature" by @qinxuye in #1329

Full Changelog: v0.10.1...v0.10.2

v0.10.1

12 Apr 02:47
e3a947e
Compare
Choose a tag to compare

What's new in 0.10.1 (2024-04-12)

These are the changes in inference v0.10.1.

New features

Enhancements

Bug fixes

Tests

Documentation

New Contributors

Full Changelog: v0.10.0...v0.10.1

v0.10.0

29 Mar 04:56
2857ec4
Compare
Choose a tag to compare

What's new in 0.10.0 (2024-03-29)

These are the changes in inference v0.10.0.

New features

Enhancements

Bug fixes

  • BUG: Fix the incorrect model interface address caused a 307 redirect to HTTP, blocking the request and preventing the display of the model list. by @wertycn in #1182
  • BUG: fix doc fail introduced by #1171 & update readme by @qinxuye in #1203
  • BUG: Increase validator types for thie 'input' parameter of embeddings to match OpenAI API by @Minamiyama in #1201

Documentation

New Contributors

Full Changelog: v0.9.4...v0.10.0