Skip to content

llamafile v0.7.4

Compare
Choose a tag to compare
@jart jart released this 24 Apr 17:08
· 123 commits to main since this release
73bf13d
  • Display prompt eval tokens per second in web gui e4d97b2
  • Add ability to override chat template in web gui ebd096e
  • Simply and optimize the sgemm code more ef1c524