Skip to content

Pull requests: ggerganov/llama.cpp

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

add_special option for server tokenize endpoint
#7059 opened May 3, 2024 by JohanAR Loading…
tests : add test-tokenizer-0.sh high priority Very important issue
#7036 opened May 2, 2024 by ggerganov Loading…
Disable benchmark on forked repo
#7034 opened May 2, 2024 by CISC Loading…
Add BPE pre-tokenization for Command-R.
#7033 opened May 2, 2024 by dranger003 Loading…
convert-hf : reduce repeated boilerplate from write_tensors need feedback Testing and feedback with results are needed refactoring Refactoring
#7031 opened May 1, 2024 by compilade Loading…
3 of 18 tasks
Add token healing example
#7028 opened May 1, 2024 by mare5x Draft
chore: Add hashsum for stablelm models
#7018 opened May 1, 2024 by teleprint-me Loading…
Tidy Android Instructions README.md
#7016 opened Apr 30, 2024 by Jeximo Loading…
Fix flash attention for ROCm
#7011 opened Apr 30, 2024 by jdecourval Draft
Attempt at OpenElm
#6986 opened Apr 29, 2024 by joshcarp Draft
llama3 custom regex split
#6965 opened Apr 28, 2024 by jaime-m-p Loading…
move ndk code to a new library
#6951 opened Apr 27, 2024 by eltonkola Loading…
Updated server_queue to delete tasks from queue when server is shutdown. Feature Request #6421 demo Demonstrate some concept or idea, not intended to be merged
#6941 opened Apr 27, 2024 by rahsuri Loading…
ProTip! Add no:assignee to see everything that’s not assigned.