{"payload":{"feedbackUrl":"https://github.com/orgs/community/discussions/53140","repo":{"id":612354784,"defaultBranch":"master","name":"llama.cpp","ownerLogin":"ggerganov","currentUserCanPush":false,"isFork":false,"isEmpty":false,"createdAt":"2023-03-10T18:58:00.000Z","ownerAvatar":"https://avatars.githubusercontent.com/u/1991296?v=4","public":true,"private":false,"isOrgOwned":false},"refInfo":{"name":"","listCacheKey":"v0:1718004394.0","currentOid":""},"activityList":{"items":[{"before":"448ca042ca60cc9018b1d6b684fa7ed80a77a109","after":"195430cbcc2ed1d486e12a28c793a27cd28f479a","ref":"refs/heads/gg/server-debug-win","pushedAt":"2024-06-10T08:13:50.000Z","pushType":"force_push","commitsCount":0,"pusher":{"login":"ggerganov","name":"Georgi Gerganov","path":"/ggerganov","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/1991296?s=80&v=4"},"commit":{"message":"increase timeout","shortMessageHtmlLink":"increase timeout"}},{"before":null,"after":"448ca042ca60cc9018b1d6b684fa7ed80a77a109","ref":"refs/heads/gg/server-debug-win","pushedAt":"2024-06-10T07:26:34.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"ggerganov","name":"Georgi Gerganov","path":"/ggerganov","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/1991296?s=80&v=4"},"commit":{"message":"tmp","shortMessageHtmlLink":"tmp"}},{"before":"d0b09468d0252bdae25abe187004f2ce6cfab6b0","after":"9e4d62e6abb9096fa93f6d7756547ec495888eb8","ref":"refs/heads/gg/server-fix-prompt","pushedAt":"2024-06-10T06:31:55.000Z","pushType":"force_push","commitsCount":0,"pusher":{"login":"ggerganov","name":"Georgi Gerganov","path":"/ggerganov","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/1991296?s=80&v=4"},"commit":{"message":"server : improve \"prompt\" handling","shortMessageHtmlLink":"server : improve \"prompt\" handling"}},{"before":null,"after":"d0b09468d0252bdae25abe187004f2ce6cfab6b0","ref":"refs/heads/gg/server-fix-prompt","pushedAt":"2024-06-10T06:12:30.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"ggerganov","name":"Georgi Gerganov","path":"/ggerganov","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/1991296?s=80&v=4"},"commit":{"message":"server : improve \"prompt\" handling","shortMessageHtmlLink":"server : improve \"prompt\" handling"}},{"before":null,"after":"956bb14595c1c6864bd890b961cd68ea83bbb434","ref":"refs/heads/gg/remove-instruct","pushedAt":"2024-06-10T05:38:11.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"ggerganov","name":"Georgi Gerganov","path":"/ggerganov","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/1991296?s=80&v=4"},"commit":{"message":"examples : remove --instruct remnants","shortMessageHtmlLink":"examples : remove --instruct remnants"}},{"before":"e95beeb1fc4621826ddd616776dbdf717366bf5c","after":"10ceba354a3b152ff425e9fa97f9caaef99a46b1","ref":"refs/heads/master","pushedAt":"2024-06-09T23:04:50.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"philiptaron","name":"Philip Taron","path":"/philiptaron","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/43863?s=80&v=4"},"commit":{"message":"flake.lock: Update (#7838)\n\nFlake lock file updates:\r\n\r\n• Updated input 'nixpkgs':\r\n 'github:NixOS/nixpkgs/ad57eef4ef0659193044870c731987a6df5cf56b?narHash=sha256-SzDKxseEcHR5KzPXLwsemyTR/kaM9whxeiJohbL04rs%3D' (2024-05-29)\r\n → 'github:NixOS/nixpkgs/051f920625ab5aabe37c920346e3e69d7d34400e?narHash=sha256-4q0s6m0GUcN7q%2BY2DqD27iLvbcd1G50T2lv08kKxkSI%3D' (2024-06-07)\r\n\r\nCo-authored-by: github-actions[bot] ","shortMessageHtmlLink":"flake.lock: Update (#7838)"}},{"before":"5a21852b0b87b5f9cabca6c05087607c6adaa62f","after":null,"ref":"refs/heads/gg/imatrix-partial-data","pushedAt":"2024-06-09T17:19:37.000Z","pushType":"branch_deletion","commitsCount":0,"pusher":{"login":"ggerganov","name":"Georgi Gerganov","path":"/ggerganov","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/1991296?s=80&v=4"}},{"before":"57bf62ce7cb75cca589943e2050d29bff4026e76","after":"e95beeb1fc4621826ddd616776dbdf717366bf5c","ref":"refs/heads/master","pushedAt":"2024-06-09T17:19:35.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"ggerganov","name":"Georgi Gerganov","path":"/ggerganov","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/1991296?s=80&v=4"},"commit":{"message":"imatrix : handle partial entries (#7833)","shortMessageHtmlLink":"imatrix : handle partial entries (#7833)"}},{"before":"3e2ee443159724e2d3a0741f6b167e599ec088aa","after":"57bf62ce7cb75cca589943e2050d29bff4026e76","ref":"refs/heads/master","pushedAt":"2024-06-09T15:24:29.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"mofosyne","name":"Brian","path":"/mofosyne","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/827793?s=80&v=4"},"commit":{"message":"docs: Added initial PR template with directions for doc only changes and squash merges [no ci] (#7700)\n\nThis commit adds pull_request_template.md and CONTRIBUTING.md . It focuses on explaining to contributors the need to rate PR complexity level, when to add [no ci] and how to format PR title and descriptions.\r\n\r\nCo-authored-by: Brian \r\nCo-authored-by: compilade ","shortMessageHtmlLink":"docs: Added initial PR template with directions for doc only changes …"}},{"before":"42b53d192f4e3abf1b7c8e424628424504ea5dc5","after":"3e2ee443159724e2d3a0741f6b167e599ec088aa","ref":"refs/heads/master","pushedAt":"2024-06-09T10:50:35.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"mofosyne","name":"Brian","path":"/mofosyne","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/827793?s=80&v=4"},"commit":{"message":"server: do not remove whitespace at the start of a completion chunk (#7830)","shortMessageHtmlLink":"server: do not remove whitespace at the start of a completion chunk (#…"}},{"before":"2decf57bc6e4a6b45176c3727d964a01161beecc","after":"42b53d192f4e3abf1b7c8e424628424504ea5dc5","ref":"refs/heads/master","pushedAt":"2024-06-09T07:42:25.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"JohannesGaessler","name":"Johannes Gäßler","path":"/JohannesGaessler","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/18492268?s=80&v=4"},"commit":{"message":"CUDA: revise q8_1 data layout for mul_mat_q (#7824)","shortMessageHtmlLink":"CUDA: revise q8_1 data layout for mul_mat_q (#7824)"}},{"before":"3af93718117d4c185bef78cae05898f9881c9c77","after":null,"ref":"refs/heads/compilade/convert-hf-model-part-prefix","pushedAt":"2024-06-09T07:00:11.000Z","pushType":"branch_deletion","commitsCount":0,"pusher":{"login":"mofosyne","name":"Brian","path":"/mofosyne","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/827793?s=80&v=4"}},{"before":"5795b941827fdec6c1662986de962badff456718","after":"2decf57bc6e4a6b45176c3727d964a01161beecc","ref":"refs/heads/master","pushedAt":"2024-06-09T06:39:25.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"mofosyne","name":"Brian","path":"/mofosyne","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/827793?s=80&v=4"},"commit":{"message":"convert-hf : set the model name based on cli arg, if present (#7693)\n\n `--model-name` argument was added a while ago but did not do anything.\r\nThis commit fixes this issue and enables this feature.","shortMessageHtmlLink":"convert-hf : set the model name based on cli arg, if present (#7693)"}},{"before":"32d11dbbe8c839de93bc5562cb85327c40c7ea2c","after":null,"ref":"refs/heads/compilade/gguf-py-decouple-writer-meta","pushedAt":"2024-06-09T04:09:14.000Z","pushType":"branch_deletion","commitsCount":0,"pusher":{"login":"mofosyne","name":"Brian","path":"/mofosyne","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/827793?s=80&v=4"}},{"before":"ed9f2521185706481501a5e6d5315397b11802ff","after":"5795b941827fdec6c1662986de962badff456718","ref":"refs/heads/master","pushedAt":"2024-06-09T02:47:25.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"mofosyne","name":"Brian","path":"/mofosyne","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/827793?s=80&v=4"},"commit":{"message":"convert-hf : match model part name prefix and suffix (#7687)\n\nIn #7075, to fix the conversion of (some) models using model-00001-of-00001.safetensors instead of model.safetensors for a single model part we simply used the same logic as the part count to get the part names. \r\n\r\nBut this doesn't always work correctly, like when unusual additional model files like consolidated.safetensors in https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.3 are present.\r\n\r\nThis commit matching both the prefix and the suffix of the model part names should fix this problem without breaking any previously-supported upstream models. But according to report by @teleprint-me there is still some\r\npersistent problem, but shall do in the meantime.","shortMessageHtmlLink":"convert-hf : match model part name prefix and suffix (#7687)"}},{"before":"fe1e3917cfa0f9397a765cfd0aef880674d938d5","after":"ed9f2521185706481501a5e6d5315397b11802ff","ref":"refs/heads/master","pushedAt":"2024-06-09T02:34:29.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"mofosyne","name":"Brian","path":"/mofosyne","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/827793?s=80&v=4"},"commit":{"message":"gguf-py : decouple adding metadata from writing in GGUFWriter (#7827)\n\nMain changes of this PR is to consolidate GGUFWriter.add_key and GGUFWriter.add_val into GGUFWriter.add_key_value. \r\n\r\nIn addition use_temp_file is now opt-in instead of opt-out defaulting to False.\r\n\r\nAlso GGUFWriter now does not require output file name until when actually writing to it.\r\n\r\nAnd GGUFWriter doesn't really need to eagerly prepare the data layout of the metadata","shortMessageHtmlLink":"gguf-py : decouple adding metadata from writing in GGUFWriter (#7827)"}},{"before":null,"after":"09f16b735cb3fee5aa345af10471b657f8a87e38","ref":"refs/heads/update_flake_lock_action","pushedAt":"2024-06-09T00:20:05.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"ggerganov","name":"Georgi Gerganov","path":"/ggerganov","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/1991296?s=80&v=4"},"commit":{"message":"flake.lock: Update\n\nFlake lock file updates:\n\n• Updated input 'nixpkgs':\n 'github:NixOS/nixpkgs/ad57eef4ef0659193044870c731987a6df5cf56b?narHash=sha256-SzDKxseEcHR5KzPXLwsemyTR/kaM9whxeiJohbL04rs%3D' (2024-05-29)\n → 'github:NixOS/nixpkgs/051f920625ab5aabe37c920346e3e69d7d34400e?narHash=sha256-4q0s6m0GUcN7q%2BY2DqD27iLvbcd1G50T2lv08kKxkSI%3D' (2024-06-07)","shortMessageHtmlLink":"flake.lock: Update"}},{"before":"315c3afe4fcfd62aab24a6b9fc274d1c2287afe9","after":null,"ref":"refs/heads/revert-7682-master","pushedAt":"2024-06-08T23:43:43.000Z","pushType":"branch_deletion","commitsCount":0,"pusher":{"login":"slaren","name":null,"path":"/slaren","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/2141330?s=80&v=4"}},{"before":"d4d915d351d1f1270d56184bdd46672893e8a5d8","after":"fe1e3917cfa0f9397a765cfd0aef880674d938d5","ref":"refs/heads/master","pushedAt":"2024-06-08T23:43:39.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"slaren","name":null,"path":"/slaren","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/2141330?s=80&v=4"},"commit":{"message":"Revert \"[SYCL] Update rpc-server.cpp to include SYCL backend (#7682)\" (#7808)\n\nThis reverts commit 9422c5e34bbd302493b77a8f6d546154a1f4fe82.","shortMessageHtmlLink":"Revert \"[SYCL] Update rpc-server.cpp to include SYCL backend (#7682)\" ("}},{"before":"7a16ce7db2a74a223f0f3b9cee66d4539c5bce8f","after":"d4d915d351d1f1270d56184bdd46672893e8a5d8","ref":"refs/heads/master","pushedAt":"2024-06-08T19:21:08.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"ngxson","name":"Xuan Son Nguyen","path":"/ngxson","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/7702203?s=80&v=4"},"commit":{"message":"url: save -mu downloads to new cache location (#7826)\n\n* url: save -mu download to new cache location\r\n\r\n* url: fs_get_cache_file_path util\r\n\r\n* url: tweak sig of fs_get_cache_file","shortMessageHtmlLink":"url: save -mu downloads to new cache location (#7826)"}},{"before":"fe59f20d26b4344ccef3ca538cddedfe30ab0686","after":"32d11dbbe8c839de93bc5562cb85327c40c7ea2c","ref":"refs/heads/compilade/gguf-py-decouple-writer-meta","pushedAt":"2024-06-08T17:03:03.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"compilade","name":null,"path":"/compilade","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/113953597?s=80&v=4"},"commit":{"message":"gguf-py : always defer GGUFWrite output file opening\n\nChanging what happens when the output file is opened will be easier,\nsince this reduces the cases to consider.\n\n* gguf-py : prevent GGUFWriter from writing all tensors multiple times\n\nIt was already checked with an assertion before, but using WriterState\nshould make the error message slightly less cryptic.","shortMessageHtmlLink":"gguf-py : always defer GGUFWrite output file opening"}},{"before":"9fc0f55cd97e12df26543c4e0d0ffecac9934956","after":"02c762477b411a26cbaa406947182afa9766e0bc","ref":"refs/heads/0cc4m/vulkan-rope-update","pushedAt":"2024-06-08T15:33:46.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"0cc4m","name":null,"path":"/0cc4m","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/11707594?s=80&v=4"},"commit":{"message":"Fix segfault when running out of VRAM\n\nCo-authored-by: slaren ","shortMessageHtmlLink":"Fix segfault when running out of VRAM"}},{"before":"175a17950de58c44572ffbcabd5449b76d6be7fb","after":"5a21852b0b87b5f9cabca6c05087607c6adaa62f","ref":"refs/heads/gg/imatrix-partial-data","pushedAt":"2024-06-08T09:40:41.000Z","pushType":"force_push","commitsCount":0,"pusher":{"login":"ggerganov","name":"Georgi Gerganov","path":"/ggerganov","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/1991296?s=80&v=4"},"commit":{"message":"imatrix : handle partial entries","shortMessageHtmlLink":"imatrix : handle partial entries"}},{"before":null,"after":"175a17950de58c44572ffbcabd5449b76d6be7fb","ref":"refs/heads/gg/imatrix-partial-data","pushedAt":"2024-06-08T09:32:50.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"ggerganov","name":"Georgi Gerganov","path":"/ggerganov","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/1991296?s=80&v=4"},"commit":{"message":"imatrix : handle partial entries","shortMessageHtmlLink":"imatrix : handle partial entries"}},{"before":"da799b41891e34aac86ce4e173f9c4c0afd4fab3","after":"7a16ce7db2a74a223f0f3b9cee66d4539c5bce8f","ref":"refs/heads/master","pushedAt":"2024-06-08T07:50:31.000Z","pushType":"pr_merge","commitsCount":1,"pusher":{"login":"ggerganov","name":"Georgi Gerganov","path":"/ggerganov","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/1991296?s=80&v=4"},"commit":{"message":"server : smart slot selection using Longest Common Prefix (#7728)\n\n* server : Smart selection of available slot using Longest Common Substring\r\n\r\n* add usage\r\n\r\n* remove trailing whitespaces\r\n\r\n* Use Longest Common Prefix (LCP) instead of LCS\r\n\r\n* Rename argument","shortMessageHtmlLink":"server : smart slot selection using Longest Common Prefix (#7728)"}},{"before":"9a13c535fd4c6e909f16b83c927f83db0bea6326","after":"9fc0f55cd97e12df26543c4e0d0ffecac9934956","ref":"refs/heads/0cc4m/vulkan-rope-update","pushedAt":"2024-06-08T07:42:03.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"0cc4m","name":null,"path":"/0cc4m","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/11707594?s=80&v=4"},"commit":{"message":"Return nullptr on alloc_buffer when allocation fails, instead of throwing an exception\n\nMinor fixes","shortMessageHtmlLink":"Return nullptr on alloc_buffer when allocation fails, instead of thro…"}},{"before":null,"after":"fe59f20d26b4344ccef3ca538cddedfe30ab0686","ref":"refs/heads/compilade/gguf-py-decouple-writer-meta","pushedAt":"2024-06-08T00:38:10.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"compilade","name":null,"path":"/compilade","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/113953597?s=80&v=4"},"commit":{"message":"gguf-py : decouple adding metadata from writing in GGUFWriter","shortMessageHtmlLink":"gguf-py : decouple adding metadata from writing in GGUFWriter"}},{"before":"7ffe2098ac0d434c2c71c8834d910c52fb8a90f2","after":"2981cc1ae30b45758ad6e3c4738b9112cfe605b2","ref":"refs/heads/fix-min-cmake-cuda","pushedAt":"2024-06-07T19:35:27.000Z","pushType":"force_push","commitsCount":0,"pusher":{"login":"cebtenzzre","name":"Jared Van Bortel","path":"/cebtenzzre","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/14168726?s=80&v=4"},"commit":{"message":"cmake : fix CMake requirement for CUDA","shortMessageHtmlLink":"cmake : fix CMake requirement for CUDA"}},{"before":"f97381d6b0593ae1f88238d2ea4699ca3d69ee25","after":"7ffe2098ac0d434c2c71c8834d910c52fb8a90f2","ref":"refs/heads/fix-min-cmake-cuda","pushedAt":"2024-06-07T19:25:26.000Z","pushType":"force_push","commitsCount":0,"pusher":{"login":"cebtenzzre","name":"Jared Van Bortel","path":"/cebtenzzre","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/14168726?s=80&v=4"},"commit":{"message":"cmake : fix CMake requirement for CUDA\n\nSigned-off-by: Jared Van Bortel ","shortMessageHtmlLink":"cmake : fix CMake requirement for CUDA"}},{"before":null,"after":"f97381d6b0593ae1f88238d2ea4699ca3d69ee25","ref":"refs/heads/fix-min-cmake-cuda","pushedAt":"2024-06-07T19:20:15.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"cebtenzzre","name":"Jared Van Bortel","path":"/cebtenzzre","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/14168726?s=80&v=4"},"commit":{"message":"cmake : fix CMake requirement for CUDA\n\nSigned-off-by: Jared Van Bortel ","shortMessageHtmlLink":"cmake : fix CMake requirement for CUDA"}}],"hasNextPage":true,"hasPreviousPage":false,"activityType":"all","actor":null,"timePeriod":"all","sort":"DESC","perPage":30,"cursor":"djE6ks8AAAAEYNovaAA","startCursor":null,"endCursor":null}},"title":"Activity · ggerganov/llama.cpp"}