Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

server : use different seeds for child completions examples python python script changes server
#18700 opened Jan 8, 2026 by ggerganov Loading…
Improving inference speed for the repack buffer type on NUMA architectures ggml changes relating to the ggml tensor library for machine learning
#18698 opened Jan 8, 2026 by zzjianhui Loading…
common : add --license to display embedded licenses build Compilation issues python python script changes script Script related
#18696 opened Jan 8, 2026 by angt Loading…
scripts : pr2wt.sh reset to remote head script Script related
#18695 opened Jan 8, 2026 by ggerganov Loading…
Webui/file upload examples server
#18694 opened Jan 8, 2026 by ServeurpersoCom Loading…
ggml-cuda: extend concat support for more types ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#18690 opened Jan 8, 2026 by Lourdle Loading…
model: try to improve Qwen3 Next model Model specific python python script changes
#18683 opened Jan 8, 2026 by ngxson Draft
vulkan: Use VK_EXT_shader_64bit_indexing to handle large mat_mul(_id) ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#18678 opened Jan 7, 2026 by jeffbolznv Loading…
Autoparser - complete refactoring of parser architecture documentation Improvements or additions to documentation examples model Model specific python python script changes script Script related server testing Everything test related
#18675 opened Jan 7, 2026 by pwilkin Draft
Fix integer overflow in GGUF tensor parsing ggml changes relating to the ggml tensor library for machine learning
#18674 opened Jan 7, 2026 by alexanderkent Loading…
HIP: adjust RDNA3.5 MMQ kernel selction logic ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#18666 opened Jan 7, 2026 by JohannesGaessler Loading…
MCP MVP enhancement New feature or request examples server/webui server
#18655 opened Jan 7, 2026 by allozaur Draft
docs: update ops.md for CANN backend documentation Improvements or additions to documentation
#18654 opened Jan 7, 2026 by hipudding Loading…
CANN: support gated linear attn Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#18653 opened Jan 7, 2026 by hipudding Loading…
common: use httplib + boringssl by default build Compilation issues devops improvements to build systems and github actions
#18648 opened Jan 6, 2026 by ngxson Draft
[Do Not Merge] model : LFM2.5-Audio-1.5B examples model Model specific python python script changes server
#18641 opened Jan 6, 2026 by tdakhran Draft
2 of 5 tasks
Remove annoying warnings (unused functions)
#18639 opened Jan 6, 2026 by Nekotekina Loading…
alloc : skip unassigned leafs ggml changes relating to the ggml tensor library for machine learning
#18636 opened Jan 6, 2026 by ggerganov Draft
Added note for compiling on integrated GPUs documentation Improvements or additions to documentation
#18633 opened Jan 6, 2026 by alosslessdev Draft
rpc : implement event and async backend APIs ggml changes relating to the ggml tensor library for machine learning
#18626 opened Jan 5, 2026 by rgerganov Draft
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.