Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: abetlen/llama-cpp-python
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: main
Choose a base ref
...
head repository: NimbleEdge/llama-cpp-python
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: main
Choose a head ref
Checking mergeability… Don’t worry, you can still create the pull request.
  • 1 commit
  • 2 files changed
  • 1 contributor

Commits on Aug 30, 2025

  1. Add support for tensor_buft_overrides for more finegrained control of…

    … which layers are offloaded to GPU, and add n_cpu_moe parameter
    
    Signed-off-by: Kira Selby <kaselby@uwaterloo.ca>
    kaselby committed Aug 30, 2025
    Configuration menu
    Copy the full SHA
    c205042 View commit details
    Browse the repository at this point in the history
Loading