upgrade to b6188 #24

xkong-anaconda · 2025-11-20T20:21:08Z

llama.cpp 0.0.6188

Destination channel: defaults

Links

PKG-10893
Upstream repository
Upstream release b6188
Upstream diff b6082...b6188
Relevant dependency PRs:
- Related to llama-cpp-python 0.3.16 compatibility requirement

CUDA Variants

Explanation of changes:

This PR upgrades llama.cpp from b6082 to b6188 on the main_b6082 preservation branch. This is a special branch structure to provide b6188 packages for llama-cpp-python 0.3.16 compatibility without affecting the current main feedstock

Why b6188?

llama-cpp-python 0.3.16 requires llama.cpp b6173-b6238 (API compatibility range)
The llama_get_kv_self() and related APIs were removed in b6239
b6188 is a stable release within the compatible range

Callek · 2025-11-20T22:22:57Z

FWIW looks like there is an issue, our preprod caught:

            {'ctx': {'error': {}},
             'input': ['--suppress-variables',
                       '--skip-existing',
                       '--error-overlinking',
                       '--variants "{skip_cuda_prefect: True}"'],
             'loc': ['body', 'tasks', 14, 'build', 'build_options'],
             'msg': "Value error, Invalid build_options: ['--variants "
                    '"{skip_cuda_prefect: True}"\']. Allowed: [\'--debug\', '
                    "'--error-overdepending', '--error-overlinking', "
                    "'--no-error-overlinking', '--no-include-recipe', "
                    "'--no-test', '--no-verify', '--quiet', '--skip-existing', "
                    "'--suppress-variables']",
             'type': 'value_error'},

(which might be why we don't see anything triggered in prod)

Looks like that exists as part of the abs.yaml in the root of the repo. I've flagged @markan to help determine from a slack thread in our private team channel if thats a regression or a feature of it being flagged in the cli generation code

xkong-anaconda · 2025-11-21T19:32:50Z

FWIW looks like there is an issue, our preprod caught:
            {'ctx': {'error': {}},
             'input': ['--suppress-variables',
                       '--skip-existing',
                       '--error-overlinking',
                       '--variants "{skip_cuda_prefect: True}"'],
             'loc': ['body', 'tasks', 14, 'build', 'build_options'],
             'msg': "Value error, Invalid build_options: ['--variants "
                    '"{skip_cuda_prefect: True}"\']. Allowed: [\'--debug\', '
                    "'--error-overdepending', '--error-overlinking', "
                    "'--no-error-overlinking', '--no-include-recipe', "
                    "'--no-test', '--no-verify', '--quiet', '--skip-existing', "
                    "'--suppress-variables']",
             'type': 'value_error'},
(which might be why we don't see anything triggered in prod)

Looks like that exists as part of the abs.yaml in the root of the repo. I've flagged @markan to help determine from a slack thread in our private team channel if thats a regression or a feature of it being flagged in the cli generation code

Thank you very much for the help and inestigation.

…12.4

…together

…ackages

cbouss · 2025-12-01T21:55:23Z

recipe/build-llama-cpp.sh

-ctest -L main -C Release --output-on-failure -j${CPU_COUNT} --timeout 900 -E "(test-tokenizers-ggml-vocabs)"
+# Skip test-backend-ops on Metal and CUDA (has test failures in b6188)
+if [[ "${gpu_variant}" == "metal" ]] || [[ "${gpu_variant}" == "cuda-12" ]]; then
+    ctest -L main -C Release --output-on-failure -j${CPU_COUNT} --timeout 900 -E "(test-tokenizers-ggml-vocabs|test-backend-ops)"


What kind of failures were you seeing there? (for metal and cuda)
Do you still have logs?

Unfortunately, I don't have the original test failure logs anymore. But it appeared to be related to backend operations not properly working with CUDA. Metal GPU builds fail test-backend-ops with Flash Attention operations producing "not supported [Metal]" errors. and Win failures with values 20-30x higher than tolerance.
This skip is specific to the b6188 downgrade, not intended for the main feedstock
If you need the exact failure details, I could re-run the build without the skip to capture the errors
Let me know if you need me to investigate further!

Yes, please re-run, as the packages go to the main channel, we better check as much as possible.

You can add this line to the script to capture test-backend-ops logs without failing the whole build:
ctest -L main -C Release --output-on-failure -j${CPU_COUNT} --timeout 900 -R "test-backend-ops" || true

b6188

3788d36

xkong-anaconda self-assigned this Nov 20, 2025

xkong-anaconda marked this pull request as draft November 20, 2025 20:30

xkong-anaconda marked this pull request as ready for review November 20, 2025 20:30

xkong-anaconda added 5 commits November 21, 2025 10:10

Fix abs.yaml: Remove --variants option not supported by PBP

7f1eeeb

Fix build errors: update abs.yaml and add libcurl pin

13e6f42

Fix patches

4300c8a

Update conda_build_config.yaml

af6c9e2

Fix increase-nmse-tolerance-aarch64.patch

24bae3f

xkong-anaconda added 8 commits November 30, 2025 19:58

Add GCC 12 pin for Linux CUDA builds (CUDA 12.4 requires gcc < 13)

e5f38d3

Remove GCC pins - let conda auto-select version compatible with CUDA …

b2cf302

…12.4

Skip test-backend-ops on Metal for b6188 (Flash Attention not supported)

17b5cfa

Add output_set skip conditions to prevent building both package sets …

e8dfcc1

…together

Add Jinja2 workaround for undefined variables when output_set skips p…

af3d2e9

…ackages

Skip test-backend-ops on CUDA builds (has test failures in b6188)

ba5608e

Fix Windows c_stdlib_version in conda_build_config.yaml

15a720f

Fix Windows CUDA build configuration and skip flaky test

0b0c892

cbouss self-requested a review December 1, 2025 21:13

cbouss reviewed Dec 1, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

upgrade to b6188 #24

upgrade to b6188 #24

Uh oh!

xkong-anaconda commented Nov 20, 2025 •

edited

Loading

Uh oh!

Callek commented Nov 20, 2025 •

edited

Loading

Uh oh!

xkong-anaconda commented Nov 21, 2025

Uh oh!

cbouss Dec 1, 2025

Uh oh!

xkong-anaconda Dec 2, 2025

Uh oh!

cbouss Dec 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

upgrade to b6188 #24

Are you sure you want to change the base?

upgrade to b6188 #24

Uh oh!

Conversation

xkong-anaconda commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Links

CUDA Variants

Explanation of changes:

Uh oh!

Callek commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xkong-anaconda commented Nov 21, 2025

Uh oh!

cbouss Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

xkong-anaconda Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

cbouss Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

xkong-anaconda commented Nov 20, 2025 •

edited

Loading

Callek commented Nov 20, 2025 •

edited

Loading