Skip to content

Commit 922cc38

Browse files
Upgrading bitsandbytes. (#2910)
* Upgrading bitsandbytes. Co-Authored-By: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com> * Tighter lock. --------- Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com>
1 parent 120bd3e commit 922cc38

File tree

3 files changed

+9
-15
lines changed

3 files changed

+9
-15
lines changed

launcher/src/main.rs

Lines changed: 1 addition & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -2078,14 +2078,7 @@ fn main() -> Result<(), LauncherError> {
20782078
let cuda_graphs = match (&args.cuda_graphs, &quantize) {
20792079
(Some(cuda_graphs), _) => cuda_graphs.iter().cloned().filter(|&c| c > 0).collect(),
20802080
#[allow(deprecated)]
2081-
(
2082-
None,
2083-
Some(
2084-
Quantization::Bitsandbytes
2085-
| Quantization::BitsandbytesNf4
2086-
| Quantization::BitsandbytesFp4,
2087-
),
2088-
) => {
2081+
(None, Some(Quantization::Bitsandbytes)) => {
20892082
tracing::warn!("Bitsandbytes doesn't work with cuda graphs, deactivating them");
20902083
vec![]
20912084
}

server/poetry.lock

Lines changed: 7 additions & 6 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

server/pyproject.toml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,7 @@ grpcio-reflection = "^1.51.1"
1616
grpc-interceptor = "^0.15.4"
1717
typer = "^0.12.5"
1818
accelerate = {version = "^1.1.0", optional = true}
19-
bitsandbytes = { version = "^0.43.0", optional = true }
19+
bitsandbytes = { version = "^0.45.0", optional = true }
2020
safetensors = "^0.4.5"
2121
loguru = "^0.7.2"
2222
opentelemetry-api = "^1.27.0"

0 commit comments

Comments
 (0)