Commit 2034224
committed
Merge remote-tracking branch 'origin/compilade/mamba2' into BambaAbstractMemory
This is definitely a surgical merge, so there are likely pieces still
missing, especially from llama-model.cpp
Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>
* origin/compilade/mamba2:
convert : fix flake8 lint
ggml : avoid multiply by D in GGML_OP_SSM_SCAN
ggml : remove unused fast broadcast path in GGML_MUL
metal : fix wrong number of tokens per sequence in SSM_SCAN
metal : fix SSM_SCAN state head offset
metal : add back n_seqs to SSM_SCAN args
metal : remove unused arguments for SSM_SCAN
metal : use log and exp instead of log1pf and expf in SSM_SCAN
metal : fix SSM_SCAN pipeline scope
metal : attempt to adapt SSM_SCAN for Mamba-2
llama : avoid redundant state copy for Mamba 1 and 2
convert_hf : prefer SentencePiece tokenizer for Mamba-2 when present
llama : add missing break
llama : remove unused variable
llama : fix Mamba-2 conv state saving
llama : support running Mamba-Codestral-7B-v0.1
ggml : SIMD ggml_ssm_scan for Mamba-2
llama : initial Mamba-2 supportFile tree
20 files changed
+719
-167
lines changed- ggml
- include
- src
- ggml-metal
- gguf-py/gguf
- src
- tests
20 files changed
+719
-167
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
4171 | 4171 | | |
4172 | 4172 | | |
4173 | 4173 | | |
| 4174 | + | |
| 4175 | + | |
| 4176 | + | |
| 4177 | + | |
| 4178 | + | |
| 4179 | + | |
| 4180 | + | |
| 4181 | + | |
| 4182 | + | |
| 4183 | + | |
| 4184 | + | |
| 4185 | + | |
| 4186 | + | |
| 4187 | + | |
| 4188 | + | |
| 4189 | + | |
| 4190 | + | |
| 4191 | + | |
| 4192 | + | |
| 4193 | + | |
| 4194 | + | |
| 4195 | + | |
| 4196 | + | |
| 4197 | + | |
| 4198 | + | |
| 4199 | + | |
| 4200 | + | |
| 4201 | + | |
| 4202 | + | |
| 4203 | + | |
| 4204 | + | |
| 4205 | + | |
| 4206 | + | |
| 4207 | + | |
| 4208 | + | |
| 4209 | + | |
| 4210 | + | |
| 4211 | + | |
| 4212 | + | |
| 4213 | + | |
| 4214 | + | |
| 4215 | + | |
| 4216 | + | |
| 4217 | + | |
| 4218 | + | |
| 4219 | + | |
| 4220 | + | |
| 4221 | + | |
| 4222 | + | |
| 4223 | + | |
| 4224 | + | |
| 4225 | + | |
| 4226 | + | |
| 4227 | + | |
| 4228 | + | |
| 4229 | + | |
| 4230 | + | |
| 4231 | + | |
| 4232 | + | |
| 4233 | + | |
| 4234 | + | |
| 4235 | + | |
| 4236 | + | |
| 4237 | + | |
| 4238 | + | |
| 4239 | + | |
| 4240 | + | |
| 4241 | + | |
| 4242 | + | |
| 4243 | + | |
| 4244 | + | |
| 4245 | + | |
| 4246 | + | |
| 4247 | + | |
| 4248 | + | |
| 4249 | + | |
| 4250 | + | |
| 4251 | + | |
| 4252 | + | |
| 4253 | + | |
| 4254 | + | |
| 4255 | + | |
| 4256 | + | |
| 4257 | + | |
| 4258 | + | |
4174 | 4259 | | |
4175 | 4260 | | |
4176 | 4261 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1854 | 1854 | | |
1855 | 1855 | | |
1856 | 1856 | | |
1857 | | - | |
| 1857 | + | |
| 1858 | + | |
1858 | 1859 | | |
1859 | 1860 | | |
1860 | 1861 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
463 | 463 | | |
464 | 464 | | |
465 | 465 | | |
| 466 | + | |
| 467 | + | |
466 | 468 | | |
467 | 469 | | |
468 | | - | |
469 | 470 | | |
470 | 471 | | |
471 | | - | |
| 472 | + | |
472 | 473 | | |
473 | 474 | | |
474 | 475 | | |
475 | | - | |
476 | 476 | | |
477 | 477 | | |
478 | | - | |
479 | 478 | | |
480 | | - | |
481 | 479 | | |
482 | 480 | | |
483 | | - | |
| 481 | + | |
484 | 482 | | |
485 | 483 | | |
| 484 | + | |
486 | 485 | | |
487 | 486 | | |
488 | 487 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
189 | 189 | | |
190 | 190 | | |
191 | 191 | | |
| 192 | + | |
192 | 193 | | |
193 | 194 | | |
194 | 195 | | |
| |||
1125 | 1126 | | |
1126 | 1127 | | |
1127 | 1128 | | |
| 1129 | + | |
1128 | 1130 | | |
1129 | 1131 | | |
1130 | 1132 | | |
| |||
2586 | 2588 | | |
2587 | 2589 | | |
2588 | 2590 | | |
| 2591 | + | |
2589 | 2592 | | |
2590 | 2593 | | |
2591 | 2594 | | |
2592 | 2595 | | |
| 2596 | + | |
2593 | 2597 | | |
2594 | 2598 | | |
2595 | 2599 | | |
2596 | 2600 | | |
| 2601 | + | |
2597 | 2602 | | |
2598 | 2603 | | |
2599 | 2604 | | |
2600 | 2605 | | |
| 2606 | + | |
2601 | 2607 | | |
2602 | | - | |
| 2608 | + | |
2603 | 2609 | | |
2604 | 2610 | | |
2605 | | - | |
| 2611 | + | |
2606 | 2612 | | |
2607 | 2613 | | |
2608 | 2614 | | |
2609 | | - | |
| 2615 | + | |
2610 | 2616 | | |
| 2617 | + | |
2611 | 2618 | | |
2612 | | - | |
| 2619 | + | |
2613 | 2620 | | |
2614 | 2621 | | |
| 2622 | + | |
2615 | 2623 | | |
2616 | 2624 | | |
2617 | 2625 | | |
2618 | 2626 | | |
| 2627 | + | |
2619 | 2628 | | |
2620 | | - | |
| 2629 | + | |
2621 | 2630 | | |
2622 | 2631 | | |
| 2632 | + | |
| 2633 | + | |
| 2634 | + | |
| 2635 | + | |
| 2636 | + | |
2623 | 2637 | | |
2624 | 2638 | | |
2625 | 2639 | | |
2626 | | - | |
2627 | | - | |
| 2640 | + | |
| 2641 | + | |
| 2642 | + | |
| 2643 | + | |
| 2644 | + | |
| 2645 | + | |
2628 | 2646 | | |
2629 | | - | |
| 2647 | + | |
| 2648 | + | |
| 2649 | + | |
| 2650 | + | |
| 2651 | + | |
| 2652 | + | |
2630 | 2653 | | |
2631 | 2654 | | |
2632 | 2655 | | |
2633 | 2656 | | |
| 2657 | + | |
| 2658 | + | |
2634 | 2659 | | |
2635 | 2660 | | |
2636 | | - | |
2637 | 2661 | | |
2638 | 2662 | | |
2639 | | - | |
| 2663 | + | |
2640 | 2664 | | |
2641 | 2665 | | |
2642 | 2666 | | |
2643 | | - | |
2644 | 2667 | | |
2645 | 2668 | | |
2646 | | - | |
2647 | 2669 | | |
2648 | | - | |
2649 | 2670 | | |
2650 | 2671 | | |
2651 | | - | |
| 2672 | + | |
2652 | 2673 | | |
2653 | 2674 | | |
| 2675 | + | |
2654 | 2676 | | |
2655 | 2677 | | |
2656 | 2678 | | |
| |||
2660 | 2682 | | |
2661 | 2683 | | |
2662 | 2684 | | |
2663 | | - | |
2664 | | - | |
| 2685 | + | |
| 2686 | + | |
| 2687 | + | |
| 2688 | + | |
2665 | 2689 | | |
2666 | | - | |
| 2690 | + | |
| 2691 | + | |
| 2692 | + | |
| 2693 | + | |
| 2694 | + | |
| 2695 | + | |
| 2696 | + | |
2667 | 2697 | | |
2668 | 2698 | | |
2669 | 2699 | | |
| |||
0 commit comments