adding-support-for-mamba2 #1009

Goekdeniz-Guelmez · 2024-10-02T10:49:20Z

No description provided.

…doesnt axepts groups parameter

…niz-Guelmez/mlx-examples into adding-support-for-mamba2

…t MambaMixer block pass)

…niz-Guelmez/mlx-examples into adding-support-for-mamba2

hg0428 · 2024-10-22T18:51:21Z

Codestral Mamba and other models rely on the Mamba2 architecture. Hopefully we can get this soon.

…tes Gibberish on Codestral

Goekdeniz-Guelmez · 2025-01-20T18:51:53Z

i think it has something to do with the codestral repo on hf, the layers are not converted correctly. I'll try that later when im home.

…odestral. working: rokyang/mamba2-130m-hf rokyang/mamba2-370m-hf rokyang/mamba2-780m-hf rokyang/mamba2-1.3b-hf rokyang/mamba2-2.7b-hf

python -m mlx_lm.generate --model /Users/gokdenizgulmez/Desktop/Mamba-Codestral-7B-v0.1-4bit --prompt "# A function that computes fibonacci def fibonacci(" -m 64 ========== n): print(f"{os.path.abspath(".")/data/data/data/com.android.launcher.png) ## 🙌🏼 🙌🙌🙌🙌🙌🙌 class _State(Enum): def __init__ (self ========== Prompt: 16 tokens, 84.547 tokens-per-sec Generation: 64 tokens, 13.774 tokens-per-sec Peak memory: 4.139 GB

Goekdeniz-Guelmez · 2025-01-21T20:04:31Z

Hey @awni, I finished it with mamba-codestral, I will push the quantised version up but you can also use mistralai/Mamba-Codestral-7B-v0.1:

python -m mlx_lm.generate --model /Users/gokdenizgulmez/Desktop/Mamba-Codestral-7B-v0.1-4bit --prompt "Rene Descartes was" -m 12 
==========
a French surrealist painting, 2016
==========
Prompt: 7 tokens, 38.813 tokens-per-sec
Generation: 12 tokens, 14.927 tokens-per-sec
Peak memory: 4.122 GB

Ps. There is no prompt format though.

hey-tommy · 2025-01-27T17:34:17Z

@Goekdeniz-Guelmez Thanks for all your hard work on this!

awni · 2025-02-03T22:00:27Z

@Goekdeniz-Guelmez I tried both codestral and mamba2 2.7B. Both models generate pretty bad responses.. the 2.7B doesn't really work at all even in 8-bit... so I think there must be a bug there.

The codestral one can generate text but doesn't seem to be able to end correctly. Is that your experience?

Goekdeniz-Guelmez · 2025-02-03T22:35:34Z

Yes, I’ve tried it again with a max generations number and got the same problems, I’ll look into it tomorrow.

Goekdeniz-Guelmez · 2025-02-03T22:38:45Z

I think it is somewhere in the ssm computation that I got wrong.

Create mamba2.py

49b9fc1

Goekdeniz-Guelmez changed the title ~~Create mamba2.py~~ adding-support-for-mamba2 Oct 2, 2024

Goekdeniz-Guelmez mentioned this pull request Oct 2, 2024

support for mamba 2 (Codestral mamba) #859 #893

Open

Goekdeniz-Guelmez and others added 3 commits October 2, 2024 18:21

updating ACKNOWLEDGMENTS.md file

409ddc4

update trainer/lora.py and adding DepthWiseConv1d because mlx 0.18.0 …

264ba43

…doesnt axepts groups parameter

Merge branch 'ml-explore:main' into adding-support-for-mamba2

52d6ca0

awni mentioned this pull request Oct 10, 2024

Architecture Requests for Mamba #1030

Open

Goekdeniz-Guelmez and others added 16 commits October 11, 2024 20:53

fixing loading the model

4e1236c

Merge branch 'adding-support-for-mamba2' of https://github.com/Goekde…

9c075a7

…niz-Guelmez/mlx-examples into adding-support-for-mamba2

quick clean up and fix

6f88dd5

adding debug statements

00ba27f

Merge branch 'ml-explore:main' into adding-support-for-mamba2

3f1c1dd

Merge branch 'ml-explore:main' into adding-support-for-mamba2

855fcc4

adding debug statements (somehiw generating only goes through the fis…

8073cb4

…t MambaMixer block pass)

Merge branch 'adding-support-for-mamba2' of https://github.com/Goekde…

181d6ab

…niz-Guelmez/mlx-examples into adding-support-for-mamba2

fix generation works too (almost)

cd036cc

quick save

4ab5139

generation works but outputs gibberish

ab4cf1d

still generating gibberish

c1634ce

Merge branch 'ml-explore:main' into adding-support-for-mamba2

0ef73f3

generation works! trying training now

b9c57cd

Merge branch 'adding-support-for-mamba2' of https://github.com/Goekde…

5326d93

…niz-Guelmez/mlx-examples into adding-support-for-mamba2

adding multi token input and correct cache handling in ssm step

758597e

Goekdeniz-Guelmez and others added 6 commits October 22, 2024 21:23

update

55485b9

not working, incorrect handling with cache probably

e43a2ab

notes

9ab581d

inference works but is hella slow

a677638

update

7c8849e

Merge branch 'ml-explore:main' into adding-support-for-mamba2

3b70708

Goekdeniz-Guelmez and others added 12 commits December 18, 2024 19:32

update: using einsum on som elines making it faster, but still genera…

0ae536c

…tes Gibberish on Codestral

update

d044db9

new set but still gibberish

f4cbe27

still gibberish

2ed5194

nits

3384d38

nits

4e94e87

optimizations

8deada9

Merge branch 'ml-explore:main' into adding-support-for-mamba2

5509ef8

adding correct initialisation of dt, A and D

dd4957f

fixing cache

531ac96

update

db514f2

inference works

e96c17d

Goekdeniz-Guelmez and others added 4 commits January 21, 2025 10:57

Merge branch 'ml-explore:main' into adding-support-for-mamba2

be4bc7a

inference with the origional mamba2 model woirks but still not with c…

eb432f4

…odestral. working: rokyang/mamba2-130m-hf rokyang/mamba2-370m-hf rokyang/mamba2-780m-hf rokyang/mamba2-1.3b-hf rokyang/mamba2-2.7b-hf

codestral inference exxtually works now

a6a92cb

Goekdeniz-Guelmez and others added 7 commits January 21, 2025 22:52

removing custom RMSNorm class

c13de47

removing unnessesairy lines and cleaning up

12e9f34

small optimization

a4b716e

Merge branch 'ml-explore:main' into adding-support-for-mamba2

dd29e74

removig sanitize

2462a34

Merge branch 'main' into adding-support-for-mamba2

de856c7

update ACKNOWLEDGMENTS

3642a9d

Merge branch 'ml-explore:main' into adding-support-for-mamba2

57e1044

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adding-support-for-mamba2 #1009

adding-support-for-mamba2 #1009

Goekdeniz-Guelmez commented Oct 2, 2024

hg0428 commented Oct 22, 2024

Goekdeniz-Guelmez commented Jan 20, 2025 •

edited

Loading

Goekdeniz-Guelmez commented Jan 21, 2025 •

edited

Loading

hey-tommy commented Jan 27, 2025

awni commented Feb 3, 2025

Goekdeniz-Guelmez commented Feb 3, 2025 •

edited

Loading

Goekdeniz-Guelmez commented Feb 3, 2025

adding-support-for-mamba2 #1009

Are you sure you want to change the base?

adding-support-for-mamba2 #1009

Conversation

Goekdeniz-Guelmez commented Oct 2, 2024

hg0428 commented Oct 22, 2024

Goekdeniz-Guelmez commented Jan 20, 2025 • edited Loading

Goekdeniz-Guelmez commented Jan 21, 2025 • edited Loading

hey-tommy commented Jan 27, 2025

awni commented Feb 3, 2025

Goekdeniz-Guelmez commented Feb 3, 2025 • edited Loading

Goekdeniz-Guelmez commented Feb 3, 2025

Goekdeniz-Guelmez commented Jan 20, 2025 •

edited

Loading

Goekdeniz-Guelmez commented Jan 21, 2025 •

edited

Loading

Goekdeniz-Guelmez commented Feb 3, 2025 •

edited

Loading