MLX model support #124

ZachNagengast · 2024-04-23T04:57:06Z

Draft PR for the early stages of supporting MLX based whisper models directly in WhisperKit. (To be updated)

Initial TODOs:

* Added MLX feature extractor implementation * CI fix * added better multiarray conversion * CI fix * CI fix * fixed `asMLMultiArray` implementation, fixed CI * update xcode, trigger pr when targeting not main branch * check if vision os builds * update watch os version * conditional watchos compilation * conditional package.swift * conditional package.swift * ci fix * ci fix * ci fix * ci fix * ci fix * ci fix * ci fix * ci fix * ci fix * ci fix * ci fix * add other tests targest back * package.swift cleanup * general cleanup * revert to xcode 15.2

…into mlx-support

* added mlx audio encoder * fixed model protocols * removed not needed

ZachNagengast · 2024-05-24T19:21:48Z

@jkrukowski relevant: https://ml-explore.github.io/mlx/build/html/install.html#binary-size-minimization

We'll want to minimize the binary size as much as reasonable for iOS deployment

…into mlx-support

* fixes for mlx models * fixed asMLXArray * fixed tests, mlx doesn't run on simulators * fix

* Added more tests for MLX, cleanup * bumped timeout * fixed tests * reverted cache id

* update mlx-swift * - reverted mlx version - updated readme - updated makefile * reversed * fixed tests * updated mlx-swift * updated makefile * remove device change, fft can run on gpu now * updated readme, added tests * updated readme * review changes * review changes * CI model cache path fix * tests failed * Update package.swift * Keep setupModels with adjustments * Test CI skip cache * Test CI package name change * Test CI optional CLI settings * Use correct logits size for MLX --------- Co-authored-by: ZachNagengast <[email protected]>

atiorh · 2024-11-17T07:05:14Z

Sources/WhisperKit/MLX/Attention.swift

+    }
+
+    private func qkvAttention(_ q: MLXArray, _ k: MLXArray, _ v: MLXArray, _ mask: MLXArray?) -> (MLXArray, MLXArray) {
+        let (nBatch, nCtx, nState) = (q.shape[0], q.shape[1], q.shape[2])


https://swiftpackageindex.com/ml-explore/mlx-swift/0.16.0/documentation/mlxfast/scaleddotproductattention(queries:keys:values:scale:mask:stream:)

neo773 · 2025-01-27T05:55:38Z

is this still on the roadmap?

ZachNagengast · 2025-01-27T21:54:49Z

@neo773 Yes it is, we plan to finalize this before v1.0.0 sometime early this year. It is one of the last remaining pieces of feature work to finish up for the v1.0.0 and we will also refine a lot of the docs and do a bit of refactoring for a stable, production-ready release.

Initial mlx integration

0c15804

ZachNagengast marked this pull request as draft April 23, 2024 04:57

ZachNagengast and others added 5 commits April 22, 2024 21:57

Merge branch 'main' into mlx-support

e8e99fb

Merge branch 'main' into mlx-support

1bc914d

Merge branch 'mlx-support' of https://github.com/argmaxinc/WhisperKit …

6b8aaf7

…into mlx-support

Added MLX Audio Encoder (#139)

7cc004b

* added mlx audio encoder * fixed model protocols * removed not needed

ZachNagengast and others added 5 commits May 28, 2024 14:34

Merge branch 'main' into mlx-support

9748793

Merge branch 'mlx-support' of https://github.com/argmaxinc/WhisperKit …

fc6cf9e

…into mlx-support

Updates for merge

941b101

Allow MLX and CoreML to coexist (#156)

470e227

* fixes for mlx models * fixed asMLXArray * fixed tests, mlx doesn't run on simulators * fix

added MLX text decoder (#161)

b88079d

ZachNagengast marked this pull request as ready for review June 12, 2024 11:56

ZachNagengast and others added 16 commits June 15, 2024 10:53

Merge branch 'main' into mlx-support

c677f0e

Fix merge

ce60492

Cleanup and more tests for MLX (#169)

1e12fe2

* Added more tests for MLX, cleanup * bumped timeout * fixed tests * reverted cache id

Merge branch 'main' into mlx-support

674e26b

Formatting

4d24e43

Merge branch 'main' into mlx-support

b44c2ce

Fix merge for makefile function

20549e1

Skip plugin validation in CI

9e9e13a

Fix tests from merge

ca46214

Update model paths

1615d69

Merge branch 'main' into mlx-support

d2c6fd0

Fix model downloads

08eb93e

Fix HF auth

5432a8f

Fix HF login script

48cf8ff

Include hf token in download step

b274e2f

Remove hf login in favor up update model repo permissions

4b35952

Use scheme from run config

a139839

ZachNagengast mentioned this pull request Jul 15, 2024

Noting a macOS 15 Beta 3 crash ml-explore/mlx-swift#114

Closed

atiorh mentioned this pull request Sep 10, 2024

MLX performance improvement with in-place KVCache #201

Open

atiorh reviewed Nov 17, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MLX model support #124

MLX model support #124

ZachNagengast commented Apr 23, 2024 •

edited

Loading

ZachNagengast commented May 24, 2024

atiorh Nov 17, 2024

neo773 commented Jan 27, 2025

ZachNagengast commented Jan 27, 2025

MLX model support #124

Are you sure you want to change the base?

MLX model support #124

Conversation

ZachNagengast commented Apr 23, 2024 • edited Loading

ZachNagengast commented May 24, 2024

atiorh Nov 17, 2024

Choose a reason for hiding this comment

neo773 commented Jan 27, 2025

ZachNagengast commented Jan 27, 2025

ZachNagengast commented Apr 23, 2024 •

edited

Loading