[LFX Contribution]: Integrating MLX framework as a WasmEdge NN Backend #3330

guptaaryan16 · 2024-04-08T15:05:21Z

Thanks for giving me such an important feature to work on, and I have been working on the issue for a while and feels like it can be taken for an initial review now.

Now my work is divided into two parts:

Implementing a CPP inference Example for MLX using the current CPP API:
I have implemented a simple NN with the current CPP API of MLX, keeping in mind the needs for the plugin and the general reference to other projects like PyTorch C++ Fronted API.
Currently the API supports creating layers like BatchNorm, transformers among others and is able to perform basic inference . I am working on more classes and models and will try to complete them as soon as possible.
Current features:

Creating model classes for the LLMs and other NNs
Get parameters listed in a Hashmap and print them, Also working on loading weights from a SafeTensors file
Completed some classes based on the nn::Module class and working on more

For implementation, you can check: https://github.com/guptaaryan16/mlx/blob/Cpp_api/examples/cpp/nn_inference_example.cpp
https://github.com/guptaaryan16/mlx/blob/Cpp_api/examples/cpp/attention.cpp

Implementing mlx.cpp and other files for wasi-nn plugin
I have been studying the implementation of other plugins, especially ggml and pytorch, and it has helped me to implement some functionality like loading weights from SafeTensors and GGUF format for now.
Most of the functionality further will be dependent on the implementation of the CPP frontend for the MLX library, which I plan to now complete as soon as possible.

My Notes and Info About Completed Milestones

Currently, I have faced multiple problems due my current knowledge in C++ and low level design for the CPP based NN ( for loading LLMs) and thus I have not been able to move as fast as I have listed in my milestones. Still I believe that after a solid baseline implementation of a NN class(which is almost done now) and further a LLAMA model, I will be able to complete the plugin as soon as possible.
From now on, I will be using the current implemented NN example to create something similar to llama.cpp for MLX (will try to keep it within the library itself to simplify loading of models) and thus will complete more parts for the plugin within a few weeks.

Thank you for helping me till now and hope that I complete the project as soon as possible.

cc @hydai @awni

juntao · 2024-04-08T15:05:25Z

Hello, I am a code review bot on flows.network. Here are my reviews of code commits in this PR.

bmorphism · 2024-04-13T06:35:28Z

wow, this is pretty brutal @juntao -- respect!

Signed-off-by: Aryan Gupta <guptaaryan16@gmail.com>

guptaaryan16 requested a review from ibmibmibm as a code owner April 8, 2024 15:05

github-actions bot added c-Plugin An issue related to WasmEdge Plugin c-WASI-NN labels Apr 8, 2024

guptaaryan16 marked this pull request as draft April 8, 2024 15:05

guptaaryan16 force-pushed the wasi-nn-mlx branch 2 times, most recently from ecf74e4 to 84da6d7 Compare May 22, 2024 12:18

guptaaryan16 added 4 commits May 28, 2024 16:40

Initial Commit for Wasi-NN MLX plugin

3e5dcd1

Signed-off-by: Aryan Gupta <guptaaryan16@gmail.com>

Fix a typo

c7e5654

Signed-off-by: Aryan Gupta <guptaaryan16@gmail.com>

Completed implementation for functions of mlx.cpp

ceb6837

Signed-off-by: Aryan Gupta <guptaaryan16@gmail.com>

Rewrite CMake and fix output bugs

bb74fa5

Signed-off-by: Aryan Gupta <guptaaryan16@gmail.com>

guptaaryan16 force-pushed the wasi-nn-mlx branch from 6a708b8 to bb74fa5 Compare May 28, 2024 11:10

github-actions bot added the c-CMake label May 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LFX Contribution]: Integrating MLX framework as a WasmEdge NN Backend #3330

[LFX Contribution]: Integrating MLX framework as a WasmEdge NN Backend #3330

guptaaryan16 commented Apr 8, 2024

juntao commented Apr 8, 2024 •

edited

bmorphism commented Apr 13, 2024

[LFX Contribution]: Integrating MLX framework as a WasmEdge NN Backend #3330

Are you sure you want to change the base?

[LFX Contribution]: Integrating MLX framework as a WasmEdge NN Backend #3330

Conversation

guptaaryan16 commented Apr 8, 2024

Now my work is divided into two parts:

My Notes and Info About Completed Milestones

juntao commented Apr 8, 2024 • edited

bmorphism commented Apr 13, 2024

juntao commented Apr 8, 2024 •

edited