A basic training example using GGML #707
bssrdf
started this conversation in
Show and tell
Replies: 1 comment 1 reply
-
Nice job! Thank you for sharing If you have some feedback of what could be improved - please let us know. The training capabilities in |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi, I just want to share what I have been working on recently. This is an example of training a MNIST VAE. The goal is to use only
ggml
pipeline and its implementation of ADAM optimizer.There aren't many training examples using
ggml
.The only one I found is baby-llama. But I think its way of doing opmization is not quite right.Found another training example inllama.cpp
which shows a proper way of using Adam.Some of the mods I have to add
Below are some samples from the VAE trained on MNIST after each epoch (total 10 epochs).
| data:image/s3,"s3://crabby-images/d8812/d881294ed488b9e6e155be2276a8a81acce150d9" alt="mnist-sample-epoch_2"
| data:image/s3,"s3://crabby-images/9a22e/9a22ed138acf063de8e09b6c2fe1644c45249ed3" alt="mnist-sample-epoch_4"
| data:image/s3,"s3://crabby-images/a8fa5/a8fa5031a5a140941b7bba4fdfe9ce80abdad0c6" alt="mnist-sample-epoch_6"
| data:image/s3,"s3://crabby-images/179b9/179b99d74dbd758b44ad34900ed4d918e25c24f9" alt="mnist-sample-epoch_8"
| data:image/s3,"s3://crabby-images/40bc0/40bc01f8c227d889592d7aff4a4b46a53c1612d0" alt="mnist-sample-epoch_10"
Beta Was this translation helpful? Give feedback.
All reactions