-
Notifications
You must be signed in to change notification settings - Fork 156
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Trace failing for PyTorch model - [TEN404] Internal tensorizer error: SundaSizeTiling:tuple index out of range #1101
Comments
I took a brief look at the model code, and I see the model expects |
Hi @aws-rishyraj , Thank you for taking the time and the recommendation. I just updated the module locally and changed the signature of the
Tested running the model with the new set up and it worked fine, giving the same output as the initial implementation, however still getting the exact same error when trying the trace method:
Any input or idea what I could try to fix this? |
@aws-rishyraj please let me know if you need more data or debug logs from the command. |
It looks like the underlying tracing process is unable to trace the full model, and is fragmenting the graph which is more susceptible to compilation issues. Generally speaking, models that use pure torch tensor operations and have no conditional logic are the most compatible with This is why I suggest compiling a small piece of the model and slowly add more pieces of the model. kokoro_model.bert = torch_neuronx.trace(kokoro_model.bert, mock_inputs_for_kokoro_bert) |
Hi @aws-rishyraj , That makes a lot of sense. I will give it a shot. Thanks a lot for the guidance! |
Just a quick update on this, after some digging, I found that HuggingFace already provides some models already compiled with AWS Inferentia through HF Optimum Neuron, including ALBERT which is the model the |
Hello,
I am trying to compile a PyTorch model for AWS Inferentia but it's currently failing with status 70. The main error seems to be:
Running on an AWS inf2 instance with Ubuntu 22, installed dependencies as described in the docs
The model I am trying to compile is Kokoro-82M
I found some similar threads but no concrete pointers whether this was a bug or an issue with the setup.
You can find below the data I gathered and steps to reproduce it.
Model Sample Testing:
Tracing Output:
Model Eval:
The text was updated successfully, but these errors were encountered: