The Llama Android example doesn't generate tokens when trying to run local inference. #158

BakungaBronson · 2025-01-25T23:18:33Z

System Info

Since this is running on a phone, I will share the phone details instead.

Samsung Galaxy S24 Ultra - Running Android 14.

Information

The official example scripts
My own modified scripts

🐛 Describe the bug

The Llama Android example doesn't generate tokens when trying to run local inference.

I have followed the documentation as it is, including preparing the model using Executorch and using adb to push the .pte and renamed tokenizer.bin file, but for some reason, there are no tokens being produced. I have tried the Llama3.2-1B-Instruct model and the Llama3.2-1B-Instruct-int4-spinquant-eo8 model both display the same empty message. There are no issues running Executorch on the models

Error logs

Not errors but I'm posting the logs from the app.

2025-01-26 07:01:00] mSettingsFields
{"backendType":"XNNPACK",isClearChatHistory":false,"
isLoadModel":false,"modelFilePath":"/data/local/tmp/
llama/llama3_2_bf16.pte","modelType":"LLAMA_3_2
""remoteModel": '''remoteURL":''systemPrompt"'''"
temperature":0.0,"tokenizerFilePath":"/data/local/tmp/ llama/tokenizer.bin'}
2025-01-26 07:01:00] mModelType from settings
LLAMA_3_2
2025-01-26 07:01:00] mBackendType from settings
XNNPACK
12025-01-26 07:01:00| mRemoteURL from settings
2025-01-26 07:01:00] mRemoteModel from settings
[2025-01-26 07:01:05] saving settings /data/local/ tmp/llama/llama3_2_bf16.pte
|2025-01-26 07:01:05] onResume is called
2025-01-26 07:01:05| test
2025-01-26 07:01:05, local model is changing to / data/local/tmp/llama/llama3_2_bf16.pte
2025-01-26 07:01:05 UPDATING local client with new data
2025-01-26 07:01:05] com.example
.llamastackandroiddemo.SettingsFields@42a53bf
12025-01-26 07:01:05, Updating local model to / data/ local/tmp/llama/llama3_2_bf16.pte

12025-01-26 06:55:19] mRemoteURL from settings
[2025-01-26 06:55:19] mRemoteModel from settings
[2025-01-26 06:55:39] saving settings /data/local/ tmp/llama/llama3_2_bf16.pte
|2025-01-26 06:55:39] onResume is called
2025-01-26 06:55:39] test
|2025-01-26 07:00:58, onResume is called
2025-01-26 07:00:58 test
[2025-01-26 07:00:58] local model is changing to / data/local/tmp/llama/llama3_2_bf16.pte
[2025-01-26 07:00:58] UPDATING local client with new data
[2025-01-26 07:00:58] com.example
.llamastackandroiddemo.SettingsFields@59f2fde
[2025-01-26 07:00:58] |lamaStackCloudInference true exampleLlamaStackLocallnferencefalse
2025-01-26 07:00:58] Models configured. You can now do local (llama3_2_bf16.pte) inference.
[2025-01-26 07:00:58, onResume mCurrentSettingsFields com.example
.llamastackandroiddemo.SettingsFields@91135bf

2025-01-26 07:01:05 com.example
.llamastackandroiddemo.SettingsFields@42a53bf
12025-01-26 07:01:05, Updating local model to / data/ local/tmp/llama/llama3_2_bf16.pte
2025-01-26 07:01:05] |lamaStackCloudInference true exampleLlamaStackLocallnferencefalse
L2025-01-26 07:01:05, Models configured. You can now do local (Ilama3_2_bf16.pte) inference.
2025-01-26 07:01:05 onResume mCurrentSettingsFields com.example
.llamastackandroiddemo.SettingsFields@440a8c
[2025-01-26 07:01:13] Running inference local..
prompt=hi
[2025-01-26 07:01:13] Running inference locally.. raw prompt=hi
[2025-01-26 07:01:13] local inference with prompt=hi
L2025-01-26 07:01:13, conversation history length 4
2025-01-26 07:01:30] onResume is called
2025-01-26 07:01:30] test
[2025-01-26 07:02:09] onResume is called
2025-01-26 07:02:09] test
[2025-01-26 07:02:10] Inference mode: Remote
2025-01-26 07:02:12, Inference mode: Local

Expected behavior

When the model and tokenizer are loaded and the settings are made to point to them the application should perform on-device inference.

The text was updated successfully, but these errors were encountered:

WuhanMonkey · 2025-01-26T00:16:35Z

Hi @BakungaBronson, few ideas on debugging this:

I don't see the local inference client got init in your log. Do you see it? Log should be here?
Try add a system prompt to overide the tool calling system prompt in the Settings page (i.e. be concise). In this case, you disable tool calling for debugging purpose.

cc. @Riandy

BakungaBronson · 2025-01-26T00:31:08Z

Hey @WuhanMonkey,

Interesting, I can't see that in the logs, the one log that does look interesting is llamaStackCloudInference true exampleLlamaStackLocalInferencefalse, despite it showing Local on the home page.
I just tried changing the system prompt, and it still doesn't output anything.

I suspect it has to do with the log you mentioned missing. Is there a need to change any of the files in the example folder? I left the two main files as they were and ran the app in Android Studio.

WuhanMonkey · 2025-01-26T04:20:25Z

Hey @WuhanMonkey,

Interesting, I can't see that in the logs, the one log that does look interesting is llamaStackCloudInference true exampleLlamaStackLocalInferencefalse, despite it showing Local on the home page.

I just tried changing the system prompt, and it still doesn't output anything.

I suspect it has to do with the log you mentioned missing. Is there a need to change any of the files in the example folder? I left the two main files as they were and ran the app in Android Studio.

You shouldn't need to do anything besides downloading executorch.aar into lib folder. Can you enable and pull all the logs based on AppLogging? This should give a good trail on what went wrong.

BakungaBronson · 2025-01-26T12:46:54Z

Hey @WuhanMonkey,

Interesting, I can't see that in the logs, the one log that does look interesting is llamaStackCloudInference true exampleLlamaStackLocalInferencefalse, despite it showing Local on the home page.

I just tried changing the system prompt, and it still doesn't output anything.

I suspect it has to do with the log you mentioned missing. Is there a need to change any of the files in the example folder? I left the two main files as they were and ran the app in Android Studio.

You shouldn't need to do anything besides downloading executorch.aar into lib folder. Can you enable and pull all the logs based on AppLogging? This should give a good trail on what went wrong.

How can I do this? I have found the AppLogging class, but I do not know how to enable it.

WuhanMonkey · 2025-01-27T17:23:44Z

@BakungaBronson It is enabled by default. So in your Logcat, filter the logs based on package name, for example com.example.llamastackandroiddemo. You should find logs for AppLogging and ExecuTorch specifically tagged.

BakungaBronson · 2025-01-27T19:25:48Z

@WuhanMonkey , I can see the logs now, the specific log you mentioned earlier does exist.

2025-01-28 03:16:27.818  4424-4424  AppLogging              com.example.llamastackandroiddemo    D  onResume is called
2025-01-28 03:16:27.819  4424-4424  AppLogging              com.example.llamastackandroiddemo    D  test 
2025-01-28 03:16:27.819  4424-4424  AppLogging              com.example.llamastackandroiddemo    D  local model is changing to /data/local/tmp/llama/llama3_2_bf16.pte
2025-01-28 03:16:27.819  4424-4424  AppLogging              com.example.llamastackandroiddemo    D  UPDATING local client with new data
2025-01-28 03:16:27.820  4424-4424  AppLogging              com.example.llamastackandroiddemo    D  com.example.llamastackandroiddemo.SettingsFields@ead6edb
2025-01-28 03:16:27.820  4424-4461  llama_stack             com.example.llamastackandroiddemo    D  ExampleLlamaStackLocalInference init is called
2025-01-28 03:16:27.825  4424-4424  AppLogging              com.example.llamastackandroiddemo    D  llamaStackCloudInference true exampleLlamaStackLocalInferencefalse
2025-01-28 03:16:27.825  4424-4424  AppLogging              com.example.llamastackandroiddemo    D  Models configured. You can now do local (llama3_2_bf16.pte)  inference.
2025-01-28 03:16:27.825  4424-4424  AppLogging              com.example.llamastackandroiddemo    D  onResume mCurrentSettingsFields com.example.llamastackandroiddemo.SettingsFields@11d78

However there seems to also be some lines with errors from Executorch. They appear multiple times.:

2025-01-28 03:17:58.635  4424-4443  InputTransport          com.example.llamastackandroiddemo    D  Input channel destroyed: 'ClientS', fd=130
2025-01-28 03:17:58.637  4424-4443  InputTransport          com.example.llamastackandroiddemo    D  Input channel destroyed: 'ClientS', fd=112
2025-01-28 03:17:58.637  4424-4443  InputTransport          com.example.llamastackandroiddemo    D  Input channel destroyed: 'ClientS', fd=148
2025-01-28 03:17:58.637  4424-4443  InputTransport          com.example.llamastackandroiddemo    D  Input channel destroyed: 'ClientS', fd=118
2025-01-28 03:17:58.640  4424-5691  AppLogging              com.example.llamastackandroiddemo    D  conversation history length 4
2025-01-28 03:17:58.644  4424-5691  System.out              com.example.llamastackandroiddemo    I  Chat Completion Prompt is: <|begin_of_text|><|start_header_id|>system<|end_header_id|>
2025-01-28 03:17:58.644  4424-5691  System.out              com.example.llamastackandroiddemo    I  You are a helpful assistant.<|eot_id|><|start_header_id|>user<|end_header_id|>Hi<|eot_id|><|start_header_id|>assistant<|end_header_id|><|start_header_id|>user<|end_header_id|>you<|eot_id|><|start_header_id|>assistant<|end_header_id|><|start_header_id|>user<|end_header_id|>Hi<|eot_id|><|start_header_id|>assistant<|end_header_id|> with seqLength of 99
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.660  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.661  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.662  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.663  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  kernel 'llama::update_cache.out' not found.
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  1,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  2,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  3,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  dtype: 15 | dim order: [
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  0,
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  ]
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  Missing operator: [17] llama::update_cache.out
2025-01-28 03:17:58.664  4424-5691  ExecuTorch              com.example.llamastackandroiddemo    E  There are 32 instructions don't have corresponding operator registered. See logs for details

BakungaBronson · 2025-01-27T19:32:07Z

There are also two lone errors:

2025-01-28 03:16:27.581  4424-4454  tackandroiddemo         com.example.llamastackandroiddemo    E  Invalid resource ID 0x00000000.

2025-01-28 03:16:27.718  4424-4458  Adreno-AppProfiles      com.example.llamastackandroiddemo    E  QSPM AIDL service doesn't exist

WuhanMonkey · 2025-01-28T18:40:28Z

@BakungaBronson It looks very much your ExecuTorch model has some issue. Does the same model and tokenizer works with ET itself, if so what is the ET version?

btw, we also launched Llama Stack Kotlin SDK v0.1 along with updated demo app. Please try it out.

BakungaBronson · 2025-01-28T21:58:47Z

@WuhanMonkey you're right, the model can't run when I use Executorch on it. My Executoruch version is executorch-0.6.0a0+e78ed83

I tried ./cmake-out/executor_runner --model_path llama3_2_bf16.pte

And got the same long error shown above.

What could I be doing wrong? This is the output from me running executorch on the Llama 3.2 1B instruct model.

Script to convert

python -m examples.models.llama.export_llama --model "llama3_2" --checkpoint /Users/bakunga/.llama/checkpoints/Llama3.2-1B-Instruct/consolidated.00.pth --params /Users/bakunga/.llama/checkpoints/Llama3.2-1B-Instruct/params.json -kv --use_sdpa_with_kv_cache -X -d bf16 --metadata '{"get_bos_id":128000, "get_eos_ids":[128009, 128001]}' --output_name="llama3_2_bf16.pte"

Output

WARNING:root:Replacing KVCache with CustomKVCache. This modifies the model in place.
WARNING:root:Replacing KVCache with CustomKVCache. This modifies the model in place.
WARNING:root:Replacing KVCache with CustomKVCache. This modifies the model in place.
WARNING:root:Replacing KVCache with CustomKVCache. This modifies the model in place.
WARNING:root:Replacing KVCache with CustomKVCache. This modifies the model in place.
INFO:root:Looking for libcustom_ops_aot_lib.so in /Users/bakunga/anaconda3/envs/executorch/lib/python3.10/site-packages/executorch
INFO:root:Loading custom ops library: /Users/bakunga/anaconda3/envs/executorch/lib/python3.10/site-packages/executorch/extension/llm/custom_ops/libcustom_ops_aot_lib.dylib
INFO:root:Model after source transforms: Transformer(
  (tok_embeddings): Embedding(128256, 2048)
  (rope): Rope(
    (apply_rotary_emb): RotaryEmbedding()
  )
  (layers): ModuleList(
    (0-15): 16 x TransformerBlock(
      (attention): Attention(
        (wq): Linear(in_features=2048, out_features=2048, bias=False)
        (wk): Linear(in_features=2048, out_features=512, bias=False)
        (wv): Linear(in_features=2048, out_features=512, bias=False)
        (wo): Linear(in_features=2048, out_features=2048, bias=False)
        (rope): Rope(
          (apply_rotary_emb): RotaryEmbedding()
        )
        (kv_cache): CustomKVCache()
        (SDPA): SDPACustom()
      )
      (feed_forward): FeedForward(
        (w1): Linear(in_features=2048, out_features=8192, bias=False)
        (w2): Linear(in_features=8192, out_features=2048, bias=False)
        (w3): Linear(in_features=2048, out_features=8192, bias=False)
      )
      (attention_norm): RMSNorm()
      (ffn_norm): RMSNorm()
    )
  )
  (norm): RMSNorm()
  (output): Linear(in_features=2048, out_features=128256, bias=False)
)
INFO:root:Exporting with:
INFO:root:inputs: (tensor([[2, 3, 4]]), tensor([0]))
INFO:root:kwargs: None
INFO:root:dynamic shapes: ({1: <class 'executorch.extension.llm.export.builder.token_dim'>}, {0: 1})
INFO:root:Running canonical pass: RemoveRedundantPermutes
INFO:root:Using pt2e [] to quantizing the model...
INFO:root:No quantizer provided, passing...
INFO:root:Lowering model using following partitioner(s):
INFO:root:--> XnnpackDynamicallyQuantizedPartitioner
/Users/bakunga/anaconda3/envs/executorch/lib/python3.10/site-packages/executorch/exir/emit/_emitter.py:1582: UserWarning: Mutation on a buffer in the model is detected. ExecuTorch assumes buffers that are mutated in the graph have a meaningless initial state, only the shape and dtype will be serialized, unless a pass which sets meta["et_init_buffer"] to True such as InitializedMutableBufferPass is run.
  warnings.warn(
INFO:root:Required memory for activation in bytes: [0, 52051968]
modelname: llama3_2_bf16
output_file: llama3_2_bf16.pte
INFO:root:Saved exported program to llama3_2_bf16.pte

Congratulations on the SDK launch. I will definitely give it a go.

WuhanMonkey · 2025-01-29T17:38:38Z

@BakungaBronson The current Kotlin SDK uses ExecuTorch v0.5.0. For your reference, this is the commit it pinned against. My guess is some issue in v0.6.0a0. In general, the executorch itself should be able to run your model after export. My suggestions are:

Try export the model based on Release v0.4.0 or v0.5.0-rc3.
File an issue on executorch side

WuhanMonkey self-assigned this Jan 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The Llama Android example doesn't generate tokens when trying to run local inference. #158

The Llama Android example doesn't generate tokens when trying to run local inference. #158

BakungaBronson commented Jan 25, 2025

WuhanMonkey commented Jan 26, 2025

BakungaBronson commented Jan 26, 2025

WuhanMonkey commented Jan 26, 2025

BakungaBronson commented Jan 26, 2025

WuhanMonkey commented Jan 27, 2025

BakungaBronson commented Jan 27, 2025

BakungaBronson commented Jan 27, 2025

WuhanMonkey commented Jan 28, 2025

BakungaBronson commented Jan 28, 2025 •

edited

Loading

WuhanMonkey commented Jan 29, 2025

The Llama Android example doesn't generate tokens when trying to run local inference. #158

The Llama Android example doesn't generate tokens when trying to run local inference. #158

Comments

BakungaBronson commented Jan 25, 2025

System Info

Information

🐛 Describe the bug

Error logs

Expected behavior

WuhanMonkey commented Jan 26, 2025

BakungaBronson commented Jan 26, 2025

WuhanMonkey commented Jan 26, 2025

BakungaBronson commented Jan 26, 2025

WuhanMonkey commented Jan 27, 2025

BakungaBronson commented Jan 27, 2025

BakungaBronson commented Jan 27, 2025

WuhanMonkey commented Jan 28, 2025

BakungaBronson commented Jan 28, 2025 • edited Loading

WuhanMonkey commented Jan 29, 2025

BakungaBronson commented Jan 28, 2025 •

edited

Loading