Skip to content

Commit e2d4780

Browse files
authored
Update README.md
1 parent b9a56cd commit e2d4780

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

README.md

+3
Original file line numberDiff line numberDiff line change
@@ -43,6 +43,7 @@ Updated at 11/09/2024
4343
http://localhost:3000/generate` to prompt the LLM
4444
- Or use your preferred API clients
4545
- Run `sam deploy --guided` to deploy to AWS
46+
- This will deploy a cloudformation stack consisting of an API gateway and a Lambda function
4647

4748
## Metrics
4849

@@ -82,6 +83,8 @@ http://localhost:3000/generate` to prompt the LLM
8283

8384
- It may be cheaper to just use a hosted LLM using AWS Bedrock, etc.. on the cloud as the pricing structure for Lambda w/ Qwen does not look more competitive compared to Claude 3 Haiku
8485

86+
- Furthermore, the API gateway timeout is not easily configurable beyond the 30s timeout, depending on your usecase, this may not be very ideal
87+
8588
- Results via local is dependant on your machine specs!! and may heavily skew your perception, expectation vs reality
8689

8790
- Depending on your use case also, the latency per lambda invocation and responses might incur poor user experiences

0 commit comments

Comments
 (0)