Air Gapped and Offline local GPU usage of LLM models Meta Llama 3.3, claude-sonnet 3.5, Gemma 2-27b, Qwen2, Mistral-large-instruct, Deepseek R1 / V3 - compare to online OpenAI o1 pro

Air Gapped and Offline local GPU usage of LLM models Meta Llama 3.2, claude-sonnet 3.5, Gemma 2-27b, Qwen2, Mistral-large-instruct, Deepseek R1 / V3 

see https://obrienlabs.medium.com/running-reasoning-llms-like-the-deepseek-r1-70b-43g-locally-for-private-offline-air-gapped-259fa437da8f

Compare to OpenAI o1 pro
 
- deepseek R1 - https://github.com/ObrienlabsDev/machine-learning/issues/37 - see https://github.com/ObrienlabsDev/blog/issues/95
- Qwen2 - https://github.com/ObrienlabsDev/blog/issues/96
- Llama 3.3 - https://github.com/ObrienlabsDev/blog/issues/97
- Mistral-large-instruct https://github.com/ObrienlabsDev/blog/issues/98
- claude-sonnet 3.5 https://github.com/ObrienlabsDev/blog/issues/99
- gemma 2 - https://github.com/ObrienlabsDev/blog/issues/31

- see also https://github.com/ObrienlabsDev/blog/wiki/CUDA-based-%E2%80%90-High-Performance-Computing-%E2%80%90-LLM-Training-%E2%80%90-Ground-to-GCP-Cloud-Hybrid
- GPU/CPU performance from the bottom up -  https://github.com/ObrienlabsDev/performance

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Air Gapped and Offline local GPU usage of LLM models Meta Llama 3.3, claude-sonnet 3.5, Gemma 2-27b, Qwen2, Mistral-large-instruct, Deepseek R1 / V3 - compare to online OpenAI o1 pro #100

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Air Gapped and Offline local GPU usage of LLM models Meta Llama 3.3, claude-sonnet 3.5, Gemma 2-27b, Qwen2, Mistral-large-instruct, Deepseek R1 / V3 - compare to online OpenAI o1 pro #100

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions