Qwen2 72b-instruct on NVIDIA 48G RTX-A6000 or Apple M4 Max 40 core 48G compared to OpenAI o1 pro

https://ollama.com/library/qwen2:72b-instruct

see
https://huggingface.co/datasets/princeton-nlp/SWE-bench_Verified
uses yarn
https://arxiv.org/pdf/2309.00071


prompt
Can we go over creating a mandelbrot image using cuda c. I would like to fully utilize all the cores of the GPU. I am using specific GPUs from NVidia including the RTX-3500 Ada mobile with 5120 cores and the RTX-4090 with 16384 cores.


Observation
- Qwen2 seems confused over the RTX-3500 Ada mobile  - confusing it with the Amper generation RTX-3050, OpenAI O1 pro and deepseek-r1 are ok with the 3500


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Qwen2 72b-instruct on NVIDIA 48G RTX-A6000 or Apple M4 Max 40 core 48G compared to OpenAI o1 pro #96

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Qwen2 72b-instruct on NVIDIA 48G RTX-A6000 or Apple M4 Max 40 core 48G compared to OpenAI o1 pro #96

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions