Skip to content

Qwen2 72b-instruct on NVIDIA 48G RTX-A6000 or Apple M4 Max 40 core 48G compared to OpenAI o1 pro #96

Open
@obriensystems

Description

@obriensystems

https://ollama.com/library/qwen2:72b-instruct

see
https://huggingface.co/datasets/princeton-nlp/SWE-bench_Verified
uses yarn
https://arxiv.org/pdf/2309.00071

prompt
Can we go over creating a mandelbrot image using cuda c. I would like to fully utilize all the cores of the GPU. I am using specific GPUs from NVidia including the RTX-3500 Ada mobile with 5120 cores and the RTX-4090 with 16384 cores.

Observation

  • Qwen2 seems confused over the RTX-3500 Ada mobile - confusing it with the Amper generation RTX-3050, OpenAI O1 pro and deepseek-r1 are ok with the 3500

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions