Update threading configuration for NVL and PTL by sunxiaoxia2022 · Pull Request #36367 · openvinotoolkit/openvino

sunxiaoxia2022 · 2026-06-12T03:07:22Z

Improve hybrid CPU latency heuristics to enable more ALL/AUTO configurations on high LP E-core-share platforms.
Add new model-profile-based AUTO rules using lp_ecore_share, memory tolerance, convolution pressure, and GEMM ratio.
Refine latency stream sizing so model_prefer_threads is honored when building single-stream ALL core configurations.
Extend CPUStreamsExecutor internal stream metadata from a single core type to multiple core types.
Update task arena creation logic to bind streams by the actual set of eligible core types instead of forcing a single-type binding.
Add unit coverage for mixed-core stream metadata and stream type selection behavior.

AI assistance used: yes
If yes, summarize how AI was used and what human validation was performed (build/tests/manual checks).

sunxiaoxia2022 added 4 commits June 10, 2026 15:47

extend core_type to core_types

a94549b

add LPEcore usage on platform with LPEcore

1c5720d

add condition of using LPEcore for PTL

6866d37

remove all core static

59942fa

sunxiaoxia2022 requested review from peterchen-intel and wangleis June 12, 2026 03:07

github-actions Bot added category: inference OpenVINO Runtime library - Inference category: CPU OpenVINO CPU plugin labels Jun 12, 2026

Provide feedback