[Feature Request] Global Threadpool in Python API #23523

alex-halpin · 2025-01-28T23:07:52Z

Describe the feature request

Expose the ability to utilize a global threadpool for inference sessions in the python API

Describe scenario use case

My current use case requires the instantiation of many (thousands) of small onnx models in memory at once. Doing so causes too many threads to be spawned halting the program. The functionality for a global threadpool exists in the cpp source but is not exposed to the python bindings.

alex-halpin · 2025-01-28T23:12:36Z

I have linked a fork that i have tested to be working on my macbook with the following implementation as an example use case

ort.set_global_thread_pool_sizes(64, 64) # new functionality

class OnnxRunner:

    sess_options = ort.SessionOptions()
    sess_options.use_per_session_threads = False # newly exposed to python api

    def __init__(self, model: bytes):
        self.session = ort.InferenceSession(model, sess_options=self.sess_options)

    def predict(self, x: Array):
        x = x.astype("float32")
        y = self.session.run(["output"], {"input": x})

        return y[0]

yuslepukhin · 2025-01-31T20:00:37Z

Would you be willing to submit a PR?

alex-halpin · 2025-02-03T16:20:23Z

Would you be willing to submit a PR?

yes, i have an open PR here

khoover · 2025-03-21T04:35:05Z

+1 to this, we're a similar situation of having thousands of small models that are resident simultaneously, we don't want to have that many thread pools spun up and sitting idle.

alex-halpin · 2025-03-27T15:47:05Z

+1 to this, we're a similar situation of having thousands of small models that are resident simultaneously, we don't want to have that many thread pools spun up and sitting idle.

My pr is still open but i unfortunately don't really have the knowledge or bandwidth to push it further at the moment. I threw it together as a POC for my team but we ended up just going with:

inter_op_num_threads =1
intra_op_num threads = 1

in local testing this worked to pin the inference sessions to 1 global shared thread which was sufficient for our use-case.

alex-halpin added the feature request request for unsupported feature or enhancement label Jan 28, 2025

alex-halpin mentioned this issue Jan 28, 2025

enable global thread pool in python #23495

Open

khoover mentioned this issue Mar 28, 2025

Add python bindings to the global thread pool functionality #24238

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Global Threadpool in Python API #23523

[Feature Request] Global Threadpool in Python API #23523

alex-halpin commented Jan 28, 2025

alex-halpin commented Jan 28, 2025

yuslepukhin commented Jan 31, 2025

alex-halpin commented Feb 3, 2025

khoover commented Mar 21, 2025

alex-halpin commented Mar 27, 2025

[Feature Request] Global Threadpool in Python API #23523

[Feature Request] Global Threadpool in Python API #23523

Comments

alex-halpin commented Jan 28, 2025

Describe the feature request

Describe scenario use case

alex-halpin commented Jan 28, 2025

yuslepukhin commented Jan 31, 2025

alex-halpin commented Feb 3, 2025

khoover commented Mar 21, 2025

alex-halpin commented Mar 27, 2025