Implement new Python APIs #25999

yuslepukhin · 2025-09-09T19:16:17Z

Description

This pull request introduces several enhancements to ONNX Runtime's Python and C++ APIs, focusing on improved device and memory information handling, synchronization stream support, and tensor copy functionality. It adds new Python bindings for device/memory types, exposes more detailed session input/output metadata, and provides a Python-accessible tensor copy API. The changes also refactor and extend the C++ API for better stream and memory info management.

Key changes include:

Device and Memory Information Enhancements

Added Python bindings for OrtMemoryInfoDeviceType, OrtDeviceMemoryType, and expanded OrtDevice to expose the memory type via a new mem_type method. The OrtMemoryInfo Python class now supports both legacy and new V2 constructors and exposes additional properties such as device memory type and vendor ID. [1] [2] [3]
Extended the Python InferenceSession object to provide access to input/output OrtMemoryInfo and OrtEpDevice objects through new properties and methods. [1] [2] [3] [4]

Synchronization Stream and Execution Provider Device Support

Introduced Python bindings for OrtSyncStream, including creation via OrtEpDevice.create_sync_stream() and retrieval of device-specific OrtMemoryInfo via OrtEpDevice.memory_info(). [1] [2]
Refactored the C++ API to generalize SyncStream handling, allowing for unowned streams and improved type safety. [1] [2]

Tensor Copy Functionality

Added a new Python-level copy_tensors function and corresponding C++ binding, enabling efficient copying of tensor data between OrtValue objects, optionally using a synchronization stream. [1] [2] [3]

Miscellaneous Improvements and Fixes

Changed the return type of the OrtValue.data_ptr method in the Python binding from int64_t to uintptr_t for better cross-platform compatibility. [1] [2]
Minor improvements to error messages and device type handling in the Python API (e.g., for OrtDevice). [1] [2]
Included necessary C++ includes for plugin stream support.

These changes collectively improve the flexibility and introspection capabilities of ONNX Runtime's device, memory, and execution provider interfaces, and make advanced features available to Python users.

Motivation and Context

Depends on: #26021

Also: AttributeError: 'InferenceSession' object has no attribute 'inputs_meminfo'

copy_tensors fails no data transfer to copy from CPU to CPU. lintrunner complains OrtSyncStream is undefined.

Copilot

Pull Request Overview

This PR introduces significant enhancements to ONNX Runtime's Python and C++ APIs, focusing on device and memory management, synchronization streams, and tensor operations. The changes provide better introspection capabilities and make advanced features previously only available in C++ accessible from Python.

Adds comprehensive Python bindings for device/memory types and execution provider device handling
Introduces synchronization stream support with Python-accessible APIs
Implements a new tensor copy functionality for efficient data transfer between OrtValue objects

Reviewed Changes

Copilot reviewed 8 out of 8 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
onnxruntime/test/python/onnxruntime_test_python_autoep.py	Adds tests for new EP device memory info, sync stream creation, and tensor copy functionality
onnxruntime/test/python/onnxruntime_test_python.py	Adds tests for new session memory info properties and device/memory info APIs
onnxruntime/python/onnxruntime_pybind_state.cc	Core implementation of new Python bindings for device types, memory info, sync streams, and tensor copy
onnxruntime/python/onnxruntime_pybind_ortvalue.cc	Changes OrtValue data_ptr return type from int64_t to uintptr_t for better cross-platform compatibility
onnxruntime/python/onnxruntime_inference_collection.py	Adds Python wrapper methods for accessing session memory info and EP devices, plus tensor copy function
onnxruntime/init.py	Exports new public APIs for device/memory types and tensor copy functionality
include/onnxruntime/core/session/onnxruntime_cxx_inline.h	Refactors sync stream implementation to support templated approach
include/onnxruntime/core/session/onnxruntime_cxx_api.h	Generalizes SyncStream handling with template-based implementation for better type safety

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

onnxruntime/test/python/onnxruntime_test_python_autoep.py

onnxruntime/python/onnxruntime_pybind_state.cc

yuslepukhin added 8 commits September 4, 2025 13:35

Begin

cece80d

Make name an std::string in OrtMemoryInfo

c053030

Implement OrtMemoryInfo interfaces

9d8e5ce

Passing None to copy_tensors does not work.

d58cecf

Also: AttributeError: 'InferenceSession' object has no attribute 'inputs_meminfo'

Address a bug when a number of meminfos requested is always for inputs

37d3ee5

Two issues:

3d416f9

copy_tensors fails no data transfer to copy from CPU to CPU. lintrunner complains OrtSyncStream is undefined.

Test copy_tensors

f061e75

Merge branch 'main' into yuslepukhin/cs_python

543133f

yuslepukhin requested review from skottmckay, adrianlizarraga and Copilot September 9, 2025 19:16

Copilot AI reviewed Sep 9, 2025

View reviewed changes

onnxruntime/test/python/onnxruntime_test_python_autoep.py Outdated Show resolved Hide resolved

onnxruntime/python/onnxruntime_pybind_state.cc Outdated Show resolved Hide resolved

yuslepukhin added the release:1.23.0 label Sep 9, 2025

jywu-msft removed the release:1.23.0 label Sep 10, 2025

yuslepukhin added 2 commits September 11, 2025 13:58

Address co-pilot comments

0660951

Merge branch 'main' into yuslepukhin/cs_python

f2e9213

adrianlizarraga reviewed Sep 16, 2025

View reviewed changes

onnxruntime/python/onnxruntime_pybind_state.cc Outdated Show resolved Hide resolved

onnxruntime/python/onnxruntime_pybind_state.cc Outdated Show resolved Hide resolved

Address review comments

5c7e462

adrianlizarraga approved these changes Sep 17, 2025

View reviewed changes

yuslepukhin merged commit abc63e8 into main Sep 17, 2025
87 of 92 checks passed

yuslepukhin deleted the yuslepukhin/cs_python branch September 17, 2025 18:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement new Python APIs #25999

Implement new Python APIs #25999

Uh oh!

yuslepukhin commented Sep 9, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Implement new Python APIs #25999

Implement new Python APIs #25999

Uh oh!

Conversation

yuslepukhin commented Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Device and Memory Information Enhancements

Synchronization Stream and Execution Provider Device Support

Tensor Copy Functionality

Miscellaneous Improvements and Fixes

Motivation and Context

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yuslepukhin commented Sep 9, 2025 •

edited

Loading