Replies: 1 comment
-
|
I'm using libfabrics with verbs on my linux app. Now trying to port. But I have problem with Windows 11 and verbs too Bug Report: fi_pingpong fails with verbs provider on Windows (Error -10038 / WSAENOTCONN)SummaryThe Environment
Reproduction Steps
cd C:\Users\someusername\libfabric\x64\Debug-v142
fi_pingpong.exe -p verbs -e msg -I 5 -s 192.168.40.144 # Server (background)
fi_pingpong.exe -p verbs -e msg -I 5 -s 192.168.40.144 192.168.40.144 # ClientExpected Behaviorfi_pingpong should complete successfully with verbs provider, similar to tcp provider. Actual BehaviorTCP Provider (Works)Verbs Provider (Fails)With verbose output: Server-Side Output (Verbs)The server successfully sends the name but does not proceed to open fabric resources, indicating the client connection is being reset before the domain can be opened. Available Providers (fi_info.exe -p verbs)Analysis
Potential Root Causes
Related Code Locations
WorkaroundUse the fi_pingpong.exe -p tcp -e rdm -I 5 # Server
fi_pingpong.exe -p tcp -e rdm -I 5 localhost # ClientAdditional Notes
RequestPlease investigate:
|
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
I’m trying to use the libfabric verbs provider on Windows 11 with Mellanox ConnectX-6 NIC for RDMA communication, but fi_info -p verbs and the fi_pingpong test do not work as expected. fi_info.exe doesn’t even show the devices which use verbs provider, but with -l argument I can see that verbs is listed (version: 203.10). Running the same version of libfabric on Ubuntu 22.04 with identical Mellanox hardware detects verbs correctly, so this doesn’t appear to be a hardware issue.
I'm getting ret=-120 "no message available".
Here is the full log output with FI_LOG_LEVEL=info set:
fi_info -p verbs
Click to expand full log
Here is the Get-NetAdapterRdma output form Winodws PowerShell:
Name InterfaceDescription Enabled Operational PFC ETS
Ethernet Mellanox ConnectX-6 Dx Adapter #3 True False False False
vEthernet (Default Hyper-V Virtual Ethernet Adapter False False NA NA
vEthernet (WSL (H Hyper-V Virtual Ethernet Adapter #2 False False NA NA
Ethernet 2 Mellanox ConnectX-6 Dx Adapter #4 True False False False
System Configuration:
-OS: Windows 11 Pro(24H2)
-Hardware: Mellanox ConnectX-6 Dx 100 GbE
-Driver: WinOF-2 version 25.7.50000 Windows Client 11, https://network.nvidia.com/products/adapter-software/ethernet/windows/winof-2/
-libfabric version: 2.3.1 (built from source using Visual Studio 2022, Release-v142 x64),extracted header files from downloaded NetworkDirect_DDK.zip:\NetDirect\include\ into include\windows
-RoCE: Enabled in mlxconfig (ROCE_V2=1, ROCE_ENABLE=2)
Beta Was this translation helpful? Give feedback.
All reactions