avoid too high number of threads on multi socket system and with HT by dtrawins · Pull Request #4223 · openvinotoolkit/model_server

dtrawins · 2026-05-19T15:13:47Z

🛠 Summary

Problems to solve:
on dual socket host with HT, number of threads was too high in latency model. Optimal is number of physical cores on one socket.
Without docker on linux, OV should apply all defaults
When container has quota, number of threads should be like number of allocated cores.

🧪 Checklist

Unit tests added.
The documentation updated.
Change follows security best practices.
``

Copilot

Pull request overview

This PR introduces additional CPU-topology detection utilities and updates OpenVINO CPU default property selection to reduce excessive thread counts (especially on multi-socket / hyper-threaded systems) when running under Linux containers.

Changes:

Added Linux helpers to detect physical CPU cores and socket count via /proc/cpuinfo.
Updated applyDefaultCpuProperties() to apply container-aware defaults (pinning, streams, and thread count) based on performance hint and detected topology.
Changed error handling in applyDefaultCpuProperties() to log warnings instead of failing with an error status.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 5 comments.

File	Description
`src/systeminfo.hpp`	Declares new Linux CPU-topology helper APIs (physical cores, sockets).
`src/systeminfo.cpp`	Implements CPU-topology detection by parsing `/proc/cpuinfo`.
`src/ov_utils.cpp`	Adjusts default OpenVINO CPU properties in Docker, including thread-count logic based on topology.

+uint16_t getNumberOfPhysicalCores() {
+    std::set<std::string> uniqueCores;
+    std::ifstream cpuInfo("/proc/cpuinfo");
+    if (!cpuInfo.is_open()) {
+        return std::max<uint16_t>(static_cast<uint16_t>(std::thread::hardware_concurrency()), 1);
+    }
+    std::string line;
+    while (std::getline(cpuInfo, line)) {
+        if (line.find("core id") != std::string::npos) {
+            uniqueCores.insert(line);
+        }
+    }
+    if (uniqueCores.empty()) {
+        return std::max<uint16_t>(static_cast<uint16_t>(std::thread::hardware_concurrency()), 1);
+    }
+    return static_cast<uint16_t>(uniqueCores.size());
+}
+
+uint16_t getNumberOfSockets() {
+    std::set<std::string> uniqueSockets;
+    std::ifstream cpuInfo("/proc/cpuinfo");
+    if (!cpuInfo.is_open()) {
+        return 1;
+    }
+    std::string line;
+    while (std::getline(cpuInfo, line)) {
+        if (line.find("physical id") != std::string::npos) {
+            uniqueSockets.insert(line);
+        }
+    }
+    if (uniqueSockets.empty()) {
+        return 1;
+    }
+    return static_cast<uint16_t>(uniqueSockets.size());


    try {
-        const uint16_t coreCount = getCoreCount();
-        const uint16_t sanitizedCoreCount = coreCount > 0 ? coreCount : 1;
-
-        if (properties.find(ov::inference_num_threads.name()) == properties.end()) {
-            properties[ov::inference_num_threads.name()] = static_cast<int>(sanitizedCoreCount);
-            SPDLOG_DEBUG("applyDefaultCpuProperties: setting inference_num_threads to {}", sanitizedCoreCount);
+        if (!isRunningInDocker()) {
+            return StatusCode::OK;
        }
+        const uint16_t coreCount = getCoreCount();


+                const uint16_t sockets = getNumberOfSockets();
+                numThreads = std::max(1, static_cast<int>(getNumberOfPhysicalCores() / sockets));


    } catch (const std::exception& ex) {
-        SPDLOG_ERROR("Exception while applying default CPU properties: {}", ex.what());
-        return StatusCode::INTERNAL_ERROR;
+        SPDLOG_WARN("Exception while applying default CPU properties: {}", ex.what());
    } catch (...) {
-        SPDLOG_ERROR("Unknown exception while applying default CPU properties");
-        return StatusCode::INTERNAL_ERROR;
+        SPDLOG_WARN("Unknown exception while applying default CPU properties");
    }


mzegla · 2026-05-20T09:52:27Z

+        if (!isRunningInDocker()) {
+            return StatusCode::OK;
        }
+        const uint16_t coreCount = getCoreCount();


No need for >0 check?

mzegla · 2026-05-20T09:55:27Z

+        return std::max<uint16_t>(static_cast<uint16_t>(std::thread::hardware_concurrency()), 1);
+    }
+    std::string line;
+    while (std::getline(cpuInfo, line)) {


Do we treat cpu info file as trusted?
We read, iterate and load lines to memory so potentially malicious cpu info could overflow memory.

avoid too high number of threads on multi socket system and with HT

fe6c7c1

dtrawins requested review from Copilot, dkalinowski and mzegla May 19, 2026 15:13

Copilot started reviewing on behalf of dtrawins May 19, 2026 15:14 View session

Copilot AI reviewed May 19, 2026

View reviewed changes

dtrawins added 2 commits May 19, 2026 17:27

fix situation with quota

4265f5e

fix cores detection

3efed84

dtrawins added this to the 2026.2_rc milestone May 19, 2026

style

5674f65

mzegla reviewed May 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

avoid too high number of threads on multi socket system and with HT#4223

avoid too high number of threads on multi socket system and with HT#4223
dtrawins wants to merge 4 commits into
mainfrom
tune_threads

dtrawins commented May 19, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

mzegla May 20, 2026

Uh oh!

mzegla May 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		const uint16_t sockets = getNumberOfSockets();
		numThreads = std::max(1, static_cast<int>(getNumberOfPhysicalCores() / sockets));

Conversation

dtrawins commented May 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🛠 Summary

🧪 Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

mzegla May 20, 2026

Choose a reason for hiding this comment

Uh oh!

mzegla May 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dtrawins commented May 19, 2026 •

edited

Loading