What is an integer overflow in tensor shape validation?

It occurs when a machine learning model file specifies tensor dimensions so large that multiplying them together exceeds the maximum integer value, wrapping around to a small number and causing undersized memory allocations.

How do you prevent integer overflow in C++ memory allocations?

Validate all externally-sourced dimension values against known safe upper bounds before using them in arithmetic or allocation calls, and use checked multiplication functions to detect overflow.

What CWE is integer overflow?

CWE-190 (Integer Overflow or Wraparound) — where arithmetic on integers produces a value outside the representable range, wrapping to an unexpected value.

Is checking for zero-size allocations enough to prevent this vulnerability?

No. While zero-size checks prevent one attack vector, you must also validate upper bounds to prevent integer overflow when dimensions are multiplied together for allocation size calculations.

Can static analysis detect integer overflow in tensor shapes?

Yes, advanced static analyzers and AI-based scanners can trace data flow from model file parsing (source) to allocation calls (sink) and flag missing validation, as demonstrated by the multi_agent_ai scanner that detected this issue.

Tensor Shape Integer Overflow in C++ Fix

How Integer Overflow in Tensor Shape Validation Happens in C++ with OpenVINO and How to Fix It

Introduction

The file tools/plugin/modules/ov_noise_suppression/noise_suppression_interface.cpp implements an OpenVINO-based noise suppression plugin that loads neural network models and processes audio data. At line 87, the code retrieves input tensor shapes directly from model files using nd->model->input("input").get_shape() — but a critical flaw meant those dimensions were never validated before being used in subsequent memory allocations. A crafted .xml/.bin model file with dimensions set to 0x7FFFFFFFFFFFFFFF or 0 could trigger integer overflow or zero-size allocation, leading to heap corruption and potential arbitrary code execution.

This vulnerability is particularly dangerous because model files are often distributed as external assets, downloaded from repositories, or provided by users — making them a realistic attack vector in production audio processing pipelines.

The Vulnerability Explained

When the noise suppression plugin initializes, it loads an OpenVINO model and retrieves the expected input tensor shape:

nd->inp_shape = nd->model->input("input").get_shape();

This single line at line 87 trusts whatever dimensions the model file declares. The shape is a std::vector of dimension values that are subsequently used to allocate buffers for audio processing. Here's what goes wrong:

Attack Scenario 1: Integer Overflow

If an attacker crafts a model file where the input shape is [9223372036854775807, 1024] (INT64_MAX × 1024), any subsequent multiplication of these dimensions to compute buffer sizes will overflow. For example:

// Hypothetical allocation downstream
size_t buffer_size = shape[0] * shape[1] * sizeof(float);
// 9223372036854775807 * 1024 * 4 = wraps to a tiny value
float* buffer = new float[buffer_size]; // Allocates ~few bytes
// Subsequent writes overflow the heap

The result is a small heap allocation followed by writes that corrupt adjacent memory — a classic heap buffer overflow.

Attack Scenario 2: Zero-Size Dimensions

If dimensions are set to [0, 0], the allocation succeeds (many allocators return a valid pointer for zero-size requests), but subsequent processing code writes data assuming non-zero dimensions, causing out-of-bounds memory access.

Real-World Impact

This plugin runs in audio processing pipelines. An attacker who can influence which model file is loaded — through a compromised model repository, man-in-the-middle on download, or a malicious plugin configuration — gains the ability to corrupt memory in the host process. In audio/media frameworks, this can lead to denial of service or, in worst cases, remote code execution.

The Fix

The fix introduces a compile-time constant NS_MAX_SHAPE_DIM and validates every dimension immediately after retrieval:

Before (vulnerable):

nd->inp_shape = nd->model->input("input").get_shape();
// Dimensions used directly — no validation

After (fixed):

#define NS_MAX_SHAPE_DIM (1u << 24)

// ... inside the initialization function:
nd->inp_shape = nd->model->input("input").get_shape();
for (auto dim : nd->inp_shape) {
    // Reject zero-sized or excessively large dimensions
    if (dim == 0 || dim > NS_MAX_SHAPE_DIM) {
        // Error handling: refuse to proceed with invalid model
        return -EINVAL;
    }
}

The key design decisions:

NS_MAX_SHAPE_DIM set to 1 << 24 (16,777,216): This is generous enough for any legitimate audio processing tensor (typical noise suppression models use dimensions like [1, 480]) while preventing overflow when dimensions are multiplied together. Even 16M × 16M × 4 bytes would be 1 PB — clearly invalid — so capping individual dimensions prevents overflow in any reasonable downstream arithmetic.
Validation immediately after get_shape(): By checking dimensions at the earliest possible point, the fix ensures no downstream code can operate on invalid values. This follows the principle of validating at the trust boundary (model file → application memory).
Both zero and overflow protection: The check rejects both dim == 0 (preventing zero-size allocations) and dim > NS_MAX_SHAPE_DIM (preventing overflow), covering both attack vectors.

The CMakeLists.txt changes add regression tests that verify the validation holds under adversarial inputs:

add_test(NAME ns_shape_zero
    COMMAND test_ns_shape_validation --dim 0 --dim 480)
add_test(NAME ns_shape_overflow
    COMMAND test_ns_shape_validation --dim 9223372036854775807 --dim 1024)

These tests ensure that the security invariant is maintained: models with zero or INT64_MAX dimensions are rejected gracefully rather than causing memory corruption.

Prevention & Best Practices

1. Validate All External Dimensions at Trust Boundaries

Any time your code reads shape, size, or count values from external files (model files, configuration, serialized data), validate them before use:

// Pattern: validate immediately after reading
auto shape = model.get_shape();
for (auto dim : shape) {
    if (dim <= 0 || dim > MAX_SAFE_DIM) {
        return error("Invalid dimension in model file");
    }
}

2. Use Checked Arithmetic for Allocation Sizes

Even with dimension validation, use overflow-safe multiplication when computing allocation sizes:

// GCC/Clang built-in
size_t total;
if (__builtin_mul_overflow(dim_a, dim_b, &total)) {
    return error("Allocation size overflow");
}

3. Treat Model Files as Untrusted Input

ML model files (.xml, .bin, .onnx, .pb) are complex binary formats that should be treated with the same suspicion as user-uploaded files. Validate all metadata before acting on it.

4. Define Maximum Safe Bounds as Constants

Using named constants like NS_MAX_SHAPE_DIM makes the security invariant explicit, documentable, and easy to audit. It's far better than ad-hoc checks scattered throughout the code.

5. Add Regression Tests for Adversarial Inputs

The PR includes tests with INT64_MAX and 0 dimensions — these serve as permanent guards against regression and document the threat model.

Key Takeaways

Never trust tensor dimensions from model files — the get_shape() call at line 87 returned attacker-controlled values that flowed directly into allocation logic without any bounds checking.
NS_MAX_SHAPE_DIM (1u << 24) is the right granularity — validate individual dimensions, not just total allocation size, because overflow can occur during intermediate multiplication steps.
Zero-size dimensions are as dangerous as huge ones — a dimension of 0 passes most "less than MAX" checks but causes zero-size allocations followed by writes, which is undefined behavior.
Audio/ML plugins are high-value targets — they process external model files and run in privileged media pipelines, making input validation critical.
Regression tests with adversarial values (INT64_MAX, 0) permanently encode the security boundary in CI, preventing future developers from accidentally removing the guard.

How Orbis AppSec Detected This

Source: OpenVINO model file (.xml/.bin) loaded via ov::Core::read_model(), with tensor shape dimensions as the tainted data
Sink: nd->model->input("input").get_shape() at noise_suppression_interface.cpp:87, where dimensions flow into memory allocation calculations
Missing control: No validation of dimension values against safe bounds (neither zero-check nor upper-bound check)
CWE: CWE-190 (Integer Overflow or Wraparound)
Fix: Added NS_MAX_SHAPE_DIM constant (1<<24) and a validation loop that rejects any dimension that is zero or exceeds the maximum safe value

Orbis AppSec automatically detected this vulnerability and opened a pull request with the fix. Try Orbis AppSec on your repositories to find and fix issues like this automatically.

Conclusion

This vulnerability demonstrates a subtle but dangerous pattern in ML-integrated applications: trusting metadata from model files without validation. The OpenVINO noise suppression plugin's get_shape() call returned dimensions directly from an external file, and those dimensions were used in allocation arithmetic without any bounds checking. The fix is elegant in its simplicity — a single constant and a validation loop — but it closes both the integer overflow and zero-size allocation attack vectors simultaneously. When working with any external data that influences memory allocation sizes, always validate at the trust boundary, define explicit safe bounds, and test with adversarial values.

cwe	CWE-190
fix	Added `NS_MAX_SHAPE_DIM` (1<<24) bound check on each dimension after `get_shape()` call
risk	Out-of-bounds memory access, heap corruption, potential code execution
language	C++
root cause	Model input tensor dimensions from `get_shape()` used in allocations without range validation
vulnerability	Integer overflow via unchecked tensor shape dimensions

How integer overflow in tensor shape validation happens in C++ with OpenVINO and how to fix it

Answer Summary

Vulnerability at a Glance

How Integer Overflow in Tensor Shape Validation Happens in C++ with OpenVINO and How to Fix It

Introduction

The Vulnerability Explained

The Fix

Prevention & Best Practices

Key Takeaways

How Orbis AppSec Detected This

Conclusion

References

Frequently Asked Questions

What is an integer overflow in tensor shape validation?

How do you prevent integer overflow in C++ memory allocations?

What CWE is integer overflow?

Is checking for zero-size allocations enough to prevent this vulnerability?

Can static analysis detect integer overflow in tensor shapes?

View the Security Fix

Related Articles

How heap buffer overflow happens in C memcpy() with untrusted PDU length and how to fix it

How integer overflow in _MALLOC() happens in C emulator memory allocation and how to fix it

How strcpy buffer overflow happens in C++ debugger command handling and how to fix it

How command injection happens in Python subprocess and how to fix it

How command injection happens in Python subprocess and how to fix it

How buffer overflow happens in C kernel PTY subsystem (tty_ptmx.c) and how to fix it