What is a heap out-of-bounds read?

A heap out-of-bounds read occurs when a program reads memory beyond the allocated boundary of a heap buffer, potentially crashing the process or exposing adjacent memory contents to an attacker.

How do you prevent heap out-of-bounds reads in C++?

Always validate that the source buffer's size is greater than or equal to the number of bytes you intend to copy before calling memcpy, memmove, or similar functions. Prefer safer abstractions like std::span or range-checked container methods when possible.

What CWE is heap out-of-bounds read?

Heap out-of-bounds read is classified as CWE-125: Out-of-bounds Read.

Is address space layout randomization (ASLR) enough to prevent this vulnerability?

No. ASLR raises the bar for exploitation but does not prevent the read itself. An attacker can still trigger a crash (denial of service) and, in certain scenarios, can still leak memory contents even with ASLR enabled.

Can static analysis detect heap out-of-bounds reads like this one?

Yes. Static analysis tools such as Semgrep, Coverity, and CodeQL can flag memcpy calls where the source size is not validated against the copy length. Dynamic tools like AddressSanitizer (ASan) and Valgrind can also catch these at runtime during testing.

Heap Out-of-Bounds Read in GLTF Loader: How a Missing Bounds Check Could Crash Your App or Leak Memory

Introduction

There's a class of vulnerability that has haunted C and C++ codebases for decades — the out-of-bounds read. It doesn't always get the same dramatic attention as a remote code execution bug, but it can be just as dangerous. At best, it crashes your application. At worst, it silently leaks heap memory to an attacker, exposing passwords, tokens, encryption keys, or other sensitive data that happened to be sitting nearby in memory.

This week, we're breaking down exactly that kind of vulnerability: a missing bounds check before a memcpy call in EfkRes.GLTFLoader.cpp, part of the Effekseer resource converter toolchain. If you write C++ that parses any kind of binary file format — GLTF, FBX, custom binary protocols — this one is worth understanding deeply.

The Vulnerability Explained

What Is a GLTF File?

GLTF (GL Transmission Format) is a widely-used 3D asset format. It stores geometry, animations, materials, and more — often with binary buffer data packed tightly into typed arrays. A GLTF accessor describes how to interpret a chunk of that binary data: what type it is, how many components, and where it lives in the buffer.

When a GLTF loader reads one of these accessors, it typically needs to copy raw bytes from the file's binary buffer into a typed destination struct — say, a vec3 (12 bytes) or a mat4 (64 bytes). That's where memcpy comes in.

The Vulnerable Code Pattern

At line 98 of EfkRes.GLTFLoader.cpp, the code performed an operation that looked roughly like this:

// VULNERABLE: No bounds check before memcpy
memcpy(&dst.v, src.data(), sizeof(dst.v));

This copies sizeof(dst.v) bytes from src.data() into dst.v. The problem? There is no validation that src actually contains at least sizeof(dst.v) bytes.

If src is a buffer view that only has, say, 3 bytes of data, but sizeof(dst.v) is 4 (a single float), the memcpy will happily read 1 byte past the end of the source buffer. For a mat4 with sizeof(dst.v) == 64, the overread could be far more severe.

How Could This Be Exploited?

The attack surface here is any crafted GLTF file. GLTF files are routinely loaded from:

User-uploaded assets in game editors and 3D tools
Downloaded asset packs from the internet
Network-streamed game content
Third-party plugin content

An attacker who can supply a GLTF file to a vulnerable application can craft a buffer accessor that declares a component type requiring N bytes, but provides a buffer view with fewer than N bytes. The result:

Application crash (Denial of Service): Reading beyond a heap allocation boundary can trigger a segmentation fault or heap corruption, crashing the application.
Heap memory disclosure: In many heap implementations, the bytes immediately following a small allocation belong to adjacent heap objects — which could contain anything: cached credentials, decrypted file contents, other loaded assets, or internal pointers. A carefully crafted GLTF could cause the loader to copy those bytes into the parsed output, where they might be logged, transmitted, or otherwise exposed.

A Concrete Attack Scenario

Imagine a game editor that lets users import community-made 3D assets. An attacker uploads a GLTF file where:

A mesh accessor claims to have vertex positions (each requiring 12 bytes for a vec3)
The backing buffer view contains only 8 bytes of actual data

When the editor loads this file, the GLTF loader hits the vulnerable memcpy, reads 4 bytes past the end of the buffer, and copies heap memory into the parsed vertex data. If the editor then logs or transmits mesh data for debugging or telemetry, those 4 bytes of heap content go with it.

Scale this up — a mat4 requiring 64 bytes with only 4 provided — and you're leaking 60 bytes of adjacent heap memory per accessor. A crafted GLTF with dozens of such accessors could exfiltrate hundreds of bytes of heap contents per load operation.

The Fix

What Changed

The fix adds a bounds check before the memcpy call, ensuring the source buffer contains at least as many bytes as the destination requires:

// BEFORE (vulnerable):
memcpy(&dst.v, src.data(), sizeof(dst.v));

// AFTER (safe):
if (src.size() < sizeof(dst.v)) {
    // Handle error: truncated buffer data in GLTF file
    return false; // or throw, or log and skip
}
memcpy(&dst.v, src.data(), sizeof(dst.v));

This is a small change — one if statement — but it completely closes the vulnerability. If the source buffer is too small, the operation is rejected before any memory is touched beyond the valid range.

Why This Works

The security property being enforced is simple and absolute:

A copy operation must never read more bytes from the source than the source actually contains.

By checking src.size() < sizeof(dst.v) before the copy, the code ensures this invariant holds for every call site, regardless of what the GLTF file claims about its buffer contents. Malformed or malicious files are caught at the boundary, before they can influence memory access patterns.

The Regression Test

The fix is accompanied by a regression test suite that validates this invariant across a wide range of adversarial inputs:

# Example: truncated buffer must always be rejected
def test_gltf_buffer_truncation_always_rejected(src_size, dst_size):
    src_data = bytes([0xAA] * src_size)

    with pytest.raises((ValueError, IndexError, BufferError, OverflowError)):
        parse_gltf_buffer_accessor(src_data, dst_size)

The test cases include:
- Empty buffers — 0 bytes provided for any destination size
- One byte short — e.g., 3 bytes when 4 are needed (common off-by-one pattern)
- Crafted magic bytes — ELF headers, PNG signatures, and other recognizable patterns in truncated buffers
- Exact boundary — src.size() == dst_size must succeed (no false positives)
- Oversized buffers — src.size() > dst_size must succeed and copy only the needed bytes

This kind of parameterized adversarial testing is exactly what catches regressions if someone accidentally removes or weakens the bounds check in the future.

Prevention & Best Practices

1. Always Validate Before You Copy

Any time you call memcpy, memmove, or equivalent functions with a size derived from a destination type, verify the source:

// Pattern to follow for every binary deserialization:
template<typename T>
bool SafeCopy(T& dst, const std::vector<uint8_t>& src, size_t offset = 0) {
    if (src.size() < offset + sizeof(T)) {
        return false; // Reject truncated data
    }
    memcpy(&dst, src.data() + offset, sizeof(T));
    return true;
}

2. Prefer Safe Abstractions Over Raw memcpy

In modern C++, consider using std::span with explicit size checking, or serialization libraries that handle bounds checking internally:

#include <span>

bool ReadComponent(std::span<const uint8_t> src, float& dst) {
    if (src.size() < sizeof(float)) return false;
    std::memcpy(&dst, src.data(), sizeof(float));
    return true;
}

3. Use Sanitizers During Development

Enable AddressSanitizer (ASan) and MemorySanitizer (MSan) in your debug and CI builds. These tools catch out-of-bounds reads and writes at runtime with precise error reporting:

# Compile with ASan
clang++ -fsanitize=address -fno-omit-frame-pointer -g your_code.cpp

# Or with CMake
set(CMAKE_CXX_FLAGS "${CMAKE_CXX_FLAGS} -fsanitize=address")

ASan would have caught this exact vulnerability the first time a truncated GLTF was loaded in a test environment.

4. Fuzz Your File Parsers

File format parsers are a prime target for fuzzing. Tools like libFuzzer and AFL++ generate thousands of mutated inputs per second, specifically designed to find crashes and memory errors:

// libFuzzer entry point for GLTF loader
extern "C" int LLVMFuzzerTestOneInput(const uint8_t* data, size_t size) {
    // Feed raw bytes as a GLTF file to your loader
    LoadGLTFFromMemory(data, size);
    return 0;
}

Fuzzing is particularly effective at finding exactly this class of bug — truncated binary data that parsers don't expect.

5. Know Your CWEs

This vulnerability maps to:

CWE-125: Out-of-bounds Read — Reading data past the end of an intended buffer
CWE-119: Improper Restriction of Operations within the Bounds of a Memory Buffer — The parent category for buffer safety issues
CWE-20: Improper Input Validation — Failing to validate that input data meets size requirements before use

The OWASP Top 10 also covers this under A03:2021 – Injection, and the OWASP Testing Guide recommends fuzzing and boundary testing for all binary format parsers.

6. Code Review Checklist for Binary Parsers

When reviewing any code that parses binary data, look for these red flags:

[ ] memcpy calls where the size comes from sizeof(destination) — is the source validated?
[ ] Array indexing with values read from untrusted input — is there a bounds check?
[ ] Pointer arithmetic on data from files or network — is the resulting pointer within the valid buffer?
[ ] Loops that iterate based on a count value from the file — is the count capped?

Conclusion

A single missing if statement — one bounds check before a memcpy — was the difference between a safe GLTF loader and one that could be crashed or coerced into leaking heap memory by a malicious file. This is a pattern that has caused real-world vulnerabilities in countless parsers, from image loaders to document processors to network protocol implementations.

The key takeaways:

Never trust size information from untrusted input. A GLTF file claiming its buffer contains 64 bytes doesn't make it true. Always verify.
Validate at the boundary. Check buffer sizes before any copy or access operation, not after.
Use tooling. AddressSanitizer, fuzzing, and static analysis tools exist precisely to catch these bugs before they reach production.
Write regression tests. The adversarial test suite accompanying this fix ensures that if this bounds check is ever accidentally removed or weakened, CI will catch it immediately.

Security vulnerabilities in file parsers are often dismissed as "only a crash" — but as we've seen, an out-of-bounds read can be a stepping stone to memory disclosure, and memory disclosure can be a stepping stone to much more serious compromise. Treat every parser input as hostile, and your users will be safer for it.

This vulnerability was identified and fixed by automated security scanning. The fix was verified by build, re-scan, and LLM code review.

Have a vulnerability you'd like us to break down? Reach out to the Fenny Security team.

cwe	CWE-125
fix	Add a bounds check comparing source buffer size against the number of bytes to be copied before calling memcpy
risk	Application crash or sensitive heap memory disclosure via crafted GLTF file
language	C++
root cause	memcpy reads from a buffer-view without verifying the view contains enough bytes
vulnerability	Heap Out-of-Bounds Read

Heap Out-of-Bounds Read in GLTF Loader: How a Missing Bounds Check Could Crash Your App or Leak Memory

Answer Summary

Vulnerability at a Glance

Heap Out-of-Bounds Read in GLTF Loader: How a Missing Bounds Check Could Crash Your App or Leak Memory

Introduction

The Vulnerability Explained

What Is a GLTF File?

The Vulnerable Code Pattern

How Could This Be Exploited?

A Concrete Attack Scenario

The Fix

What Changed

Why This Works

The Regression Test

Prevention & Best Practices

1. Always Validate Before You Copy

2. Prefer Safe Abstractions Over Raw memcpy

3. Use Sanitizers During Development

4. Fuzz Your File Parsers

5. Know Your CWEs

6. Code Review Checklist for Binary Parsers

Conclusion

Frequently Asked Questions

What is a heap out-of-bounds read?

How do you prevent heap out-of-bounds reads in C++?

What CWE is heap out-of-bounds read?

Is address space layout randomization (ASLR) enough to prevent this vulnerability?

Can static analysis detect heap out-of-bounds reads like this one?

View the Security Fix

Related Articles

How insecure update manifest parsing happens in C++ UpdateHelper.cpp and how to fix it

How integer overflow in buffer size calculation happens in C++ and how to fix it

How buffer overflow via unchecked memcpy offset happens in C++ PCL point cloud parsing and how to fix it

How buffer overflow via sprintf() happens in C++ settings parsing and how to fix it

How integer overflow in tensor shape validation happens in C++ with OpenVINO and how to fix it

How strcpy buffer overflow happens in C++ debugger command handling and how to fix it