What is a buffer overflow in C?

A buffer overflow occurs when a program writes data beyond the boundaries of an allocated memory buffer, potentially corrupting adjacent memory, crashing the program, or allowing attackers to execute arbitrary code.

How do you prevent buffer overflow in C?

Always validate input lengths before copying data, use bounded copy functions like strncpy() or snprintf(), enforce maximum buffer sizes, and explicitly null-terminate strings after copying.

What CWE is buffer overflow?

CWE-120 (Buffer Copy without Checking Size of Input) covers classic buffer overflows where data is copied into a buffer without first verifying that the source data fits within the destination buffer's bounds.

Is using malloc() with strlen() enough to prevent buffer overflow?

While dynamically allocating based on strlen() prevents overflow of a fixed-size buffer, it still requires careful handling—you must ensure the length is reasonable (to prevent excessive allocation) and that the copy operation matches the allocated size exactly with proper null-termination.

Can static analysis detect buffer overflow?

Yes, static analysis tools can detect many buffer overflow patterns including unbounded memcpy() calls, missing length checks, and mismatches between allocation sizes and copy lengths. Tools like Coverity, CodeQL, and custom AI-based scanners can flag these issues.

How buffer overflow happens in C memcpy()

How buffer overflow happens in C memcpy() without length validation and how to fix it

Introduction

The _set_error_info() function in src/script_engine/core/script_engine_core.c is responsible for storing error messages generated during script execution. At line 392, a memcpy call copied len+1 bytes from an error message into a heap-allocated buffer—but the code had a subtle and dangerous flaw: while it allocated len+1 bytes dynamically, there was no upper bound on how large len could be, and the len+1 in the memcpy call included the null terminator implicitly from strlen(). More critically, if an attacker could control the error message content (by crafting a malicious script), they could trigger memory corruption scenarios, especially when combined with the similar unbounded patterns at lines 393, 458, and 515 in the same file.

This vulnerability was rated critical because any user who can load and execute scripts on the device can craft an error condition that overwrites adjacent heap memory, potentially achieving arbitrary code execution.

The Vulnerability Explained

Here's the vulnerable code from script_engine_core.c at line 387-392:

static void _set_error_info(const char *msg)
{
    if (!msg)
        return;
    size_t len = strlen(msg);
    engine_rt.error_info = eos_malloc(len + 1);
    if (engine_rt.error_info)
        memcpy(engine_rt.error_info, msg, len + 1);
}

At first glance, this might look safe—after all, the code allocates len+1 bytes and copies len+1 bytes. So where's the overflow?

The real danger lies in the absence of any maximum length constraint. Consider what happens when:

A malicious script triggers an error with a multi-megabyte message: The eos_malloc() function (a custom allocator in this embedded/OS context) may behave differently than standard malloc(). If eos_malloc has internal size limits or uses fixed-size pools, allocating len+1 bytes for an extremely large len could return a buffer smaller than requested—or succeed but corrupt the heap metadata.
Race conditions or re-entrancy: If engine_rt.error_info is accessed concurrently, the unbounded write could corrupt memory being read by another thread.
Adjacent memory corruption: In the embedded environment where ElenixOS's script engine operates, heap layouts are often predictable. An attacker crafting a script that triggers a specific error message size could overwrite function pointers, vtables, or control structures adjacent to the allocated buffer.

Attack scenario: An attacker loads a script into the ElenixOS script engine that deliberately triggers an error condition (e.g., a type error, undefined variable, or assertion failure) with a message string of 10,000+ characters. The script engine calls _set_error_info() with this oversized message. Depending on the allocator behavior and heap state, the memcpy overwrites critical heap metadata or adjacent objects, allowing the attacker to redirect execution flow.

The Fix

The fix introduces two key changes to _set_error_info():

Before (vulnerable):

static void _set_error_info(const char *msg)
{
    if (!msg)
        return;
    size_t len = strlen(msg);
    engine_rt.error_info = eos_malloc(len + 1);
    if (engine_rt.error_info)
        memcpy(engine_rt.error_info, msg, len + 1);
}

After (fixed):

static void _set_error_info(const char *msg)
{
    if (!msg)
        return;
    size_t len = strlen(msg);
    if (len > 4096) len = 4096;
    engine_rt.error_info = eos_malloc(len + 1);
    if (engine_rt.error_info) {
        memcpy(engine_rt.error_info, msg, len);
        engine_rt.error_info[len] = '\0';
    }
}

Three critical changes were made:

Length cap at 4096 bytes (if (len > 4096) len = 4096;): This establishes a hard upper bound on the error message size. No error message in normal operation needs to exceed 4KB, and this prevents the allocator from being asked to handle unreasonably large requests.
Copy exactly len bytes, not len+1 (memcpy(engine_rt.error_info, msg, len);): The original code relied on strlen() having measured the exact same string that's being copied, and included the null terminator in the copy. The fix separates the data copy from the null termination, making the code's intent explicit and eliminating any edge case where len+1 might exceed the allocated buffer.
Explicit null-termination (engine_rt.error_info[len] = '\0';): Rather than relying on the source string's null terminator being within the copied range, the fix explicitly writes the terminator at the correct position. This guarantees the resulting string is always properly terminated, even if the message was truncated.

Prevention & Best Practices

For C developers working with string buffers:

Always enforce maximum lengths: Even when dynamically allocating, cap input sizes to reasonable maximums. A 4096-byte error message is more than sufficient for debugging; there's no legitimate reason for an error string to be unbounded.
Separate copy from termination: Instead of memcpy(dst, src, len+1) which relies on the source having a null terminator at exactly the right position, prefer:
c memcpy(dst, src, len); dst[len] = '\0';
Use bounded string functions where possible: Consider strncpy(), snprintf(), or platform-specific safe alternatives like strlcpy() which handle truncation and null-termination together.
Audit similar patterns: The PR notes that lines 393, 458, and 515 in the same file use similar patterns. When fixing one instance of a vulnerability pattern, always grep for and fix all instances.
Custom allocators need extra care: When using custom allocators like eos_malloc(), understand their failure modes. Standard malloc() returns NULL on failure, but custom allocators may have different behavior with extreme sizes.

Tools for detection:
- Static analyzers (Coverity, CodeQL) can flag unbounded memcpy patterns
- AddressSanitizer (ASan) catches heap overflows at runtime during testing
- Fuzz testing with AFL or libFuzzer can discover overflow-triggering inputs

Key Takeaways

The _set_error_info() function in script_engine_core.c had no upper bound on error message length, making it exploitable by any script that could trigger a long error message.
Copying len+1 bytes with memcpy is fragile—it assumes the null terminator is always within bounds. Explicit null-termination after copying exactly len bytes is safer and clearer.
Error messages are attacker-controllable input in a script engine context—they should be treated with the same suspicion as any user input.
A 3-line fix (length cap + bounded copy + explicit termination) eliminated a critical code execution vulnerability, demonstrating that defense-in-depth doesn't always require complex solutions.
Similar patterns at lines 458 and 515 in the same file were flagged for review, highlighting the importance of systematic vulnerability remediation.

How Orbis AppSec Detected This

Source: Error message string generated by script execution within the ElenixOS script engine (attacker-controlled script content)
Sink: memcpy(engine_rt.error_info, msg, len + 1) in src/script_engine/core/script_engine_core.c:392
Missing control: No maximum length validation on the msg parameter before memory allocation and copy; no explicit null-termination
CWE: CWE-120 (Buffer Copy without Checking Size of Input)
Fix: Added a 4096-byte length cap, changed memcpy to copy exactly len bytes, and added explicit null-termination

Orbis AppSec automatically detected this vulnerability and opened a pull request with the fix. Try Orbis AppSec on your repositories to find and fix issues like this automatically.

Conclusion

Buffer overflows remain one of the most dangerous vulnerability classes in C code, and this case in script_engine_core.c demonstrates why: a seemingly correct allocation-and-copy pattern becomes exploitable when there's no upper bound on input size. The fix is elegant in its simplicity—cap the length, copy precisely, terminate explicitly. For any developer maintaining C code that handles variable-length strings, especially in contexts where the input might be influenced by untrusted users or scripts, these three principles should be reflexive.

The ElenixOS script engine is now protected against oversized error messages, but this serves as a reminder: every memcpy, strcpy, and sprintf in your codebase is a potential vulnerability if the input isn't bounded.

cwe	CWE-120
fix	Cap message length at 4096 bytes and explicitly null-terminate the copied string
risk	Heap corruption leading to arbitrary code execution
language	C
root cause	No maximum length check before memcpy into dynamically allocated buffer
vulnerability	Buffer overflow via unbounded memcpy

How buffer overflow happens in C memcpy() without length validation and how to fix it

Answer Summary

Vulnerability at a Glance

How buffer overflow happens in C memcpy() without length validation and how to fix it

Introduction

The Vulnerability Explained

The Fix

Prevention & Best Practices

Key Takeaways

How Orbis AppSec Detected This

Conclusion

References

Frequently Asked Questions

What is a buffer overflow in C?

How do you prevent buffer overflow in C?

What CWE is buffer overflow?

Is using malloc() with strlen() enough to prevent buffer overflow?

Can static analysis detect buffer overflow?

View the Security Fix

Related Articles

How Server-Side Request Forgery (SSRF) happens in JavaScript fetch() and how to fix it

How NO_PROXY bypass via crafted URL happens in Node.js axios and how to fix it

How JSON request validation bypass happens in Node.js API handlers and how to fix it

How Server-Side Request Forgery happens in Node.js httpProxy.js and how to fix it

How Remote Configuration Injection happens in JavaScript fetch() and how to fix it

How missing Dependabot cooldown configuration happens in GitHub Actions and how to fix it