ropchain = "A"*1038 # junk padding to reach the return address # ROP chain to execute /bin//sh ropchain += pack('<I', 0x080e9101) # pop edx ; pop ebx ; pop esi ; pop edi ; pop ebp ; ret # ... intermediate gadgets ... ropchain += pack('<I', 0x080c861f) # int 0x80 (execve syscall)

The primary remediation for CVE-2017-20229 is updating the MAWK interpreter to a patched release. Upstream maintainers addressed the flaw in version 1.3.4-20171017. Users must ensure their systems run this version or a more recent iteration.

Administrators can verify their installed version by executing mawk -W version. If the reported build date precedes 20171017, the binary is likely vulnerable and requires immediate patching via the distribution's package manager.

As a defense-in-depth measure, system administrators should implement monitoring to detect unusually long arguments passed to text processing utilities. Security Information and Event Management (SIEM) rules can flag execution events where the argument length for mawk or awk exceeds 1000 characters.

Product

Affected Versions

Fixed Version

MAWK

Thomas Dickey / Mike Brennan

<= 1.3.3-17

1.3.4-20171017

Attribute

Detail

Vulnerability Type

Stack-Based Buffer Overflow

CWE ID

CWE-787

CVSS v3.1 Score

9.8 (Critical)

EPSS Score

0.00077 (22.94%)

Exploit Status

Public PoC Available

Attack Vector

Local / Remote (Context Dependent)

Impact

Arbitrary Code Execution

CVE-2017-20229

9.80.08%

CVE-2017-20229: Stack-Based Buffer Overflow in MAWK Interpreter

Amit Schendel

Senior Security Researcher

Mar 31, 2026·5 min read·25 visits

PoC Available

Executive Summary (TL;DR)

A stack-based buffer overflow in the MAWK interpreter (<= 1.3.3-17) allows local privilege escalation or arbitrary code execution. Attackers exploit this by passing a long string exceeding internal stack limits, overwriting control data to execute a ROP chain.

MAWK versions 1.3.3-17 and prior contain a critical stack-based buffer overflow vulnerability in the main argument parsing and stack management routines. This flaw allows an attacker to achieve arbitrary code execution by supplying excessively long command-line arguments, overwriting adjacent memory to hijack control flow via a Return-Oriented Programming (ROP) chain.

Attack Flow Diagram

Vulnerability Overview

MAWK is a pattern scanning and text processing language interpreter, commonly used as the default awk implementation on numerous Linux distributions, including older versions of Debian and Ubuntu. It processes command-line arguments and script content to execute automated text-processing tasks.

CVE-2017-20229 is a stack-based buffer overflow vulnerability residing in MAWK's argument parsing and stack management components. The vulnerability affects MAWK versions 1.3.3-17 and prior, arising from inadequate boundary validation of user-supplied input.

When a user or automated script invokes the MAWK binary with excessively long arguments, the application copies this input into fixed-size stack buffers without verifying the length. This behavior constitutes an out-of-bounds write (CWE-787), permitting an attacker to overwrite adjacent memory addresses, including the saved instruction pointer.

Root Cause Analysis

The root cause of CVE-2017-20229 lies in the absence of explicit stack depth and buffer size validation within the interpreter's initialization and argument parsing routines. Prior to the October 2017 patch, MAWK allocated internal stack buffers with a presumed maximum capacity—often around 1024 bytes.

During execution, when the interpreter processes command-line arguments or inline script content, it pushes this data onto the execution stack. Because the software failed to track the cumulative size of the incoming data relative to the allocated buffer boundaries, continuous memory writing occurs beyond the allocated limits.

Once the 1024-byte boundary is exceeded, the overflowing data overwrites adjacent stack frames. This contiguous memory corruption directly overwrites the return address of the active function, granting an attacker control over the CPU's instruction pointer upon function return.

Code Analysis

While the exact C source patch diff is abstracted in the upstream changelog, the operational changes clearly indicate the structural flaw. The unpatched MAWK implementation utilized standard memory copy operations without correlating the input length against the active stack boundary.

The upstream changelog from October 17, 2017, explicitly states the remediation logic: "Add checks for stack overflow and underflow; increase stack limit to 1024." The patched implementation introduces boundary tracking variables that monitor the depth of the execution stack.

In the patched versions, before any argument data is pushed onto the stack, the application calculates the required size and compares it against the remaining stack capacity. If the input exceeds the newly strictly enforced 1024-byte limit, the application halts execution and throws a memory error rather than continuously writing out of bounds.

Exploitation Methodology

Exploitation of CVE-2017-20229 is publicly documented and demonstrated via Exploit-DB ID 42357, authored by Juan Sacco. The exploit script constructs a specialized payload designed to exploit the missing boundary checks by passing a precise string length directly to the MAWK binary.

The payload consists of approximately 1038 bytes of arbitrary "junk" padding. This specific length is calculated to completely fill the vulnerable stack buffer and align the subsequent bytes exactly over the saved return address. Following the padding, the exploit appends a carefully crafted Return-Oriented Programming (ROP) chain.

ropchain = "A"*1038 # junk padding to reach the return address
# ROP chain to execute /bin//sh
ropchain += pack('<I', 0x080e9101) # pop edx ; pop ebx ; pop esi ; pop edi ; pop ebp ; ret
# ... intermediate gadgets ...
ropchain += pack('<I', 0x080c861f) # int 0x80 (execve syscall)

The ROP chain chains together small instruction sequences (gadgets) already present in the MAWK executable. This technique circumvents Data Execution Prevention (DEP) protections. The gadgets sequentially load the necessary registers to trigger the execve system call (int 0x80), ultimately spawning an interactive /bin//sh shell with the privileges of the vulnerable process.

Impact Assessment

The exploitation of CVE-2017-20229 results in arbitrary code execution (ACE). An attacker successfully triggering the buffer overflow gains the ability to execute unconstrained operating system commands. The privilege level of the spawned shell directly matches the context in which MAWK was invoked.

If a privileged system script or a setuid binary relies on MAWK to process untrusted user input, this vulnerability functions as a Local Privilege Escalation (LPE) vector. The attacker transitions from an unprivileged user to the root user by manipulating the arguments passed to the system process.

The vulnerability carries a critical CVSS 3.1 score of 9.8. This score reflects the potential for Remote Code Execution (RCE) if MAWK is utilized within a web application backend, such as processing parameters in a legacy CGI script. In such configurations, an unauthenticated remote attacker can submit the malicious payload over the network to compromise the server.

Remediation and Mitigation

Technical Appendix

CVSS Score

9.8/ 10

CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:U/C:H/I:H/A:H

EPSS Probability

0.08%

Top 77% most exploited

Affected Systems

Debian Linux (Older Releases)Ubuntu Linux (Older Releases)Systems utilizing MAWK <= 1.3.3-17 as the default awk interpreter

Affected Versions Detail

Product	Affected Versions	Fixed Version
MAWK Thomas Dickey / Mike Brennan	<= 1.3.3-17	1.3.4-20171017

Attribute	Detail
Vulnerability Type	Stack-Based Buffer Overflow
CWE ID	CWE-787
CVSS v3.1 Score	9.8 (Critical)
EPSS Score	0.00077 (22.94%)
Exploit Status	Public PoC Available
Attack Vector	Local / Remote (Context Dependent)
Impact	Arbitrary Code Execution

MITRE ATT&CK Mapping

T1203Exploitation for Client Execution

Execution

T1068Exploitation for Privilege Escalation

Privilege Escalation

CWE-787

Out-of-bounds Write

The software writes data past the end, or before the beginning, of the intended buffer.

Known Exploits & Detection

Exploit-DBLocal Buffer Overflow Exploit (Python) for MAWK 1.3.3-17 by Juan Sacco

Vulnerability Timeline

Public exploit published on Exploit-DB by Juan Sacco.

2017-07-24

Thomas Dickey released a patch adding stack overflow checks and increasing the stack limit.

2017-10-17

Official NVD publication and CVSS score assignment.

2026-03-28