// Safe direct process execution const NPX_EXEC_OPTS = { shell: false as const }; const result = execFileSync( 'npx', ['--yes', 'agentic-flow', '--agent', agent, '--task', task], { ...NPX_EXEC_OPTS, encoding: 'utf-8' } );

Exploitation of this vulnerability relies on feeding an LLM agent structured malicious input that is subsequently passed down to the MCP server's active tools.

Attack Vector Details

An attacker can place a weaponized payload inside a public repository, web page, or PDF document that they know an agent is scheduled to read. For example, the payload might read: x\"; touch /tmp/INJECTED; id > /tmp/rce.txt; echo \".

When the agent processes this text, it invokes the agentic_flow_agent tool and transmits the payload as the task argument. The resulting string is passed directly to the shell as:

npx --yes agentic-flow --agent \"coder\" --task \"x\"; touch /tmp/INJECTED; id > /tmp/rce.txt; echo \"\"

The shell splits this command sequence on the semicolon, successfully running the initial command, followed by the execution of touch /tmp/INJECTED and id > /tmp/rce.txt on the host operating system.

import { describe, it, expect } from 'vitest'; import { readFileSync, readdirSync, statSync } from 'node:fs'; import { join, resolve } from 'node:path'; const MCP_ROOT = resolve(__dirname, '../../src/mcp'); const EXEMPT_FILES = new Set<string>([ 'fastmcp/tools/hooks/pretrain.ts' ]);

Product

Affected Versions

Fixed Version

agentic-flow

ruvnet

<= 2.0.13

2.0.14

Attribute

Detail

CWE ID

CWE-78

Attack Vector

Network (Unauthenticated) / User Interaction Required (AI processing untrusted input)

CVSS Severity

8.8 (High)

Exploit Status

PoC Available / Verified

KEV Status

Not Listed

GHSA-VCV2-R9JH-99M5

GHSA-VCV2-R9JH-99M5: OS Command Injection in agentic-flow MCP Server Tools

Alon Barad

Software Engineer

Jun 19, 2026·5 min read·4 visits

Executive Summary (TL;DR)

Unsanitized parameters passed to agentic-flow MCP tools are executed directly in system shells via Node.js execSync, enabling remote attackers to run arbitrary OS commands when an AI agent processes malicious external content.

An OS command injection vulnerability (CWE-78) exists in agentic-flow versions 2.0.13 and prior. The package's Model Context Protocol (MCP) server tools directly interpolate user-controlled parameters into shell command strings executed via child_process.execSync without validation. If an AI agent processes untrusted external input and forwards it as parameters to any affected tool, an attacker can break out of the shell argument quotes and execute arbitrary OS commands on the host machine.

Attack Flow Diagram

Vulnerability Overview

The vulnerability, tracked as GHSA-vcv2-r9jh-99m5, is an OS command injection flaw (CWE-78) in the agentic-flow npm package at versions 2.0.13 and earlier.

This library exposes Model Context Protocol (MCP) servers and tools designed to facilitate direct actions by artificial intelligence (AI) agents, such as executing local system commands, interacting with databases, and editing codebases.

The attack surface is highly significant because LLM-driven agents are designed to autonomously read and parse untrusted data sources (such as public GitHub repositories, scraped web pages, or incoming emails) and pass summarized arguments to these tools.

If an attacker can manipulate the input that an AI agent parses, they can trigger command execution on the host machine hosting the MCP server. This bypasses the typical boundaries established for AI agent sandboxing and compromises the underlying operating system.

Root Cause Analysis

The root cause of this vulnerability lies in the unsafe use of Node.js's native child_process.execSync() function paired with template string interpolation.

When execSync(commandString) is called in Node.js, the runtime spawns a shell (such as /bin/sh on Unix systems or cmd.exe on Windows) to parse and run the complete string. Because the codebase directly interpolated user-controlled variables into the command string within double quotes, system shells treated metacharacters as active shell directives.

An attacker can supply payloads containing double quotes to break out of the string literal, followed by shell control characters such as semicolons, logical AND operators, or pipes. This terminates the legitimate command and causes the shell to execute arbitrary appended commands under the security context of the parent Node.js process.

Code Analysis

Below is a comparison of the vulnerable and patched implementations within the MCP server codebase.

Vulnerable Code Path

In src/mcp/standalone-stdio.ts, parameters were concatenated dynamically without sanitization:

// Unsafe shell interpolation inside agentic_flow_agent tool
let cmd = `npx --yes agentic-flow --agent \"${agent}\" --task \"${task}\"`;
const result = execSync(cmd, { encoding: 'utf-8' });

Patched Implementation

The remediated code replaces the shell execution wrapper with execFileSync and disables the shell interpreter globally:

// Safe direct process execution
const NPX_EXEC_OPTS = { shell: false as const };
 
const result = execFileSync(
  'npx',
  ['--yes', 'agentic-flow', '--agent', agent, '--task', task],
  { ...NPX_EXEC_OPTS, encoding: 'utf-8' }
);

Using execFileSync with shell: false passes arguments directly to the OS kernel via the execve system call. The operating system treats each element of the arguments array as a literal string payload, preventing shell parsing engines from executing special control characters.

Exploitation & Attack Vectors

Exploitation of this vulnerability relies on feeding an LLM agent structured malicious input that is subsequently passed down to the MCP server's active tools.

Attack Vector Details

When the agent processes this text, it invokes the agentic_flow_agent tool and transmits the payload as the task argument. The resulting string is passed directly to the shell as:

npx --yes agentic-flow --agent \"coder\" --task \"x\"; touch /tmp/INJECTED; id > /tmp/rce.txt; echo \"\"

The shell splits this command sequence on the semicolon, successfully running the initial command, followed by the execution of touch /tmp/INJECTED and id > /tmp/rce.txt on the host operating system.

Impact Assessment

This vulnerability presents a high-risk security threat, receiving a CVSS v3.1 score of 8.8.

Because MCP servers typically run locally on developers' machines or inside centralized internal deployment environments, successful exploitation yields arbitrary shell command execution with the permissions of the underlying system user. An attacker can perform actions such as exfiltrating environment variables, reading private codebases, stealing SSH and API keys, or establishing persistent reverse shells.

The impact is compounded because the attack vector bypasses traditional network defense systems, using the AI agent as an interactive proxy to perform the exploit delivery phase.

Remediation & Detection

To secure affected systems, immediate software updates and robust monitoring are required.

Security Regression Tests

The development team implemented automated regression security tests to block future occurrences of this flaw. The test suite, located in tests/security/cwe-78-mcp-execsync.test.ts, parses the codebase to verify that execSync is never imported or invoked without explicit inclusion in an exempt list:

import { describe, it, expect } from 'vitest';
import { readFileSync, readdirSync, statSync } from 'node:fs';
import { join, resolve } from 'node:path';
 
const MCP_ROOT = resolve(__dirname, '../../src/mcp');
const EXEMPT_FILES = new Set<string>([
  'fastmcp/tools/hooks/pretrain.ts'
]);

Organizations should also deploy static analysis configurations (such as ESLint rules) to restrict the use of raw shell execution sinks across all Node.js development projects.

Fix Analysis (1)

Technical Appendix

CVSS Score

8.8/ 10

CVSS:3.1/AV:N/AC:L/PR:N/UI:R/S:U/C:H/I:H/A:H

Affected Systems

agentic-flow <= 2.0.13ruflo < 3.12.4claude-flow < 3.12.4@claude-flow/cli < 3.12.4

Affected Versions Detail

Product	Affected Versions	Fixed Version
agentic-flow ruvnet	<= 2.0.13	2.0.14

Attribute	Detail
CWE ID	CWE-78
Attack Vector	Network (Unauthenticated) / User Interaction Required (AI processing untrusted input)
CVSS Severity	8.8 (High)
Exploit Status	PoC Available / Verified
KEV Status	Not Listed

MITRE ATT&CK Mapping

T1203Exploitation for Client Execution

Execution

T1059Command and Scripting Interpreter

Execution

CWE-78

OS Command Injection

Improper Neutralization of Special Elements used in an OS Command ('OS Command Injection')

More Reports

•22 minutes ago•GHSA-MQQ5-J7W8-2HGH

7.5

GHSA-MQQ5-J7W8-2HGH: Missing Authorization in Alchemy CMS API Pages Controller

A critical missing authorization vulnerability exists in the API Pages Controller of Alchemy CMS. An unauthenticated remote attacker can exploit the 'nested' action to retrieve the entire nested page tree. Furthermore, by appending the query parameter '?elements=true', the attacker can extract sensitive content from draft, unpublished, and restricted pages, bypassing all access controls.

Alon Barad

0 views•6 min read

•about 1 hour ago•GHSA-WFPW-MMFH-QQ69

4.8

GHSA-WFPW-MMFH-QQ69: Use-After-Free Vulnerability in Nokogiri XML Node-Level XInclude Processing

Nokogiri is a popular Ruby gem used for parsing XML and HTML documents. A Use-After-Free (UAF) vulnerability exists in its CRuby implementation during XInclude processing. When an application traverses an XML document and exposes nodes to Ruby before calling `do_xinclude`, the underlying C library `libxml2` can free these structures in-place. This leaves active Ruby objects holding pointers to freed memory, leading to potential segmentation faults, memory corruption, or information disclosure.

Amit Schendel

2 views•7 min read

•about 1 hour ago•GHSA-PHWJ-RPRQ-35PP

2.3

GHSA-PHWJ-RPRQ-35PP: Use-After-Free Vulnerability in Nokogiri XML Attribute Value Modification

A use-after-free (UAF) vulnerability exists in the CRuby native extension of the Nokogiri gem when updating XML attribute values. If child nodes of an XML attribute are wrapped by Ruby objects prior to setting the attribute's value, the underlying C memory structures are freed while the Ruby wrapper retains a dangling pointer. This results in memory corruption, invalid pointer dereferences, and application crashes during execution or garbage collection.

Amit Schendel

4 views•6 min read

•about 2 hours ago•GHSA-VMHF-C436-HXJ4

5.1

GHSA-VMHF-C436-HXJ4: Client-side Stored Cross-Site Scripting (XSS) in JupyterLab Extension Manager

A client-side Stored Cross-Site Scripting (XSS) vulnerability exists in the JupyterLab Extension Manager. This vulnerability allows an attacker to register a malicious package on the Python Package Index (PyPI) with a crafted metadata homepage URL using the 'javascript:' pseudo-protocol. When a JupyterLab user opens the Extension Manager and clicks the extension name, the browser executes arbitrary JavaScript code within the context of the JupyterLab origin. This can lead to the theft of active workspace documents, credentials, and API tokens. The issue affects all versions of JupyterLab prior to version 4.5.9.

Amit Schendel

2 views•5 min read

•about 2 hours ago•GHSA-JV2H-4P9V-WF5W

8.8

GHSA-JV2H-4P9V-WF5W: Arbitrary Remote Code Execution via Incomplete Environment Denylist in Ouroboros AI

An arbitrary Remote Code Execution (RCE) vulnerability exists in ouroboros-ai due to an incomplete fix for CVE-2026-47211. Ouroboros automatically loads environment configurations from local .env files located in the current working directory (CWD) of cloned repositories. Although a denylist (_UNTRUSTED_ENV_DENYLIST) was introduced in version 0.39.0 to filter out execution-routing environment variables, multiple critical configuration variables were omitted, enabling complete sandbox bypass and arbitrary system command execution.

Alon Barad

4 views•6 min read

•about 4 hours ago•CVE-2026-12151

7.5

CVE-2026-12151: Denial of Service via Uncontrolled Fragment Buffering in Undici WebSocket Client

A high-severity denial of service vulnerability in the undici WebSocket client (CVE-2026-12151) arises from uncontrolled memory consumption. Although undici validates individual fragment sizes against a cumulative payload limit, it fails to cap the total number of frames in a single message stream. This allows a rogue or compromised WebSocket server to send an infinite sequence of small or empty continuation frames, causing unbounded memory allocation and eventual heap exhaustion on the client process.

Amit Schendel

5 views•7 min read