The technical root cause of GHSA-GR75-JV2W-4656 comprises three major logical and implementation failures. First, the file-search agent middleware validates the existence of the root directory but fails to validate or sanitize search patterns (such as glob patterns and relative traversals). If an input pattern contains relative path modifiers like ../../, the middleware evaluates them relative to the root but permits the resolution of files outside that boundary.

Second, the middleware does not perform post-resolution path validation using canonicalized absolute paths. When resolving file paths, the system retrieves and reads files without verifying if the fully resolved target path is a subpath of the allowed root. If the allowed root directory contains symbolic links pointing to sensitive system files, the system dereferences and reads the target file instead of throwing an access violation. This represents a classic symbolic link vulnerability (CWE-59).

Third, the application utilizes an insecure string-prefix comparison to enforce directory boundaries. Specifically, the system validates paths by verifying whether candidate_path.startswith(allowed_root) evaluates to true. This validation strategy is insecure when the allowed_root string does not end with a directory separator. For example, if the root path is /usr/app, a candidate path of /usr/app-secrets/config.json satisfies the prefix condition despite pointing to a completely different directory. This allows attackers to access sibling directories sharing the same prefix string.

These three failures combine to create multiple escape vectors. An attacker can use directory traversal to read files relative to the workspace, use symbolic links to bypass detection when standard input validation is active, or use prefix matching flaws to access sibling application directories that might hold separate database credentials or application tokens.

# Vulnerable Path Check Implementation import os def secure_file_load(user_path, safe_directory="/usr/app"): # VULNERABILITY 1: Insecure prefix matching # If safe_directory is '/usr/app', '/usr/app-secrets' matches if not user_path.startswith(safe_directory): raise ValueError("Access Denied") # VULNERABILITY 2: Missing path canonicalization # User path can be '/usr/app/../../etc/passwd' # The startswith check succeeds, but the system accesses /etc/passwd with open(user_path, 'r') as f: return f.read()

# Patched Path Check Implementation import os from pathlib import Path def secure_file_load_patched(user_path, safe_directory="/usr/app"): # Canonicalize safe directory and candidate path safe_path = Path(safe_directory).resolve() candidate_path = Path(user_path).resolve() # Verify directory boundary using relative_to or checking parent structures try: # relative_to raises ValueError if candidate_path is not under safe_path candidate_path.relative_to(safe_path) except ValueError: raise ValueError("Access Denied: Path is outside of safe directory") with open(candidate_path, 'r') as f: return f.read()

Product

Affected Versions

Fixed Version

langchain

LangChain

< 1.3.9

1.3.9

langchain-anthropic

LangChain

< 1.4.6

1.4.6

Attribute

Detail

CWE ID

CWE-22, CWE-59

Attack Vector

Local

CVSS Score

4.7 (Moderate)

EPSS Score

N/A

Exploit Status

None / Unproven

KEV Status

Not Listed

GHSA-GR75-JV2W-4656

GHSA-GR75-JV2W-4656: Path Traversal and Sandbox Escape in LangChain File-Search Middleware and Loaders

Alon Barad

Software Engineer

Jun 16, 2026·8 min read·18 visits

Executive Summary (TL;DR)

Insecure path resolution, missing symlink checks, and a path-prefix boundary bypass in LangChain allow attackers to escape file sandboxes via directory traversal or symbolic links.

A path traversal and sandbox escape vulnerability in LangChain and LangChain-Anthropic Python packages allows unauthenticated local attackers to access files outside the restricted directory via crafted input, symbolic links, or prefix bypasses.

Attack Flow Diagram

Vulnerability Overview

GHSA-GR75-JV2W-4656 is a path traversal and sandbox escape vulnerability identified within the LangChain library ecosystem, specifically affecting the langchain and langchain-anthropic Python packages. The LangChain ecosystem provides developers with frameworks to build applications powered by Large Language Models (LLMs), including autonomous agents that interact with the physical filesystem. To support file-related operations, LangChain implements middleware, directory search loaders, and config readers that parse file paths dynamically.

The integration of file-handling capabilities within LLM-controlled environments introduces a substantial attack surface. When LLMs are permitted to call filesystem-backed tools with arguments derived from untrusted user instructions, the application relies entirely on the underlying software boundaries to enforce directory restrictions. If the software boundaries are flawed, the LLM agent can be coerced into accessing files beyond its operational sandbox.

The vulnerability occurs because LangChain's internal path resolution mechanisms do not strictly restrict resolved paths to their specified root directories. This design deficiency manifests as an improper limitation of pathnames to a restricted directory (CWE-22) and improper link resolution before file access (CWE-59). An attacker can exploit this weakness to traverse the filesystem or follow symbolic links pointing to sensitive administrative files.

By leveraging this flaw, an attacker who can input text into the LLM prompt can trigger arbitrary file reads. The attack does not require direct access to the command line of the server hosting the application, as the LLM agent acts as an execution proxy. The vulnerability is highly operationalizable in any configuration that links LLMs with local workspace search tools.

Root Cause Analysis

Code Analysis

To understand the vulnerability, consider the following implementation of the vulnerable path verification mechanism:

# Vulnerable Path Check Implementation
import os
 
def secure_file_load(user_path, safe_directory="/usr/app"):
    # VULNERABILITY 1: Insecure prefix matching
    # If safe_directory is '/usr/app', '/usr/app-secrets' matches
    if not user_path.startswith(safe_directory):
        raise ValueError("Access Denied")
 
    # VULNERABILITY 2: Missing path canonicalization
    # User path can be '/usr/app/../../etc/passwd'
    # The startswith check succeeds, but the system accesses /etc/passwd
    with open(user_path, 'r') as f:
        return f.read()

The patched version introduces strict path canonicalization using os.path.realpath or Path.resolve() to resolve all relative segments and symbolic links. It also enforces correct boundary checking by appending the directory separator or using path parent comparisons:

# Patched Path Check Implementation
import os
from pathlib import Path
 
def secure_file_load_patched(user_path, safe_directory="/usr/app"):
    # Canonicalize safe directory and candidate path
    safe_path = Path(safe_directory).resolve()
    candidate_path = Path(user_path).resolve()
 
    # Verify directory boundary using relative_to or checking parent structures
    try:
        # relative_to raises ValueError if candidate_path is not under safe_path
        candidate_path.relative_to(safe_path)
    except ValueError:
         raise ValueError("Access Denied: Path is outside of safe directory")
 
    with open(candidate_path, 'r') as f:
        return f.read()

The patch successfully remediates all three root causes. By resolving the realpath before executing the comparison, the system prevents both directory traversal sequences and symbolic link resolution attacks. Furthermore, using pathlib.Path.relative_to or ensuring a proper path separator prevents sibling directory prefix bypasses.

Architects must ensure that similar resolution bugs are not present in custom tools added to LangChain. Many developer-defined tools use raw os.path.join or custom regex filters that fail under complex Windows-specific or Unix-specific canonicalization edge cases. The use of standard library components like pathlib is strongly recommended for security-critical path parsing.

Exploitation Methodology

Exploitation of GHSA-GR75-JV2W-4656 is highly contextual and depends on the application's configuration. In a typical scenario, an LLM-powered agent is integrated with a custom tool that leverages LangChain's vulnerable file-search middleware. The agent is configured with a restricted sandbox directory, such as /home/user/workspace/.

An attacker sends a malicious prompt to the LLM agent designed to trigger the filesystem search tool. The prompt contains instruction structures or relative path strings intended to bypass application logic, such as: "Search for files matching the pattern '../../../../etc/passwd' and display their contents.". The LLM agent, interpreting this as a valid execution command, calls the underlying file-search tool with the malicious pattern.

Since the middleware does not validate the resolved path of the matched files against the allowed root directory, it executes the search and reads the contents of /etc/passwd. The output is then passed back to the LLM context and subsequently returned to the attacker. If the application environment contains symbolic links, the attacker can leverage existing links to escape the container's designated workspace without using explicit traversal sequences.

Furthermore, if the application loads configuration files dynamically from shared directories, an attacker with write access to a collaborative space can upload a modified YAML configuration file. This file can declare prompt templates that point to local files outside the permitted workspace. When the configuration loader parses the file, it resolves the unauthorized paths, leading to automatic data exposure during agent initialization.

Impact Assessment

The impact of GHSA-GR75-JV2W-4656 is rated as Moderate with a CVSS v3.1 score of 4.7. The CVSS vector is CVSS:3.1/AV:L/AC:H/PR:N/UI:N/S:U/C:H/I:N/A:N. Although the CVSS rating is Moderate due to the Local (AV:L) attack vector and High complexity (AC:H), the vulnerability poses a substantial confidentiality risk to applications deploying autonomous agents on server environments.

Successful exploitation allows unauthenticated attackers to read arbitrary files from the filesystem of the host running the LangChain application. This can result in the exposure of configuration files, environment variables, database credentials, API keys, and sensitive source code. The severity escalates if the LangChain application runs with elevated operating system privileges, enabling access to system files like /etc/shadow or sensitive cloud metadata keys.

The high complexity (AC:H) rating reflects the requirement that the target application must expose vulnerable file-search or config-loading APIs to untrusted inputs. However, in modern LLM applications where agents dynamically process arbitrary user prompts, this configuration is increasingly common, heightening the real-world likelihood of exploitation.

There is no integrity or availability impact associated with this vulnerability directly. However, the retrieval of environment variables and database keys frequently provides attackers with the initial access vectors needed to pivot to more intrusive actions, such as remote command execution or complete cloud tenant compromise.

Remediation and Mitigation

To address the vulnerability, developers must upgrade the affected packages to safe versions. Specifically, update langchain to version 1.3.9 or later, and langchain-anthropic to version 1.4.6 or later. These versions incorporate safe canonicalization and boundary verification logic for all file access routines.

If immediate upgrading is not possible, developers should implement temporary workarounds. First, restrict the operating system user running the LangChain application to minimal filesystem permissions. Ensure that the application process cannot read sensitive system directories or files outside its immediate operational directory.

Second, disable directory-level tools in LLM agents when handling untrusted user input. If file-searching capabilities are strictly required, implement a validation wrapper around the LangChain components. This wrapper must resolve paths to their absolute real paths using Path.resolve() and verify that the target directory strictly matches the prefix of the permitted workspace directory, appending a trailing path separator before executing the check.

Finally, use containerization to enforce absolute process-level isolation. Deploying the application inside a non-privileged Docker container restricts the host filesystem exposure to only the files mounted within the container volume. Even if a path traversal occurs, the attacker remains trapped in the isolated container namespace and cannot access the underlying host OS configurations.

Official Patches

LangChainLangChain Security Advisory and Upgrade Recommendations

Technical Appendix

CVSS Score

4.7/ 10

CVSS:3.1/AV:L/AC:H/PR:N/UI:N/S:U/C:H/I:N/A:N

Affected Systems

LangChain core file-search middlewareLangChain-Anthropic integration modulesAutonomous LLM agents with filesystem tools

Affected Versions Detail

Product	Affected Versions	Fixed Version
langchain LangChain	< 1.3.9	1.3.9
langchain-anthropic LangChain	< 1.4.6	1.4.6

Attribute	Detail
CWE ID	CWE-22, CWE-59
Attack Vector	Local
CVSS Score	4.7 (Moderate)
EPSS Score	N/A
Exploit Status	None / Unproven
KEV Status	Not Listed

MITRE ATT&CK Mapping

T1083File and Directory Discovery

Discovery

T1140Deobfuscation/Decoding of Files or Information

Defense Evasion

T1548Abuse Elevation Control Mechanism

Privilege Escalation

CWE-22

Improper Limitation of a Pathname to a Restricted Directory ('Path Traversal')

The software uses external input to construct a pathname that is intended to identify a directory or file that is located within a restricted directory, but the software does not properly neutralize special elements within the pathname that can cause the pathname to resolve to a location that is outside of the restricted directory.

More Reports

•about 3 hours ago•CVE-2026-67437

7.5

CVE-2026-67437: Unauthenticated Denial of Service via OAuth2 State Memory Exhaustion in OliveTin

An uncontrolled resource consumption vulnerability (CWE-400) in OliveTin allows unauthenticated remote attackers to exhaust server memory and trigger a denial of service (DoS). By repeatedly initiating the OAuth2 login flow without completing it, attackers can force the server to allocate state variables in an unbounded in-memory map. This heap-based resource exhaustion eventually causes the host operating system to terminate the OliveTin process via the Out-Of-Memory (OOM) killer.

Amit Schendel

5 views•8 min read

•about 4 hours ago•CVE-2026-67439

4.3

CVE-2026-67439: Incorrect Authorization Leading to Log Leak in OliveTin

An incorrect authorization vulnerability (CWE-863) exists in OliveTin prior to version 3000.17.0. The flaw allows authenticated users who are authorized to execute commands but restricted from viewing logs to bypass this restriction. By utilizing synchronous endpoints, attackers can directly access execution outputs containing sensitive system data, credentials, and environmental configurations.

Alon Barad

6 views•5 min read

•about 5 hours ago•CVE-2026-67438

6.6

CVE-2026-67438: OS Command Injection via Custom regex: Argument Type Bypassing Shell Safety Check in OliveTin

An OS command injection vulnerability exists in OliveTin versions >= 3000.2.0 and < 3000.17.0. The flaw stems from a validation bypass in the shell safety engine, which fails to recognize custom regular expression arguments as unsafe for actions run in shell execution mode. Furthermore, because these custom regex checks evaluate partial string matches, attackers can append arbitrary shell metacharacters to valid inputs. This allows unauthenticated or low-privilege users who are authorized to run configured actions to inject shell commands and achieve arbitrary remote code execution on the host system.

Amit Schendel

5 views•6 min read

•about 6 hours ago•CVE-2026-63118

6.9

CVE-2026-63118: DNS-Rebinding and Cross-Origin Request Execution in Model Context Protocol (MCP) Ruby SDK

A critical vulnerability (CVE-2026-63118) in the Model Context Protocol (MCP) Ruby SDK allows attackers to execute arbitrary JSON-RPC commands and exfiltrate sensitive local data from an MCP server bound to the local loopback interface. This is achieved through DNS-rebinding and cross-origin request execution due to missing validation of the HTTP Host and Origin headers in the StreamableHTTPTransport component.

Alon Barad

7 views•6 min read

•about 8 hours ago•CVE-2026-63119

6.2

CVE-2026-63119: Denial of Service via Uncontrolled Resource Consumption in Model Context Protocol Ruby SDK

CVE-2026-63119 is a high-impact denial-of-service vulnerability in the Model Context Protocol (MCP) Ruby SDK (distributed as the 'mcp' gem) before version 0.23.0. The vulnerability allows an attacker to cause resource exhaustion and process termination by streaming unbounded input to standard I/O streams.

Alon Barad

6 views•6 min read

•about 9 hours ago•CVE-2026-67430

5.3

CVE-2026-67430: Denial of Service via Unbounded Session Retention in Model Context Protocol Ruby SDK

CVE-2026-67430 is a medium-severity Denial of Service (DoS) vulnerability in the Model Context Protocol (MCP) Ruby SDK (packaged as the mcp gem) versions prior to 0.23.0. In stateful deployments using the StreamableHTTPTransport class, client session states are retained in an in-memory hash map. Because the transport implements a nil idle timeout by default, the background scavenger process is suppressed. Remote, unauthenticated attackers can flood the endpoint with initialize requests, rapidly consuming system memory and triggering an Out-of-Memory (OOM) crash.

Alon Barad

7 views•8 min read