Getting LLMs Drunk to Find Remote Linux Kernel OOB Writes (and More)

This article details an autonomous LLM-driven vulnerability research harness that discovered over 20 CVEs, including two high-severity remote, unauthenticated out-of-bounds write vulnerabilities (CVE-2026-31432 and CVE-2026-31433, CVSS 8.8) in the Linux kernel's ksmbd module, triggered by specific compound SMB requests. The research was inspired by a simpler documentation-to-code mismatch flaw, CVE-2025-32462 (CVSS 2.8), which was a local privilege escalation in sudo affecting versions prior to 1.9.17. The fixed version for the sudo vulnerability is 1.9.17, while mitigation details for the kernel vulnerabilities are not provided in the article.

Read Full Article →

TLDR: the grossly overengineered, self-orchestrating team of vulnerability-hunting agents detailed below has discovered 20+ CVEs over the past few months, including CVE-2026-31432 and CVE-2026-31433 : two remote, unauthenticated OOB writes in the Linux kernel’s ksmbd . Read on for the details of the setup that achieved this, including – yes! – getting LLMs drunk. Background # “LLMing” vulnerability research has been on my “Do Something About This” list since DARPA’s AIxCC and XBOW’s initial results . But back in 2023-24, models required a lot of harnessing to get anything useful, tool use was rudimentary, and the idea of squeezing as much code as I could into a model’s context – then triaging away the false positives – filled me with dread. The push to actually do something came in the summer of 2025. Rich Mirch reported a dead-simple, unnoticed-for-12-years local privilege escalation in sudo: CVE-2025-32462 . Contrary to the documentation, the --host flag did not just permit listing privileges on a different host – it made the hostname portion of sudo rules irrelevant . So, e.g., if a sudoers rule granted you root on somehost but not the local host, you could abuse the flag to get full root locally. This LPE was not LLM-found (AFAICT), but it did make me wonder: what if instead of getting LLMs to drive various tools, we had them hunt for (stupid simple) mismatches between documentation and the actual code? It seemed like an easier lift for (local) LLMs in terms of context size, harnessing complexity, and intelligence required. These would not be the most technically exciting findings, but their practical effects would be just as serious: impact-wise, an LPE is an LPE! By the end of 2025, I’d begun working on a harness to do just this. But, to paraphrase Mike Tyson, everyone has a plan until a new model drops. Almost as soon as my harness was done, the models got good enough to greatly simplify the scaffolding required even for context-heavy external tool use. At this point, my quest fissioned into three : Can we find the “docs ↔ code mismatch”-type vulnerabilities – the original goal, inspired by the finding above? Given the step change in capabilities, what about vulnerabilities in general? More speculatively, can we get a “move 37” out of LLMs to either a) find entirely novel bug classes, or at least b) unlock something in smaller models to enhance their hunting capabilities? Findings # The answers were roughly “yes,” “yes,” and “maybe.” Below are 30+ findings (20+ CVEs assigned as of 2026-04-29, some not yet published) discovered fully autonomously via the custom harness. I prioritized network-reachable services first, given the impending avalanche: Target Issue CVE # Linux kernel (ksmbd) Compound READ + QUERY_INFO(Security) requests can trigger a (remote, unauthenticated) out-of-bounds write in ksmbd CVE-2026-31432 , fix Linux kernel (ksmbd) Compound QUERY_DIRECTORY + QUERY_INFO(FILE_ALL_INFORMATION) requests can trigger a (remote, unauthenticated) out-of-bounds write in ksmbd CVE-2026-31433 , fix Docker Crafted Docker API requests can make AuthZ plugins see no request body, bypassing body-inspecting authorization policies CVE-2026-34040 Docker PENDING PUBLICATION PENDING ASSIGNMENT Docker PENDING PUBLICATION PENDING ASSIGNMENT OpenSSL PENDING PUBLICATION CVE-2026-34182 CUPS On network-exposed CUPS with a shared PostScript queue, unauthenticated Print-Job requests can reach arbitrary code execution over the network as lp CVE-2026-34980 CUPS An unprivileged local attacker can coerce cupsd into leaking a reusable local admin token, escalating to a rootful file (over)write CVE-2026-34990 CUPS RSS notify-recipient-uri path traversal lets a remote IPP client write RSS XML outside CacheDir/rss , including clobbering job.cache CVE-2026-34978 HAProxy Single-packet infinite-loop DoS (QUIC) CVE-2026-26080 HAProxy Single-packet DoS (QUIC) CVE-2026-26081 Caddy Large host lists make MatchHost case-sensitive, enabling host-based routing/access-control bypass CVE-2026-27588 Caddy %xx escaped-path matching skips case normalization, enabling path-based access-control bypass CVE-2026-27587 Traefik TCP readTimeout bypass in the Postgres STARTTLS handling path, allowing an unauthenticated connection-stalling denial of service CVE-2026-25949 udisks Missing authorization on LUKS header restore lets a local unprivileged user overwrite encryption metadata, causing irreversible denial of service/data loss CVE-2026-26103 udisks Missing authorization on LUKS header backup lets a local unprivileged user export sensitive encryption metadata CVE-2026-26104 systemd-machined Local privilege escalation in affected desktop-session configurations via the RegisterMachine IPC/D-Bus path CVE-2026-4105 etcd Authorization bypasses in multiple gRPC APIs let unauthorized users invoke operations such as MemberList , Alarm , Lease APIs, and compaction in affected auth-enabled clusters CVE-2026-33413 Squid Heap use-after-free in ICP handling lets...

Read Full Article → ← Back to News

Getting LLMs Drunk to Find Remote Linux Kernel OOB Writes (and More)

Related Articles

Share this article