Skip to main content
PDF Security6 min read

How a Copy-Paste Trick Exposed Redacted Federal Documents

Federal court documents with redacted text exposed through simple copy-paste extraction technique
How a Copy-Paste Trick Exposed Redacted Federal Documents

In late 2025, a federal agency released hundreds of pages from a high-profile criminal investigation. The documents were heavily redacted — black bars covered names, financial details, and other sensitive information throughout. Within hours, social media users demonstrated that those redactions were cosmetic. Anyone could select the black bars, copy the text behind them, and paste it into a text editor. Names of victims, financial records, and investigative details the agency intended to protect were fully readable.

What Happened

The released PDFs contained black rectangles drawn over sensitive text. This technique produces a visual overlay, not a redaction — the digital equivalent of holding a strip of black tape over a page while photocopying it, except the copy is interactive and the tape is transparent to any computer.

The underlying text remained in the PDF content stream, fully intact. Standard PDF readers allow users to select text behind an annotation. Text extraction tools ignore the visual overlay entirely and return the raw characters. No specialized software was required — Ctrl+C was sufficient.

Social media users demonstrated the technique in viral videos. Within hours, the unredacted content was circulating across multiple platforms. The agency pulled the documents and reissued properly redacted versions, but the exposed data — including names of victims who had been promised confidentiality — had already spread widely.

Who the Redactions Were Meant to Protect

The redactions covered the names of victims with a legal right to privacy, identities of cooperating witnesses, and details of ongoing investigations. These protections are standard in federal criminal proceedings and exist for documented safety and legal reasons.

When the redactions failed, the exposed individuals were not public figures. They were private citizens whose identities the court system had ordered protected. The exposure resulted not from a cyberattack or a deliberate leak, but from a procedural error in PDF preparation.

This Is Not a New Problem

This failure mode has been documented for over two decades. In 2019, defense attorneys in a federal case filed a court document with black-box redactions that could be bypassed the same way, revealing details of foreign contacts that prosecutors had sought to keep sealed. In 2011, a federal security agency accidentally released its internal procedures manual with bypassable redactions. The intelligence community published a guide on proper PDF redaction in 2005 specifically because the problem was already widespread across government.

Two decades of documented warnings and published technical guidance did not prevent a major federal agency from repeating the same error in late 2025.

What Proper Redaction Actually Means

A PDF is not a photograph. It is a structured data file that contains text as encoded character data, separate from how that text is rendered visually. When you draw a black rectangle over text in a PDF, you are adding a visual element on top of the text data. The text itself is untouched.

Proper redaction does three things:

  1. Removes the text from the content stream. The actual character data — the bytes that represent the letters and numbers — is deleted from the PDF. There is nothing left to copy or extract.
  2. Replaces the area with an opaque fill. A black (or colored) rectangle replaces the space where text existed, but there is no text data beneath it. Selecting and copying returns nothing.
  3. Strips associated metadata. The document's revision history, comments, form data, and other hidden structures are cleaned so the redacted text cannot be recovered from any secondary source within the file.

What Went Wrong (Overlay Only)

Drew black annotation rectangles over text. The text remained in the PDF content stream. Anyone with Ctrl+C could extract it. This is the single most common redaction mistake in existence.

What Proper Redaction Looks Like

Used a proper redaction tool that removes the underlying text data from the PDF before generating the final document. The redacted output should contain no trace of the original text — not in the content stream, not in metadata, not anywhere.

Why This Keeps Happening

The core problem is that most PDF software makes it easy to do the wrong thing and hard to do the right thing. The most widely used PDF editor in government has both drawing tools and a dedicated redaction tool. The drawing tools create visual overlays. The redaction tool removes data. They look similar on screen. The difference is invisible to the person applying them — but catastrophic to the people whose data is supposed to be protected.

Training programs exist, but the recurring nature of these incidents suggests a tooling problem more than a knowledge problem. When the default workflow produces overlays and true redaction requires extra steps, errors are predictable. Tools where the default behavior removes the underlying data — rather than offering it as an option in a submenu — would eliminate the most common failure mode.

How to Verify Redactions Are Real

Whether you are producing redacted documents or receiving them, verification takes less than a minute:

  1. Try to select text behind the black bars. If your cursor highlights characters, the redaction is fake.
  2. Copy and paste into a text editor. Select the redacted region, press Ctrl+C, paste into Notepad. If text appears, the data is still there.
  3. Use Ctrl+F to search. If you know a word that should be redacted, search for it. If it appears in results, the redaction failed.
  4. Extract all text. Use any text extraction tool on the full document. Properly redacted text will not appear anywhere in the output.

Any one of these checks, applied before publication, would have caught the failed redactions in the 2025 case.

The Practical Consequences

Redaction failures produce real privacy violations. In the 2025 case, individuals whose identities were court-ordered sealed were exposed to a mass audience. The breach did not originate from a cyberattack or an insider leak — it resulted from a PDF preparation error that has been documented and preventable for over twenty years. For any organization handling sensitive personal information, the distinction between a visual overlay and a true redaction is a baseline operational requirement.

Redact PDFs Properly — No Upload Required

PDFb2's redaction tool permanently removes text from the PDF content stream. No visual overlays. No data left behind. Processing happens entirely in your browser — the unredacted document never leaves your device.

Redact PDF Now