HTML Redact
Redact emails, phone numbers, SSNs, cards and more from the visible text of an HTML file — every tag and attribute is left intact. The structure-aware counterpart to Text Redact.
About HTML Redact
HTML Redact masks emails, phone numbers, SSNs, card numbers, and more from the visible text of an HTML file while leaving every tag and attribute untouched. It is the structure-aware counterpart to Text Redact: the page still renders and parses, but the personal data in the readable content is gone. It runs in your browser, so the document is never uploaded.
- Category
- privacy
- Input
- Accepts: text/html or text/plain.
- Output
- Outputs: text/html (multiple).
- Cost
- Free, runs in your browser
- Memory
- low
Common uses
- Sanitize a saved email's HTML before attaching it to a public bug report
- Strip customer phone numbers from an exported support-ticket page
- Redact credit card numbers in an order confirmation page kept for an archive
- Clean SSNs out of an HTML form dump without breaking the markup
- Prepare an HTML invoice for sharing by masking the recipient's personal details
- Remove emails from a scraped contact page while keeping its layout intact
Frequently asked questions
Will redacting break my HTML?
No. Only the visible text content is masked; tags, attributes, and structure are preserved so the page still renders.
What kinds of data does it catch?
Emails, phone numbers, SSNs, card-shaped numbers, and other common PII patterns in the readable text.
Does my HTML file get uploaded?
No. The redaction runs entirely in your browser tab.
How is this different from Text Redact?
Text Redact treats input as plain text. HTML Redact is structure-aware: it redacts visible content while leaving markup and attributes alone.
Does it catch personal data written in free-form prose?
It targets pattern-shaped data like emails and numbers. For free-form mentions of names and places, an LLM-based redactor is a better fit.
Keywords
- html
- redact
- pii
- privacy
- sanitize
- mask