inspect

Unicode Info

Inspect every character in text — codepoint, UTF-8 bytes, JS escape, Unicode block, category — and flag invisibles, BOMs, RTL marks, and control characters.

Loading…

About Unicode Info

Unicode Info breaks text down character by character, showing each one's codepoint, UTF-8 byte sequence, JavaScript escape, Unicode block, and category. Crucially, it flags the things you can't see — zero-width characters, BOMs, right-to-left marks, and control characters — that quietly break parsers, diffs, and logins. It runs in your browser and returns a structured JSON report.

Category: inspect
Input: Accepts: text/plain.
Output: Outputs: application/json.
Cost: Free, runs in your browser
Memory: low

Privacy: Unicode Info runs entirely on your device. Files you provide never leave your browser — no uploads, no server, no tracking. The page works offline once loaded.

Common uses

Track down an invisible zero-width space that's breaking a string comparison or a password match
Diagnose a leading byte-order mark (BOM) corrupting the first column of a CSV import
Spot a hidden RTL override character injected into a filename or commit message
Inspect emoji and combining sequences to understand why a string's length looks wrong
Verify that pasted text contains no stray control characters before storing it
Look up the exact codepoint and Unicode block of an unfamiliar glyph

Frequently asked questions

What does it report for each character?

Codepoint, UTF-8 bytes, JavaScript escape sequence, Unicode block, and general category — plus flags for invisibles, BOMs, RTL marks, and control characters.

Why would I care about invisible characters?

They cause bugs that are nearly impossible to see — failed equality checks, mangled imports, and spoofed identifiers — so surfacing them is the whole point.

What's the output format?

A JSON array with one entry per character, suitable for inspection or feeding into another tool.

Does my text get uploaded?

No. The analysis is performed entirely in your browser; the text stays on your device.

Does it handle characters outside the basic plane?

Yes — codepoints are reported per character, including astral-plane characters like many emoji.

Keywords

unicode
codepoint
character
inspect
utf-8
utf8
escape
invisible
zero-width
bom
rtl
hex
encoding

About Unicode Info

Common uses

Frequently asked questions

Keywords

Try next

Related tools