What is the correct JSON Unicode escape format?

\u followed by exactly four hex digits, e.g. \u00A9 for ©.

How do I escape emoji in JSON?

Use UTF-8 encoding (no escape needed) or surrogate pairs with \u.

Quick answer

Unicode in JSON strings uses \u followed by exactly four hexadecimal digits.

JSON Invalid Unicode Escape

Unicode in JSON strings uses \u followed by exactly four hexadecimal digits. Wrong length or non-hex characters cause an error.

Common causes

Using fewer or more than four hex digits after \u.
Using uppercase U or wrong case in hex.
Copying Unicode from a context that uses different escape format.

How to fix

Use \uXXXX where X is 0-9 or A-F (e.g. \u0020 for space).
For code points above U+FFFF, use a surrogate pair (two \u escapes).
Validate with a JSON validator to see the exact invalid escape.

JSON invalid Unicode escape errors happen when a string contains a backslash-u sequence that does not follow JSON syntax. In valid JSON, Unicode escapes must use \uXXXX, where X is a hexadecimal digit. This validator page helps developers, API consumers, and QA teams identify malformed escape sequences quickly, especially in payloads generated by apps, logs, templates, or copy-paste workflows. It is useful when debugging JSON parsing failures in web apps, backend services, integrations, and data pipelines. If your JSON includes accented characters, symbols, or non-Latin text, checking escape formatting is an important step before sending or storing the data.

How This Validator Works

This check looks for Unicode escape sequences inside JSON strings and verifies that each one matches the JSON specification. A valid escape must begin with a backslash and u, followed by exactly four hexadecimal characters: 0-9, a-f, or A-F. Examples include \u00E9 and \u20AC. If the sequence is too short, too long, missing digits, or contains non-hex characters, the JSON parser will reject it. In practice, this validator helps isolate the exact location of the malformed escape so you can correct the source text or encoding process.

Common Validation Errors

Too few hex digits: \u12 is invalid because JSON requires exactly four digits.
Too many hex digits: \u12345 is invalid because the escape must stop after four hex characters.
Non-hex characters: \u12G4 fails because G is not a hexadecimal digit.
Missing backslash: u1234 is treated as plain text, not an escape.
Broken escaping in source code: Some languages require double escaping, so the JSON text may be correct in one context and broken in another.
Invalid surrogate handling: Some Unicode characters require paired escapes, and incomplete pairs can cause downstream parsing issues.

Where This Validator Is Commonly Used

API request and response debugging when JSON payloads fail to parse.
Frontend applications that build JSON strings from user input or form data.
Backend services that serialize text from databases, queues, or external systems.
Data migration and ETL pipelines where text encoding issues can surface.
Content management systems that store structured metadata in JSON fields.
QA and test automation for validating fixtures, mocks, and sample payloads.
Developer tooling such as linters, formatters, and schema-aware editors.

Why Validation Matters

JSON is widely used because it is simple, portable, and machine-readable, but that also means parsers are strict about syntax. A single malformed Unicode escape can prevent an entire payload from loading, which may break an API call, stop a page from rendering, or cause a data import to fail. Validating early helps reduce debugging time, improves interoperability between systems, and makes text handling more reliable across languages, frameworks, and storage layers. It also helps ensure that international characters are preserved correctly instead of being corrupted or rejected.

Technical Details

JSON string escapes are defined by the JSON grammar and must follow exact syntax rules.
Unicode escape format: \u plus exactly four hexadecimal digits.
Hex digits allowed: 0-9, a-f, A-F.
Common parser behavior: invalid escapes usually trigger a syntax error before the JSON can be consumed.
Language context matters: when JSON is embedded in source code, the host language may require escaping the backslash itself.
Surrogate pairs: some code points outside the Basic Multilingual Plane may be represented as paired Unicode escapes in JSON.

Pattern	Status	Notes
\u00E9	Valid	Exactly four hex digits.
\u12	Invalid	Too few digits.
\u12G4	Invalid	Contains a non-hex character.
\u12345	Invalid	Too many digits for a single escape.

FAQ

What does JSON invalid Unicode escape mean?

It means a JSON string contains a \u escape that does not match the required format. JSON expects exactly four hexadecimal digits after \u. If the sequence is shorter, longer, or includes characters outside the hex range, the parser will treat it as invalid and reject the JSON text.

What is the correct Unicode escape format in JSON?

The correct format is \uXXXX, where each X is a hexadecimal digit. Valid examples include \u0041 for “A” and \u03A9 for “Ω”. This format is strict, so even a small typo can make the entire JSON document fail to parse.

Why does \u12 fail in JSON?

It fails because JSON requires exactly four hexadecimal digits after \u. The sequence \u12 only has two digits, so it is incomplete. Parsers usually stop at the error and report a syntax issue rather than trying to guess the intended character.

Can I use more than four digits after \u in JSON?

No, not for a single Unicode escape. JSON uses exactly four hex digits per escape. If you need to represent a character outside the Basic Multilingual Plane, JSON may use a surrogate pair made of two separate \uXXXX escapes. The parser still expects each escape to be individually valid.

Why does my JSON work in one tool but fail in another?

Different tools may handle escaping at different layers. For example, a string may be valid in a programming language literal but invalid once it is converted into raw JSON text, or vice versa. It is important to check the final JSON payload, not just the source code that generated it.

How do I fix a malformed Unicode escape?

Replace the invalid sequence with a proper \uXXXX escape or use the actual Unicode character if your system supports it. Make sure the escape contains exactly four hexadecimal digits and that any surrounding quotes, backslashes, or string delimiters are also correct.

Does this error always mean the text is corrupted?

Not always. Sometimes the text is fine, but the escaping rules were applied incorrectly during serialization, templating, or copy-paste. The issue may come from the application layer, a transport layer, or a manual edit rather than from the original content itself.

Is Unicode escaping required in JSON?

No, JSON strings can contain many Unicode characters directly if the text is encoded properly. Unicode escapes are mainly used for compatibility, readability in certain contexts, or when a character needs to be represented in a safe ASCII-only form. The key requirement is that any escape sequence used must be syntactically valid.

Related Validators & Checkers

JSON Validator
JSON Syntax Checker
JSON Escape Validator
Unicode Text Validator
String Escape Checker
API Payload Validator

FAQ

How do I escape emoji in JSON?: Use UTF-8 encoding (no escape needed) or surrogate pairs with \u.

Fix it now

Try in validator (prefill this example)

All tools · Canonical