UTF-8 text validator
Encode text as UTF-8 bytes and decode with fatal errors — detects invalid surrogate pairs for storage in JS strings.
JavaScript strings are UTF-16; this checks round-trip through UTF-8 bytes. For binary files, use a hex or base64 workflow.
- UTF-8 round-trip: OK
- Code points (approx): 13
- UTF-8 byte length: 15