Unicode Text Converter

Convert plain text (letters, sometimes numbers, sometimes punctuation) to obscure characters from Unicode. The output is fully cut-n-pastable text.

Small FAQ

What conversions does this do?

This toy only converts characters from the ASCII range. Characters are only converted on a one-to-one basis; no combining characters (eg U+20DE COMBINING ENCLOSING SQUARE), many to one (eg ligatures), or context varying (eg Braille) transformations are done.

Current true transforms:
circled, negative circled, Asian fullwidth, math bold, math bold Fraktur, math bold italic, math bold script, math double-struck, math monospace, math sans, math sans-serif bold, math sans-serif bold italic, math sans-serif italic, parenthesized, regional indicator symbols, squared, negative squared, and tagging text (invisible for hidden metadata tagging).

Psuedo transforms (made by picking and choosing from here and there in Unicode) available:
acute accents, CJK based, curvy variant 1, curvy variant 2, curvy variant 3, faux Cyrillic, Mock Ethiopian, math Fraktur, rock dots, small caps, stroked, subscript (many missing, no caps), superscript (some missing), inverted, and reversed (an incomplete alphabet, better with CAPITALS).
Capitalization preserved where available.

What makes an alphabet "psuedo"?

One or more of the letters transliterated has a different meaning or source than intended. In the non-bold version of Fraktur, for example, several letters are "black letter" but most are "mathematical fraktur". In the Faux Cyrillic and Faux Ethiopic, letters are selected merely based on superficial similarities, rather than phonetic or semantic similarities.

What is "CJK"?

CJK is a collective term for the Chinese, Japanese, and Korean languages, all of which use Chinese characters and derivatives in their writing systems.

What is "Fullwidth"?

These are "Roman" letters that are the same width as Japanese characters and are typically used when mixing English and Japanese.

What is the deal with "Tag"?

"Tags" is a Unicode block containing characters for invisibly tagging texts by language. The tag characters are deprecated in favor of markup. All printable ASCII have a tag version. Properly rendered, they have both no glyph and zero width. Note that sometimes zero width text cannot be easily copied.

What is the deal with "Regional Indicator"?

This block of characters is intended to indicate a global region, eg "France". As such some tools use short sequences of Regional Indicators to encode flags. The idea is that the same two-letter country codes used in domain names would be mapped into this block to represent that region, eg, with a flag. So U+1F1EB ("Symbol Letter F") and U+1F1F7 ("Symbol Letter R") are the way the French flag might be encoded: 🇫🇷 (results will vary with browser).

A Unicode Toy © 2009-2021 Eli the Bearded