Unicode Transformation Format (UTF)

The Unicode Consortium develops the Unicode Standard.
Their goal is to replace the existing character sets with its standard Unicode Transformation Format (UTF).
The Unicode Standard has become a success and is implemented in HTML, XML, Java, JavaScript, E-mail, ASP, PHP, etc.
The Unicode Consortium cooperates with the leading standards development organizations, like ISO, W3C, and ECMA.
The Unicode Standard covers (almost) all the characters, punctuations, and symbols in the world.
The default character encoding in HTML-5 is UTF-8. The Unicode standard is also supported in many operating systems and all modern browsers.

===================================================

The Unicode Character Sets

Unicode can be implemented by different character sets.
The most commonly used encodings are UTF-8 and UTF-16:

 UTF-8 :
A character in UTF8 can be from 1 to 4 bytes long.
UTF-8 can represent any character in the Unicode standard.
UTF-8 is backwards compatible with ASCII.
UTF-8 is the preferred encoding for e-mail and web pages

 UTF-16:
16-bit Unicode Transformation Format is a variable-length character encoding for Unicode,
capable of encoding the entire Unicode repertoire.
UTF-16 is used in major operating systems and environments, like Microsoft Windows, Java and .NET.

 Tip:
All HTML 4 processors support UTF-8,
and all HTML 5 and XML processors support both UTF-8 and UTF-16!

If an HTML5 web page uses a different character set than UTF-8,
it should be specified in the tag like:

w3schools.com/charsets/ref_utf_symbols.asp

Unicode Characters (graphemica.com)


‎ ‎ ‎ ‎
Char   Entity HTML Dec     HTML Hex     Description
ăabreveăălatin small letter a with breve
â acirc â â latin small letter a with circumflex
Î Icirc Î Î latin capital letter I with circumflex
î icirc î î latin small letter i with circumflex
à agrave à à latin small letter a with grave = latin small letter a grave
á aacute á á latin small letter a with acute
æ aelig æ æ latin small letter ae = latin small ligature ae
ç ccedil ç ç latin small letter c with cedilla
è egrave è è latin small letter e with grave
é eacute é é latin small letter e with acute
ê ecirc ê ê latin small letter e with circumflex
Δ Delta Δ Δ greek capital letter delta
Φ Phi Φ Φ greek capital letter phi
φ phi φ φ greek smal letter phi
α alpha α α greek smal letter alpha
β beta β β greek smal letter beta
γ gamma γ γ greek smal letter gamma
μ mu μ μ greek smal letter mu
π pi π π greek smal letter pi
ϖ piv ϖ ϖ Greek pi symbol
ρ rho ρ ρ greek smal letter rho
ψ psi ψ ψ greek smal letter psi
ω omega ω ω greek smal letter omega
& amp & & ampersand
< lt &#60; &#xA0; less than
> gt &#62; &#xA0; greater than
± plusmn &#177; &#xB1; plus-minus sign = plus-or-minus sign
µ micro &#181; &#xB5; micro sign
¼ frac14 &#188; &#xBC; vulgar fraction one quarter = fraction one quarter
½ frac12 &#189; &#xBD; vulgar fraction one half = fraction one half
¾ frac34 &#190; &#xBE; vulgar fraction three quarters = fraction three quarters
bull &#8226; &#x2022; bullet = black small circle
bull02022
bullet02022
prime &#8242; &#x2032; prime = minutes = feet
Prime &#8243; &#x2033; double prime = seconds = inches
oline &#8254; &#x203E; overline = spacing overscore
frasl &#8260; &#x2044; fraction slash
larr &#8592; &#x2190; leftwards arrow
uarr &#8593; &#x2191; upwards arrow
rarr &#8594; &#x2192; rightwards arrow
darr &#8595; &#x2193; downwards arrow
harr &#8596; &#x2194; left right arrow
crarr &#8629; &#x21B5; downwards arrow with corner leftwards = carriage return
rArr &#8658; &#x21D2; rightwards double arrow
exist &#8707; &#x2203; there exists
empty &#8709; &#x2205; empty set = null set = diameter
isin &#8712; &#x2208; element of
notin &#8713; &#x2209; not an element of
sum &#8721; &#x2211; n-ary sumation
minus &#8722; &#x2212; minus sign
lowast &#8727; &#x2217; asterisk operator
radic &#8730; &#x221A; square root = radical sign
prop &#8733; &#x221D; proportional to
infin &#8734; &#x221E; infinity
sim &#8764; &#x223C; tilde operator = varies with = similar to
reversed tilde &#8765; &#x223D;
sine wave &#8767; &#x223F;
inverted lazy s &#8766; &#x223E;
homothetic &#8763; &#x223B;
proportion &#8759; &#x2237;
geometric proportion &#8762; &#x223A;
excess &#8761; &#x2239;
dot minus &#8760; &#x2238;
cong &#8773; &#x2245; approximately equal to
approxeq &#8778; &#x224A; almost equal or equal to
approx &#8776; &#x2248; almost equal to
asymp &#8771; &#x2243; almost equal to = asymptotic to
minus tilde &#8770; &#x2242;
not tilde &#8769; &#x2241;
wreath product &#8768; &#x2240;
ne &#8800; &#x2260; not equal to
equiv &#8801; &#x2261; identical to
le &#8804; &#x2264; less-than or equal to
ge &#8805; &#x2265; greater-than or equal to
sub &#8834; &#x2282; subset of
sup &#8835; &#x2283; superset of
nsub &#8836; &#x2284; not a subset of
perp &#8869; &#x22A5; up tack (orthogonal to) (perpendicular)
down tack &#8868; &#x22A4;
left tack &#8867; &#x22A3;
right tack &#8866; &#x22A2;
squared dot operator &#8865; &#x22A1;
squared times &#8864; &#x22A0;
squared minus &#8863; &#x229F;
squared plus &#8862; &#x229E;
circled dash &#8861; &#x229D;
circled equals &#8860; &#x229C;
circled asterisk operator &#8859; &#x229B;
circled ring operator &#8858; &#x229A;
circled dot operator &#8857; &#x2299;
circled division slash &#8856; &#x2298;
circled times &⊗ &#x2297; otimes
circled minus &#8854; &#x2296;
circled plus &#8853; &#x2295; oplus
square cup &#8852; &#x2294;
square cap &#8851; &#x2293;
square original of or equal to &#8850; &#x2292;
square image of or equal to &#8849; &#x2291;
square original of &#8848; &#x2290;
square image of &#8847; &#x228F;
multiset union &#8846; &#x228E;
multiset multiplication &#8845; &#x228D;
multiset &#8844; &#x228C;
& &
hearts &#9829; &#x2665; black heart suit = valentine
bigcirc 025EF
bigstar 02605
boxminus 0229F
boxplus 0229E
boxtimes 022A0
check 02713
checkmark 02713
dagger 02020
˜ DiacriticalTilde 002DC
÷ div 000F7
÷ divide 000F7
$ dollar 00024
DoubleLongRightArrow 027F9
DoubleRightArrow 021D2