Question 1

Which characters require encoding?

Accepted Answer

In HTML body content: <, >, and &. In attribute values, additionally " and ' depending on the quote style. Other characters can be encoded but are not required if the document declares a UTF-8 charset.

Question 2

What is the difference between ' and '?

Accepted Answer

' is an HTML5 named reference for the apostrophe; ' is the numeric reference. Both produce the same character. ' was not valid in HTML4, so ' is safer for older documents.

Question 3

Should non-ASCII characters be encoded?

Accepted Answer

Generally no, if the document is UTF-8. Encoding them increases file size and reduces readability. Encode only when the document encoding is uncertain or when the character is reserved markup.

Question 4

Can decoded output contain script?

Accepted Answer

Decoding turns

Question 5

How do hex and decimal entities differ?

Accepted Answer

Both encode the same Unicode code point; only the syntax differs. &#38; (decimal) and &#x26; (hex) are identical to the parser. Hex is more common when copying from Unicode tables that list code points in hex; decimal appears in older code.

Question 6

What's double-encoding?

Accepted Answer

Encoding an already-encoded string. '&' becomes '&' on first pass and '&amp;' on second. Visible '&' in a rendered page is the classic symptom: data was encoded once at storage time and again at display time. Fix one encoding pass, not both.

Question 7

Are emoji handled correctly?

Accepted Answer

Yes. Emoji code points are above U+FFFF and require either decimal or hex numeric references. '&#x1F600;' decodes to the grinning face. Modern UTF-8 documents can include emoji literally without encoding; entities matter only when the document encoding is restricted.

Question 8

Why does   render as a space?

Accepted Answer

is a non-breaking space (U+00A0), visually similar to a regular space but treated differently by line-breaking. HTML collapses sequences of whitespace including regular spaces, but does not collapse non-breaking spaces, which makes them useful for forced spacing in titles.

HTML Entity Encoder

Related Tools

About This Tool

Frequently Asked Questions