Question 1

When do I need to encode HTML entities?

Accepted Answer

Any time you're inserting user-supplied or untrusted text into HTML. Failing to encode is the number-one cause of XSS. Modern templating engines do this by default; raw `innerHTML` or string concatenation does not, and that's where vulnerabilities show up.

Question 2

Why does my output have double-encoded entities like `&amp;`?

Accepted Answer

Something in your pipeline encoded the same string twice. Common cause: storing already-encoded HTML in a database, then encoding again on render. Decode once and check — if the original input was raw text, you have a double-encode bug.

Question 3

Named or numeric entities — which is preferable?

Accepted Answer

Named (`&`) for readability when the entity has a name. Numeric (`&#65;` or `&#x41;`) for everything else, especially Unicode. Named entities only exist for a small set; numeric works for any codepoint.

Question 4

Does it encode characters inside script and style tags?

Accepted Answer

It encodes whatever you paste into the input. But you should not encode HTML entities inside `

Question 5

Should I encode all non-ASCII characters?

Accepted Answer

No, not in modern web HTML. If your page declares `<meta charset="utf-8">` and the file is saved as UTF-8, non-ASCII characters render fine without encoding. Encoding them just bloats the output. Encode only the structurally significant characters and special edge cases.

Question 6

Why does my output have `&amp;`?

Accepted Answer

Double-encoding. The input was already encoded once (by a CMS, a paste from Word, an upstream API), and you encoded it again. Decode first, then encode if needed. The smell — visible `&` or `
` — means somewhere in the pipeline an extra encode step is firing.

Question 7

Does this work for URL encoding?

Accepted Answer

No, URL encoding is different. URLs use percent-encoding (`%20` for space, `%3D` for `=`). HTML entities and URL encoding are not interchangeable. For URLs, use a URL encoder; for HTML body or attributes, use this tool.

Question 8

What about Unicode emoji?

Accepted Answer

Modern HTML with UTF-8 declared renders emoji directly without encoding — that's the recommended approach. If you must entity-encode (rare), use numeric entities: `&#x1F600;` for 😀. Older parsers may not handle codepoints above U+FFFF correctly; named entities don't exist for emoji.

Question 9

Can I round-trip encode then decode safely?

Accepted Answer

Yes, with a caveat. The five HTML-significant characters round-trip cleanly. Some named entities have multiple representations (the encoder picks one) — `'` and `'` both decode to `'`, but the encoder may not use the same form you started with. The displayed character is identical; the byte sequence may differ.

HTML Entity Encoder/Decoder

Related Tools

About This Tool

Frequently Asked Questions