lynx/tests/lynx-dump/data/iso-8859-2a.html.exp
Kamil Dudka 5bdda90d01 Resolves: CVE-2021-38165 - implement a gating test
... based on `fmf` and `tmt`
2021-10-15 10:12:52 +02:00

194 lines
12 KiB
Plaintext
Raw Blame History

This file contains invisible Unicode characters

This file contains invisible Unicode characters that are indistinguishable to humans but may be processed differently by a computer. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

#[1]iso-8859-1 test [2]iso-8859-2 ALT test
iso8859-2 plus table, and cp-1252
Description Code Entity name
=================================== ============ ==============
quotation mark " --> " " --> "
ampersand & --> & & --> &
less-than sign &#60; --> < &lt; --> <
greater-than sign &#62; --> > &gt; --> >
Description Char Code Entity name
=================================== ==== ============ ==============
euro sign &128; --> €
undefined &129; --> 
single low-9 quotation mark &130; --> ‚
latin small letter f with hook &131; --> ƒ
double low-9 quotation mark &132; --> „
horizontal ellipsis &133; --> …
dagger &134; --> †
double dagger &135; --> ‡
modifier letter circumflex accent &136; --> ˆ
per mille sign &137; --> ‰
latin capital letter s with caron &138; --> Š
single left-pointing angle quote mark &139; --> ‹
latin capital ligature oe &140; --> Œ
undefined &141; --> 
latin capital letter z with caron &142; --> Ž
undefined &143; --> 
undefined &144; --> 
left single quotation mark &145; --> ‘
right single quotation mark &146; --> ’
left double quotation mark &147; --> “
right double quotation mark &148; --> ”
bullet &149; --> •
en dash &150; --> –
em dash &151; --> —
small tilde &152; --> ˜
trade mark sign &153; --> ™
latin small letter s with caron &154; --> š
single right-pointing angle quote mark &155; --> ›
latin small ligature oe &156; --> œ
undefined &157; --> 
latin small letter z with caron &158; --> ž
latin capital letter y with diaeresis &159; --> Ÿ
non-breaking space &#160; --> &nbsp; -->
capital A, ogonek Ą &#260; --> Ą &Aogon; --> Ą
breve {˘} {&#728;}-->{˘} {&breve;} -->{˘}
capital L, stroke Ł &#321; --> Ł &Lstrok; --> Ł
general currency sign ¤ &#164; --> ¤ &curren; --> ¤
capital L, caron Ľ &#317; --> Ľ &Lcaron; --> Ľ
capital S, acute accent Ś &#346; --> Ś &Sacute; --> Ś
section sign § &#167; --> § &sect; --> §
umlaut (dieresis) ¨ &#168; --> ¨ &uml; --> ¨
&die; --> ¨
capital S, caron Š &#352; --> Š &Scaron; --> Š
capital S, cedilla Ş &#350; --> Ş &Scedil; --> Ş
capital T, caron Ť &#356; --> Ť &Tcaron; --> Ť
capital Z, acute accent Ź &#377; --> Ź &Zacute; --> Ź
soft hyphen [] [&#173;]-->[] [&shy;] -->[]
capital Z, caron Ž &#381; --> Ž &Zcaron; --> Ž
capital Z, dot above Ż &#379; --> Ż &Zdot; --> Ż
degree sign ° &#176; --> ° &deg; --> °
small a, ogonek ą &#261; --> ą &aogon; --> ą
ogonek {˛} {&#731;}-->{˛} {&ogon;} -->{˛}
small l, stroke ł &#322; --> ł &lstrok; --> ł
acute accent ´ &#180; --> ´ &acute; --> ´
small l, caron ľ &#318; --> ľ &lcaron; --> ľ
small s, acute accent ś &#347; --> ś &sacute; --> ś
caron {ˇ} {&#711;}-->{ˇ} {&caron;} -->{ˇ}
cedilla ¸ &#184; --> ¸ &cedil; --> ¸
small s, caron š &#353; --> š &scaron; --> š
small s, cedilla ş &#351; --> ş &scedil; --> ş
small t, caron ť &#357; --> ť &tcaron; --> ť
small z, acute accent ź &#378; --> ź &zacute; --> ź
double acute accent {˝} {&#733;}-->{˝} {&dblac;} -->{˝}
small z, caron ž &#382; --> ž &zcaron; --> ž
small z, dot above ż &#380; --> ż &zdot; --> ż
capital R, acute accent Ŕ &#340; --> Ŕ &Racute; --> Ŕ
capital A, acute accent Á &#193; --> Á &Aacute; --> Á
capital A, circumflex accent  &#194; -->  &Acirc; --> Â
capital A, breve Ă &#258; --> Ă &Abreve; --> Ă
capital A, dieresis or umlaut mark Ä &#196; --> Ä &Auml; --> Ä
capital L, acute accent Ĺ &#313; --> Ĺ &Lacute; --> Ĺ
capital C, acute accent Ć &#262; --> Ć &Cacute; --> Ć
capital C, cedilla Ç &#199; --> Ç &Ccedil; --> Ç
capital C, caron Č &#268; --> Č &Ccaron; --> Č
capital E, acute accent É &#201; --> É &Eacute; --> É
capital E, ogonek Ę &#280; --> Ę &Eogon; --> Ę
capital E, dieresis or umlaut mark Ë &#203; --> Ë &Euml; --> Ë
capital E, caron Ě &#282; --> Ě &Ecaron; --> Ě
capital I, acute accent Í &#205; --> Í &Iacute; --> Í
capital I, circumflex accent Î &#206; --> Î &Icirc; --> Î
capital D, caron Ď &#270; --> Ď &Dcaron; --> Ď
capital D, stroke Đ &#272; --> Đ &Dstrok; --> Đ
capital Eth, Icelandic N/A &#208; --> Ð &ETH; --> Ð
capital N, acute accent Ń &#323; --> Ń &Nacute; --> Ń
capital N, caron Ň &#327; --> Ň &Ncaron; --> Ň
capital O, acute accent Ó &#211; --> Ó &Oacute; --> Ó
capital O, circumflex accent Ô &#212; --> Ô &Ocirc; --> Ô
capital O, double acute accent Ő &#368; --> Ű &Odblac; --> Ő
capital O, dieresis or umlaut mark Ö &#214; --> Ö &Ouml; --> Ö
multiply sign × &#215; --> × &times; --> ×
capital R, caron Ř &#344; --> Ř &Rcaron; --> Ř
capital U, ring Ů &#366; --> Ů &Uring; --> Ů
capital U, acute accent Ú &#218; --> Ú &Uacute; --> Ú
capital U, double acute accent Ű &#368; --> Ű &Udblac; --> Ű
capital U, dieresis or umlaut mark Ü &#220; --> Ü &Uuml; --> Ü
capital Y, acute accent Ý &#221; --> Ý &Yacute; --> Ý
capital T, cedilla Ţ &#354; --> Ţ &Tcedil; --> Ţ
small sharp s, German (sz ligature) ß &#223; --> ß &szlig; --> ß
small r, acute accent ŕ &#341; --> ŕ &racute; --> ŕ
small a, acute accent á &#225; --> á &aacute; --> á
small a, circumflex accent â &#226; --> â &acirc; --> â
small a, breve ă &#259; --> ă &abreve; --> ă
small a, dieresis or umlaut mark ä &#228; --> ä &auml; --> ä
small l, acute accent ĺ &#314; --> ĺ &lacute; --> ĺ
small c, acute accent ć &#263; --> ć &cacute; --> ć
small c, cedilla ç &#231; --> ç &ccedil; --> ç
small c, caron č &#269; --> č &ccaron; --> č
small e, acute accent é &#233; --> é &eacute; --> é
small e, ogonek ę &#281; --> ę &eogon; --> ę
small e, dieresis or umlaut mark ë &#235; --> ë &euml; --> ë
small e, caron ě &#283; --> ě &ecaron; --> ě
small i, acute accent í &#237; --> í &iacute; --> í
small i, circumflex accent î &#238; --> î &icirc; --> î
small d, caron ď &#271; --> ď &dcaron; --> ď
small d, stroke đ &#273; --> đ &dstrok; --> đ
small eth, Icelandic N/A &#240; --> ð &eth; --> ð
small n, acute accent ń &#324; --> ń &nacute; --> ń
small n, caron ň &#328; --> ň &ncaron; --> ň
small o, acute accent ó &#243; --> ó &oacute; --> ó
small o, circumflex accent ô &#244; --> ô &ocirc; --> ô
small o, double acute accent ő &#369; --> ű &odblac; --> ő
small o, dieresis or umlaut mark ö &#246; --> ö &ouml; --> ö
division sign ÷ &#247; --> ÷ &divide; --> ÷
small r, caron ř &#345; --> ř &rcaron; --> ř
small u, ring ů &#367; --> ů &uring; --> ů
small u, acute accent ú &#250; --> ú &uacute; --> ú
small u, double acute accent ű &#369; --> ű &udblac; --> ű
small u, dieresis or umlaut mark ü &#252; --> ü &uuml; --> ü
small y, acute accent ý &#253; --> ý &yacute; --> ý
small t, cedilla ţ &#355; --> ţ &tcedil; --> ţ
dot above {˙} {&#729;}-->{˙} {&dot;} -->{˙}
Some other characters of interest Char Code Entity name
=================================== ==== ============ ==============
capital AE diphthong (ligature) N/A &#198; --> Æ &AElig; --> Æ
small ae diphthong (ligature) N/A &#230; --> æ &aelig; --> æ
capital OE ligature N/A {&#338;}-->{Œ} {&OElig;} -->{Œ}
small oe ligature N/A {&#339;}-->{œ} {&oelig;} -->{œ}
copyright N/A &#169; --> © &copy; --> ©
registered trademark N/A &#174; --> ® &reg; --> ®
trademark sign N/A &#8482;--> ™ &trade; --> ™
em space N/A [&#8195;]->[ ] [&emsp;] -->[ ]
en space N/A [&#8194;]->[ ] [&ensp;] -->[ ]
1/3-em space N/A [&#8196;]->[] [&emsp13;] -->[]
1/4-em space N/A [&#8197;]->[] [&emsp14;] -->[]
thin space N/A [&#8201;]->[ ] [&thinsp;]-->[ ]
hair space N/A [&#8202;]->[] [&hairsp;]-->[]
em dash N/A [&#8212;]->[—] [&mdash;] -->[—]
en dash N/A [&#8211;]->[] [&ndash;] -->[]
__________________________________________________________________
Characters not found in ISO-8859-2 have "N/A" in the Char column. Some
characters for which I could not find entity names in either [3]RFC
2070 or the [4]ISOlat1, ISOlat2, ISOnum, ISOpub and ISOtech sets (the
ones included by Peter Flynn's [5]HTML Pro DTD) are shown enclosed in
{braces}.
There also is a variation of this table which tests [6]ISO-8859-2
characters and entities in ALT attributes.
See Martin Ramsch's original [7]ISO-8859-1 Table for related info and
links, and for some notes on entity names. This file is mostly just an
adaptation of his table to the ISO-8859-2 character set.
__________________________________________________________________
kweide@tezcat.com 1997-03-09
References
1.
2.
3. http://www.internic.net/rfc/rfc2070.txt
4. ftp://www.ucc.ie/pub/sgml/
5. http://www.ucc.ie/doc/www/html/dtds/htmlpro.html
6.
7. http://www.uni-passau.de/~ramsch/iso8859-1.html