Windows-1252

Windows-1252 or CP-1252 (code page 1252) is a single-byte character encoding of the Latin alphabet, used by default in the legacy components of Microsoft Windows for English and many European languages including Spanish, French, and German.

Windows-1252
MIME / IANAwindows-1252[1]
Alias(es)cp1252 (code page 1252)
Language(s)Basically all supported by ISO/IEC 8859-1 e.g. English, Danish (except for a rare exceptional letter), Irish, Italian, Norwegian, Portuguese, Spanish, Swedish, German, Finnish, Icelandic, French, Faroese, Luxembourgish, Albanian, Estonian, Swahili, Tswana, Catalan, Basque, Occitan, Rotokas, Romansh, Dutch (except the IJ/ij character, substituted by IJ/ÿ), and Slovene (except the č character, substituted by ç).
Created byMicrosoft
StandardWHATWG Encoding Standard
Classificationextended ASCII, Windows-125x
ExtendsISO 8859-1 (excluding C1 controls)
Transforms / EncodesISO 8859-15

It is the most-used single-byte character encoding in the world (on websites at least). As of September 2022, 0.3% of all websites declared use of Windows-1252,[2][3] but at the same time 1.3%[2] used ISO 8859-1 (while only 5 of the top 1000 websites[4]), which by HTML5 standards should be considered the same encoding,[5] so that 1.6% of websites effectively use Windows-1252. Pages declared as US-ASCII would also count as this character set. An unknown (but probably large) subset of other pages use only the ASCII portion of UTF-8, or only the codes matching Windows-1252 from their declared character set, and could also be counted.

Depending on the country, use can be much higher than the global average, e.g. for Germany, according to website use (including ISO-8859-1) at 4.0%.[6][7]

Details

This character encoding is a superset of ISO 8859-1 in terms of printable characters, but differs from the IANA's ISO-8859-1 by using displayable characters rather than control characters in the 80 to 9F (hex) range. Notable additional characters include curly quotation marks and all the printable characters that are in ISO 8859-15 (at different places than ISO 8859-15). It is known to Windows by the code page number 1252, and by the IANA-approved name "windows-1252".

It is very common to mislabel Windows-1252 text with the charset label ISO-8859-1. A common result was that all the quotes and apostrophes (produced by "smart quotes" in word-processing software) were replaced with question marks or boxes on non-Windows operating systems, making text difficult to read. Most modern web browsers and e-mail clients treat the media type charset ISO-8859-1 as Windows-1252 to accommodate such mislabeling. This is now standard behavior in the HTML5 specification, which requires that documents advertised as ISO-8859-1 actually be parsed with the Windows-1252 encoding.[5]

Historically, the phrase "ANSI Code Page" was used in Windows to refer to non-DOS encodings; the intention was that most of these would be ANSI standards such as ISO-8859-1. Even though Windows-1252 was the first and by far most popular code page named so in Microsoft Windows parlance, the code page has never been an ANSI standard. Microsoft explains, "The term ANSI as used to signify Windows code pages is a historical reference, but is nowadays a misnomer that continues to persist in the Windows community."[8]

In LaTeX packages, CP-1252 is referred to as "ansinew".

IBM uses code page 1252 (CCSID 1252 and euro sign extended CCSID 5348) for Windows-1252.[9][10][11]

It is called "WE8MSWIN1252" by Oracle.[12]

Codepage layout

The following table shows Windows-1252. Differences from ISO-8859-1 have the Unicode code point number below the character, based on the Unicode.org mapping of Windows-1252 with "best fit". A tooltip, generally available only when one points to the immediate left of the character, shows the Unicode code point name and the decimal Alt code.

Windows-1252 (CP1252)[13][14][15][16][17]
0 1 2 3 4 5 6 7 8 9 A B C D E F
0_ title="Alt+0 U+0000 NULL" style="font-size:75%;padding:1px;"|NUL title="Alt+01 U+0001 START OF HEADING" style="font-size:75%;padding:1px;"|SOH title="Alt+02 U+0002 START OF TEXT" style="font-size:75%;padding:1px;"|STX title="Alt+03 U+0003 END OF TEXT" style="font-size:75%;padding:1px;"|ETX title="Alt+04 U+0004 END OF TRANSMISSION" style="font-size:75%;padding:1px;"|EOT title="Alt+05 U+0005 ENQUIRY" style="font-size:75%;padding:1px;"|ENQ title="Alt+06 U+0006 ACKNOWLEDGE" style="font-size:75%;padding:1px;"|ACK title="Alt+07 U+0007 BELL" style="font-size:75%;padding:1px;"|BEL title="Alt+08 U+0008 BACKSPACE" style="font-size:75%;padding:1px;"|BS title="Alt+09 U+0009 CHARACTER TABULATION" style="font-size:75%;padding:1px;"|HT title="Alt+010 U+000A LINE FEED (LF)" style="font-size:75%;padding:1px;"|LF title="Alt+011 U+000B LINE TABULATION" style="font-size:75%;padding:1px;"|VT title="Alt+012 U+000C FORM FEED (FF)" style="font-size:75%;padding:1px;"|FF title="Alt+013 U+000D CARRIAGE RETURN (CR)" style="font-size:75%;padding:1px;"|CR title="Alt+014 U+000E SHIFT OUT" style="font-size:75%;padding:1px;"|SO title="Alt+015 U+000F SHIFT IN" style="font-size:75%;padding:1px;"|SI
1_ title="Alt+016 U+0010 DATA LINK ESCAPE" style="font-size:75%;padding:1px;"|DLE title="Alt+017 U+0011 DEVICE CONTROL ONE" style="font-size:75%;padding:1px;"|DC1 title="Alt+018 U+0012 DEVICE CONTROL TWO" style="font-size:75%;padding:1px;"|DC2 title="Alt+019 U+0013 DEVICE CONTROL THREE" style="font-size:75%;padding:1px;"|DC3 title="Alt+020 U+0014 DEVICE CONTROL FOUR" style="font-size:75%;padding:1px;"|DC4 title="Alt+021 U+0015 NEGATIVE ACKNOWLEDGE" style="font-size:75%;padding:1px;"|NAK title="Alt+022 U+0016 SYNCHRONOUS IDLE" style="font-size:75%;padding:1px;"|SYN title="Alt+023 U+0017 END OF TRANSMISSION BLOCK" style="font-size:75%;padding:1px;"|ETB title="Alt+024 U+0018 CANCEL" style="font-size:75%;padding:1px;"|CAN title="Alt+025 U+0019 END OF MEDIUM" style="font-size:75%;padding:1px;"|EM title="Alt+026 U+001A SUBSTITUTE" style="font-size:75%;padding:1px;"|SUB title="Alt+027 U+001B ESCAPE" style="font-size:75%;padding:1px;"|ESC title="Alt+028 U+001C INFORMATION SEPARATOR FOUR" style="font-size:75%;padding:1px;"|FS title="Alt+029 U+001D INFORMATION SEPARATOR THREE" style="font-size:75%;padding:1px;"|GS title="Alt+030 U+001E INFORMATION SEPARATOR TWO" style="font-size:75%;padding:1px;"|RS title="Alt+031 U+001F INFORMATION SEPARATOR ONE" style="font-size:75%;padding:1px;"|US
2_ title="Alt+32 U+0020 SPACE" style="font-size:75%;padding:1px;"| SP  title="Alt+33 U+0021 EXCLAMATION MARK" style="padding:1px;"|! title="Alt+34 U+0022 QUOTATION MARK" style="padding:1px;"|" title="Alt+35 U+0023 NUMBER SIGN" style="padding:1px;"|# title="Alt+36 U+0024 DOLLAR SIGN" style="padding:1px;"|$ title="Alt+37 U+0025 PERCENT SIGN" style="padding:1px;"|% title="Alt+38 U+0026 AMPERSAND" style="padding:1px;"|& title="Alt+39 U+0027 APOSTROPHE" style="padding:1px;"|' title="Alt+40 U+0028 LEFT PARENTHESIS" style="padding:1px;"|( title="Alt+41 U+0029 RIGHT PARENTHESIS" style="padding:1px;"|) title="Alt+42 U+002A ASTERISK" style="padding:1px;"|* title="Alt+43 U+002B PLUS SIGN" style="padding:1px;"|+ title="Alt+44 U+002C COMMA" style="padding:1px;"|, title="Alt+45 U+002D HYPHEN-MINUS" style="padding:1px;"|- title="Alt+46 U+002E FULL STOP" style="padding:1px;"|. title="Alt+47 U+002F SOLIDUS" style="padding:1px;"|/
3_ title="Alt+48 U+0030 DIGIT ZERO" style="padding:1px;"|0 title="Alt+49 U+0031 DIGIT ONE" style="padding:1px;"|1 title="Alt+50 U+0032 DIGIT TWO" style="padding:1px;"|2 title="Alt+51 U+0033 DIGIT THREE" style="padding:1px;"|3 title="Alt+52 U+0034 DIGIT FOUR" style="padding:1px;"|4 title="Alt+53 U+0035 DIGIT FIVE" style="padding:1px;"|5 title="Alt+54 U+0036 DIGIT SIX" style="padding:1px;"|6 title="Alt+55 U+0037 DIGIT SEVEN" style="padding:1px;"|7 title="Alt+56 U+0038 DIGIT EIGHT" style="padding:1px;"|8 title="Alt+57 U+0039 DIGIT NINE" style="padding:1px;"|9 title="Alt+58 U+003A COLON" style="padding:1px;"|: title="Alt+59 U+003B SEMICOLON" style="padding:1px;"|; title="Alt+60 U+003C LESS-THAN SIGN" style="padding:1px;"|< title="Alt+61 U+003D EQUALS SIGN" style="padding:1px;"|= title="Alt+62 U+003E GREATER-THAN SIGN" style="padding:1px;"|> title="Alt+63 U+003F QUESTION MARK" style="padding:1px;"|?
4_ title="Alt+64 U+0040 COMMERCIAL AT" style="padding:1px;"|@ title="Alt+65 U+0041 LATIN CAPITAL LETTER A" style="padding:1px;"|A title="Alt+66 U+0042 LATIN CAPITAL LETTER B" style="padding:1px;"|B title="Alt+67 U+0043 LATIN CAPITAL LETTER C" style="padding:1px;"|C title="Alt+68 U+0044 LATIN CAPITAL LETTER D" style="padding:1px;"|D title="Alt+69 U+0045 LATIN CAPITAL LETTER E" style="padding:1px;"|E title="Alt+70 U+0046 LATIN CAPITAL LETTER F" style="padding:1px;"|F title="Alt+71 U+0047 LATIN CAPITAL LETTER G" style="padding:1px;"|G title="Alt+72 U+0048 LATIN CAPITAL LETTER H" style="padding:1px;"|H title="Alt+73 U+0049 LATIN CAPITAL LETTER I" style="padding:1px;"|I title="Alt+74 U+004A LATIN CAPITAL LETTER J" style="padding:1px;"|J title="Alt+75 U+004B LATIN CAPITAL LETTER K" style="padding:1px;"|K title="Alt+76 U+004C LATIN CAPITAL LETTER L" style="padding:1px;"|L title="Alt+77 U+004D LATIN CAPITAL LETTER M" style="padding:1px;"|M title="Alt+78 U+004E LATIN CAPITAL LETTER N" style="padding:1px;"|N title="Alt+79 U+004F LATIN CAPITAL LETTER O" style="padding:1px;"|O
5_ title="Alt+80 U+0050 LATIN CAPITAL LETTER P" style="padding:1px;"|P title="Alt+81 U+0051 LATIN CAPITAL LETTER Q" style="padding:1px;"|Q title="Alt+82 U+0052 LATIN CAPITAL LETTER R" style="padding:1px;"|R title="Alt+83 U+0053 LATIN CAPITAL LETTER S" style="padding:1px;"|S title="Alt+84 U+0054 LATIN CAPITAL LETTER T" style="padding:1px;"|T title="Alt+85 U+0055 LATIN CAPITAL LETTER U" style="padding:1px;"|U title="Alt+86 U+0056 LATIN CAPITAL LETTER V" style="padding:1px;"|V title="Alt+87 U+0057 LATIN CAPITAL LETTER W" style="padding:1px;"|W title="Alt+88 U+0058 LATIN CAPITAL LETTER X" style="padding:1px;"|X title="Alt+89 U+0059 LATIN CAPITAL LETTER Y" style="padding:1px;"|Y title="Alt+90 U+005A LATIN CAPITAL LETTER Z" style="padding:1px;"|Z title="Alt+91 U+005B LEFT SQUARE BRACKET" style="padding:1px;"|[ title="Alt+92 U+005C REVERSE SOLIDUS" style="padding:1px;"|\ title="Alt+93 U+005D RIGHT SQUARE BRACKET" style="padding:1px;"|] title="Alt+94 U+005E CIRCUMFLEX ACCENT" style="padding:1px;"|^ title="Alt+95 U+005F LOW LINE" style="padding:1px;"|_
6_ title="Alt+96 U+0060 GRAVE ACCENT" style="padding:1px;"|` title="Alt+97 U+0061 LATIN SMALL LETTER A" style="padding:1px;"|a title="Alt+98 U+0062 LATIN SMALL LETTER B" style="padding:1px;"|b title="Alt+99 U+0063 LATIN SMALL LETTER C" style="padding:1px;"|c title="Alt+100 U+0064 LATIN SMALL LETTER D" style="padding:1px;"|d title="Alt+101 U+0065 LATIN SMALL LETTER E" style="padding:1px;"|e title="Alt+102 U+0066 LATIN SMALL LETTER F" style="padding:1px;"|f title="Alt+103 U+0067 LATIN SMALL LETTER G" style="padding:1px;"|g title="Alt+104 U+0068 LATIN SMALL LETTER H" style="padding:1px;"|h title="Alt+105 U+0069 LATIN SMALL LETTER I" style="padding:1px;"|i title="Alt+106 U+006A LATIN SMALL LETTER J" style="padding:1px;"|j title="Alt+107 U+006B LATIN SMALL LETTER K" style="padding:1px;"|k title="Alt+108 U+006C LATIN SMALL LETTER L" style="padding:1px;"|l title="Alt+109 U+006D LATIN SMALL LETTER M" style="padding:1px;"|m title="Alt+110 U+006E LATIN SMALL LETTER N" style="padding:1px;"|n title="Alt+111 U+006F LATIN SMALL LETTER O" style="padding:1px;"|o
7_ title="Alt+112 U+0070 LATIN SMALL LETTER P" style="padding:1px;"|p title="Alt+113 U+0071 LATIN SMALL LETTER Q" style="padding:1px;"|q title="Alt+114 U+0072 LATIN SMALL LETTER R" style="padding:1px;"|r title="Alt+115 U+0073 LATIN SMALL LETTER S" style="padding:1px;"|s title="Alt+116 U+0074 LATIN SMALL LETTER T" style="padding:1px;"|t title="Alt+117 U+0075 LATIN SMALL LETTER U" style="padding:1px;"|u title="Alt+118 U+0076 LATIN SMALL LETTER V" style="padding:1px;"|v title="Alt+119 U+0077 LATIN SMALL LETTER W" style="padding:1px;"|w title="Alt+120 U+0078 LATIN SMALL LETTER X" style="padding:1px;"|x title="Alt+121 U+0079 LATIN SMALL LETTER Y" style="padding:1px;"|y title="Alt+122 U+007A LATIN SMALL LETTER Z" style="padding:1px;"|z title="Alt+123 U+007B LEFT CURLY BRACKET" style="padding:1px;"|{ title="Alt+124 U+007C VERTICAL LINE" style="padding:1px;"|| title="Alt+125 U+007D RIGHT CURLY BRACKET" style="padding:1px;"|} title="Alt+126 U+007E TILDE" style="padding:1px;"|~ title="Alt+0127 U+007F DELETE" style="font-size:75%;padding:1px;"|DEL
8_ title="Alt+0128 U+20AC EURO SIGN" style="padding:1px;vertical-align:bottom;line-height:.61;"|
20AC
title="Alt+0130 U+201A SINGLE LOW-9 QUOTATION MARK" style="padding:1px;vertical-align:bottom;line-height:.61;"|
201A
title="Alt+0131 U+0192 LATIN SMALL LETTER F WITH HOOK" style="padding:1px;vertical-align:bottom;line-height:.61;"|ƒ
0192
title="Alt+0132 U+201E DOUBLE LOW-9 QUOTATION MARK" style="padding:1px;vertical-align:bottom;line-height:.61;"|
201E
title="Alt+0133 U+2026 HORIZONTAL ELLIPSIS" style="padding:1px;vertical-align:bottom;line-height:.61;"|
2026
title="Alt+0134 U+2020 DAGGER" style="padding:1px;vertical-align:bottom;line-height:.61;"|
2020
title="Alt+0135 U+2021 DOUBLE DAGGER" style="padding:1px;vertical-align:bottom;line-height:.61;"|
2021
title="Alt+0136 U+02C6 MODIFIER LETTER CIRCUMFLEX ACCENT" style="padding:1px;vertical-align:bottom;line-height:.61;"|ˆ
02C6
title="Alt+0137 U+2030 PER MILLE SIGN" style="padding:1px;vertical-align:bottom;line-height:.61;"|
2030
title="Alt+0138 U+0160 LATIN CAPITAL LETTER S WITH CARON" style="padding:1px;vertical-align:bottom;line-height:.61;"|Š
0160
title="Alt+0139 U+2039 SINGLE LEFT-POINTING ANGLE QUOTATION MARK" style="padding:1px;vertical-align:bottom;line-height:.61;"|
2039
title="Alt+0140 U+0152 LATIN CAPITAL LIGATURE OE" style="padding:1px;vertical-align:bottom;line-height:.61;"|Œ
0152
title="Alt+0142 U+017D LATIN CAPITAL LETTER Z WITH CARON" style="padding:1px;vertical-align:bottom;line-height:.61;"|Ž
017D
9_ title="Alt+0145 U+2018 LEFT SINGLE QUOTATION MARK" style="padding:1px;vertical-align:bottom;line-height:.61;"|
2018
title="Alt+0146 U+2019 RIGHT SINGLE QUOTATION MARK" style="padding:1px;vertical-align:bottom;line-height:.61;"|
2019
title="Alt+0147 U+201C LEFT DOUBLE QUOTATION MARK" style="padding:1px;vertical-align:bottom;line-height:.61;"|
201C
title="Alt+0148 U+201D RIGHT DOUBLE QUOTATION MARK" style="padding:1px;vertical-align:bottom;line-height:.61;"|
201D
title="Alt+0149 U+2022 BULLET" style="padding:1px;vertical-align:bottom;line-height:.61;"|
2022
title="Alt+0150 U+2013 EN DASH" style="padding:1px;vertical-align:bottom;line-height:.61;"|
2013
title="Alt+0151 U+2014 EM DASH" style="padding:1px;vertical-align:bottom;line-height:.61;"|
2014
title="Alt+0152 U+02DC SMALL TILDE" style="padding:1px;vertical-align:bottom;line-height:.61;"|˜
02DC
title="Alt+0153 U+2122 TRADE MARK SIGN" style="padding:1px;vertical-align:bottom;line-height:.61;"|
2122
title="Alt+0154 U+0161 LATIN SMALL LETTER S WITH CARON" style="padding:1px;vertical-align:bottom;line-height:.61;"|š
0161
title="Alt+0155 U+203A SINGLE RIGHT-POINTING ANGLE QUOTATION MARK" style="padding:1px;vertical-align:bottom;line-height:.61;"|
203A
title="Alt+0156 U+0153 LATIN SMALL LIGATURE OE" style="padding:1px;vertical-align:bottom;line-height:.61;"|œ
0153
title="Alt+0158 U+017E LATIN SMALL LETTER Z WITH CARON" style="padding:1px;vertical-align:bottom;line-height:.61;"|ž
017E
title="Alt+0159 U+0178 LATIN CAPITAL LETTER Y WITH DIAERESIS" style="padding:1px;vertical-align:bottom;line-height:.61;"|Ÿ
0178
A_ title="Alt+0160 U+00A0 NO-BREAK SPACE" style="font-size:75%;padding:1px;"|NBSP title="Alt+0161 U+00A1 INVERTED EXCLAMATION MARK" style="padding:1px;"|¡ title="Alt+0162 U+00A2 CENT SIGN" style="padding:1px;"|¢ title="Alt+0163 U+00A3 POUND SIGN" style="padding:1px;"|£ title="Alt+0164 U+00A4 CURRENCY SIGN" style="padding:1px;"|¤ title="Alt+0165 U+00A5 YEN SIGN" style="padding:1px;"|¥ title="Alt+0166 U+00A6 BROKEN BAR" style="padding:1px;"|¦ title="Alt+0167 U+00A7 SECTION SIGN" style="padding:1px;"|§ title="Alt+0168 U+00A8 DIAERESIS" style="padding:1px;"|¨ title="Alt+0169 U+00A9 COPYRIGHT SIGN" style="padding:1px;"|© title="Alt+0170 U+00AA FEMININE ORDINAL INDICATOR" style="padding:1px;"|ª title="Alt+0171 U+00AB LEFT-POINTING DOUBLE ANGLE QUOTATION MARK" style="padding:1px;"|« title="Alt+0172 U+00AC NOT SIGN" style="padding:1px;"|¬ title="Alt+0173 U+00AD SOFT HYPHEN" style="font-size:75%;padding:1px;"|SHY title="Alt+0174 U+00AE REGISTERED SIGN" style="padding:1px;"|® title="Alt+0175 U+00AF MACRON" style="padding:1px;"|¯
B_ title="Alt+0176 U+00B0 DEGREE SIGN" style="padding:1px;"|° title="Alt+0177 U+00B1 PLUS-MINUS SIGN" style="padding:1px;"|± title="Alt+0178 U+00B2 SUPERSCRIPT TWO" style="padding:1px;"|² title="Alt+0179 U+00B3 SUPERSCRIPT THREE" style="padding:1px;"|³ title="Alt+0180 U+00B4 ACUTE ACCENT" style="padding:1px;"|´ title="Alt+0181 U+00B5 MICRO SIGN" style="padding:1px;"|µ title="Alt+0182 U+00B6 PILCROW SIGN" style="padding:1px;"| title="Alt+0183 U+00B7 MIDDLE DOT" style="padding:1px;"|· title="Alt+0184 U+00B8 CEDILLA" style="padding:1px;"|¸ title="Alt+0185 U+00B9 SUPERSCRIPT ONE" style="padding:1px;"|¹ title="Alt+0186 U+00BA MASCULINE ORDINAL INDICATOR" style="padding:1px;"|º title="Alt+0187 U+00BB RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK" style="padding:1px;"|» title="Alt+0188 U+00BC VULGAR FRACTION ONE QUARTER" style="padding:1px;"|¼ title="Alt+0189 U+00BD VULGAR FRACTION ONE HALF" style="padding:1px;"|½ title="Alt+0190 U+00BE VULGAR FRACTION THREE QUARTERS" style="padding:1px;"|¾ title="Alt+0191 U+00BF INVERTED QUESTION MARK" style="padding:1px;"|¿
C_ title="Alt+0192 U+00C0 LATIN CAPITAL LETTER A WITH GRAVE" style="padding:1px;"|À title="Alt+0193 U+00C1 LATIN CAPITAL LETTER A WITH ACUTE" style="padding:1px;"|Á title="Alt+0194 U+00C2 LATIN CAPITAL LETTER A WITH CIRCUMFLEX" style="padding:1px;"|Â title="Alt+0195 U+00C3 LATIN CAPITAL LETTER A WITH TILDE" style="padding:1px;"|Ã title="Alt+0196 U+00C4 LATIN CAPITAL LETTER A WITH DIAERESIS" style="padding:1px;"|Ä title="Alt+0197 U+00C5 LATIN CAPITAL LETTER A WITH RING ABOVE" style="padding:1px;"|Å title="Alt+0198 U+00C6 LATIN CAPITAL LETTER AE" style="padding:1px;"|Æ title="Alt+0199 U+00C7 LATIN CAPITAL LETTER C WITH CEDILLA" style="padding:1px;"|Ç title="Alt+0200 U+00C8 LATIN CAPITAL LETTER E WITH GRAVE" style="padding:1px;"|È title="Alt+0201 U+00C9 LATIN CAPITAL LETTER E WITH ACUTE" style="padding:1px;"|É title="Alt+0202 U+00CA LATIN CAPITAL LETTER E WITH CIRCUMFLEX" style="padding:1px;"|Ê title="Alt+0203 U+00CB LATIN CAPITAL LETTER E WITH DIAERESIS" style="padding:1px;"|Ë title="Alt+0204 U+00CC LATIN CAPITAL LETTER I WITH GRAVE" style="padding:1px;"|Ì title="Alt+0205 U+00CD LATIN CAPITAL LETTER I WITH ACUTE" style="padding:1px;"|Í title="Alt+0206 U+00CE LATIN CAPITAL LETTER I WITH CIRCUMFLEX" style="padding:1px;"|Î title="Alt+0207 U+00CF LATIN CAPITAL LETTER I WITH DIAERESIS" style="padding:1px;"|Ï
D_ title="Alt+0208 U+00D0 LATIN CAPITAL LETTER ETH" style="padding:1px;"|Ð title="Alt+0209 U+00D1 LATIN CAPITAL LETTER N WITH TILDE" style="padding:1px;"|Ñ title="Alt+0210 U+00D2 LATIN CAPITAL LETTER O WITH GRAVE" style="padding:1px;"|Ò title="Alt+0211 U+00D3 LATIN CAPITAL LETTER O WITH ACUTE" style="padding:1px;"|Ó title="Alt+0212 U+00D4 LATIN CAPITAL LETTER O WITH CIRCUMFLEX" style="padding:1px;"|Ô title="Alt+0213 U+00D5 LATIN CAPITAL LETTER O WITH TILDE" style="padding:1px;"|Õ title="Alt+0214 U+00D6 LATIN CAPITAL LETTER O WITH DIAERESIS" style="padding:1px;"|Ö title="Alt+0215 U+00D7 MULTIPLICATION SIGN" style="padding:1px;"|× title="Alt+0216 U+00D8 LATIN CAPITAL LETTER O WITH STROKE" style="padding:1px;"|Ø title="Alt+0217 U+00D9 LATIN CAPITAL LETTER U WITH GRAVE" style="padding:1px;"|Ù title="Alt+0218 U+00DA LATIN CAPITAL LETTER U WITH ACUTE" style="padding:1px;"|Ú title="Alt+0219 U+00DB LATIN CAPITAL LETTER U WITH CIRCUMFLEX" style="padding:1px;"|Û title="Alt+0220 U+00DC LATIN CAPITAL LETTER U WITH DIAERESIS" style="padding:1px;"|Ü title="Alt+0221 U+00DD LATIN CAPITAL LETTER Y WITH ACUTE" style="padding:1px;"|Ý title="Alt+0222 U+00DE LATIN CAPITAL LETTER THORN" style="padding:1px;"|Þ title="Alt+0223 U+00DF LATIN SMALL LETTER SHARP S" style="padding:1px;"|ß
E_ title="Alt+0224 U+00E0 LATIN SMALL LETTER A WITH GRAVE" style="padding:1px;"|à title="Alt+0225 U+00E1 LATIN SMALL LETTER A WITH ACUTE" style="padding:1px;"|á title="Alt+0226 U+00E2 LATIN SMALL LETTER A WITH CIRCUMFLEX" style="padding:1px;"|â title="Alt+0227 U+00E3 LATIN SMALL LETTER A WITH TILDE" style="padding:1px;"|ã title="Alt+0228 U+00E4 LATIN SMALL LETTER A WITH DIAERESIS" style="padding:1px;"|ä title="Alt+0229 U+00E5 LATIN SMALL LETTER A WITH RING ABOVE" style="padding:1px;"|å title="Alt+0230 U+00E6 LATIN SMALL LETTER AE" style="padding:1px;"|æ title="Alt+0231 U+00E7 LATIN SMALL LETTER C WITH CEDILLA" style="padding:1px;"|ç title="Alt+0232 U+00E8 LATIN SMALL LETTER E WITH GRAVE" style="padding:1px;"|è title="Alt+0233 U+00E9 LATIN SMALL LETTER E WITH ACUTE" style="padding:1px;"|é title="Alt+0234 U+00EA LATIN SMALL LETTER E WITH CIRCUMFLEX" style="padding:1px;"|ê title="Alt+0235 U+00EB LATIN SMALL LETTER E WITH DIAERESIS" style="padding:1px;"|ë title="Alt+0236 U+00EC LATIN SMALL LETTER I WITH GRAVE" style="padding:1px;"|ì title="Alt+0237 U+00ED LATIN SMALL LETTER I WITH ACUTE" style="padding:1px;"|í title="Alt+0238 U+00EE LATIN SMALL LETTER I WITH CIRCUMFLEX" style="padding:1px;"|î title="Alt+0239 U+00EF LATIN SMALL LETTER I WITH DIAERESIS" style="padding:1px;"|ï
F_ title="Alt+0240 U+00F0 LATIN SMALL LETTER ETH" style="padding:1px;"|ð title="Alt+0241 U+00F1 LATIN SMALL LETTER N WITH TILDE" style="padding:1px;"|ñ title="Alt+0242 U+00F2 LATIN SMALL LETTER O WITH GRAVE" style="padding:1px;"|ò title="Alt+0243 U+00F3 LATIN SMALL LETTER O WITH ACUTE" style="padding:1px;"|ó title="Alt+0244 U+00F4 LATIN SMALL LETTER O WITH CIRCUMFLEX" style="padding:1px;"|ô title="Alt+0245 U+00F5 LATIN SMALL LETTER O WITH TILDE" style="padding:1px;"|õ title="Alt+0246 U+00F6 LATIN SMALL LETTER O WITH DIAERESIS" style="padding:1px;"|ö title="Alt+0247 U+00F7 DIVISION SIGN" style="padding:1px;"|÷ title="Alt+0248 U+00F8 LATIN SMALL LETTER O WITH STROKE" style="padding:1px;"|ø title="Alt+0249 U+00F9 LATIN SMALL LETTER U WITH GRAVE" style="padding:1px;"|ù title="Alt+0250 U+00FA LATIN SMALL LETTER U WITH ACUTE" style="padding:1px;"|ú title="Alt+0251 U+00FB LATIN SMALL LETTER U WITH CIRCUMFLEX" style="padding:1px;"|û title="Alt+0252 U+00FC LATIN SMALL LETTER U WITH DIAERESIS" style="padding:1px;"|ü title="Alt+0253 U+00FD LATIN SMALL LETTER Y WITH ACUTE" style="padding:1px;"|ý title="Alt+0254 U+00FE LATIN SMALL LETTER THORN" style="padding:1px;"|þ title="Alt+0255 U+00FF LATIN SMALL LETTER Y WITH DIAERESIS" style="padding:1px;"|ÿ

  According to the information on Microsoft's and the Unicode Consortium's websites, positions 81, 8D, 8F, 90, and 9D are unused; however, the Windows API MultiByteToWideChar maps these to the corresponding C1 control codes. The "best fit" mapping documents this behavior, too.[13]

History

  • The first version of the codepage 1252 used in Microsoft Windows 1.0 did not have positions D7 and F7 defined. All the characters in the ranges 80–9F were undefined too.
  • The second version, used in Microsoft Windows 2.0, positions D7, F7, 91, and 92 had been defined.
  • The third version, used since Microsoft Windows 3.1, had all the present-day positions defined, except euro sign and Z with caron character pair.
  • The final version listed above debuted in Microsoft Windows 98 and was ported to older versions of Windows with the euro symbol update.

OS/2 extensions

The OS/2 operating system supports an encoding by the name of Code page 1004 (CCSID 1004) or "Windows Extended".[18][19] This mostly matches code page 1252, with the exception of certain C0 control characters being replaced by diacritic characters.

Code page 1004 (differing rows only)[20][21][22][23]
0 1 2 3 4 5 6 7 8 9 A B C D E F
0_ title="Alt+0 U+0000 NULL" style="font-size:75%;padding:1px;"|NUL title="Alt+01 U+0001 START OF HEADING" style="font-size:75%;padding:1px;"|SOH title="Alt+02 U+0002 START OF TEXT" style="font-size:75%;padding:1px;"|STX title="Alt+03 U+0003 END OF TEXT" style="font-size:75%;padding:1px;"|ETX ˉ
02C9
˘
02D8
˙
02D9
title="Alt+07 U+0007 BELL" style="font-size:75%;padding:1px;"|BEL ˚
02DA
title="Alt+09 U+0009 CHARACTER TABULATION" style="font-size:75%;padding:1px;"|HT ˝
02DD
˛
02DB
ˇ
02C7
title="Alt+013 U+000D CARRIAGE RETURN (CR)" style="font-size:75%;padding:1px;"|CR title="Alt+014 U+000E SHIFT OUT" style="font-size:75%;padding:1px;"|SO title="Alt+015 U+000F SHIFT IN" style="font-size:75%;padding:1px;"|SI

MSDOS extensions [rare]

There is a rarely used, but useful, graphics extended code page 1252 where codes 0x00 to 0x1f allow for box drawing as used in applications such as MSDOS Edit and Codeview. One of the applications to use this code page was an Intel Corporation Install/Recovery disk image utility from mid/late 1995. These programs were written for its P6 User Test Program machines (US example[24]). It was used exclusively in its then EMEA region (Europe, Middle East & Africa). In time the programs were changed to use code page 850.

Graphics Extended Code Page 1252
0 1 2 3 4 5 6 7 8 9 A B C D E F
0_
1_

Palm OS variant

This variant of Windows-1252 is used by Palm OS 3.5. Python gives it the palmos label.[25] Differences from Windows-1252 have their Unicode code point.

Palm OS code page[26]
0 1 2 3 4 5 6 7 8 9 A B C D E F
8_ title="Alt+0128 U+20AC EURO SIGN" style="padding:1px;"| title="Alt+0130 U+201A SINGLE LOW-9 QUOTATION MARK" style="padding:1px;"| title="Alt+0131 U+0192 LATIN SMALL LETTER F WITH HOOK" style="padding:1px;"|ƒ title="Alt+0132 U+201E DOUBLE LOW-9 QUOTATION MARK" style="padding:1px;"| title="Alt+0133 U+2026 HORIZONTAL ELLIPSIS" style="padding:1px;"| title="Alt+0134 U+2020 DAGGER" style="padding:1px;"| title="Alt+0135 U+2021 DOUBLE DAGGER" style="padding:1px;"| title="Alt+0136 U+02C6 MODIFIER LETTER CIRCUMFLEX ACCENT" style="padding:1px;"|ˆ title="Alt+0137 U+2030 PER MILLE SIGN" style="padding:1px;"| title="Alt+0138 U+0160 LATIN CAPITAL LETTER S WITH CARON" style="padding:1px;"|Š title="Alt+0139 U+2039 SINGLE LEFT-POINTING ANGLE QUOTATION MARK" style="padding:1px;"| title="Alt+0140 U+0152 LATIN CAPITAL LIGATURE OE" style="padding:1px;"|Œ
2666

2663

2665
9_
2660
title="Alt+0145 U+2018 LEFT SINGLE QUOTATION MARK" style="padding:1px;"| title="Alt+0146 U+2019 RIGHT SINGLE QUOTATION MARK" style="padding:1px;"| title="Alt+0147 U+201C LEFT DOUBLE QUOTATION MARK" style="padding:1px;"| title="Alt+0148 U+201D RIGHT DOUBLE QUOTATION MARK" style="padding:1px;"| title="Alt+0149 U+2022 BULLET" style="padding:1px;"| title="Alt+0150 U+2013 EN DASH" style="padding:1px;"| title="Alt+0151 U+2014 EM DASH" style="padding:1px;"| title="Alt+0152 U+02DC SMALL TILDE" style="padding:1px;"|˜  title="Alt+0153 U+2122 TRADE MARK SIGN" style="padding:1px;"| title="Alt+0154 U+0161 LATIN SMALL LETTER S WITH CARON" style="padding:1px;"|š title="Alt+0155 U+203A SINGLE RIGHT-POINTING ANGLE QUOTATION MARK" style="padding:1px;"| title="Alt+0156 U+0153 LATIN SMALL LIGATURE OE" style="padding:1px;"|œ title="Alt+0159 U+0178 LATIN CAPITAL LETTER Y WITH DIAERESIS" style="padding:1px;"|Ÿ

See also

References

  1. Character Sets, Internet Assigned Numbers Authority (IANA), 2018-12-12
  2. "Historical trends in the usage statistics of character encodings for websites, September 2022". w3techs.com. Retrieved 2022-09-19.
  3. "Frequenty Asked Questions". w3techs.com.
  4. "Usage Survey of Character Encodings broken down by Ranking". w3techs.com. Retrieved 2022-09-29.
  5. "Encoding". WHATWG. 27 January 2015. sec. 5.2 Names and labels. Archived from the original on 4 February 2015. Retrieved 4 February 2015.
  6. "Distribution of Character Encodings among websites that use .de". w3techs.com. Retrieved 2022-10-23.
  7. "Distribution of Character Encodings among websites that use German". w3techs.com. Retrieved 2022-10-23.
  8. Wissink, Cathy (5 April 2002). "Unicode and Windows XP" (PDF). Microsoft. p. 1. Archived (PDF) from the original on 4 February 2015. Retrieved 4 February 2015.
  9. "Code page 1252 information document". Archived from the original on 2016-03-03.
  10. "CCSID 1252 information document". Archived from the original on 2016-03-26.
  11. "CCSID 5348 information document". Archived from the original on 2014-11-29.
  12. "Database Client Installation Guide". Oracle. Retrieved 2021-02-14.
  13. "Unicode mappings of Windows-1252 with 'Best Fit'". Unicode. Archived from the original on 4 February 2015. Retrieved 4 February 2015.
  14. Code Page CPGID 01252 (pdf) (PDF), IBM
  15. Code Page CPGID 01252 (txt), IBM
  16. International Components for Unicode (ICU), ibm-1252_P100-2000.ucm, 2002-12-03
  17. International Components for Unicode (ICU), ibm-5348_P100-1997.ucm, 2002-12-03
  18. "Code page 1004 information document". Archived from the original on 2015-06-25.
  19. "CCSID 1004 information document". Archived from the original on 2016-03-26.
  20. "Code Page 01004" (PDF). IBM. Archived from the original (PDF) on 2015-07-08. (version based on Windows 3.1 version of Windows-1252)
  21. Code Page CPGID 01004 (pdf) (PDF), IBM
  22. Code Page CPGID 01004 (txt), IBM
  23. Borgendale, Ken (2001). "Codepage 1004 - Windows Extended". OS/2 codepages by number. Archived from the original on 2018-05-13. Retrieved 2018-05-13. (version based on current version of Windows-1252)
  24. Storaasli, Olaf (1996). "Performance of the NASA equation solvers on computational mechanics applications" (PDF). Performance of NASA Equation Solvers on Computational Mechanics Applications. NASA. doi:10.2514/6.1996-1505. S2CID 15711051. Archived from the original (PDF) on 2019-05-03.
  25. "codecs — Codec registry and base classes (§ Text Encodings)". The Python Standard Library—Python 3.9.4 Documentation. Python Software Foundation.
  26. Mullender, Sjoerd (9 December 2021). "Python Character Mapping Codec for Palm OS 3.5". CPython source tree. Python Software Foundation.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.