目录

名称

Encode::Byte - 单字节编码

概要

use Encode qw/encode decode/; 
$greek = encode("iso-8859-7", $utf8);  # loads Encode::Byte implicitly
$utf8  = decode("iso-8859-7", $greek); # ditto

摘要

此模块实现了各种单字节编码。对于大多数情况,它使用 \x80-\xff(上半部分)来映射非 ASCII 字符。支持的编码如下。

Canonical      Alias		                      Description
--------------------------------------------------------------------
# ISO 8859 series
(iso-8859-1	is in built-in)
iso-8859-2	latin2					     [ISO]
iso-8859-3	latin3					     [ISO]
iso-8859-4	latin4					     [ISO]
iso-8859-5						     [ISO]
iso-8859-6						     [ISO]
iso-8859-7						     [ISO]
iso-8859-8						     [ISO]
iso-8859-9	latin5					     [ISO]
iso-8859-10	latin6					     [ISO]
iso-8859-11
(iso-8859-12 is nonexistent)
iso-8859-13   latin7					     [ISO]
iso-8859-14	latin8					     [ISO]
iso-8859-15	latin9					     [ISO]
iso-8859-16	latin10					     [ISO]

# Cyrillic
koi8-f					
koi8-r        cp878					 [RFC1489]
koi8-u						 [RFC2319]

# Vietnamese
viscii

# all cp* are also available as ibm-*, ms-*, and windows-*
# also see L<http://msdn.microsoft.com/en-us/library/aa752010%28VS.85%29.aspx>

cp424  
cp437  
cp737  
cp775  
cp850  
cp852  
cp855  
cp856  
cp857  
cp860  
cp861  
cp862  
cp863  
cp864  
cp865  
cp866  
cp869  
cp874  
cp1006  
cp1250	WinLatin2
cp1251	WinCyrillic
cp1252	WinLatin1
cp1253	WinGreek
cp1254	WinTurkish
cp1255	WinHebrew
cp1256	WinArabic
cp1257	WinBaltic
cp1258	WinVietnamese

# Macintosh
# Also see L<http://developer.apple.com/technotes/tn/tn1150.html>
MacArabic  
MacCentralEurRoman  
MacCroatian  
MacCyrillic  
MacFarsi  
MacGreek  
MacHebrew  
MacIcelandic  
MacRoman  
MacRomanian  
MacRumanian  
MacSami  
MacThai  
MacTurkish  
MacUkrainian  

# More vendor encodings
AdobeStandardEncoding
nextstep
hp-roman8

描述

要详细了解如何使用此模块,请参阅 Encode

另请参见

Encode