Flashnux

GNU/Linux man pages

Livre :
Expressions régulières,
Syntaxe et mise en oeuvre :

ISBN : 978-2-7460-9712-4
EAN : 9782746097124
(Editions ENI)

GNU/Linux

CentOS 4.8

i386

perlcn(1)


PERLCN

PERLCN

NAME
DESCRIPTION
SEE ALSO
AUTHORS

NAME

perlcn − ¼òÌåÖÐÎÄ Perl Ö¸ÄÏ

DESCRIPTION

»¶ÓÀ´µ½ Perl µÄÌìµØ!

´Ó 5.8.0 °æ¿ªÊ¼, Perl ¾ß±¸ÁËÍêÉÆµÄ Unicode (ͳһÂë) Ö§Ô®, Ò²Á¬´øÖ§Ô®ÁËÐí¶àÀ¶¡ÓïϵÒÔÍâµÄ±àÂ뷽ʽ; CJK (ÖÐÈÕº«) ±ãÊÇÆäÖеÄÒ»²¿·Ý. Unicode Êǹú¼ÊÐԵıê×¼, ÊÔͼº- ¸ÇÊÀ½çÉÏËùÓеÄ×Ö·ü: Î÷·½ÊÀ½ç, ¶«·½ÊÀ½ç, ÒÔ¼°Á½Õß¼äµÄÒ»ÇÐ (Ï£À°ÎÄ, ÐðÀüÑÇÎÄ, ÑÇÀ²®ÎÄ, Ï£²®À´ÎÄ, Ó¡¶ÈÎÄ, Ó¡µØ°²ÎÄ, µÈµÈ). ËüÒ²ÈÝÄÉÁ˶àÖÖ×÷ҵϵͳÓëƽ̨ (Èç PC ¼°Âó½ðËþ).

Perl ±¾ÉíÒÔ Unicode ½øÐвÙ×÷. Õâ±íʾ Perl ÄÚ²¿µÄ×Ö·ü´®Êý¾Ý¿ÉÓà Unicode ±íʾ; Perl µÄº¯Ê½ÓëËã·ü (ÀýÈçÕý¹æ±íʾʽ±È¶Ô) Ò²ÄÜ¶Ô Unicode ½øÐвÙ×÷. ÔÚÊäÈë¼°Êä³öʱ, ΪÁË´¦ÀíÒÔ Unicode ֮ǰµÄ±àÂ뷽ʽ´æ·ÅµÄÊý¾Ý, Perl ÌṩÁË Encode Õâ¸öÄ£¿é, ¿ÉÒÔÈÃÄãÇáÒ׵ضÁÈ¡¼°Ð´Èë¾ÉÓеıàÂëÊý¾Ý.

Encode ÑÓÉìÄ£¿éÖ§Ô®ÏÂÁмòÌåÖÐÎĵıàÂ뷽ʽ (’gb2312’ ±íʾ ’euc−cn’):

    euc-cn      Unix ÑÓÉì×Ö·ü¼¯, Ò²¾ÍÊÇË׳ƵĹú±êÂë
    gb2312-raw  δ¾´¦ÀíµÄ (µÍ±ÈÌØ) GB2312 ×Ö·ü±í
    gb12345     δ¾´¦ÀíµÄÖйúÓ÷±ÌåÖÐÎıàÂë
    iso-ir-165  GB2312 + GB6345 + GB8565 + ÐÂÔö×Ö·ü
    cp936       ×ÖÂëÒ³ 936, Ò²¿ÉÒÔÓà ’GBK’ (À©³ä¹ú±êÂë) Ö¸Ã÷
    hz          7 ±ÈÌØÒݳöʽ GB2312 ±àÂë

¾ÙÀýÀ´Ëµ, ½« EUC-CN ±àÂëµÄµµ°¸×ª³É Unicode, ìóÐè¼üÈëÏÂÁÐÖ¸Áî:

    perl -Mencoding=euc-cn,STDOUT,utf8 -pe1 < file.euc-cn > file.utf8

Perl Ò²ÄÚ¸½ÁË "piconv", Ò»Ö§Í&ecirc;È«ÒÔ Perl д³ÉµÄ×Ö·&uuml;ת»»¹¤¾ß³ÌÐò, Ó÷¨È&ccedil;ÏÂ:

    piconv -f euc-cn -t utf8 < file.euc-cn > file.utf8
    piconv -f utf8 -t euc-cn < file.utf8 > file.euc-cn

ÁíÍ&acirc;, À&uuml;Óà encoding Ä£¿&eacute;, Äã¿ÉÒÔÇáÒ×д³öÒÔ×Ö·&uuml;Ϊµ¥Î»µÄ³ÌÐòÂ&euml;, È&ccedil;ÏÂË&ugrave;ʾ:

    #!/usr/bin/env perl
    # Æ&ocirc;¶¯ euc-cn ×Ö´®½&acirc;Îö; ±&ecirc;×¼Êä³öÈ&euml;¼°±&ecirc;×¼´íÎó¶¼É&egrave;Ϊ euc-cn ±&agrave;Â&euml;
    use encoding ’euc-cn’, STDIN => ’euc-cn’, STDOUT => ’euc-cn’;
    print length("ÂæÍÕ");            #  2 (Ë«ÒýºÅ±íʾ×Ö·&uuml;)
    print length(’ÂæÍÕ’);            #  4 (µ¥ÒýºÅ±íʾ×Ö½Ú)
    print index("×»×»½Ì»å", "»×»½"); # -1 (²»°üº¬´Ë×Ó×Ö·&uuml;´®)
    print index(’×»×»½Ì»å’, ’»×»½’); #  1 (´ÓµÚ¶þ¸ö×Ö½Ú¿ªÊ¼)

ÔÚ×&icirc;ºóÒ»ÁÐÀý×ÓÀ&iuml;, "×»" µÄµÚ¶þ¸ö×Ö½ÚÓ&euml; "×»" µÄµÚÒ»¸ö×Ö½Ú½áºÏ³É EUC-CN Â&euml;µÄ "»×"; "×»" µÄµÚ¶þ¸ö×Ö½ÚÔòÓ&euml; "½Ì" µÄµÚÒ»¸ö×Ö½Ú½áºÏ³É "»½". Õ&acirc;½&acirc;¾öÁËÒÔÇ° EUC-CN Â&euml;±È¶Ô´¦ÀíÉϳ£¼&uuml;µÄÎÊÌ&acirc;.

¶&icirc;Í&acirc;µÄÖÐÎı&agrave;Â&euml;

È&ccedil;¹&uuml;Ð&egrave;Òª¸ü¶&agrave;µÄÖÐÎı&agrave;Â&euml;, ¿ÉÒÔ´Ó CPAN (<http://www.cpan.org/>) ÏÂÔØ Encode::HanExtra Ä£¿&eacute;. ËüÄ¿Ç°ÌṩÏÂÁб&agrave;Â&euml;·½Ê½:

    gb18030     À©³ä¹ýµÄ¹ú±&ecirc;Â&euml;, °üº¬·±ÌåÖÐÎÄ

ÁíÍ&acirc;, Encode::HanConvert Ä£¿&eacute;ÔòÌṩÁ˼ò·±×ª»»ÓõÄÁ½ÖÖ±&agrave;Â&euml;:

    big5-simp   Big5 ·±ÌåÖÐÎÄÓ&euml; Unicode ¼òÌåÖÐÎÄ»¥×ª
    gbk-trad    GBK ¼òÌåÖÐÎÄÓ&euml; Unicode ·±ÌåÖÐÎÄ»¥×ª

È&ocirc;Ï&euml;ÔÚ GBK Ó&euml; Big5 Ö®¼ä»¥×ª, Ç&euml;²Î¿¼¸ÃÄ£¿&eacute;ÄÚ¸½µÄ b2g.pl Ó&euml; g2b.pl Á½Ö§³ÌÐò, »òÔÚ³ÌÐòÄÚʹÓÃÏÂÁÐд·¨:

    use Encode::HanConvert;
    $euc_cn = big5_to_gb($big5); # ´Ó Big5 תΪ GBK
    $big5 = gb_to_big5($euc_cn); # ´Ó GBK תΪ Big5

½øÒ»²½µÄÐÅÏ¢

Ç&euml;²Î¿¼ Perl ÄÚ¸½µÄ´óÁ¿ËµÃ÷Îļþ (²»ÐÒÈ«ÊÇÓÃÓ¢ÎÄдµÄ), À´Ñ§Ï°¸ü¶&agrave;¹ØÓÚ Perl µÄ֪ʶ, ÒÔ¼° Unicode µÄʹÓ÷½Ê½. ²»¹ý, Í&acirc;²¿µÄ×ÊÔ´Ï&agrave;µ±·á¸»:

Ìṩ Perl ×ÊÔ´µÄÍøÖ·
<http://www.perl.com/>

Perl µÄÊ×Ò³ (ÓÉÅ·À³Àñ¹«Ë¾Î¬»¤)

<http://www.cpan.org/>

Perl ×ۺϵä²ØÍø (Comprehensive Perl Archive Network)

<http://lists.perl.org/>

Perl ÓʵÝÂÛ̳һÀÀ

ѧϰ Perl µÄÍøÖ·
<http://www.oreilly.com.cn/html/perl.html>

¼òÌåÖÐÎÄ°æµÄÅ·À³Àñ Perl Ê&eacute;½å

Perl ʹÓÃÕß¼¯»á
<http://www.pm.org/groups/asia.shtml#China>

Öйú Perl Íƹã×&eacute;Ò»ÀÀ

Unicode Ï&agrave;¹ØÍøÖ·
<http://www.unicode.org/>

Unicode ѧÊõѧ»á (Unicode ±&ecirc;×¼µÄÖƶ¨Õß)

<http://www.cl.cam.ac.uk/%7Emgk25/unicode.html>

Unix/Linux É쵀 UTF−8 ¼° Unicode ´ð¿ÍÎÊ

SEE ALSO

Encode, Encode::CN, encoding, perluniintro, perlunicode

AUTHORS

Jarkko Hietaniemi <jhi@iki.fi>

Autrijus Tang (ÌÆ×Úºº) <autrijus@autrijus.org>



perlcn(1)