Packages not in Groups

perl-Unicode-Map8: Unicode-Map8 - Mapping table between 8-bit chars and Unicode

Name:perl-Unicode-Map8 Vendor:gaas@cpan.org
Version:0.12 License:Artistic
Release:1 URL:http://www.cpan.org
Summary
The *Unicode::Map8* class implement efficient mapping tables between 8-bit character sets and 16 bit character sets like Unicode. The tables are efficient both in terms of space allocated and translation speed. The 16-bit strings is assumed to use network byte order. The following methods are available: =over 4 =item $m = Unicode::Map8->new( [$charset] ) The object constructor creates new instances of the Unicode::Map8 class. I takes an optional argument that specify then name of a 8-bit character set to initialize mappings from. The argument can also be a the name of a mapping file. If the charset/file can not be located, then the constructor returns *undef*. If you omit the argument, then an empty mapping table is constructed. You must then add mapping pairs to it using the addpair() method described below. =item $m->addpair( $u8, $u16 ); Adds a new mapping pair to the mapping object. It takes two arguments. The first is the code value in the 8-bit character set and the second is the corresponding code value in the 16-bit character set. The same codes can be used multiple times (but using the same pair has no effect). The first definition for a code is the one that is used. Consider the following example: $m->addpair(0x20, 0x0020); $m->addpair(0x20, 0x00A0); $m->addpair(0xA0, 0x00A0); It means that the character 0x20 and 0xA0 in the 8-bit charset maps to themselves in the 16-bit set, but in the 16-bit character set 0x0A0 maps to 0x20. =item $m->default_to8( $u8 ) Set the code of the default character to use when mapping from 16-bit to 8-bit strings. If there is no mapping pair defined for a character then this default is substituted by to8() and recode8(). =item $m->default_to16( $u16 ) Set the code of the default character to use when mapping from 8-bit to 16-bit strings. If there is no mapping pair defined for a character then this default is used by to16(), tou() and recode8(). =item $m->nostrict; All undefined mappings are replaced with the identity mapping. Undefined character are normally just removed (or replaced with the default if defined) when converting between character sets. =item $m->to8( $ustr ); Converts a 16-bit character string to the corresponding string in the 8-bit character set. =item $m->to16( $str ); Converts a 8-bit character string to the corresponding string in the 16-bit character set. =item $m->tou( $str ); Same an to16() but return a Unicode::String object instead of a plain UCS2 string. =item $m->recode8($m2, $str); Map the string $str from one 8-bit character set ($m) to another one ($m2). Since we assume we know the mappings towards the common 16-bit encoding we can use this to convert between any of the 8-bit character sets. =item $m->to_char16( $u8 ) Maps a single 8-bit character code to an 16-bit code. If the 8-bit character is unmapped then the constant NOCHAR is returned. The default is not used and the callback method is not invoked. =item $m->to_char8( $u16 ) Maps a single 16-bit character code to an 8-bit code. If the 16-bit character is unmapped then the constant NOCHAR is returned. The default is not used and the callback method is not invoked. =back The following callback methods are available. You can override these methods by creating a subclass of Unicode::Map8. =over 4 =item $m->unmapped_to8 When mapping to 8-bit character string and there is no mapping defined (and no default either), then this method is called as the last resort. It is called with a single integer argument which is the code of the unmapped 16-bit character. It is expected to return a string that will be incorporated in the 8-bit string. The default version of this method always returns an empty string. Example: package MyMapper; @ISA=qw(Unicode::Map8); sub unmapped_to8 { my($self, $code) = @_; require Unicode::CharName; "<" . Unicode::CharName::uname($code) . ">"; } =item $m->unmapped_to16 Likewise when mapping to 16-bit character string and no mapping is defined then this method is called. It should return a 16-bit string with the bytes in network byte order. The default version of this method always returns an empty string. =back

Arch: i386

Download:perl-Unicode-Map8-0.12-1.i386.rpm
Build Date:Thu Sep 14 17:06:59 2006
Packager:Arix International <cpan2rpm{%}arix{*}com>
Size:648 KiB

Changelog

* Thu Sep 14 21:00:00 2006 cmc@vosill.math.hmc.edu
- Initial build.

Listing created by RepoView-0.5.2