Thanks Jeff. Cheers, Parag
On Mon, Jan 4, 2010 at 7:41 AM, Jeff Peng <jeffp...@netzero.net> wrote: > Parag Kalra: > > Hmmm - > http://search.cpan.org/~dankogai/Encode-2.39/lib/Encode/Guess.pm<http://search.cpan.org/%7Edankogai/Encode-2.39/lib/Encode/Guess.pm> >> >> It says right at the bottom that below method won't work to guess the >> encoding. :( >> >> > Encode::Guess maybe work, but not so exactly. > Because some Code Bits of an encoding are overlapped (for example,gb2312 > and gbk),so you can't get the encoding style of a small string just by > guess. But for large text,it maybe work rightly. > > Here is another guess way (not by me) you may reference to: > > use Encode; > use LWP::Simple qw(get); > use strict; > > my $str = get "http://www.sina.com.cn"; > > eval {my $str2 = $str; Encode::decode("gbk", $str2, 1)}; > print "not gbk: $...@\n" if $@; > > eval {my $str2 = $str; Encode::decode("utf8", $str2, 1)}; > print "not utf8: $...@\n" if $@; > > eval {my $str2 = $str; Encode::decode("big5", $str2, 1)}; > print "not big5: $...@\n" if $@; > > > HTH. >