Hi, James

I'm Japanese.

> Any other suggestions or ways to correctly sort Japanese Words?
Though I'm not well versed in ordering Japanese words,
my friend told me the standards named "JIS X 4061:1996".

That's the standards using for Japanese dictionary(not python) or
Japanese book index.
The ja.wikipedia has an overview of that if you can read Japanese.
http://ja.wikipedia.org/wiki/日本語文字列照合順番

Also, you can buy the specification of the standards, but it seems
Japanese only.
http://www.webstore.jsa.or.jp/webstore/Com/FlowControl.jsp?lang=en&bunsyoId=JIS+X+4061%3A1996&dantaiCd=JIS&status=1&pageNo=0

I found the implementation with Perl. Maybe, there is no Python implementation.
http://search.cpan.org/~sadahiro/Lingua-JA-Sort-JIS-0.05/JIS.pod

I hope this helps.

thanks,
Tetsuya


On Thu, Jan 6, 2011 at 7:31 PM, James Hancock <jlhanc...@gmail.com> wrote:
> Wow, I have no idea what you just said... but, I think I agree.
>
> If I am right, what you are saying is, because the same kanji has multiple
> readings and the words are sorted by reading and not the kanji itself, that
> dictionary is going to be impossibly difficult to make.
>
> quick example, skip if you want to
> ------------------------------------------------->
> 大学 is read "daigaku" and means college
> 大き is read "Ōki" and means big.
>
>
> The 大 is read "dai" in one and "O" in the other. Giving the character a
> priority would not make it sort the right way because the order desired is
> by the reading, and the reading is all about the context of the characters.
>
> giving the character a value would make it be:
>
> 大き (Oki)
> 大学 (Daigaku
> 最高 (Saiko - means the best, because Django is the best!)
>
> When really, alphabetically it is:
>
> 大き (Oki)
> 最高 (Saiko)
> 大学 (Daigaku)
>
> <----------------------------------
> End of Example:
>
>
> Man, the Japanese where not thinking about programming when they made their
> language.
>
> Any other suggestions or ways to correctly sort Japanese Words?
>
> Cheers,
> James Hancock
>
> On Thu, Jan 6, 2011 at 5:38 PM, Masklinn <maskl...@masklinn.net> wrote:
>>
>> On 2011-01-06, at 07:23 , Sam Walters wrote:
>> > Hi,
>> > Personally I would map the priority of every character in a dict and
>> > pass this to sorted
>>
>> Given Japanese is not an alphabetical language and mixes syllabic and
>> logographic scripts (the logographic system having a few thousand
>> graphemes), I doubt this kind of trivial ideas is going to work correctly
>> (it only works correctly for simple alphabetical engines, even diacritics
>> are going to cause an explosion in the number of cases)
>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "Django users" group.
>> To post to this group, send email to django-us...@googlegroups.com.
>> To unsubscribe from this group, send email to
>> django-users+unsubscr...@googlegroups.com.
>> For more options, visit this group at
>> http://groups.google.com/group/django-users?hl=en.
>>
>
> --
> You received this message because you are subscribed to the Google Groups
> "Django users" group.
> To post to this group, send email to django-us...@googlegroups.com.
> To unsubscribe from this group, send email to
> django-users+unsubscr...@googlegroups.com.
> For more options, visit this group at
> http://groups.google.com/group/django-users?hl=en.
>

-- 
You received this message because you are subscribed to the Google Groups 
"Django users" group.
To post to this group, send email to django-us...@googlegroups.com.
To unsubscribe from this group, send email to 
django-users+unsubscr...@googlegroups.com.
For more options, visit this group at 
http://groups.google.com/group/django-users?hl=en.

Reply via email to