Hi, James I'm Japanese.
> Any other suggestions or ways to correctly sort Japanese Words? Though I'm not well versed in ordering Japanese words, my friend told me the standards named "JIS X 4061:1996". That's the standards using for Japanese dictionary(not python) or Japanese book index. The ja.wikipedia has an overview of that if you can read Japanese. http://ja.wikipedia.org/wiki/日本語文字列照合順番 Also, you can buy the specification of the standards, but it seems Japanese only. http://www.webstore.jsa.or.jp/webstore/Com/FlowControl.jsp?lang=en&bunsyoId=JIS+X+4061%3A1996&dantaiCd=JIS&status=1&pageNo=0 I found the implementation with Perl. Maybe, there is no Python implementation. http://search.cpan.org/~sadahiro/Lingua-JA-Sort-JIS-0.05/JIS.pod I hope this helps. thanks, Tetsuya On Thu, Jan 6, 2011 at 7:31 PM, James Hancock <jlhanc...@gmail.com> wrote: > Wow, I have no idea what you just said... but, I think I agree. > > If I am right, what you are saying is, because the same kanji has multiple > readings and the words are sorted by reading and not the kanji itself, that > dictionary is going to be impossibly difficult to make. > > quick example, skip if you want to > -------------------------------------------------> > 大学 is read "daigaku" and means college > 大き is read "Ōki" and means big. > > > The 大 is read "dai" in one and "O" in the other. Giving the character a > priority would not make it sort the right way because the order desired is > by the reading, and the reading is all about the context of the characters. > > giving the character a value would make it be: > > 大き (Oki) > 大学 (Daigaku > 最高 (Saiko - means the best, because Django is the best!) > > When really, alphabetically it is: > > 大き (Oki) > 最高 (Saiko) > 大学 (Daigaku) > > <---------------------------------- > End of Example: > > > Man, the Japanese where not thinking about programming when they made their > language. > > Any other suggestions or ways to correctly sort Japanese Words? > > Cheers, > James Hancock > > On Thu, Jan 6, 2011 at 5:38 PM, Masklinn <maskl...@masklinn.net> wrote: >> >> On 2011-01-06, at 07:23 , Sam Walters wrote: >> > Hi, >> > Personally I would map the priority of every character in a dict and >> > pass this to sorted >> >> Given Japanese is not an alphabetical language and mixes syllabic and >> logographic scripts (the logographic system having a few thousand >> graphemes), I doubt this kind of trivial ideas is going to work correctly >> (it only works correctly for simple alphabetical engines, even diacritics >> are going to cause an explosion in the number of cases) >> >> -- >> You received this message because you are subscribed to the Google Groups >> "Django users" group. >> To post to this group, send email to django-us...@googlegroups.com. >> To unsubscribe from this group, send email to >> django-users+unsubscr...@googlegroups.com. >> For more options, visit this group at >> http://groups.google.com/group/django-users?hl=en. >> > > -- > You received this message because you are subscribed to the Google Groups > "Django users" group. > To post to this group, send email to django-us...@googlegroups.com. > To unsubscribe from this group, send email to > django-users+unsubscr...@googlegroups.com. > For more options, visit this group at > http://groups.google.com/group/django-users?hl=en. > -- You received this message because you are subscribed to the Google Groups "Django users" group. To post to this group, send email to django-us...@googlegroups.com. To unsubscribe from this group, send email to django-users+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/django-users?hl=en.