Hi, From: [EMAIL PROTECTED] (Craig Small) Subject: Status of new search engine Date: Tue, 17 Dec 2002 19:30:54 +1100
> Hello, > After that last request I decided it was time to get serious about the > searching and have now got it going, sort of. Thank you for your efforts. > - This one is using UTF-8, it should mean the charsets that were not > supported before are now, including 2byte ones. For a Japanese speaker like me, this IS everything. I am really looking forward this will be available. I tested the new page. 1. I searched "kubota", which is my name, in "any" languages. There are 150 results. The second one is Japanese page, http://www.debian.org/devel/website/translation_coordinators.ja.html However, the result-showing page displays the title and so on brokenly. I found the search page http://search.debian.org/new/search.cgi have the following line: <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> which is apparently wrong. The page should be utf-8. Of course, the result items must be converted into UTF-8. (Since the search result page has to display various languages in one page, the encoding must be UTF-8.) 2. I searched "Linux" in Language: "Any". It results in 112950 hits. However, in any other languages such as "English", "German", and "Japanese", the result was zero. 3. I searched a Japanese word which means "security" in Language: "Any". The result was zero. I checked the URL of the result page. It was: http://search.debian.org/new/search.cgi?q=%A5%BB%A5%AD%A5%E5%A5%EA%A5%C6%A5%A3&ps=10&o=0&m=and&lang= The variable "q" seems to be the word in EUC-JP encoding. (I used a text browser "w3m" in Debian Sid, ja_JP.eucJP locale). On the other hand, when I used Microsoft Windows and Internet Explorer, the URL was: http://search.debian.org/new/search.cgi?q=%26%2312475%3B%26%2312461%3B%26%2312517%3B%26%2312522%3B%26%2312486%3B%26%2312451%3B&ps=10&o=0&m=and&lang= I don't know the value of the variable "q". Thus, it might be difficult to handle various international input from webforms. However, I believe it is possible because Google does it well. --- Tomohiro KUBOTA <[EMAIL PROTECTED]> http://www.debian.or.jp/~kubota/