Christian Perrier wrote: > Your review should be sent as an answer to this mail.
> -Description: WWW search system for an intranet or small internet > +Description: web search and indexation system - binaries Annoyingly, "indexation" exists only as a technical term in accounting. And the indexing has to happen first. So: Description: web indexing and searching system - binaries > The ht://Dig system is a complete World Wide Web indexing and searching ^^^^^^^^^^ Take out the misleading WW part. But that means it's repeating "web search and indexing system" in both synopsis and long description ("system" in particular occurs far too often); one or the other should perhaps say "web search engine". > system for a small domain or intranet. This system is not meant to > + replace the need for powerful internet-wide search systems. Come to think of it, we wouldn't want to "replace the need" for Google (we would satisfy the need by replacing Google). Say: The ht://Dig system is a complete web search engine for a small domain or intranet. It is not meant to replace the major Internet-wide search engines; instead it is meant [...] (It's strangely hard to phrase this without giving an example!) > + The ht://Dig search engine can span several web servers at a site. > > We're in the 21st century, now..:-) Even further, today. But I'd merge this into the first Feature: - intranet searching, spanning multiple local web servers; > - * Keywords can be added to HTML documents > + - HTML documents keywords support; That doesn't quite work, and it's a bit obscure anyway. It's talking about <META NAME="htdig-keywords" CONTENT="foo"> and other specially added tags. I was thinking: - keyword tagging of HTML documents; -- JBR with qualifications in linguistics, experience as a Debian sysadmin, and probably no clue about this particular package
--- ../htdig.old/debian/templates 2007-12-18 08:53:28.000000000 +0000 +++ debian/templates 2008-01-01 16:05:32.000000000 +0000 @@ -1,22 +1,22 @@ Template: htdig/generate-databases Type: boolean Default: true -_Description: Generate endings database now? +_Description: Generate ht://Dig endings database now? The ht://Dig search engine requires an endings database which has to be generated before the first start. Generating the database will take a short while. It can be done either now or later by calling the - '/usr/sbin/htdigconfig' script. Should the database be generated now - rather than later on? + '/usr/sbin/htdigconfig' script. Template: htdig/run-rundig Type: boolean Default: false -_Description: Execute the rundig script daily? - rundig is a program that obtains on-line content and creates a searchable - database for the ht://Dig search engine. It downloads and indexes HTML - from the web, as defined in /etc/htdig/htdig.conf. Say Yes if you wish to - run this program daily. Say No if you don't plan to use htdig in this - way (perhaps htdig is being used by a third party, for instance KDE, to - perform local indexing). If in doubt, say No. It is the safest choice - unless you know what you're doing. +_Description: Schedule a daily execution of the 'rundig' script? + On-line content must be indexed by the 'rundig' script before + ht://Dig can be used to search data. That script indexes the + web content defined in /etc/htdig/htdig.conf. + . + If you choose this option, a daily run of the script will be scheduled. + You may choose to not use this option if ht://Dig is used another + way (for instance by KDE, to + perform local indexing). If in doubt, do not choose this option. --- ../htdig.old/debian/control 2007-12-18 08:53:28.000000000 +0000 +++ debian/control 2008-01-01 16:14:58.000000000 +0000 @@ -10,55 +10,35 @@ Depends: ${shlibs:Depends}, perl, lockfile-progs, debconf (>= 1.2.9) | debconf-2.0 Suggests: htdig-doc, wwwoffle | httpd Conflicts: libmifluz0 -Description: WWW search system for an intranet or small internet - The ht://Dig system is a complete World Wide Web indexing and searching - system for a small domain or intranet. This system is not meant to - replace the need for powerful internet-wide search systems like Lycos, - Google, or Yahoo!. Instead it is meant to cover the search needs of a - single company, campus, or even a particular subsection of a website. - . - As opposed to some WAIS-based or web-server based search engines, - ht://Dig can span several web servers at a site. The type of these - different web servers doesn't matter as long as they understand the - HTTP 1.0 protocol. +Description: web indexing and searching system - binaries + The ht://Dig system is a complete web search engine for a small domain + or intranet. It is not meant to replace the major Internet-wide search + engines; instead it is meant to cover the search needs of a single + company, campus, or even a particular subsection of a website. . Features: - * Intranet searching - * It is free - * Robot exclusion is supported - * Boolean expression searching - * Configurable search results - * Fuzzy searching (different algorithms supported) - * Searching of HTML and text files - * Keywords can be added to HTML documents - * Email notification of expired documents - * A Protected server can be indexed - * Searches on subsections of the database - * Full source code included - * The depth of the search can be limited - * Full support for the ISO-Latin-1 character set - . - Please note that ht://Dig is a resource-hog, with respect to processor usage, - when indexing. - . - Disk space requirements: - . - 13.000 documents indexed: 150MB disk space with a 'wordlist database' - 93MB disk space without a 'wordlist' - . - Multiplying the number of documents to index by 12.000 comes pretty close - to the real disk space used. + - intranet searching, spanning multiple local web servers; + - robot exclusion; + - boolean expression searches; + - configurable search results; + - fuzzy searching (various algorithms supported); + - indexing of HTML and text files; + - keyword tagging of HTML documents; + - email notification of expired documents; + - indexing of protected servers; + - searches on subsections of the database; + - limitation of search depth; + - ISO-Latin-1 character set support. Package: htdig-doc Architecture: all Depends: perl5 Suggests: htdig Section: doc -Description: Documentation for the htdig package - The ht://Dig system is a complete World Wide Web indexing and searching - system for a small domain or intranet. This system is not meant to - replace the need for powerful internet-wide search systems like Lycos, - Google, or Yahoo!. Instead it is meant to cover the search needs of a - single company, campus, or even a particular subsection of a website. +Description: web indexing and searching system - documentation + The ht://Dig system is a complete web search engine for a small domain + or intranet. It is not meant to replace the major Internet-wide search + engines; instead it is meant to cover the search needs of a single + company, campus, or even a particular subsection of a website. . - This package contains HTML documentation and examples. + This package provides the HTML documentation and examples.
Template: htdig/generate-databases Type: boolean Default: true _Description: Generate ht://Dig endings database now? The ht://Dig search engine requires an endings database which has to be generated before the first start. Generating the database will take a short while. It can be done either now or later by calling the '/usr/sbin/htdigconfig' script. Template: htdig/run-rundig Type: boolean Default: false _Description: Schedule a daily execution of the 'rundig' script? On-line content must be indexed by the 'rundig' script before ht://Dig can be used to search data. That script indexes the web content defined in /etc/htdig/htdig.conf. . If you choose this option, a daily run of the script will be scheduled. You may choose to not use this option if ht://Dig is used another way (for instance by KDE, to perform local indexing). If in doubt, do not choose this option.
Source: htdig Section: web Priority: optional Maintainer: Debian QA Group <[EMAIL PROTECTED]> Build-Depends: debhelper (>= 5), zlib1g-dev, flex, bison Standards-Version: 3.7.2 Package: htdig Architecture: any Depends: ${shlibs:Depends}, perl, lockfile-progs, debconf (>= 1.2.9) | debconf-2.0 Suggests: htdig-doc, wwwoffle | httpd Conflicts: libmifluz0 Description: web indexing and searching system - binaries The ht://Dig system is a complete web search engine for a small domain or intranet. It is not meant to replace the major Internet-wide search engines; instead it is meant to cover the search needs of a single company, campus, or even a particular subsection of a website. . Features: - intranet searching, spanning multiple local web servers; - robot exclusion; - boolean expression searches; - configurable search results; - fuzzy searching (various algorithms supported); - indexing of HTML and text files; - keyword tagging of HTML documents; - email notification of expired documents; - indexing of protected servers; - searches on subsections of the database; - limitation of search depth; - ISO-Latin-1 character set support. Package: htdig-doc Architecture: all Depends: perl5 Suggests: htdig Section: doc Description: web indexing and searching system - documentation The ht://Dig system is a complete web search engine for a small domain or intranet. It is not meant to replace the major Internet-wide search engines; instead it is meant to cover the search needs of a single company, campus, or even a particular subsection of a website. . This package provides the HTML documentation and examples.