Hi, I would like to include information about packages residing in the new queue into Blends tasks pages. Because I decided to use UDD as main source of information for these pages I would like to inject information about new queue into UDD first. I've got a hint to
http://ftp-master.debian.org/new.822 which provides a good amount of the information I need. Before I start I would like to hear your opinion about creating the following two tables and the means to harvest the data for the according fields. Perhaps I'm just lacking some knowledge which might simplify things even further: -- Sources CREATE TABLE new_sources ( source text, version text, maintainer text, maintainer_name text, maintainer_email text, bin text, -- by parsing http://ftp-master.debian.org/new/<src>_<version>.html#dsc field "Binary:" changed_by text, -- Uploader? architecture text, homepage text, -- by parsing http://ftp-master.debian.org/new/<src>_<version>.html#dsc field "Homepage:" vcs_type text, -- by parsing http://ftp-master.debian.org/new/<src>_<version>.html#dsc field "Vcs-*:" vcs_url text, -- by parsing http://ftp-master.debian.org/new/<src>_<version>.html#dsc field "Vcs-*:" vcs_browser text, -- by parsing http://ftp-master.debian.org/new/<src>_<version>.html#dsc field "Vcs-Browser:" distribution text, closes int, -- WNPP bug # license text, -- trying to parse http://ftp-master.debian.org/new/<bin1>_<version>.html#binary-<bin1>-copyright field "License:" last_modified timestamp, queue text PRIMARY KEY (source, version, distribution) ); -- Packages CREATE TABLE new_packages ( package text, version text, architecture text, maintainer text, description text, -- by parsing http://ftp-master.debian.org/new/<bin>_<version>.html#control field "Description:" source text, source_version, depends text, recommends text, suggests text, enhances text, pre_depends text, breaks text, replaces text, provides text, conflicts text, installed_size integer, homepage text, section text, long_description text, license text, -- trying to parse http://ftp-master.debian.org/new/<package>_<version>.html#binary-<package>-copyright field "License:" PRIMARY KEY (package, version, architecture, distribution) ); If there is any better method to obtain the fields above than parsing HTML pages I would be really happy if you could enlighten me. Any more comments? Kind regards Andreas. -- http://fam-tille.de -- To UNSUBSCRIBE, email to debian-qa-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org