strategy for multiple packages in fewer files (was Re: [file name extension])

Darren Duncan Mon, 07 Jan 2008 17:56:44 -0800

At 8:41 AM -0800 1/7/08, Paul Hodges wrote:

A small tangent that might be relevant -- what's the current convention
for, say, putting several related "packages" in the same file?

I do that frequently in my Perl modules, and so do modules like DBI.I believe that it is a good idea to do some package grouping intocommon files (though not necessarily having all packages in one file).

Moreover, I belive that the common practice of unconditionally havinga separate file for each package is very much a design smell and abad idea.

My suggested convention, which applies to both Perl 6 and Perl 5, isto group together in one file any packages that conceptually providea single API which is used as a whole, and separate into differentfiles any packages that conceptually provide separate APIs and/orseparate functionality that is always used from optional extensions.Optional alternately meaning highly likely to be substituted by usersfor some other packages, is good to keep separate.

For example, with DBI, you have one main file (DBI.pm) that declaresa package for creating a DBMS connection object, and a package forthe DBMS handle role, and a package for the statement handle role.Then for each DBI driver, you have 1 file that contains multiplepackages, 1 each for implementing a DBMS handle and statement handle,et al. Various utilities used by DBI or DBD modules, such asSQL::Statement, are a separate file again, as while DBI may use itinternally, it isn't part of the API of DBI.

For another example, if you had your own implementation of somecollection data type, eg a graph structure where you would have onepackage representing a node object and another one representing thewhole graph, and users would interact with both directly, then those2 packages should be stored in the same file.

In fact, this also illustrates circular dependencies between objects(a graph is made of nodes, a node is useless outside a graph), and incases like these, definitely they should be together.

One rule of thumb is to look at your ideal documentation strategry.If you would naturally explain a component of your project such thatmultiple object types are conceptually joined at the hip, such thatyou need to understand all of the objects in order to understand oneof them (see the graph example), then it makes sense to put the partsof the component in one file, and you can then document the 2 of themcollectively rather than forcing separate documentation for each onethat has to constantly refer back and forth to each other.

My Muldis DB project (with separate Perl 6 and Perl 5 versions)version 0.6.0 has 13 packages grouped into 3 files:

1. Interface.pm (no deps) has Muldis::DB::Interface, ::Machine,::Process, ::Var, ::FuncBinding, ::ProcBinding


2. Validator.pm (dep on Intf) has Muldis::DB::Validator

3. Example.pm (dep on Intf) has Muldis::DB::Engine::Example,::Machine, ::Process, ::Var, ::FuncBinding, ::ProcBinding

Version 0.7.0 (unreleased) has at least 13 packages grouped into this1 additional file:

1. Value.pm (no deps, used by Example) hasMuldis::DB::Engine::Example::Value, ::Universal, ::Scalar, ::Bool,::Int, ::Rat, ::Blob, ::Text, ::QuasiTuple, ::Tuple, ::QuasiRelation,::Relation, ::Cat_Order.

2. Further expected files would be like Operator.pm, Storage.pm,Runtime.pm, Compiler.pm; these would depend on Value.pm and be usedby Example.pm, and each could potentially have multiple packages,though just 1 each is also possible.

3. Value|Operator.pm (optional user-caused 'require' at runtime) justhas the Core data types and operators; for language extensions, theywould probably add extra files per extension, such as::Value::Temporal.pm and ::Operator::Temporal.pm et al.

I think that this approach strikes a good balance serving ease ofuse, ease of maintenance, and resource efficiency.

In p5, I might write a great Foo.pm that loads Foo::Loader.pm and
Foo::Parser.pm and Foo::Object.pm; I'd usually drop them into seperate
files and have one load the rest, so I could just use Foo; and get on
with my code, but that does add some maintenance that could be skipped
if loading the one file automatically grabbed all the relevant parts.
Plusses and minuses there.... If the Foo:Widget and Foo:Gadget are only
of use with the something in Foo proper, maybe it would be reasonable
to toss them into the same file, though.

<snip>

As for your example, I would probably keep object/loader/parser as 3files, especially if the loader/parser is likely to be subsituted byusers for alternatives that use the same object, or there aresituations where object would be used but the others wouldn't. Orcombine if they are very simple. But really, you know better to makethis call yourself, having more information on your circumstances.


-- Darren Duncan

P.S. Does anyone think that the main part of this email may providea starting point for a general best practices tutorial item orPerl.com article?

strategy for multiple packages in fewer files (was Re: [file name extension])

Reply via email to