Hi Simon,

On May 14, 2:55 am, Simon J Mudd <sjm...@pobox.com> wrote:
> Hi John,
>
> While you obviously disagree with me, thanks for taking the time to
> reply. I may be looking at the problem the wrong way which is why
> I'm trying to figure out if that's the case and why.


Fair enough.


> john.bollin...@stjude.org (jcbollinger) writes:
> > On May 13, 8:07 am, jcbollinger <john.bollin...@stjude.org> wrote:

> > On the other hand, extlookup() is easy to set up and use, and it is
> > powerful enough for most needs.  If you need a more powerful data
> > lookup function then you would be lucky to find a canned one that
> > meets all your needs.
>
> Perhaps but I don't see it as being that bad, and it seems to be
> the closest thing to a solution I could find at the moment. My current
> recipes (perhaps incorrectly) are heavily customised in a way
> which often is not just "host specific" or "domain" specific. Hence
> the need to override the default settings. Other settings though
> may need this thus requiring this to be more dynamic.


Let's not cast things in terms of "correctness," except insomuch as
whether they reliably produce the desired effect on clients.

Depending on what you're trying to do with Puppet, however, it is
certainly true that some approaches to structuring your manifests will
make them easier to write and maintain than will others.  Personally,
when writing specializations into my manifests, I find it useful to
keep in mind the question of *why* a particular host or group of hosts
is different from others.  I have yet to run into an answer that
didn't fall into one of these categories, or a combination of them:

1) It has a functional role that some other systems do not
2) Its network environment requires differences from some other
systems.
3) It's just special

It has worked well for me so far to model roles via (unparameterized)
classes, assigned to nodes via their node declarations.  That leaves
only one level between "common" and "node-specific" where I might need
to customize data.  (That intermediate level could in principal be
parameterized by network domain, but in my case it is by subnet.)
Most of my nodes do not require per-node customization, so I don't end
up with many data files for extlookup.


> > Moreover, I disagree with several of the opinions and conclusions in
> > your post:
>
> > 1) You write: "The extlookup() functionality only allows [...
> > specifying implicitly ...] where to look for this value."  That is
> > false.  Extlookup does provide for configuration of a standard set of
> > CSV files to search (which can be parameterized by nodes' facts), but
> > the function also has an optional parameter to specify a specific file
> > to be searched first on any given invocation.
>
> Perhaps coming from a database background, I'd like to mirror what
> seems more _natural_ and having values spread around over potentially
> a large number of files seems non-intuitive.


If extlookup use would indeed require you to maintain a large number
of separate files then that might be a good reason to find something
better, but in all likelihood you can avoid that, or else structure it
in a sane way.  Consider also:

When you work with a database, do you normally focus on how it maps
data to the host filesystem?

Given the diverse data parameterization you described, if you created
a database for your configuration data, would you really organize
everything into a single table?  (And what would be its key?)

If you wanted to allow customization of your data with varying scopes
(e.g. supporting both per-node and per-network customization of the
same parameters) then how would you design the DB?


> > 2) You would prefer looking up data via a compound key (config_item,
> > lookup_value) rather than via a simple key (config_item).  

> > [...]  And even a DB performs multi-key queries
> > slower than single-key searches.
>
> Not if they are part of the primary key. That's part of the point.


Well that's a possible answer for you then.  Extlookup performs
lookups based on a single key, and nothing prevents you from using
keys that allow you to flatten your data.  For example, you can
structure your data so that instead of extlookup("interface"), you can
extlookup("interface-$subnet-$is_master").  In other words, the
distinction in your examples between "config_item" and
"lookup_value(s)" is artificial and unnecessary.  There is no reason
why you could not combine them all into a single field in the CSV.


> > 3) You argue that your suggested formulation of extlookup would be
> > "clearer as the configuration is more explicit then the current
> > extlookup() definition."  I think you're missing the point.  It would
> > indeed be clearer from which file the data would come, but the
> > objective of extlookup is to separate the *definition* of the data as
> > much as possible from the *use* of the data.  And I like that.  I
> > prefer that my manifests _not_ specify a bunch of details to every
> > extlookup() call, because that would obscure the details that are
> > important at the manifest level.
>
> ok. This perhaps is a different point of view we have. extlookup()
> looks like a function. As such it's nice if the function given the same
> input provides the same output. Otherwise confusion is easy.


Personally, I don't find it at all surprising when the output of a
function depends on factors other than its arguments, as long as the
dependencies are well documented.  That's common in the programming
and database worlds.  YMMV, of course.

There are things you could do with extlookup() that would contribute
to confusion, such as overriding the lookup precedence variable below
global scope.  You could also write put 'exec { "cd /$mydir; rm -f
*":  }' into one of your manifests.  Avoid these things (even if
you're *sure* $mydir will always be defined).


> As written it's clear if you do extlookup('dns_server') that you are
> getting a "dns server" result. In many of the examples provided
> the expected use case is: check first for the host, then the domain,
> then ....., but that's not specific, _and_ may be important.


When you say it's not "specific," do you mean that you can't tell from
the extlookup() call itself?  The typical use of extlookup() involves
defining the lookup precedence once, globally.  For the exceptional
case where you need to override the lookup order, you specify the
first-resort lookup file explicitly in the function call.  If you do
that, then you always know how the lookup will proceed.  To the extent
that your concern may be premised on the notion that you'll override
the lookup precedence via its controlling variable, I suggest you
construe your point as a good reason to not do that.


> > 4) You seem to want to use an arbitrary-length list of values as your
> > lookup key, but I don't see how that can work while retaining the
> > apparent objective of reducing the number of data files that must be
> > associated with your configuration and retaining the use of CSV.
>
> The .csv files are currently lookup_key,lookup_value. AFAIK there is
> no limit on either parameter's size. I'm only suggesting a third
> grouping/column item_type, and that is unlikely to change the width
> of the .csv files much. My suggestion is the structure would be:
>
> dns_server,host1.example.com,ns1.example.com
> dns_server,host2.example.com,ns1.example.com
> dns_server,all_domains,ns1.maindomain.com


Ah.  I thought you were suggesting a variable number of additional
columns between lookup_key and lookup_value.  Nevertheless, and
contrary to my own (4), I just offered a way to approach the problem
that works with extlookup() as currently written.  That is, put all
key information into the lookup_key field, delimited by some character
other than a comma.  Not only will that allow you to flatten the file
to an arbitrary degree, it will also allow (force) you to explicitly
include all the key parts in your extlookup() calls.


Cheers,

John

-- 
You received this message because you are subscribed to the Google Groups 
"Puppet Users" group.
To post to this group, send email to puppet-users@googlegroups.com.
To unsubscribe from this group, send email to 
puppet-users+unsubscr...@googlegroups.com.
For more options, visit this group at 
http://groups.google.com/group/puppet-users?hl=en.

Reply via email to