Re: [OT] large db question about no joins

Martin P. Hellwig Thu, 16 Apr 2009 16:15:49 -0700

Daniel Fetchinson wrote:

[off but interesting topic]

<cut>

What would be the corresponding database layout that would scale and I
could get the total number of legs in the zoo or total number of
animals in the zoo without join(s)?

Cheers,
Daniel

[/off but interesting topic]

That all comes down to the keywords, efficiency, robustness andperformance and you can only pick one of them. So which two can yousacrifice? The good news is that it is only theoretical, if you haveyour requirements (i.e. the query with the right results has to returnwithin an acceptable time for the user not to get frustrated) who caresif it is not as fast as it theoretically could be? Especially if thismeans you can sacrifice the theoretically performance for easierdeployment and maintenance.

To get back to the layout question, if I need to have a count of thelegs at any given time, I would like to see the business process thatrequires this operation. Using the business process I could probablynarrow down the scope quite a bit like, it is not necessary to have aprecise count at time *now* but it is good enough to have an exact countwhich is no more then 4 seconds old but the query does need to returnwithin 50 ms.

One solution could be to build a central data warehouse where all infois stored in. Then have satellite db's that does ETL syncing with thecentral one. The layout of the satellites contain an optimised tableversion for the queries you want to throw at it.


If you need to scale it, that is a question of adding another satellite.

Efficiency is right out of the window because you store in essence justmultiple copies of the same data just in another order. However it isrobust (by having multiple copies) and has a predictable performance figure.


hth
--
mph
--
http://mail.python.org/mailman/listinfo/python-list

Re: [OT] large db question about no joins

Reply via email to