Re: BerkeleyDB as a cache backend

Dan Wilga Mon, 27 Jun 2005 07:09:10 -0700

At 6:09 PM -0400 6/24/05, Arshavir Grigorian wrote:

Hello list,
I coded a caching system using BerkeleyDB::Hash as the backend. Itwas working fine until the database file became fairly large (850M).At some point the performance degraded and the web server processaccessing the database started hanging. Someone suggested lockingissues being the cause for the hangups, but trying to access the dbfrom a single script even when there were no other processesaccessing it still hung.

Having used some pretty large (though not quite 850 Mb) BDB files, Ican tell you my experiences. Unless you are using the fullytransactional model, and have lots of disk space to throw at it, I'dnow recommend against using BDB for anything that is updated from anhttpd process.

The reason has to do with corruption. Even when using the ConcurrentDB Store model, I found that I was spending a huge amount of timewriting code to detect all the possible ways in which DB files canbecome corrupt when accessed directly by httpd.

After using BDB for several years, I recently re-coded everything touse replicated MySQL DBs. Not only is the Perl code much smaller now,it actually runs more quickly due to better indexing and theadvantages that can be obtained by using SQL for calculations.

I have no doubt that BDB works very well for some things, but in myopinion an httpd process--with many concurrent threads that can bedropped unexpectedly at pretty much any time--is not one of them.(BDB is actually what the default MySQL DB format uses at its lowestlevel, but the corruption problem is avoided by having one, centralDB daemon that does all of the reads/writes to the files.)


--
Dan Wilga                                         [EMAIL PROTECTED]
Web Administrator                             http://www.mtholyoke.edu
Mount Holyoke College                                Tel: 413-538-3027
South Hadley, MA  01075            "Who left the cake out in the rain?"

Re: BerkeleyDB as a cache backend

Reply via email to