Re: [Ocfs2-users] Tracking down hangs

Andrew Robert Nicols Fri, 04 Jun 2010 07:23:34 -0700

Hi Sunil,

Thanks for the reply.

On Thu, Jun 03, 2010 at 02:18:53PM -0700, Sunil Mushran wrote:
> If scanlocks is clean, means it is not a dlm issue.

If the hang is only short, could it be that we're just missing the relevant
busy locks by running scanlocks too late?

> Have you tried mounting with data=writeback? With drbd,
> a 1G write becomes a 2G write. With ordered mode, a journal
> checkpoint, which is done when relinquishing a write lock, will
> wait on the data flush. That could be the cause for the slowdown.

I've remounted with data=writeback on the nfs server and under normal load,
we're still seeing hangs fairly frequently. I'm having real difficulty in
tracking down the cause of the issues.

I've moved away from catting the same file on each server to reading a
different file on each server. This has reduced the frequency of the issue
slightly, but not altogether.

> Does drbd have any way to see how active it is at that time? If
> so, monitor that.

We've checked out the drbd link and it appears untaxed when we see these
glitches.

Thanks in advance,

Andrew Nicols

-- 
Systems Developer

e: andrew.nic...@luns.net.uk
im: a.nic...@jabber.lancs.ac.uk
t: +44 (0)1524 5 10147

Lancaster University Network Services is a limited company registered in
England and Wales. Registered number: 04311892. Registered office:
University House, Lancaster University, Lancaster, LA1 4YW

signature.asc
Description: Digital signature

_______________________________________________
Ocfs2-users mailing list
Ocfs2-users@oss.oracle.com
http://oss.oracle.com/mailman/listinfo/ocfs2-users

Re: [Ocfs2-users] Tracking down hangs

Reply via email to