[perf-discuss] prstat LAT - how to interpret

Glen Gunselman Mon, 04 Dec 2006 12:11:57 -0800

 
 
 
We have an overloaded server (V490 with one CPU board) - CPU bound. 
Here is a sample prstat -mL taken during a time of high load(uptime
Total: 278 processes, 1710 lwps, load averages: 20.72, 13.21, 6.74):
 
   PID USERNAME USR SYS TRP TFL DFL LCK SLP LAT VCX ICX SCL SIG
PROCESS/LWPID 
  5617 cognos8   53 0.5 0.0 0.0 0.0 2.1 0.0  45  1K 200  3K   0
BIBusTKServe/18
  5617 cognos8   51 0.5 0.0 0.0 0.0 3.6 0.0  45  1K 274  3K   0
BIBusTKServe/17
  6084 cognos8   43 0.6 0.0 0.0 0.0 1.9 0.0  54  2K 222  5K   0
BIBusTKServe/20
  6084 cognos8   43 0.6 0.0 0.0 0.0 1.1 0.0  55  1K 244  4K   0
BIBusTKServe/15
  6084 cognos8   39 0.6 0.0 0.0 0.0 1.8 0.0  59  2K 212  4K   0
BIBusTKServe/22
  5617 cognos8   39 0.4 0.0 0.0 0.0 1.4 0.0  59  1K 223  3K   0
BIBusTKServe/22
  6084 cognos8   35 0.4 0.0 0.0 0.0 1.1 0.0  64  1K 262  2K   0
BIBusTKServe/19
  5617 cognos8   34 0.4 0.0 0.0 0.0 2.2 0.0  64  1K 465  2K   0
BIBusTKServe/23
 29514 oracle    28 1.2 0.1 0.0 0.0 0.0 8.6  62 217 990 899   0
oracle/1
 29948 root     2.4 0.4 0.0 0.0 0.0 0.0  77  20 109 561 961   0
cfagent/1
  5610 oracle   1.5 0.5 0.0 0.0 0.0 0.0  98 0.1   3   8 871   0
oracle/1
   942 oracle   1.2 0.6 0.0 0.0 0.0 0.0  98 0.0  15  50 506   0
oracle/1
  9378 root     0.4 1.1 0.1 0.0 0.0 0.0  98 0.9  40   9 994   0
prstat/1
  1475 oracle   1.1 0.2 0.4 0.0 0.0 0.0  98 0.2 111  55 945   0
emagent/3047304
 11646 oracle   0.8 0.0 0.0 0.0 0.0 0.0  91 8.7   1  45  80   0
java/56
 11479 oracle   0.6 0.1 0.0 0.0 0.0 0.0  98 1.0   4   4 615   0
oracle/1
 10520 oracle   0.6 0.0 0.0 0.0 0.0 0.0  98 1.4   5   0  45   5
nmccollector/1
   835 sysnav   0.1 0.2 0.1 0.0 0.0 0.0  57  42  19 240 471   0
bb-local.sh/1
  7375 oracle   0.2 0.0 0.0 0.0 0.0 0.0 100 0.0   9   3 192   0
oracle/1
 11712 oracle   0.2 0.0 0.0 0.0 0.0 0.0 100 0.0   8   2 178   0
oracle/1
 11815 oracle   0.2 0.0 0.0 0.0 0.0 100 0.0 0.2   1   3  18   0
java/37
   576 root     0.1 0.1 0.0 0.0 0.0 0.0 100 0.1 331   1  1K   0
nscd/11
 17855 oracle   0.1 0.0 0.0 0.0 0.0 100 0.0 0.1   5   0   5   0 java/2
 11805 oracle   0.1 0.1 0.0 0.0 0.0 0.0  96 3.8   4   7  62   2 perl/1
 11649 oracle   0.1 0.0 0.0 0.0 0.0 0.0 100 0.0   9   0 118   0
oracle/1
 11780 oracle   0.0 0.1 0.0 0.0 0.0 0.0  92 8.3  52   0 354  47
webcached/1
     1 root     0.0 0.1 0.0 0.0 0.0 0.0 100 0.2  13   0 361  14 init/1
  4987 cognos8  0.0 0.1 0.0 0.0 0.0 0.0  57  43 338   4 232   0 java/5
  4972 cognos8  0.1 0.0 0.0 0.0 0.0 0.0  91 8.5  68   0  77   0
cogbootstrap/3
 17855 oracle   0.0 0.1 0.0 0.0 0.0 0.0  51  49 312   2 209   0 java/5
 
>From looking at the LAT column how to I compute the CPU resources
needed to reduce LAT to  more "normal levels".
 
Page 24 of Solaris Performance and Tools includes the following
statement referring to LAT:
 
"This is an extremely useful metric--we can use it to estimate the
potential speedup for a thread if more CPU resources are added ..."
 
I have been unable to find any information on how to turn LAT into CPU
resources.  I'm reluctant to use USR + SYS (370.5 the top 9 processes) +
LAT (507 for the same top 9 processes) / 100.  This seems way too
simple. 
 
Thanks
gleng
 
Glen Gunselman
Systems Software Specialist
TCS
Emporia State University

_______________________________________________
perf-discuss mailing list
perf-discuss@opensolaris.org

[perf-discuss] prstat LAT - how to interpret

Reply via email to