[EMAIL PROTECTED] wrote:
> Eric,
> 
> If you are interested in tying DTLB misses to
> specific regions of user source code, please
> consider using 'collect' and 'analyzer'.  The
> latest builds of these tools for Opteron can
> be found here:
> 
>   /set/mars/dist/intel-S2/bin
> 

Can it find specific region in the kernel text section? As my workload
is quite simple: a typical run of Iperf that does nothing but read()
from a socket for network throughput test. Also the TLB misses mainly
happen in kernel pages. :(

- yxn

> An example of using collect:
> 
>   collect -p on -h dtlbm,hi a.out
>   analyzer test.1.er
>   er_print -pc test.1.er
> ...
> Excl.     Incl.     Excl. DTLB  Incl. DTLB   Name
> User CPU  User CPU  Misses      Misses
>   sec.      sec.
> 10.337    10.337    1801261     1801261      <Total>
>  0.440     0.440     100070      100070      test_loop + 0x000000A8, line 286 
> in "thrashfpleak.c"
>  0.430     0.430      50035       50035      test_loop + 0x00000080, line 277 
> in "thrashfpleak.c"
>  0.420     0.420      70049       70049      test_loop + 0x00000076, line 275 
> in "thrashfpleak.c"
> ...
> 
> Yukon
> 
> 
> 
> Yu Xiangning wrote On 10/31/06 22:11,:
>> Hi folks,
>>
>> While investigating a performance problem in my prototype workspace, I
>> found that the dtlb-miss rate is about 16% higher than that of the
>> baseline, so I suspect that the high tlb miss could be the issue.
>>
>> So, how can I get the statistics on the PC addresses when the tlb miss
>> happens? Can DTrace or some other facilities help here? Thank you in
>> advance!
>>
>> - yxn
>>
>> The attached in the sample trapstat output: (ignored the itlb column)
>>
>> Mine:
>> cpu m size| dtlb-miss %tim dtsb-miss %tim |%tim
>> ----------+-------------------------------+----
>>   0 u   8k|      7399  0.3         0  0.0 | 0.3
>>   0 u  64k|         0  0.0         0  0.0 | 0.0
>>   0 u 512k|         0  0.0         0  0.0 | 0.0
>>   0 u   4m|         0  0.0         0  0.0 | 0.0
>> - - - - - + - - - - - - - - - - - - - - - + - -
>>   0 k   8k|    397423 14.5       237  0.0 |14.5
>>   0 k  64k|         0  0.0         0  0.0 | 0.0
>>   0 k 512k|         0  0.0         0  0.0 | 0.0
>>   0 k   4m|         0  0.0         0  0.0 | 0.0
>> ==========+===============================+====
>>       ttl |    404822 14.7       237  0.0 |14.8
>>
>> Baseline:
>> cpu m size| dtlb-miss %tim dtsb-miss %tim |%tim
>>   0 u   8k|      8667  0.3         9  0.0 | 0.3
>>   0 u  64k|         0  0.0         0  0.0 | 0.0
>>   0 u 512k|         0  0.0         0  0.0 | 0.0
>>   0 u   4m|         0  0.0         0  0.0 | 0.0
>> - - - - - + - - - - - - - - - - - - - - - + - -
>>   0 k   8k|    342333 12.5       233  0.0 |12.6
>>   0 k  64k|         0  0.0         0  0.0 | 0.0
>>   0 k 512k|         0  0.0         0  0.0 | 0.0
>>   0 k   4m|        21  0.0         0  0.0 | 0.0
>> ==========+===============================+====
>>       ttl |    351021 12.8       242  0.0 |12.9
>>
>>
>> _______________________________________________
>> perf-discuss mailing list
>> perf-discuss@opensolaris.org
> 
_______________________________________________
perf-discuss mailing list
perf-discuss@opensolaris.org

Reply via email to