[users at i-scream] libstatgrab on AIX: data mismatch between saidar and topas

Anderson Carlos Trindade anderson.trindade at optimode.com.br
Tue May 24 15:40:19 BST 2016


Hi Jens,

Thank you by your explanations and sorry my questions. I’m really starting with AIX and I will study a little bit more about this partitioning stuff.

Thanks,

Anderson




> Em 24 de mai de 2016, à(s) 10:10, Jens Rehsack <rehsack at gmail.com> escreveu:
> 
> Hi Anderson,
> 
> No, this is not correct. The LPAR technology allows (and prefers) dedicated Resources per partition, for shared resources WPAR's are recommended. I never tried, but assume that shared resources for LPAR mean minimum and maximum of resources can be reserved on demand.
> 
> The difference you see has nothing to do with physical machine view vs. partition / logical view.
> 
> Von meinem iPhone gesendet
> 
>> Am 24. Mäi. 2016 um 14:51 schrieb Anderson Carlos Trindade <anderson.trindade at optimode.com.br>:
>> 
>> Hi Jens,
>> 
>> So, the point is:
>> 
>>   - libstatgrab is reporting the physical CPU usage. If libstatgrab shows something around 65% of Idle, it means that 65% of all physical resources are Idle.
>> 
>>   - on the other hand, the sample code is reporting the LPAR usage. If the sample code shows something around 20% of Idle, it means that LPAR has just 20% of the CPU dedicated to LPAR is available to LPAR usage
>> 
>> Is this understanding correct?
>> 
>> Considering I have one application running inside a LPAR and this application is consuming almost all CPU dedicated to LPAR (around 80%) but the physical host is using just 35% of CPU, If libstatgrab returns the physical usage, I can’t see from the libstatgrab perspective that the LPAR is almost 100% of CPU usage. Is that correct?
>> 
>> 
>> 
>> 
>>> Em 24 de mai de 2016, à(s) 06:11, Jens Rehsack <rehsack at gmail.com> escreveu:
>>> 
>>> Hi Anderson,
>>> 
>>> the example is very explicit about the measurement - it normalizes the values when lparstats.type.b.shared_enabled - libstatgrab doesn't.
>>> libstatgrab reports the physical cpu measure - which can lead to misinterpretion for shared resources (which is up to our knowledge always the case when physical resources are shared on a best effort way). So we decided against that (similar for zones (Solaris), Jails (BSD) and Containers (Linux)) until we find a tuit to analyze all available technologies and a reasonable way to deal with them.
>>> 
>>> Thanks for remind me :)
>>> 
>>> Best regards,
>>> Jens
>>> 
>>>> Am 23.05.2016 um 18:54 schrieb Anderson Carlos Trindade <anderson.trindade at optimode.com.br>:
>>>> 
>>>> Hi Jens,
>>>> 
>>>> Thank you for reply!
>>>> 
>>>> As far as I know, topas seems to be an AIX utility (https://www.ibm.com/support/knowledgecenter/#!/ssw_aix_71/com.ibm.aix.cmds5/topas.htm), but I can’t tel you where the data displayed by topas is coming from.
>>>> 
>>>> But let’s forget topas for a moment.
>>>> 
>>>> I got a sample code from IBM site (https://www.ibm.com/support/knowledgecenter/#!/ssw_aix_53/com.ibm.aix.prftools/doc/prftools/prftools07.htm%23wq407), which uses perfstat to report cpu usage statistics.
>>>> 
>>>> Then, I compiled this sample code and run in parallel to saidar, each one in a separated SSH session. While saidar is reporting around 80% of Idle time and 10% of user time, the sample code above (based on perfstat) is reporting something around 35% of idle time and 60% of user mode usage. I recorded a screenshot and I can share with you If you prefer.
>>>> 
>>>> Considering that saidar and the sample code above are getting data from the same source, why are these statistics so different?
>>>> 
>>>> 
>>>> 
>>>> 
>>>> 
>>>> 
>>>> 
>>>>> Em 23 de mai de 2016, à(s) 13:02, Jens Rehsack <rehsack at gmail.com> escreveu:
>>>>> 
>>>>> Hi,
>>>>> 
>>>>>> Am 23.05.2016 um 17:02 schrieb Anderson Carlos Trindade <anderson.trindade at optimode.com.br>:
>>>>>> 
>>>>>> Hello List,
>>>>>> 
>>>>>> I’m trying to understanding some differences between data reported by AIX utility topas and saidar.
>>>>>> 
>>>>>> On a given moment (almost in the same second), saidar report the following CPU usage:
>>>>>> 
>>>>>> CPU Idle: 88,28%
>>>>>> CPU system: 4,72%
>>>>>> CPU User: 7,00%
>>>>>> 
>>>>>> but topas report the following usage:
>>>>>> 
>>>>>> %Idle 35,8%
>>>>>> %Kern 3,5%
>>>>>> %User: 60,5%
>>>>>> %Wait 0,2%
>>>>>> 
>>>>>> It seems that both utilities are using different sources of data, since the usage reported is very different.
>>>>>> Please, could you help me to understand where these differences are coming from?
>>>>> 
>>>>> Well, I don't know where topas is fetching it's data from - and where your topas comes from (AIX Linux Tools? 3rd party repo?) ....
>>>>> 
>>>>> As you can see here https://github.com/i-scream/libstatgrab/blob/master/src/libstatgrab/cpu_stats.c#L162, libstatgrab is using perfstat - the IBM recommendation and the same source used by nmon.
>>>>> See https://www.ibm.com/support/knowledgecenter/ssw_aix_53/com.ibm.aix.prftools/doc/prftools/prftools07.htm%23wq407 for more details about libperfstat.
>>>>> 
>>>>> The reason for enhancing libstatgrab by a former customer was the poor data quality of GNU tools on Unices (HP-UX, AIX, Solaris).
>>>>> When I'm in doubt, I trust libstatgrab more than all GNU tools together >:-)
>>>>> 
>>>>>> my apologies in advance, because I'm very new on AIX world
>>>>>> 
>>>>>> This is a LPAR with 4 CPU’s
>>>>> 
>>>>> Best regards
>>>>> --
>>>>> Jens Rehsack - rehsack at gmail.com
>>> 
>>> --
>>> Jens Rehsack - rehsack at gmail.com
>> 




More information about the users mailing list