[OMPI users] MPI Tool Information Interface (MPI_T), details on collective communication

2025-04-23 Thread Anna R.


Hi, 

I'm currently working on a project that aims to perform detailed 
measurements of internal MPI communication, particularly based on the PML 
layer. For example, I would like to measure internal point-to-point 
communication within collective operations.

I’ve seen that using the monitoring module (enabled via --mca 
pml_monitoring_enable 2), Open MPI already provides summaries of the 
underlying point-to-point operations after program execution. This is very 
helpful.

My goal is to access this information *dynamically* through the MPI_T 
interface and trace the data for different MPI functions at runtime.

However, I’m running into a limitation:

   - 
   
   The coll_monitoring_messages_count PVAR only shows counts for collective 
   operations.
   - 
   
   The pml_monitoring_messages_count PVAR only seems to capture activity 
   from explicit point-to-point operations (like MPI_Send/MPI_Recv), but 
   not from collectives such as MPI_Bcast.
   
My question is:
Is it currently possible to observe the internal point-to-point operations 
*triggered 
by collectives* (as visible in the monitoring summary) through MPI_T 
performance variables?

I’d greatly appreciate any insights or recommendations.

Best regards
Anna

To unsubscribe from this group and stop receiving emails from it, send an email 
to users+unsubscr...@lists.open-mpi.org.


[OMPI users] MPI Tool Interface (MPI_T), details on collective communication

2025-04-23 Thread Anna-Lena Roth


Hi, 

I'm currently working on a project that aims to perform detailed 
measurements of internal MPI communication, particularly based on the PML 
layer. For example, I would like to measure internal point-to-point 
communication within collective operations.

I’ve seen that using the monitoring module (enabled via --mca 
pml_monitoring_enable 2), Open MPI already provides summaries of the 
underlying point-to-point operations after program execution. This is very 
helpful.

My goal is to access this information *dynamically* through the MPI_T 
interface and trace the data for different MPI functions at runtime.

However, I’m running into a limitation:

   - 
   
   The coll_monitoring_messages_count PVAR only shows counts for collective 
   operations.
   - 
   
   The pml_monitoring_messages_count PVAR only seems to capture activity 
   from explicit point-to-point operations (like MPI_Send/MPI_Recv), but 
   not from collectives such as MPI_Bcast.
   
My question is:
Is it currently possible to observe the internal point-to-point operations 
*triggered 
by collectives* (as visible in the monitoring summary) through MPI_T 
performance variables?

I’d greatly appreciate any insights or recommendations.

Best regards
Anna

To unsubscribe from this group and stop receiving emails from it, send an email 
to users+unsubscr...@lists.open-mpi.org.


Re: [OMPI users] MPI Tool Information Interface (MPI_T), details on collective communication

2025-04-23 Thread 'George Bosilca' via Open MPI users
Anna,

The monitoring PML tracks all activity on the PML but might choose to only
expose that one that the user can be interested in, aka its own messages,
and hide the rest of the traffic. This is easy in OMPI because all internal
messages are generated using negative tags (which are not allowed for users
use). Basically, setting pml_monitoring_enable to 2 enabled filtered
monitoring, and all internal traffic is counted differently, but not
exposed through the normal PVAR API. If you set pml_monitoring_enable to 1,
then all traffic will be counted and exposed through the PVAR, but you will
not be able to differentiate between internal and user generated traffic.
Look in the function mca_common_monitoring_record_pml for more info.

Adding PVARs for the internal traffic should be fairly easy, just copy the
code that works on the pml_data and pml_count arrays and make them work
on filtered_pml_data and filtered_pml_count arrays.

  George.


On Wed, Apr 23, 2025 at 4:10 AM Anna R.  wrote:

> Hi,
>
> I'm currently working on a project that aims to perform detailed
> measurements of internal MPI communication, particularly based on the PML
> layer. For example, I would like to measure internal point-to-point
> communication within collective operations.
>
> I’ve seen that using the monitoring module (enabled via --mca
> pml_monitoring_enable 2), Open MPI already provides summaries of the
> underlying point-to-point operations after program execution. This is very
> helpful.
>
> My goal is to access this information *dynamically* through the MPI_T
> interface and trace the data for different MPI functions at runtime.
>
> However, I’m running into a limitation:
>
>-
>
>The coll_monitoring_messages_count PVAR only shows counts for
>collective operations.
>-
>
>The pml_monitoring_messages_count PVAR only seems to capture activity
>from explicit point-to-point operations (like MPI_Send/MPI_Recv), but
>not from collectives such as MPI_Bcast.
>
> My question is:
> Is it currently possible to observe the internal point-to-point operations 
> *triggered
> by collectives* (as visible in the monitoring summary) through MPI_T
> performance variables?
>
> I’d greatly appreciate any insights or recommendations.
>
> Best regards
> Anna
>
> To unsubscribe from this group and stop receiving emails from it, send an
> email to users+unsubscr...@lists.open-mpi.org.
>

To unsubscribe from this group and stop receiving emails from it, send an email 
to users+unsubscr...@lists.open-mpi.org.