===== VTune ===== VTune is a performance profiler from Intel and is available on the HPC cluster. General information of Intel : https://software.intel.com/en-us/get-started-with-vtune-linux-os The VTUne commands have prefix amplxe- *. The most important are: * amplxe-cl #< launches VTune Command Line. * amplxe-gui #< launches VTune with a graphical interface. Profiling steps: * Data collection (collect) * Analysis of the results ==== Collect ==== To collect profiling data, run the executable via amplxe-cl, specifying an action between the following: https://software.intel.com/en-us/vtune-amplifier-help-amplxe-cl-actions The action necessary to collect data about performance is **collect**. This action requires an option to specify the type of analysis, including: * **hotspots**: Identify your most time-consuming source code. * **concurrency**: Analyze how your application is using available logical CPU cores, discover where parallelism is incurring synchronization overhead, and identify potential candidates for parallelization. * **hpc-performance**: Identify opportunities to optimize CPU, memory, and FPU utilization for compute-intensive or throughput applications. * **memory-access**: Measure a set of metrics to identify memory access related issues (for example, specific for NUMA architectures). Command example: amplxe-cl -collect hotspots -result-dir home/test/myApplication === Result analysis === The off-line analysis of the collected data can be done in a graphical way with amplxe-gui: amplxe-gui ==== HPC (UNIPR) ==== Connect to gui.hpc.unipr.it and perform an interactive job; eg: srun -N1 -n68 -p knl --account= > module load intel # (include vtune too) > amplxe-cl -collect hotspots -result-dir home/test/myApplication > exit It is preferable to launch on the University HPC cluster amplxe-gui from gui.hpc.unipr.it: module load intel amplxe-gui === MARCONI (CINECA) === On Marconi you can 'submit an interactive session, in which to perform both the collect and the data analysis (amplxe-gui does not work on the login machine), such as: srun -N1 -n68 -t 00:30:00 -A -p knl --pty bash > module load intel vtune > amplxe-cl -collect hpc-performance -result-dir myresults bin/myexe > amplxe-gui myresults > exit === Debug Info === To include debugging information in the results (for example the source code) you need to fill in with the option -gdwarf-2