Regardless of what application type you are going to profile, the workflow is always the same and consists of:
- Configuring a profiling session
- Running the session and getting snapshots
- Analyzing the collected snapshots
Step 1. Configuring a profiling session
Before starting a profiling session, you must select a profiling configuration:
The profiling configuration specifies which profiling type will be used and whether child processes of a main profiled process (a profiling target) must be profiled.
The profiling target is always an executable* which is specified in the currently selected run configuration .
Currently, you are able to profile the following run configuration types:
If you select a run configuration of some other type, this will disable the profiling controls on the toolbar.
To select a profiling configuration
To create a profiling configuration
In the toolbar, in the list of profiling configurations, select Edit Configurations….
In the opened Profiling Configurations window, click to add a new profiling configuration.
Specify configuration Name and other profiling options:
- Profiling type
Detailed timing information about function calls.
Accurate time measurement, number of calls is not measured.
Recommended for most cases. Ideal when you look for performance issues in your app for the first time.
Accurate calls number measurement, time measurement may be inaccurate due to profiling overhead.
When sampling data is not enough. For example, for analyzing algorithm complexity (when info about number of calls is more valuable than call time values).
Each line of code is measured, call time values are inaccurate due to huge profiler overhead.
For advanced use cases only. For example, when you already know what function causes issues and want to analyze each line of it.
Detailed temporal data about application threads and events.
Collects temporal data about thread states, application events, and other multi-threading data. Based on Event Tracing for Windows (ETW).
If you want to create a configuration for profiling Mono or Unity applications, select Timeline (Mono) or Timeline (Unity) correspondingly.
Recommended for most cases. Especially, for analyzing multithreaded apps. Use it, for example, to determine the cause of UI freezes, excessive garbage collections, uneven workload distribution, insufficient I/O, and so on.
Control profiling manually
Enables you to control profiling manually via the profiler controls. For example, to take a snapshot, you will have to click the Get Snapshot button.
Collect profiling data from start*
Control profiling via API
Enables you to control profiling directly from the code of the profiled application. For example, to take a snapshot in the exact point of your code. Learn more about using the API in the dotTrace documentation.
Profile child processes
If selected, dotTrace will profile not only the main app process but the processes it runs as well.
This option defines how dotTrace must calculate calls time. For more details, refer to https://www.jetbrains.com/help/profiler/Profiling_Guidelines__Setting_the_Right_Meter_Kind.html.
Typically, this is a choice between whether dotTrace should calculate time when a thread is not working or should not. More specifically, there are four options:
Real time (performance counter)
Recommended. dotTrace calculates the overall real time passed between method entry and exit. This time does not depend on app threads states. Time is calculated using system performance counter.
Real time (CPU instruction)
dotTrace calculates the overall real time passed between method entry and exit. This time does not depend on app threads states. Time is calculated using the CPU register.
dotTrace calculates only the time when a certain thread is running. The time when the thread is waiting or sleeping is not included in calculation.
Thread cycle time
dotTrace calculates only the time when a certain thread is running. The time when the thread is waiting or sleeping is not included in calculation. Time is calculated using the CPU register.
The table next in this section will help you to choose a time measurement method better suitable for your case.
Clear this option if you want dotTrace to turn off JIT-inlining and get call stacks that closely resemble the structure of application's source code.
If selected, dotTrace will take into account time spent in profiler itself by taking more time samples.
Merge threads with equal names
If selected, dotTrace will group the performance information for all threads with the same name under a single thread node in Call Tree and Threads Tree views.
Collect native allocations
(Timeline) If selected, dotTrace will collect information about all memory allocations the profiled application does in the native (unmanaged) heap. To be able to select this option, you must have Windows Performance Toolkit installed on your computer.
Collect debug output
(Timeline) If selected, dotTrace will collect information about all messages the profiled application sends to the debug output.
Collect TPL events
(Timeline) If selected, profiling performance may be affected but dotTrace will collect Task Parallel Library (TPL) data. If cleared, there will be no
Tasknodes in Call Tree and
asynccall nodes will lack their await and continuations parts. Learn more about how dotTrace works with asynchronous code in https://www.jetbrains.com/help/profiler/Analyzing_Async_Calls.xml.
Clear this option in case your application does not use multitasking or you do not need this info.
- Profiling type
Step 2. Running the session and getting snapshots
To start a session and get snapshots
On the toolbar, select the run configuration you want to profile.
Either selector click the corresponding button on the toolbar.
Once the profiling is started, you will see the Performance Profiler tool window opened on the Profiling tab with the profiling controller inside.
If you disabled Collect profiling data from start in the profiling configuration, click Start Recording to start recording profiling data.
Basically, now all you need is reproduce the performance issue you are looking for or, if you just evaluate the overall app performance, go through the desired usage scenarios in your application.
Click Get Snapshot. The collected snapshot will be added to the list of snapshots inside the Performance Profiler window. To start collecting profiling data again, click Start Recording one more time.
After you collect a snapshot(s), you can finish the profiling session. Normally, you do it either by closing the profiled application or by detaching the profiler via the Detach button ( Kill forcibly terminates the profiled app and the session, so, use it only in emergency cases).
Step 3. Analyzing collected snapshots
For more details on how to analyze performance snapshots, refer to Analyzing Profiling Results.
To analyze a snapshot
On the All Snapshots tab of the Performance Profiler tool window, select the snapshot you want to analyze.
Analyze the collected data using one of the views:
Call Tree: a "classic" call tree that shows you all method calls in all threads. Each top-level node represents a top-level function which was executed by a certain thread. Use this view to quickly get down to actual application activity. Learn more
Top Methods: the best place to start when analyzing application performance. It is a simple plain list of methods with the highest execution time. Note that you can reduce the system functions "noise" by excluding them from the list using the toggle: if enabled, method's execution time is calculated as a sum of method's own time and the time of all child system methods (down to the next user method in the stack). Learn more
Once the suspicious method is found, press F4 or select Jump to Source from the context menu. Rider will navigate you right to the method's source code.