I am using callgrind to profile a linux multi-threaded app and mostly it's working great. I start it with instrumentation off (--instr-atstart=no) and then once setup is done i turn it on with callgrind_control -i on. However, when I change certain configurations to try to profile a different part of the app it starts running extremely slow even before I turn instrumentation on. Basically part of the code that would take a few seconds with normal operation takes over an hour with callgrind (instrumentation turned off). Any ideas as to why that might be and how to go about debugging/resolving the slowness?
Related Questions in C++
- How to immediately apply DISPLAYCONFIG_SCALING display scaling mode with SetDisplayConfig and DISPLAYCONFIG_PATH_TARGET_INFO
- Why can't I use templates members in its specialization?
- How to fix "Access violation executing location" when using GLFW and GLAD
- Dynamic array of structures in C++/ cannot fill a dynamic array of doubles in structure from dynamic array of structures
- How do I apply the interface concept with the base-class in design?
- File refuses to compile std::erase() even if using -std=g++23
- How can I do a successful map when the number of elements to be mapped is not consistent in Thrust C++
- Can std::bit_cast be applied to an empty object?
- Unexpected inter-thread happens-before relationships from relaxed memory ordering
- How i can move element of dynamic vector in argument of function push_back for dynamic vector
- Brick Breaker Ball Bounce
- Thread-safe lock-free min where both operands can change c++
- Watchdog Timer Reset on ESP32 using Webservers
- How to solve compiler error: no matching function for call to 'dmhFS::dmhFS()' in my case?
- Conda CMAKE CXX Compiler error while compiling Pytorch
Related Questions in LINUX
- Is there some way to use printf to print a horizontal list of decrementing hex digits in NASM assembly on Linux
- Why does Hugo generate different taxonomy-related HTML on different OS's?
- Writes in io_uring do not advance the file offset
- Why `set -o pipefail` gives different output even though the pipe is not failing
- what really controls the permissions: UID or eUID?
- Compiling eBPF program in Docker fails due to missing '__u64' type
- Docker container unable to make HTTPS requests to external API
- Whow to use callback_query_handler in Python 3.10
- Create kea runtime directory at startup in Yocto image
- Problem on CPU scheduling algorithms in OS
- How to copy files into the singularity sandbox?
- Android kernel error: undefined reference to `get_hw_version_platform'
- Is there a need for BPF Linux namespace?
- Error when trying to execute a binary compiled in a Kali Linux machine on an Ubuntu system
- Issue with launching application after updating ElectronJs to version 28.0.0 on Windows and Linux
Related Questions in PROFILING
- Error Using Valgrind's callgrind and kcachegrind on a C++
- what are the numbers in the operation names when profiling an application
- Node.js --cpu-prof flag: Failed to convert CPU profile message to V8 string
- Identifying the cause of poor training performance on RTX 4090
- perf -- record cache misses at thread level granularity
- Script to track network usage showing increased results when not sending packets
- Are anonymous functions optimized in node.js
- Why VTune fails with error `[Instrumentation Engine]: __libc_thread_freeres()`?
- How to profile integration tests in java
- Why "current_thread" identifier is not in "_current_frames" dictionary?
- Raspberry Pi 4: Uneven speed of GPIO bit-banging in C loop (RPi 4, 64bit)
- Why won't this duckdb query of s3/parquet data save 'EXPLAIN ANALYZE' profiling info?
- How to resolve Segmentation Fault in RISC-V Program
- What are tasks inside another task in DevTools profiler?
- Get trace of executed Instructions in Spike simulator
Related Questions in VALGRIND
- Error Using Valgrind's callgrind and kcachegrind on a C++
- How to correctly rebuild a C++ project as a Debug using CMake for further profiling with Valgrind?
- Valgrind error when loading font in SFML project
- Why does version 3.22.0 of Valgrind-based Helgrind thread error detector reports data races and version 3.18.1 does not?
- Filtering callgrind capture based on shared library name
- C Program runs but GDB fails before main. Using GSL
- Thread #1: Bug in libpthread: sem_wait succeeded on semaphore without prior sem_post
- Valgrind and wildcard?
- Understanding stack matching rules for custom suppressions
- Can't detect a memory leak
- Valgrind showing invalid read of size 4 when using an erase function
- Valgrind on MacBook M1
- How to Resolve Valgrind Errors for Uninitialized Values and Invalid Reads in a Linux Memory Scanning Program
- What to do with valgrind error Assertion 'VG_IS_WORD_ALIGNED(a)' failed
- Unknown reason for malloc "corrupted top size" error
Related Questions in CALLGRIND
- Error Using Valgrind's callgrind and kcachegrind on a C++
- Filtering callgrind capture based on shared library name
- Callgrind for CUDA program, execution time in percentages do not add up to 100%
- valgrind 'callgrind' instruction fetch differs between runs
- -toggle-collect option in valgrind/callgrind not working
- How do I profile some code using callgrind_control in c++?
- How can I visualize GNU remake profile data for multithreaded processes
- Using `callgrind` to count function calls in Linux
- KCacheGrind/CallGrind - wrong relative timing
- Callgrind / kcachegrind why does running a program in valgrind increase sysCall time?
- Callgrind / kcachegrind call graph output implies functions call eachother when they don't
- Should I trust profiling inside or outside of callgrind for a function that calls glibc's sin()?
- Valgrind ignore templates in output
- Optimizations of nested if and switch statements using callgrind or assembly modifications
- Tracking down unnecessary copies with callgrind
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Callgrind is a tool, built on valgrind. Valgrind is basically a dynamic binary translator (libVEX, part of valgrind). It will decode every instruction and JIT-compile them into stream of some instructions of the same CPU.
As I know, there is no way to enable this translation (in valgrind implementation) for already running process, so dynamic translation is enabled all time, from start of program. It can't be turned off too.
Tools are built on valgrind by adding some instrumentation code. The "Nul" tool (nulgrind) is the tool which adds no instrumentation. But every tool uses valgrind and dynamic translation is active all time. Turning on and off in callgrind is just turning on and off additional instrumentation.
Virtual CPU, implemented by Valgrind is limited, there is (incomplete) list of limitations http://valgrind.org/docs/manual/manual-core.html#manual-core.limits Most of limitations are about floating point operations, and they can be emulated wrong.
Is the change connected with floating-point operations? Or with other listed limitations?
Also you should know, that "Valgrind serialises execution so that only one thread is running at a time". (from the same page manual-core.html)