C++ program crashes with EXIT CODE: 9 (SIGKILL)
@ I answer my own question so that some one can get helps later.
The exception was caused by OutOfMemory.
The process allocates too much memory putting pressures on OS. The OS has a hit man, oom-killer, that kills such processes for the sake of system stability. The oom-killer uses bullets called SIGKILL.
However, since SIGKILL is invisible (it cannot be caught and handled by the application), for some newbies including me, it is not always easy to figure out the true reason for the crash.
One good news is that when the hit man kills your process, it always logs its action at /var/log/messages.
Depending on your OS configuration, oom-killer might not log any message at all. In such a case, you can configure it as well. Search for rsyslog configuration in google.
Finding which process was killed by Linux OOM killer
Related videos on Youtube

syko
Updated on December 01, 2020Comments
-
syko about 2 years
My application program crashes with EXIT CODE: 9 (SIGKILL)
I never run any command such as 'kill -9 (pid)' or 'pkill (process name)' that can kill the running process.
Where should I start for debugging in this case?
I tried to dump the stack trace when the program crashes, but I found that the SIGKILL cannot be caught for error handling.
The program uses MPI and runs in cluster environments. It dies after around 1 hour of its run.
Is there any COMMON causes that can incur SIGKILL exception?
(It's running on linux; cent os 7)
-
stefaanv about 6 years@syko: it is a possibilty as the OOM sends a SIGKILL see this answer. This answer suggests that you can check logs to see if this is the case.
-
Some programmer dudeRun the program in a debugger to catch the signal.
-
zwol
strace yourprogram
from a shell prompt. This will produce a tremendous volume of output; ignore all but the last 50 or so lines. If you have no idea what the output means, post those last 50 lines here, unedited. (They won't fit into the comments. Use the "edit" link under the tags to edit the text of your question.)
-
Mikhail over 1 yearI've got this error on Jenkins, and it turned the build was indeed killed due to OOM. Thanks!