[af4bfd0] | 1 | memlog - A Memory-Allocation Logging Tool |
---|
| 2 | |
---|
| 3 | This tool attempts to help you answer the question: |
---|
| 4 | Why is my application using so much memory? |
---|
| 5 | |
---|
| 6 | ** LINKING ** |
---|
| 7 | |
---|
| 8 | How to use it depends on how your application is linked: |
---|
| 9 | |
---|
| 10 | For dynamically-linked applications, you can: |
---|
| 11 | |
---|
| 12 | 1. Use LD_PRELOAD: Set LD_PRELOAD=/path/to/memlog/libmemlog.so when you run |
---|
| 13 | your application. |
---|
| 14 | |
---|
| 15 | 2. Link directly: Add the following to your linker flags: |
---|
| 16 | -L/path/to/memlog -Wl,-rpath,/path/to/memlog -lmemlog |
---|
| 17 | |
---|
| 18 | For statically-linked applications, add the following to your linker flags: |
---|
| 19 | |
---|
| 20 | -Wl,--wrap,malloc,--wrap,free,--wrap,realloc,--wrap,calloc,--wrap,memalign \ |
---|
| 21 | /path/to/memlog/memlog_s.o -lpthread -ldl |
---|
| 22 | |
---|
| 23 | ** RUNNING ** |
---|
| 24 | |
---|
| 25 | When your application runs, you should find in your current directory files |
---|
| 26 | named 'HOST.PID.memlog', one for each process. These contain the raw tracing |
---|
| 27 | information, and are only somewhat human readable. You can create a ps/pdf |
---|
| 28 | file detailing the memory allocated when each process reached its peak memory |
---|
| 29 | use by running: |
---|
| 30 | |
---|
[4598848] | 31 | /path/to/memlog/memlog_analyze /path/to/HOST.PID.memlog |
---|
[af4bfd0] | 32 | |
---|
| 33 | this will generate files named HOST.PID.memlog.dot, HOST.PID.memlog.ps and |
---|
| 34 | HOST.PID.memlog.pdf. You'll probably find the pdf file most convenient for |
---|
[24aa734] | 35 | viewing. HOST.PID.memlog.txt is also generated, providing the same information |
---|
| 36 | in textual form. |
---|
[af4bfd0] | 37 | |
---|
[0109b01] | 38 | If you pass the --leaks option to memlog_analyze, it will provide data on |
---|
| 39 | allocations active at the end of the program (leaks) instead of those active |
---|
| 40 | when the peak memory usage is first reached. |
---|
| 41 | |
---|
[5df7203] | 42 | You might have many runs of the same application (or output from many ranks of |
---|
| 43 | an MPI job), and you'd like to pick the one for analysis with the highest |
---|
| 44 | memory usage. If you provide a glob pattern to memlog_analyze it will do this |
---|
| 45 | for you. Make sure you quote the glob pattern so that your shell does not |
---|
| 46 | expand it. |
---|
| 47 | |
---|
| 48 | /path/to/memlog/memlog_analyze "/path/to/*.memlog" |
---|
| 49 | |
---|
[22f928f] | 50 | When running under common batch systems, the files are named |
---|
| 51 | JOB_ID.HOST.PID.memlog, and when running under the BG/Q CNK, the process's rank |
---|
| 52 | is used instead of the node-local PID. |
---|
| 53 | |
---|
[af4bfd0] | 54 | Note that te peak memory usage is determined by monitoring the processes's |
---|
| 55 | maximum resident set size, not just the total allocated heap memory. |
---|
| 56 | |
---|
[192a260] | 57 | memlog_analyze takes, as a second optional parameter, the name of the output |
---|
| 58 | directory (the current directory is the default). If the directory does not |
---|
| 59 | exist, it will be created. |
---|
| 60 | |
---|
[4598848] | 61 | memlog_analyze depends on dot (from the graphviz package) and ps2pdf (from the |
---|
[af4bfd0] | 62 | ghostscript package), plus various tools from the binutils package. |
---|
| 63 | |
---|
| 64 | ** RELATED WORK ** |
---|
| 65 | |
---|
| 66 | Why was memlog created? There are several other tools that can support this use |
---|
| 67 | case, but none of them would work in our environment properly. They were |
---|
| 68 | either too slow, not runnable under the BG/Q CNK, not thread safe, did not |
---|
| 69 | properly support big-endian PPC64, supported only either static or dynamic |
---|
| 70 | linking, did not collect full backtraces, or just did not produce |
---|
| 71 | sufficiently-informative peak-usage output. |
---|
| 72 | |
---|
| 73 | That having been said, some other tools that might interest you: |
---|
| 74 | Valgrind Massif - http://valgrind.org/docs/manual/ms-manual.html |
---|
| 75 | Google Performance Tools - http://google-perftools.googlecode.com/svn/trunk/doc/heapprofile.html |
---|
| 76 | memtrail - https://github.com/jrfonseca/memtrail |
---|
| 77 | LeakTracer - http://www.andreasen.org/LeakTracer/ |
---|
| 78 | glibc mtrace - http://www.gnu.org/s/hello/manual/libc/Allocation-Debugging.html |
---|
| 79 | Heaptrack - http://milianw.de/blog/heaptrack-a-heap-memory-profiler-for-linux |
---|
| 80 | MemProf - http://www.secretlabs.de/projects/memprof/ |
---|
| 81 | |
---|
| 82 | The dot/pdf output produced by memlog was definitely inspired by that produced |
---|
| 83 | by Google's pprof tool in the aforementioned package. |
---|
| 84 | |
---|