How can I profile Python code line-by-line?

Question

I've been using cProfile to profile my code, and it's been working great. I also use gprof2dot.py to visualize the results (makes it a little clearer).

However, cProfile (and most other Python profilers I've seen so far) seem to only profile at the function-call level. This causes confusion when certain functions are called from different places - I have no idea if call #1 or call #2 is taking up the majority of the time. This gets even worse when the function in question is six levels deep, called from seven other places.

How do I get a line-by-line profiling?

Instead of this:

function #12, total time: 2.0s

I'd like to see something like this:

function #12 (called from somefile.py:102) 0.5s
function #12 (called from main.py:12) 1.5s

cProfile does show how much of the total time "transfers" to the parent, but again this connection is lost when you have a bunch of layers and interconnected calls.

Ideally, I'd love to have a GUI that would parse through the data, then show me my source file with a total time given to each line. Something like this:

main.py:

a = 1 # 0.0s
result = func(a) # 0.4s
c = 1000 # 0.0s
result = func(c) # 5.0s

Then I'd be able to click on the second "func(c)" call to see what's taking up time in that call, separate from the "func(a)" call. Does that make sense?

My guess is that you would be interested in `pstats.print_callers`. An example is [here](http://www.doughellmann.com/PyMOTW/profile/). — Muhammad Alkarouri, Oct 13 '10 at 20:18
Muhammad, that's definitely helpful! At least it fixes one problem: separating function calls depending on origin. I think Joe Kington's answer is closer to my goal, but print_callers() definitely gets me halfway there. Thanks! — rocketmonkeys, Oct 14 '10 at 15:25

score 138 · Accepted Answer · edited Nov 25 '20 at 19:24

138

I believe that's what Robert Kern's line_profiler is intended for. From the link:

File: pystone.py
Function: Proc2 at line 149
Total time: 0.606656 s

Line #      Hits         Time  Per Hit   % Time  Line Contents
==============================================================
   149                                           @profile
   150                                           def Proc2(IntParIO):
   151     50000        82003      1.6     13.5      IntLoc = IntParIO + 10
   152     50000        63162      1.3     10.4      while 1:
   153     50000        69065      1.4     11.4          if Char1Glob == 'A':
   154     50000        66354      1.3     10.9              IntLoc = IntLoc - 1
   155     50000        67263      1.3     11.1              IntParIO = IntLoc - IntGlob
   156     50000        65494      1.3     10.8              EnumLoc = Ident1
   157     50000        68001      1.4     11.2          if EnumLoc == Ident1:
   158     50000        63739      1.3     10.5              break
   159     50000        61575      1.2     10.1      return IntParIO

edited Nov 25 '20 at 19:24

Nico Schlömer

46,467
24
178
218

answered Oct 13 '10 at 20:19

Joe Kington

258,645
67
583
455

2

Joe, this is *exactly* what I was looking for. I can just use a decorator, attach a LineProfiler() object to a few functions, and it'll spit out a line-by-line profile of the function. I really wish there was a graphical way to see the results, but this is a great start! Thanks! – rocketmonkeys Oct 14 '10 at 15:40
1

As a followup: I've used this a few times, and I've even made a django decorator @profiler to automatically wrap the view in this line-by-line profiler and spit out the results. It's been perfect! This is truly what's needed when I'm profiling a view. It can't show me recursively what's taking time, but I can at least narrow it down to a single line. That's often just what I need. Thanks again! – rocketmonkeys Nov 30 '10 at 15:15
11

Does line_profiler work with Python 3? I couldn't get any information on that. – user1251007 Jul 23 '12 at 15:02
2

@Rocketmonkeys would you be willing to share your django decorator? – meshy Oct 16 '12 at 11:59
3

line_profiler does not show hits and time for me. Can anyone tell me why? And how to solve? – I159 Jan 06 '13 at 12:03
same problem as with L159 :( any help? – Hemant Mar 21 '13 at 11:24
7

Here's the decorator I wrote: https://gist.github.com/kylegibson/6583590. If you're running nosetests, be sure to use the -s option so stdout is printed immediately. – Kyle Gibson Sep 16 '13 at 17:14
6

how does the python script that produces this output look like? `import line_profiler;` and then ? – Zhubarb Mar 11 '14 at 12:19
@profile is undefined for jupyter notebook 5 with python 3.5.3 – Aseem Jun 24 '17 at 07:38
20

can anyone show how to actually use this library? The readme teaches how to install, and answers various FAQs, but doesn't mention how to use it after a pip install.. – cryanbhu Jul 25 '18 at 03:28
2

@user1251007: The [Changes](https://github.com/rkern/line_profiler#id11) section on the project's homepage seems to indicate that it has worked with Python 3 since its own first non-beta 1.0 release. – martineau Nov 27 '18 at 13:03
1

How do i use this thing – Batman Apr 03 '20 at 22:33
**Step1:**, `pip install line_profiler` **Step2:** In your script over your function you want to profile, add the `@profile` decorator **Step3:** Run this command to generate the .lprof file: `kernprof -l ` **Step4:** Run this command to see pretty results using the generated .lprof file: `python -m line_profiler ` – Mithun Kinarullathil May 21 '22 at 10:45

score 63 · Answer 2 · edited Sep 03 '19 at 14:43

You could also use pprofile(pypi). If you want to profile the entire execution, it does not require source code modification. You can also profile a subset of a larger program in two ways:

toggle profiling when reaching a specific point in the code, such as:

import pprofile
profiler = pprofile.Profile()
with profiler:
    some_code
# Process profile content: generate a cachegrind file and send it to user.

# You can also write the result to the console:
profiler.print_stats()

# Or to a file:
profiler.dump_stats("/tmp/profiler_stats.txt")

toggle profiling asynchronously from call stack (requires a way to trigger this code in considered application, for example a signal handler or an available worker thread) by using statistical profiling:

import pprofile
profiler = pprofile.StatisticalProfile()
statistical_profiler_thread = pprofile.StatisticalThread(
    profiler=profiler,
)
with statistical_profiler_thread:
    sleep(n)
# Likewise, process profile content

Code annotation output format is much like line profiler:

$ pprofile --threads 0 demo/threads.py
Command line: ['demo/threads.py']
Total duration: 1.00573s
File: demo/threads.py
File duration: 1.00168s (99.60%)
Line #|      Hits|         Time| Time per hit|      %|Source code
------+----------+-------------+-------------+-------+-----------
     1|         2|  3.21865e-05|  1.60933e-05|  0.00%|import threading
     2|         1|  5.96046e-06|  5.96046e-06|  0.00%|import time
     3|         0|            0|            0|  0.00%|
     4|         2|   1.5974e-05|  7.98702e-06|  0.00%|def func():
     5|         1|      1.00111|      1.00111| 99.54%|  time.sleep(1)
     6|         0|            0|            0|  0.00%|
     7|         2|  2.00272e-05|  1.00136e-05|  0.00%|def func2():
     8|         1|  1.69277e-05|  1.69277e-05|  0.00%|  pass
     9|         0|            0|            0|  0.00%|
    10|         1|  1.81198e-05|  1.81198e-05|  0.00%|t1 = threading.Thread(target=func)
(call)|         1|  0.000610828|  0.000610828|  0.06%|# /usr/lib/python2.7/threading.py:436 __init__
    11|         1|  1.52588e-05|  1.52588e-05|  0.00%|t2 = threading.Thread(target=func)
(call)|         1|  0.000438929|  0.000438929|  0.04%|# /usr/lib/python2.7/threading.py:436 __init__
    12|         1|  4.79221e-05|  4.79221e-05|  0.00%|t1.start()
(call)|         1|  0.000843048|  0.000843048|  0.08%|# /usr/lib/python2.7/threading.py:485 start
    13|         1|  6.48499e-05|  6.48499e-05|  0.01%|t2.start()
(call)|         1|   0.00115609|   0.00115609|  0.11%|# /usr/lib/python2.7/threading.py:485 start
    14|         1|  0.000205994|  0.000205994|  0.02%|(func(), func2())
(call)|         1|      1.00112|      1.00112| 99.54%|# demo/threads.py:4 func
(call)|         1|  3.09944e-05|  3.09944e-05|  0.00%|# demo/threads.py:7 func2
    15|         1|  7.62939e-05|  7.62939e-05|  0.01%|t1.join()
(call)|         1|  0.000423908|  0.000423908|  0.04%|# /usr/lib/python2.7/threading.py:653 join
    16|         1|  5.26905e-05|  5.26905e-05|  0.01%|t2.join()
(call)|         1|  0.000320196|  0.000320196|  0.03%|# /usr/lib/python2.7/threading.py:653 join

Note that because pprofile does not rely on code modification it can profile top-level module statements, allowing to profile program startup time (how long it takes to import modules, initialise globals, ...).

It can generate cachegrind-formatted output, so you can use kcachegrind to browse large results easily.

Disclosure: I am pprofile author.

+1 Thank you for your contribution. It looks well-done. I have a little different perspective - measuring inclusive time taken by statements and functions is one objective. Finding out what can be done to make the code faster is a different objective. The difference becomes painfully obvious as the code gets large - like 10^6 lines of code. The code can be wasting large percents of time. The way I find it is by taking a small number of very detailed samples, and examining them with a human eye - not summarizing. The problem is exposed by the fraction of time it wastes. — Mike Dunlavey, Feb 02 '15 at 14:18
You are right, I didn't mention pprofile usage when one wants to profile a smaller subset. I edited my post to add examples of this. — vpelletier, Feb 03 '15 at 06:54
This is exactly what I was looking for: non-intrusive and extensive. — egpbos, Oct 05 '16 at 06:30
Nice tool, but it runs several times slower than the original code. — rominf, Nov 19 '18 at 10:13
In deterministic mode it does have a significant overhead - the flip side of portability. On slower code, I recommend using the statistic mode, which has a ridiculously small overhead - at the expense of trace imprecision and readability. But it can be a first step too: identify hot-spot in statistic mode, produce a smaller case triggering the identified hot-spot, and use deterministic profiling to get all the details. — vpelletier, Nov 02 '20 at 10:31
I don't want to see other modules' work time. I just want to see how long does each line in my script take. PProfile creates a lot of unnecessary *for me* info. — Mehmet Burak Sayıcı, Mar 30 '21 at 14:23
@MehmetBurakSayıcı : Have you tried the documented inclusion and exclusion options ? (--exclude-syspath, --exclude and --include) — vpelletier, Apr 01 '21 at 22:38

Pe Dro · Answer 3 · 2020-08-01T15:07:23.510

Just to improve @Joe Kington 's above-mentioned answer.

For Python 3.x, use line_profiler:

Installation:

pip install line_profiler

Usage:

Suppose you have the program main.py and within it, functions fun_a() and fun_b() that you want to profile with respect to time; you will need to use the decorator @profile just before the function definitions. For e.g.,

@profile
def fun_a():
    #do something

@profile
def fun_b():
    #do something more

if __name__ == '__main__':
    fun_a()
    fun_b()

The program can be profiled by executing the shell command:

$ kernprof -l -v main.py

The arguments can be fetched using $ kernprof -h

Usage: kernprof [-s setupfile] [-o output_file_path] scriptfile [arg] ...

Options:
  --version             show program's version number and exit
  -h, --help            show this help message and exit
  -l, --line-by-line    Use the line-by-line profiler from the line_profiler
                        module instead of Profile. Implies --builtin.
  -b, --builtin         Put 'profile' in the builtins. Use 'profile.enable()'
                        and 'profile.disable()' in your code to turn it on and
                        off, or '@profile' to decorate a single function, or
                        'with profile:' to profile a single section of code.
  -o OUTFILE, --outfile=OUTFILE
                        Save stats to <outfile>
  -s SETUP, --setup=SETUP
                        Code to execute before the code to profile
  -v, --view            View the results of the profile in addition to saving
                        it.

The results will be printed on the console as:

Total time: 17.6699 s
File: main.py
Function: fun_a at line 5

Line #      Hits         Time  Per Hit   % Time  Line Contents
==============================================================
    5                                           @profile
    6                                           def fun_a():
...

EDIT: The results from the profilers can be parsed using the TAMPPA package. Using it, we can get line-by-line desired plots as plot

The instructions are accurate but the graph is misleading, because `line_profiler` does not profile memory usage (`memory_profiler` does, but it often fails). I'd recommend using (my) Scalene profiler instead, if you're on Mac OS X or Linux: `pip install -U scalene`, https://github.com/emeryberger/scalene -- it simultaneously does line-level profiling of CPU time and memory (and more!). — EmeryBerger, Aug 16 '20 at 13:54
Hello @emeryberger, the plot shown is done by a new package: TAMPPA. although its subject to issues. I know there are many ways. Thank you for sharing one. I would recommend submitting a detailed answer here :) Have you submitted an issue for 'memory_profiler' ? — Pe Dro, Aug 16 '20 at 15:17
I can get Scalene profiler to work, could you provide an exmaple? — MBV, Jan 29 '22 at 17:58

Sateesh · Answer 4 · 2020-05-14T15:55:04.687

You can take help of line_profiler package for this

1. 1st install the package:

    pip install line_profiler

2. Use magic command to load the package to your python/notebook environment

    %load_ext line_profiler

3. If you want to profile the codes for a function then
do as follows:

    %lprun -f demo_func demo_func(arg1, arg2)

you will get a nice formatted output with all the details if you follow these steps :)

Line #      Hits      Time    Per Hit   % Time  Line Contents
 1                                           def demo_func(a,b):
 2         1        248.0    248.0     64.8      print(a+b)
 3         1         40.0     40.0     10.4      print(a)
 4         1         94.0     94.0     24.5      print(a*b)
 5         1          1.0      1.0      0.3      return a/b

score 1 · Answer 5 · answered Apr 28 '15 at 23:12

1

PyVmMonitor has a live-view which can help you there (you can connect to a running program and get statistics from it).

See: http://www.pyvmmonitor.com/

answered Apr 28 '15 at 23:12

Fabio Zadrozny

24,256
4
63
75

How can I profile Python code line-by-line?

5 Answers5

Installation:

Usage:

Linked

Related