Benchmarking, profiling and optimizing (II)

Profiling

Scalene

Scalene is a sampling profiler. In addition to timings , it can also give insight into:

CPU time spent in Python (interpreted), native (compiled) and system function calls
Memory usage and copy
GPU utilization
Memory leak detection

Moreover, it adds minimal overhead due to profiling. The downside is the results are less reproducible, because it is a sampling profiler.

Scalene can be used as a CLI tool, or using IPython magic or in Web interface as an interactive widget. Here are some examples profiling walk.py with Scalene.

CLI tool

$ scalene --cli walk.py

IPython magic

This allows for profiling a specific function. For example to profile just walk, we do as follows:

In [1]: %load_ext scalene

In [2]: %run walk.py

In [3]: %scrun --cli walk(n)

Gives the following output:

SCRUN MAGIC
                              /home/ashwinmo/Sources/enccs/hpda-python/content/example/walk.py: % of time = 100.00% (1.933s) out of 1.933s.
       ╷       ╷       ╷       ╷       ╷
       │Time   │–––––– │–––––– │–––––– │
  Line │Python │native │system │GPU    │/home/ashwinmo/Sources/enccs/hpda-python/content/example/walk.py
╺━━━━━━┿━━━━━━━┿━━━━━━━┿━━━━━━━┿━━━━━━━┿━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╸
│       │       │       │       │"""A 1-D random walk.
│       │       │       │       │
│       │       │       │       │See also:
│       │       │       │       │- https://lectures.scientific-python.org/intro/numpy/auto_examples/plot_randomwalk.html
│       │       │       │       │
│       │       │       │       │"""
│       │       │       │       │import numpy as np
│       │       │       │       │
│       │       │       │       │
│    6% │       │       │       │def step():
│       │       │       │       │    import random
│    7% │   64% │  13%  │       │    return 1.0 if random.random() > 0.5 else -1.0
│       │       │       │       │
│       │       │       │       │
│       │       │       │       │def walk(n: int, dx: float = 1.0):
│       │       │       │       │    """The for-loop version.
│       │       │       │       │
│       │       │       │       │    Parameters
│       │       │       │       │    ----------
│       │       │       │       │    n: int
│       │       │       │       │        Number of time steps
│       │       │       │       │
│       │       │       │       │    dx: float
│       │       │       │       │        Step size. Default step size is unity.
│       │       │       │       │
│       │       │       │       │    """
│       │       │       │       │    xs = np.zeros(n)
│       │       │       │       │
│       │       │       │       │    for i in range(n - 1):
│       │       │       │       │        x_new = xs[i] + dx * step()
│    7% │       │       │       │        xs[i + 1] = x_new
│       │       │       │       │
│       │       │       │       │    return xs
│       │       │       │       │
│       │       │       │       │
│       │       │       │       │def walk_vec(n: int, dx: float = 1.0):
│       │       │       │       │    """The vectorized version of :func:`walk` using numpy functions."""
│       │       │       │       │    import random
│       │       │       │       │    steps = np.array(random.sample([1, -1], k=n, counts=[10 * n, 10 * n]))
│       │       │       │       │
│       │       │       │       │    # steps = np.random.choice([1, -1], size=n)
│       │       │       │       │
│       │       │       │       │    dx_steps = dx * steps
│       │       │       │       │
│       │       │       │       │    # set initial condition to zero
│       │       │       │       │    dx_steps[0] = 0
│       │       │       │       │    # use cumulative sum to replicate time evolution of position x
│       │       │       │       │    xs = np.cumsum(dx_steps)
│       │       │       │       │
│       │       │       │       │    return xs
│       │       │       │       │
│       │       │       │       │
│       │       │       │       │if __name__ == "__main__":
│       │       │       │       │    n = 1_000_000
│       │       │       │       │    _ = walk(n)
│       │       │       │       │    _ = walk_vec(n)
│       │       │       │       │
       │       │       │       │       │
╶──────┼───────┼───────┼───────┼───────┼─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╴
       │       │       │       │       │function summary for /home/ashwinmo/Sources/enccs/hpda-python/content/example/walk.py
│   14% │   69% │   9%  │       │step
│    7% │       │       │       │walk
       ╵       ╵       ╵       ╵

If you run the magic command in Jupyter you can use %scrun walk(n) instead and it should an output similar to the Web interface below.

Web interface

Running

$ scalene walk.py

opens up the following web app: