Why Gemfury? Push, build, and install  RubyGems npm packages Python packages Maven artifacts PHP packages Go Modules Debian packages RPM packages NuGet packages

Repository URL to install this package:

Details    
dask / docs / source / dashboard.rst
Size: Mime:
Dashboard Diagnostics
=====================

.. meta::
   :description: The interactive Dask dashboard provides numerous diagnostic plots for live monitoring of your Dask computation. It includes information about task runtimes, communication, statistical profiling, load balancing, memory use, and much more.

.. raw:: html

    <iframe width="560"
            height="315"
            src="https://www.youtube.com/embed/N_GqzcuGLCY"
            frameborder="0"
            allow="autoplay; encrypted-media"
            style="margin: 0 auto 20px auto; display: block;"
            allowfullscreen>
    </iframe>

Profiling parallel code can be challenging, but the interactive dashboard provided with
Dask's :doc:`distributed scheduler <scheduling>` makes this easier with live monitoring
of your Dask computations. The dashboard is built with `Bokeh <https://docs.bokeh.org>`_
and will start up automatically, returning a link to the dashboard whenever the scheduler is created.

Locally, this is when you create a :class:`Client <distributed.client.Client>` and connect the scheduler:

.. code-block:: python

   from dask.distributed import Client
   client = Client()  # start distributed scheduler locally.

In a Jupyter Notebook or JupyterLab session displaying the client object will show the dashboard address:

.. figure:: images/dashboard_link.png
    :alt: Client html repr displaying the dashboard link in a JupyterLab session.

You can also query the address from ``client.dashboard_link`` (or for older versions of distributed, ``client.scheduler_info()['services']``).

By default, when starting a scheduler on your local machine the dashboard will be served at ``http://localhost:8787/status``. You can type this address into your browser to access the dashboard, but may be directed 
elsewhere if port 8787 is taken. You can also configure the address using the ``dashboard_address``
parameter (see :class:`LocalCluster <distributed.deploy.local.LocalCluster>`).

There are numerous diagnostic plots available. In this guide you'll learn about some
of the most commonly used plots shown on the entry point for the dashboard:

- :ref:`dashboard.memory`: Cluster memory and Memory per worker
- :ref:`dashboard.proc-cpu-occ`:  Tasks being processed by each worker/ CPU Utilization per worker/ Expected runtime for all tasks currently on a worker.
- :ref:`dashboard.task-stream`: Individual task across threads.
- :ref:`dashboard.progress`: Progress of a set of tasks.

.. figure:: images/dashboard_status.png
    :alt: Main dashboard with five panes arranged into two columns. In the left column there are three bar charts. The top two show total bytes stored and bytes per worker. The bottom has three tabs to toggle between task processing, CPU utilization, and occupancy. In the right column, there are two bar charts with corresponding colors showing task activity over time, referred to as task stream and progress.

.. _dashboard.memory: 

Bytes Stored and Bytes per Worker
---------------------------------
These two plots show a summary of the overall memory usage on the cluster (Bytes Stored),
as well as the individual usage on each worker (Bytes per Worker). The colors on these plots 
indicate the following.  
     
.. raw:: html

    <table>
        <tr>
            <td>
                <div role="img" aria-label="blue square" style="color:rgba(0, 0, 255, 1); font-size: 25px ">&#9632;</div>
            </td>
            <td>Memory under target (default 60% of memory available) </td>
        </tr>
        <tr>
            <td>
                <div role="img" aria-label="orange square" style="color:rgba(255, 165, 0, 1); font-size: 25px ">&#9632;</div>
            </td>
            <td> Memory is close to the spilling to disk target (default 70% of memory available)</td>
        </tr>
        <tr>
            <td>
                <div role="img" aria-label=" grey square" style="color:rgba(128, 128, 128, 1); font-size: 25px ">&#9632;</div>
            </td>
            <td>Memory spilled to disk</td>
        </tr>
    </table>

.. figure:: images/dashboard_memory.png
    :alt: Two bar charts on memory usage. The top chart shows the total cluster memory in a single bar with mostly under target memory in blue and a small part of spilled to disk in grey. The bottom chart displays the memory usage per worker, with a separate bar for each of the 16 workers. The first four bars are orange as their worker's memory are close to the spilling to disk target, with the first worker standing out with a portion in grey that correspond to the amount spilled to disk. The remaining workers are all under target showing blue bars.  

The different levels of transparency on these plot is related to the type of memory 
(Managed, Unmanaged and Unmanaged recent), and you can find a detailed explanation of them in the
`Worker Memory management documentation <https://distributed.dask.org/en/latest/worker.html#memory-management>`_


.. _dashboard.proc-cpu-occ:

Task Processing/CPU Utilization/Occupancy
-----------------------------------------

**Task Processing** 

The *Processing* tab in the figure shows the number of tasks being processed by each worker with the blue bar. The scheduler will
try to ensure that the workers are processing the same number of tasks. If one of the bars is completely white it means that 
worker has no tasks and its waiting for them. This usually happens when the computations are close to finished (nothing 
to worry about), but it can also mean that the distribution of the task across workers is not optimized. 

There are three different colors that can appear in this plot:

.. raw:: html

    <table>
        <tr>
            <td>
                <div role="img" aria-label="blue square" style="color:rgba(0, 0, 255, 1); font-size: 25px ">&#9632;</div>
            </td>
            <td>Processing tasks.</td>
        </tr>
        <tr>
            <td>
                <div role="img" aria-label="green square" style="color:rgba(0, 128, 0, 1); font-size: 25px ">&#9632;</div>
            </td>
            <td>Saturated: It has enough work to stay busy.</td>
        </tr>
        <tr>
            <td>
                <div role="img" aria-label="red square" style="color:rgba(255, 0, 0, 1); font-size: 25px ">&#9632;</div>
            </td>
            <td>Idle: Does not have enough work to stay busy.</td>
        </tr>
    </table>

.. figure:: images/dashboard_task_processing.png
    :alt: Task Processing bar chart, showing a relatively even number of tasks on each worker.

In this plot on the dashboard we have two extra tabs with the following information:

**CPU Utilization**

The *CPU* tab shows the cpu usage per-worker as reported by ``psutils`` metrics. 

**Occupancy**

The *Occupancy* tab shows the occupancy, in time, per worker. The total occupancy for a worker is the total expected runtime
for all tasks currently on a worker. For example, an occupancy of 10s means an occupancy of 10s means that the worker 
estimates it will take 10s to execute all the tasks it has currently been assigned.

.. _dashboard.task-stream:

Task Stream
-----------

The task stream is a view of all the tasks across worker-threads. Each row represents a thread and each rectangle represents 
an individual tasks. The color for each rectangle corresponds to the task-prefix of the task being performed and it matches the color 
of the *Progress* plot (see Progress section). This means that all the individual tasks part of the `inc` task-prefix for example, will have 
the same randomly assigned color from the viridis color map. 

There are certain colors that are reserved for a specific kinds of tasks:

.. raw:: html

    <table>
        <tr>
            <td>
                <div role="img" aria-label="light red square" style="color:rgba(255, 0, 0, 0.4); font-size: 25px ">&#9632;</div>
            </td>
            <td>Transferring data between workers tasks.</td>
        </tr>
        <tr>
            <td>
                <div role="img" aria-label="light orange square" style="color: rgba(255,165,0, 0.4); font-size: 25px ">&#9632;</div>
            </td>
            <td>Reading from or writing to disk.</td>
        </tr>
        <tr>
            <td>
                <div role="img" aria-label="light grey square" style="color:rgba(128,128,128, 0.4); font-size: 25px ">&#9632;</div>
            </td>
            <td>Serializing/deserializing data.</td>
        </tr>
        <tr>
            <td>
                <div role="img" aria-label="black square" style="color:rgba(0, 0, 0, 1); font-size: 25px ">&#9632;</div>
            </td>
            <td>Erred tasks.</td>
        </tr>
    </table>


In some scenarios the dashboard will have white spaces between each rectangle, this means that during that time the worker-thread
is idle. Having too much white and red is an indication of not optimal use of resources.

.. figure:: images/dashboard_taskstream_healthy.png
    :alt: An example of a healthy Task Stream, with little to no red or white space. The stacked bar chart, with one bar per worker-thread, has different shades of blue and green for different tasks, with some red.

.. figure:: images/dashboard_task_stream_unhealthy.png
    :alt:  An example of an unhealthy Task Stream, the bar chart shows a lot of white space and red rectangles, and also some orange.


.. _dashboard.progress:

Progress
--------

The progress bars plot shows the progress of each individual task-prefix. The color of the of each bar matches the color of the 
individual tasks on the task stream that correspond to the same task-prefix. Each horizontal bar has three different components:

.. raw:: html

    <table>
        <tr>
            <td>
                <div role="img" aria-label="light grey square" style="color:rgba(128,128,128, 0.4); font-size: 25px ">&#9632;</div>
            </td>
            <td>Tasks that are ready to run.</td>
        </tr>
        <tr>
            <td>
                <div role="img" aria-label="teal square" style="color:rgba(30,151,138, 1); font-size: 25px ">&#9632;</div>
            </td>
            <td> Tasks that have been completed and are in memory.</td>
        </tr>
        <tr>
            <td>
                <div role="img" aria-label="light teal square" style="color:rgba(30,151,138, 0.6); font-size: 25px ">&#9632;</div>
            </td>
            <td>Tasks that have been completed, been in memory and have been released.</td>
        </tr>
    </table>

.. figure:: images/dashboard_progress.png
    :alt: Progress bar chart with one bar for each task-prefix matching with the names "add", "double", "inc", and "sum". The "double", "inc" and "add" bars have a progress of approximately one third of the total tasks, displayed in their individual color with different transparency levels. The "double" and "inc" bars have a grey background, and the "sum" bar is empty.
 

Dask JupyterLab Extension
--------------------------

The `JupyterLab Dask extension <https://github.com/dask/dask-labextension#dask-jupyterlab-extension>`__  
allows you to embed Dask's dashboard plots directly into JupyterLab panes. 

Once the JupyterLab Dask extension is installed you can choose any of the individual plots available and 
integrated as a pane in your JupyterLab session. For example, in the figure below we selected the *Task Stream*, 
*Progress*, *Workers Memory*, and *Graph* plots. 

.. figure:: images/dashboard_jupyterlab.png
    :alt: Dask JupyterLab extension showing an arrangement of four panes selected from a display of plot options. The panes displayed are the Task stream, Bytes per worker, Progress and the Task Graph.