TOAD Basics¶

Welcome to TOAD! Let’s get started.

TOAD (Tipping and Other Abrupt events Detector) is a Python framework for detecting and analyzing abrupt shifts in gridded Earth-system data. It implements a three-stage pipeline: (1) detecting abrupt shifts at each grid cell, (2) clustering these shifts in space and time to identify coherent regions of change, and (3) synthesizing results across multiple models, variables, or ensemble members to identify robust patterns.

This tutorial demonstrates TOAD’s core functionality using Antarctic Ice Sheet simulation data, showing you how to detect abrupt ice-sheet thinning events and identify spatially coherent collapse regions.

# Prerequisites
import matplotlib.pyplot as plt
import xarray as xr

plt.rcParams["figure.dpi"] = 300
plt.rcParams["figure.figsize"] = (12, 6)

Test data¶

To demonstrate TOAD we use a dataset from Antarctic Ice Sheet hysteresis simulations performed with the Parallel Ice Sheet Model (PISM). The simulation applies a prescribed global mean surface temperature warming from 0 to ~14°C above pre-industrial levels at a rate of 10⁻⁴ °C yr⁻¹, with the ice sheet remaining close to equilibrium throughout. This makes the dataset ideal for identifying abrupt transitions in ice thickness.

The simulation data are used here under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0). Any use of data derived from this source must include proper attribution to the original publication:

Garbe, J., Albrecht, T., Levermann, A., Donges, J. F., and Winkelmann, R.: The hysteresis of the Antarctic Ice Sheet, Nature, 585(7826), 538–544, https://doi.org/10.1038/s41586-020-2727-5, 2020.

xr.load_dataset("test_data/garbe_2020_antarctica.nc").thk.sel(
    GMST=[0, 2, 4, 6, 8, 10], method="nearest"
).plot(col="GMST", cmap="RdBu_r")

<xarray.plot.facetgrid.FacetGrid at 0x32251fef0>

../_images/0c2037c34e87e2303c6a1ec8ee8dc47e60082e93589dfc3989feefa9ac940015.png

Init the TOAD object an .nc file¶

TOAD works on spatio-temporal data. If you time dimension is not called “time”, you need to tell TOAD, as done below:

from toad import TOAD

td = TOAD("test_data/garbe_2020_antarctica.nc", time_dim="GMST")

Step 1: Compute shifts¶

First we need an abrupt shifts detection method: TOAD currently provides an implemention of ASDETECT (Boulton & Lenton, 2019).
See the asdetect.py docstring for a brief description.
You can also implement your own shifts detection algorithm, see shift_detection_methods.ipynb for details.

from toad.shifts import ASDETECT

# the test data already contains computed shifts, so let's just drop them
td.drop_shifts()

# define variable and method
td.compute_shifts(var="thk", method=ASDETECT(timescale=(0.5, 3.5)))

# ~12 seconds

Shift detection (36100 grid cells in 32 blocks): 100%|██████████| 32/32 [00:13<00:00,  2.44it/s]
INFO: New shifts variable thk_dts: min/mean/max=-1.000/-0.227/1.000 using 13901 grid cells. Skipped 61.5% grid cells: 0 NaN, 22199 constant.

Quick illustration of ASDETECT output¶

fig, axs = plt.subplots(2, sharex=True)
idx, idy = 100, 90
td.data.thk.isel(x=idx, y=idy).plot(ax=axs[0])
td.data.thk_dts.isel(x=idx, y=idy).plot(ax=axs[1])
axs[0].set_ylabel("Variable (ice-thickness)")
axs[1].set_ylabel("Detection Time Series")
axs[1].set_ylim(-1.25, 1.25)
axs[0].set_title("")
axs[1].set_title("")
axs[0].set_xlabel("")
fig.tight_layout()

../_images/67a871b82c46bf01623e037e9387978c218a88cb1ab47e481c701d7207c10075.png

Step 2: Compute Clusters¶

Now we need a clustering method: TOAD can take any clustering method from the scikit-learn library,
We recommend using HDBSCAN which has intuitive hyperparameters and supports clusters of different densities. Tip: For large datasets fast_hdbscan can be significantly faster.
You can also write your own clustering algorithm, see clustering_methods.ipynb for details.

from sklearn.cluster import HDBSCAN

td.data = td.data.drop_vars(td.cluster_vars)

td.compute_clusters(
    var="thk",  # reference your base variable, TOAD will find computed shifts for this variable
    method=HDBSCAN(min_cluster_size=15),
    time_weight=2.0,
)

INFO: New cluster variable thk_dts_cluster: Identified 79 clusters in 16,636 pts; Left 30.0% as noise (4,987 pts).

Calling the TOAD object gives you an overview of current computed variables:

For the “base variable” thk, we now have “shifts variable” thk_dts and “cluster variable” thk_dts_cluster.

td

# You can find all method parameters used for the computation in the attributes, e.g:
td.data.thk_dts.attrs

{'standard_name': 'land_ice_thickness',
 'long_name': 'land ice thickness',
 'units': 'm',
 'pism_intent': 'model_state',
 'time_dim': 'GMST',
 'method_name': 'ASDETECT',
 'toad_version': '0.3',
 'base_variable': 'thk',
 'variable_type': 'shift',
 'method_ignore_nan_warnings': 'False',
 'method_lmin': '5',
 'method_segmentation': 'two_sided',
 'method_timescale': '(0.5, 3.5)'}

Let’s plot the result:¶

TOAD supplies three plotting functions:

td.plot.overview()
td.plot.cluster_map()
td.plot.timeseries()

td.plot.overview();

../_images/0ca17466db5471f08f266e87e9a75544f2dba4f90f9cf4f7778b1e6e55768c68.png

td.plot.overview(mode="aggregated");

../_images/cba8ba1a92456eb5d78ca514096a02f61da4bdc6342a6742996fb0e1a5765172.png

td.plot.timeseries(cluster_ids=[1, 5]);

../_images/7f232a3646dd9c4442d99554df25bd91abe6a790f230d83158ad4c80ad1ca1aa.png

Customize the map with the map_style param.

from toad import MapStyle

fig, ax = td.plot.cluster_map(
    cluster_ids=[1, 5],
    include_all_clusters=False,
    map_style=MapStyle(projection="south_pole", grid_lines=False),
)
ax.set_axis_off()

../_images/39989964650c02e08c664b8e7e560769abf6e538b440a7b1f9fd78995e2332d7.png

We can also see a map of the time/forcing level of the maximum detected shift. This already gives a feeling for the structures that we see in the clusters.

td.plot.time_of_max_shift_map(
    "thk", map_style=MapStyle(projection="south_pole", grid_lines=False)
)

(<Figure size 3600x1800 with 2 Axes>,
 <GeoAxes: title={'center': 'Time of maximum shift for thk'}, xlabel='x', ylabel='y'>)

../_images/b54e7bf0911d03c7ff7ef3527ba82a9f31a9ccf832c7268c489a0840e209b0bc.png

Save your results¶

Save the dataset with the new additions using the td.save() function. This automatically adds compression.

# Use original path name with suffix
# td.save(suffix="_my_computed_toad_data")

# Use a different path name
# td.save(path="my_computed_toad_data.nc")

Export cluster masks¶

mask = td.get_cluster_mask_spatial("thk", cluster_id=1)
mask

<xarray.DataArray 'thk_dts_cluster' (y: 190, x: 190)> Size: 36kB
array([[False, False, False, ..., False, False, False],
       [False, False, False, ..., False, False, False],
       [False, False, False, ..., False, False, False],
       ...,
       [False, False, False, ..., False, False, False],
       [False, False, False, ..., False, False, False],
       [False, False, False, ..., False, False, False]], shape=(190, 190))
Coordinates:
  * y        (y) float64 2kB -3.032e+06 -3e+06 ... 2.984e+06 3.016e+06
  * x        (x) float64 2kB -3.032e+06 -3e+06 ... 2.984e+06 3.016e+06
Attributes: (12/32)
    standard_name:                     land_ice_thickness
    long_name:                         land ice thickness
    units:                             m
    pism_intent:                       model_state
    time_dim:                          GMST
    method_name:                       HDBSCAN
    ...                                ...
    method_cluster_selection_epsilon:  0.0
    method_cluster_selection_method:   eom
    method_copy:                       False
    method_leaf_size:                  40
    method_metric:                     euclidean
    method_min_cluster_size:           15

xarray.DataArray

'thk_dts_cluster'

y: 190
x: 190

False False False False False False ... False False False False False

array([[False, False, False, ..., False, False, False],
       [False, False, False, ..., False, False, False],
       [False, False, False, ..., False, False, False],
       ...,
       [False, False, False, ..., False, False, False],
       [False, False, False, ..., False, False, False],
       [False, False, False, ..., False, False, False]], shape=(190, 190))

Coordinates: (2)

(y)

float64

-3.032e+06 -3e+06 ... 3.016e+06

long_name :: Y-coordinate in Cartesian system
units :: m
axis :: Y
standard_name :: projection_y_coordinate
spacing_meters :: 16000.0

array([-3032000., -3000000., -2968000., -2936000., -2904000., -2872000.,
       -2840000., -2808000., -2776000., -2744000., -2712000., -2680000.,
       -2648000., -2616000., -2584000., -2552000., -2520000., -2488000.,
       -2456000., -2424000., -2392000., -2360000., -2328000., -2296000.,
       -2264000., -2232000., -2200000., -2168000., -2136000., -2104000.,
       -2072000., -2040000., -2008000., -1976000., -1944000., -1912000.,
       -1880000., -1848000., -1816000., -1784000., -1752000., -1720000.,
       -1688000., -1656000., -1624000., -1592000., -1560000., -1528000.,
       -1496000., -1464000., -1432000., -1400000., -1368000., -1336000.,
       -1304000., -1272000., -1240000., -1208000., -1176000., -1144000.,
       -1112000., -1080000., -1048000., -1016000.,  -984000.,  -952000.,
        -920000.,  -888000.,  -856000.,  -824000.,  -792000.,  -760000.,
        -728000.,  -696000.,  -664000.,  -632000.,  -600000.,  -568000.,
        -536000.,  -504000.,  -472000.,  -440000.,  -408000.,  -376000.,
        -344000.,  -312000.,  -280000.,  -248000.,  -216000.,  -184000.,
        -152000.,  -120000.,   -88000.,   -56000.,   -24000.,     8000.,
          40000.,    72000.,   104000.,   136000.,   168000.,   200000.,
         232000.,   264000.,   296000.,   328000.,   360000.,   392000.,
         424000.,   456000.,   488000.,   520000.,   552000.,   584000.,
         616000.,   648000.,   680000.,   712000.,   744000.,   776000.,
         808000.,   840000.,   872000.,   904000.,   936000.,   968000.,
        1000000.,  1032000.,  1064000.,  1096000.,  1128000.,  1160000.,
        1192000.,  1224000.,  1256000.,  1288000.,  1320000.,  1352000.,
        1384000.,  1416000.,  1448000.,  1480000.,  1512000.,  1544000.,
        1576000.,  1608000.,  1640000.,  1672000.,  1704000.,  1736000.,
        1768000.,  1800000.,  1832000.,  1864000.,  1896000.,  1928000.,
        1960000.,  1992000.,  2024000.,  2056000.,  2088000.,  2120000.,
        2152000.,  2184000.,  2216000.,  2248000.,  2280000.,  2312000.,
        2344000.,  2376000.,  2408000.,  2440000.,  2472000.,  2504000.,
        2536000.,  2568000.,  2600000.,  2632000.,  2664000.,  2696000.,
        2728000.,  2760000.,  2792000.,  2824000.,  2856000.,  2888000.,
        2920000.,  2952000.,  2984000.,  3016000.])

(x)

float64

-3.032e+06 -3e+06 ... 3.016e+06

long_name :: X-coordinate in Cartesian system
units :: m
axis :: X
standard_name :: projection_x_coordinate
spacing_meters :: 16000.0

array([-3032000., -3000000., -2968000., -2936000., -2904000., -2872000.,
       -2840000., -2808000., -2776000., -2744000., -2712000., -2680000.,
       -2648000., -2616000., -2584000., -2552000., -2520000., -2488000.,
       -2456000., -2424000., -2392000., -2360000., -2328000., -2296000.,
       -2264000., -2232000., -2200000., -2168000., -2136000., -2104000.,
       -2072000., -2040000., -2008000., -1976000., -1944000., -1912000.,
       -1880000., -1848000., -1816000., -1784000., -1752000., -1720000.,
       -1688000., -1656000., -1624000., -1592000., -1560000., -1528000.,
       -1496000., -1464000., -1432000., -1400000., -1368000., -1336000.,
       -1304000., -1272000., -1240000., -1208000., -1176000., -1144000.,
       -1112000., -1080000., -1048000., -1016000.,  -984000.,  -952000.,
        -920000.,  -888000.,  -856000.,  -824000.,  -792000.,  -760000.,
        -728000.,  -696000.,  -664000.,  -632000.,  -600000.,  -568000.,
        -536000.,  -504000.,  -472000.,  -440000.,  -408000.,  -376000.,
        -344000.,  -312000.,  -280000.,  -248000.,  -216000.,  -184000.,
        -152000.,  -120000.,   -88000.,   -56000.,   -24000.,     8000.,
          40000.,    72000.,   104000.,   136000.,   168000.,   200000.,
         232000.,   264000.,   296000.,   328000.,   360000.,   392000.,
         424000.,   456000.,   488000.,   520000.,   552000.,   584000.,
         616000.,   648000.,   680000.,   712000.,   744000.,   776000.,
         808000.,   840000.,   872000.,   904000.,   936000.,   968000.,
        1000000.,  1032000.,  1064000.,  1096000.,  1128000.,  1160000.,
        1192000.,  1224000.,  1256000.,  1288000.,  1320000.,  1352000.,
        1384000.,  1416000.,  1448000.,  1480000.,  1512000.,  1544000.,
        1576000.,  1608000.,  1640000.,  1672000.,  1704000.,  1736000.,
        1768000.,  1800000.,  1832000.,  1864000.,  1896000.,  1928000.,
        1960000.,  1992000.,  2024000.,  2056000.,  2088000.,  2120000.,
        2152000.,  2184000.,  2216000.,  2248000.,  2280000.,  2312000.,
        2344000.,  2376000.,  2408000.,  2440000.,  2472000.,  2504000.,
        2536000.,  2568000.,  2600000.,  2632000.,  2664000.,  2696000.,
        2728000.,  2760000.,  2792000.,  2824000.,  2856000.,  2888000.,
        2920000.,  2952000.,  2984000.,  3016000.])

Attributes: (32)
standard_name :
land_ice_thickness
long_name :
land ice thickness
units :
m
pism_intent :
model_state
time_dim :
GMST
method_name :
HDBSCAN
toad_version :
0.3
base_variable :
thk
variable_type :
cluster
method_ignore_nan_warnings :
False
method_lmin :
5
method_segmentation :
two_sided
method_timescale :
(0.5, 3.5)
cluster_ids :
[-1 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78]
shift_threshold :
0.5
shift_selection :
local
shift_direction :
both
time_weight :
2.0
n_data_points :
16636
runtime_preprocessing :
0.06089186668395996
runtime_clustering :
0.9092941284179688
runtime_total :
0.9701859951019287
shifts_variable :
thk_dts
method_algorithm :
auto
method_allow_single_cluster :
False
method_alpha :
1.0
method_cluster_selection_epsilon :
0.0
method_cluster_selection_method :
eom
method_copy :
False
method_leaf_size :
40
method_metric :
euclidean
method_min_cluster_size :
15

mask.plot()  # type: ignore

<matplotlib.collections.QuadMesh at 0x342f71700>

../_images/daf8cec34f9b043db12beffbf07d8ecf58659ce6d8bda0483516132fdbed2c8d.png

Access Timeseries in Clusters¶

# Get time series from clusters
timeseries = td.get_timeseries("thk", cluster_id=2)

# get mean of timeseries
mean = td.get_timeseries("thk", cluster_id=2, aggregation="mean")

# Eg: to plot:
plt.figure(figsize=(10, 2))
for ts in timeseries:
    ts.plot(color="k", alpha=0.12, lw=1)
mean.plot(lw=3, color="r")  # type: ignore
plt.title("Cluster 2")

Text(0.5, 1.0, 'Cluster 2')

../_images/853c9966488d235cf0471822ae6f4198b3ecbc31ca32846553d5902b68e4cc73.png

Stats¶

Calling td.stats(var) and the submodules time and space will expose more statistics related helper functions.

Time stats¶

# get start time for cluster 5
td.stats(var="thk").time.start(cluster_id=5)

2.1501

# get all stats in dictionary
td.stats("thk").time.all_stats(5)

{'duration': 0.3600000000000003,
 'duration_timesteps': 9,
 'end': 2.5101000000000004,
 'end_timestep': 62,
 'iqr_50': (2.2701000000000002, 2.3501000000000003),
 'iqr_68': (2.2301, 2.3501000000000003),
 'iqr_90': (2.1901, 2.3501000000000003),
 'mean': 2.314544444444445,
 'median': 2.3101000000000003,
 'membership_peak': 2.2701000000000002,
 'membership_peak_density': 0.005318559556786704,
 'start': 2.1501,
 'start_timestep': 53,
 'std': 0.11066577420434438,
 'steepest_gradient': 2.3101000000000003,
 'steepest_gradient_timestep': 57}

Space stats¶

# Get mean x,y position of cluster 5
td.stats(var="thk").space.mean(cluster_id=5)

(-282661.12266112264, -1269139.293139293)

td.stats("thk").space.all_stats(5)

{'central_point_for_labeling': (-248000.0, -1336000.0),
 'footprint_cumulative_area': 481,
 'footprint_mean': (-282661.12266112264, -1269139.293139293),
 'footprint_median': (-280000.0, -1272000.0),
 'footprint_std': (229008.10144179303, 214851.6466720021),
 'mean': (-282661.12266112264, -1269139.293139293),
 'median': (-280000.0, -1272000.0),
 'std': (229008.101441793, 214851.64667200207)}

Cluster scoring¶

We have a few scores available already that aim to quantify the quality of a cluster.

score_heaviside() evaluates how closely the spatially aggregated cluster time series resembles a perfect Heaviside function. A score of 1 indicates a perfect step function, while 0 indicates a linear trend.

score_consistency() measures internal coherence using hierarchical clustering. Higher scores indicate more internally consistent clusters, which is useful for assessing cluster quality.

score_spatial_autocorrelation() measures average pairwise similarity (R²) between time series within a cluster. Higher scores mean more similar behavior within the cluster. This provides a fast computation of spatial coherence.

score_nonlinearity() measures deviation from linearity using RMSE and can be normalized against unclustered data. Higher scores indicate more nonlinear behavior, which is good for detecting complex temporal patterns.

td.stats("thk").general.score_heaviside(5, aggregation="mean")

0.41722873312015085

Example 2: Synthetic Dataset with Known Shifts¶

For our second example, we demonstrate TOAD on a synthetically generated dataset with known abrupt shifts. This provides a controlled test case where we can verify that TOAD correctly identifies the spatial regions and timing of predefined transitions.

The synthetic dataset contains:

A regular lat-lon grid (90 × 180)
200 time steps
White noise background
Two distinct sigmoid transitions at different times and locations

This example illustrates TOAD’s ability to recover known shift patterns, making it useful for validation and benchmarking.

synth_data = xr.open_dataset("test_data/synth_data.nc")
td2 = TOAD(synth_data)  # alternatively pass in an xarray Dataset

# Compute shifts
td2.compute_shifts(
    "ts",
    method=ASDETECT(),
    overwrite=True,  # overwrite pre-computed shifts
)

Shift detection (16200 grid cells in 32 blocks): 100%|██████████| 32/32 [00:13<00:00,  2.33it/s]
INFO: New shifts variable ts_dts: min/mean/max=-0.516/0.007/1.000 using 16200 grid cells. Skipped 0.0% grid cells: 0 NaN, 0 constant.

# Compute clusters
td2.drop_clusters()
td2.compute_clusters(
    "ts",
    method=HDBSCAN(min_cluster_size=10),
    shift_threshold=0.6,
    overwrite=True,  # overwrite pre-computed clusters
)

INFO: New cluster variable ts_dts_cluster: Identified 2 clusters in 642 pts; Left 0.0% as noise (0 pts).

import cartopy.crs as ccrs

# Plot clusters
td2.plot.overview("ts", map_style={"projection": ccrs.Orthographic(-40, 20)});

# Tip: map style can also be set as a dict, instead of importing the MapStyle class

../_images/ce8de5006655496bb10a9de6927f2f555b3ba6b7017f536d127cc8f3a2f06d76.png

Tutorials

Implement your own clustering method