Detecting transitions and quantifying differences in two SST datasets using spatial permutation entropy

Gancio, Juan; Tirabassi, Giulio; Masoller, Cristina; Barreiro, Marcelo

doi:10.5194/esd-17-533-2026

Articles | Volume 17, issue 3

https://doi.org/10.5194/esd-17-533-2026

Articles | Volume 17, issue 3

Research article

12 May 2026

Research article |

| 12 May 2026

Detecting transitions and quantifying differences in two SST datasets using spatial permutation entropy

Juan Gancio, Giulio Tirabassi, Cristina Masoller, and Marcelo Barreiro

Abstract

Weather prediction systems rely on the vast amounts of data continuously generated by Earth modeling and monitoring systems, and efficient data analysis techniques are needed to track changes and compare datasets. Here we show that a nonlinear quantifier, the spatial permutation entropy (SPE), is useful to characterize spatio-temporal complex data, allowing detailed analysis at different scales. Specifically, we use SPE to analyze ERA5 and NOAA OI v2 sea surface temperature (SST) anomalies in two key regions, Niño 3.4 and Gulf Stream. We perform a quantitative comparison of these two SST products and find that SPE detects differences at short spatial scales (<1°). We also identify several transitions, including a transition that occurs in 2007 when ERA5 changed its sea–surface boundary condition to OSTIA, in 2013 when OSTIA updated the background error covariances, and in 2021 when NOAA SST changed satellite, from MeteOp-A to MeteOp-C. The robustness and statistical significance of the detected transitions are tested using surrogate data. We demonstrate that, using standard distance and cross-correlation analyses, the transitions are not detected with the same level of statistical significance and robustness as when using ordinal analysis.

Download & links

How to cite.

Received: 02 Oct 2025 – Discussion started: 15 Oct 2025 – Revised: 23 Apr 2026 – Accepted: 28 Apr 2026 – Published: 12 May 2026

1 Introduction

Due to the large amount of data generated by Earth modeling and monitoring systems, much effort is currently being devoted to developing new, efficient climate data analysis techniques (Dijkstra et al., 2019; Messori et al., 2017; Boers et al., 2019; Gupta et al., 2021; Díaz et al., 2023; Krouma et al., 2024; Allen et al., 2025). Ordinal analysis (Bandt and Pompe, 2002) is a popular symbolic method of time-series analysis that has been applied to geophysical data. For example, ordinal analysis was used to study time series of surface air temperature anomalies in a regular grid over the earth's surface (reanalysis data from the National Center for Environmental Prediction/National Center for Atmospheric Research NCEP/NCAR) and uncovered long-range tele-connections across multiple time scales (Barreiro et al., 2011; Deza et al., 2013). The ordinal method is based on estimating the probabilities of symbols, known as ordinal patterns (OPs), defined in terms of the temporal order of the relative values of L data points. As an example, for L=3, triplets of consecutive data values such that $x_{t} < x_{t + 1} < x_{t + 2}$ are encoded in the symbol “012” where the digits represent the rank of the corresponding value within the triplet. The symbols' probabilities are estimated from their frequencies of occurrence within the time series and their Shannon entropy, known as permutation entropy (PE), is a quantifier of nonlinear temporal correlations. PE is low when some OPs are much more probable than others, and maximum when all possible OPs are equally probable (Bandt and Pompe, 2002). Ordinal analysis is computationally very efficient and robust to the presence of artifacts and noise. The use of time-lagged (non-consecutive) data points adds versatility to the method, since it allows to select different temporal scales for the analysis. For example, for analyzing a climatic time series with monthly resolution, the L=3 OPs can be defined by considering data values in three consecutive months (e.g. January, February, March; February, March, April; etc), in three consecutive years, or equally spaced over a period of time (for example, a year) (Deza et al., 2013). An important limitation of the ordinal methodology is that the symbolic coding rule does not take into account the actual values of the data points, but their relative values, and therefore, ordinal analysis gives partial information, complementary to that obtained by using standard time series analysis techniques.

Ordinal analysis was originally proposed for time series analysis and adapted for the analysis of gridded two-dimensional spatial data (Ribeiro et al., 2012), by defining the OPs in terms of the relative values of L grid points. Spatial ordinal analysis is a versatile tool because one can choose different “shapes” and/or different spatial orientations for the symbols. For example, for symbols defined in terms of the data values of L=4 grid points, one can consider squares of 2×2 grid points, a line (horizontal or vertical) of 4 grid points, an “L” composed by 3+1 grid points, etc. Furthermore, the use of spatially lagged grid points allows tuning the spatial scale of the analysis. This spatial lag is an important parameter for the application of this analysis tool in climate science, as the dynamics of our climate involves complex processes and interactions that act at different spatial scales. Ordinal analysis has also been expanded to include more information from the analyzed signals, such as the variance of the data points that define an OP (Fadlallah et al., 2013), their amplitude (Azami and Escudero, 2016), or the dispersion of data points that define different OPs (Politi, 2017). However, ordinal analysis is one of many symbolic techniques available, and in addition to permutation entropy other entropy quantifiers for time series can also provide valuable information (Prado et al., 2020; Falasca et al., 2020; Ikuyajolu et al., 2021; Novi et al., 2024; Paluš et al., 2024).

The spatial permutation entropy (SPE), which is Shannon's entropy estimated from the probabilities of spatial ordinal patterns, has been used to analyze images, art works and textures (Sigaki et al., 2018, 2019; Tirabassi and Masoller, 2023; Tirabassi et al., 2023; Muñoz-Guillermo, 2023; Tarozo et al., 2025). It has also been used to analyze complex spatio-temporal data such as EEG recordings (Boaretto et al., 2023; Gancio et al., 2024) and cardiac synthetic data (Schlemmer et al., 2015, 2018). However, to our knowledge, SPE has not yet been tested on climate data.

Since SPE can be calculated from the relative values of a climate variable at a given time in a particular geographic region, it yields information about nonlinear spatial correlations of that climate variable, in that region, at that time. In contrast, the “temporal” PE of the variable at a particular grid point is calculated from the analysis the variable's time series at that grid point, and therefore, it yields information about nonlinear temporal correlations of that variable, at that grid point.

Our goal is to demonstrate that SPE is a reliable and versatile tool, and specifically, is able to capture subtle differences between datasets and also, changes within the same dataset. We focus on a key variable, sea surface temperature (SST) anomalies, and compare two SST products, ERA5 and NOAA Optimal Interpolation version 2 (NOAA OI v2), in two key regions, the equatorial Pacific and the the Gulf Stream. We show that SPE identifies differences in the datasets in short spatial scales, which can be more or less pronounced over different periods of time. We interpret our findings in terms of changes in the methodologies and data used to construct the SST products.

2 Data

We consider only monthly SST anomalies with respect to the seasonal cycle in the Niño 3.4 region (170–120° W, 5° N–5° S), and in the western north Atlantic (32.5–42.5° N, 67.5–45° W), a box centered on the Gulf Stream (Dong and Kelly, 2004; Parfitt and Czaja, 2016; Vries et al., 2019; Comunian et al., 2026). Both regions are highligthed in Fig. 1a. These regions were chosen not only because of their importance for the global climate, but also because they display different spatio-temporal SST dynamics. SST in the Niño 3.4 region is governed by tropical dynamics, and in particular the SST dynamics results from ocean–atmosphere interactions leading to variability mainly on interannual time scales. On the other hand, the Gulf Stream dynamics, as one of the most intense western boundary currents, is governed by internal ocean dynamics and the extratropical winds across the basin, resulting in SST variability on several time scales, from fast changes due to atmospheric-driven heat fluxes to decadal shifts in spatial structure.

We analyze NOAA Optimal Interpolation version 2 (NOAA OI v2) (Reynolds et al., 2007; Huang et al., 2021), and ERA5 global reanalysis (Hersbach et al., 2020). Both datasets have spatial resolution of 0.25°×0.25°. ERA5 starts in January 1940, while NOAA OI v2 starts in September 1981; both extend to June 2025 (therefore, the NOAA time series have 526 datapoints each, while the ERA5 time series have 1026 datapoints each).

NOAA SST includes observations from ships, drifting and moored buoys, and the Advanced Very High Resolution Radiometer (AVHRR) (Huang et al., 2021) retrieved from NOAA series and MetOp-A/-B satellites by US Navy before November 2021. After this date, NOAA SST switched to the Advanced Clear Sky Processor for Ocean (ACSPO) (Huang et al., 2023; Jonasson et al., 2020) satellite SSTs retrieved from AVHRR and the Visible Infrared Imager Radiometer Suite (VIIRS) (Huang et al., 2023).

ERA5 SST is the combination of HadISST2 (Titchner and Rayner, 2014) up to August 2007 and OSTIA (Donlon et al., 2012) from September 2007 onwards (Hirahara et al., 2016). HadISST2 assimilates in-situ observations as well as two radiometers: AVHRR and the Along Track Scanning Radiometer (ATSR).

OSTIA was originally constructed at a resolution of 0.05° and includes in situ data from various sources, as well as derived from several satellite products including AVHRR and VIIRS. It is worth noting that the higher resolution of OSTIA allows it to better resolve the tropical instability waves and sub-mesoscale eddies in the midlatitudes (Hirahara et al., 2016).

Within the regions of interest (see Fig. 1a), both datasets employ a similar grid (with 40×200 grid points for the Niño3.4, and 40×90 for the Gulf Stream region), the only difference being a small offset of 0.005° both in latitude and longitude.

3 Analysis tools

3.1 Ordinal patterns and spatial permutation entropy

Ordinal analysis is a symbolic data analysis technique proposed by Bandt and Pompe (2002) that has been extensively applied in a wide variety of different scientific fields (Leyva et al., 2022). The application of ordinal analysis to a climatological spatio-temporal dataset is schematically illustrated in Fig. 2. Ordinal analysis takes an ordered series of N data values, $x = {x_{1}, x_{2}, \dots, x_{i}, \dots, x_{N}}$ , and translates it into a sequence of symbols, $s = {s_{1}, s_{2}, \dots, s_{i}, \dots, \dots s_{n}}$ . For example, considering a grid point in the geographical region shown in Fig. 2a, x can be the time series of SST anomalies at this grid point, as schematically shown in Fig. 2b. On the other hand, the ordered series x can be the sequence of values of SST anomalies at time t, along a row (or a line), from right to left (from top to bottom), in the region shown in Fig. 2a, as schematically shown in Fig. 2e and f.

https://esd.copernicus.org/articles/17/533/2026/esd-17-533-2026-f01

Figure 1Panel (a) highlights the regions of interest: Niño 3.4 (in green), and the Gulf Stream (in orange). Panels (b) and (c) show the SST anomaly in the Niño 3.4 region, and panels (d) and (e), in the Gulf Stream region, calculated from ERA5 (b, d) and NOAA OI v2 (c, e) datasets. In panels (b)–(e), the thick lines represent the spatial mean of the anomalies, while the shading indicates the spatial standard deviation.

https://esd.copernicus.org/articles/17/533/2026/esd-17-533-2026-f02

Figure 2Illustration of the procedure used to define ordinal patterns (OPs) and to calculate the permutation entropy (PE) of patterns of length L. (a) Location of a grid point inside the Gulf Stream region. (b) The temporal evolution of NOAA SST anomaly at this grid point (sequence x) is represented by a sequence of “temporal” OPs (sequence s) that are defined by the relative ordering of L=3 consecutive (δ=1) data values. (c) The six possible orderings represent the six possible OPs. (d) The OPs' probabilities are estimated by counting the number of times each OP occurs in the sequence s, and the permutation entropy (PE), H(L,δ), is calculated using Eq. (1). (e) Snapshot of NOAA SST anomalies in the Gulf Stream region in July 2002. A row of this snapshot defines a sequence of consecutive data values, x, along the WE direction, from which spatial OPs are defined using the same rule as in (b). From the probabilities of the OPs defined over all the rows in the snapshot, the value of the entropy, for July 2002, $H_{WE}^{L, δ}$ , is calculated using Eq. (2). Repeating this procedure for the columns (along the NS direction) gives $H_{NS}^{L, δ}$ (Eq. 3). (f) Detail of how the spatial lag, δ, is used to define spatial OPs: The three consecutive values (δ=1) marked in green in the NS direction, $[1.1, - 1.4, 4.1]$ , are represented by pattern “102”, while the three values marked in orange, lagged δ=2 in the WE direction, $[3.2, 4.4, 1.3]$ , are represented by pattern “120”.

The ordinal symbolic transformation requires defining only two parameters: the symbol length, L, and the lag, δ. These parameters are used to associate, to a vector v_i whose components are L data points lagged by δ, a symbol s_i that is known as ordinal pattern (OP),

\begin{array}{l} v_{i} (L, δ) = [x_{i}, x_{i + δ}, x_{i + 2 δ}, \dots, x_{i + (L - 1) δ}] \\ \to s_{i} (L, δ) = [π (i), π (i + δ), \dots, π (i + (L - 1) δ)] . \end{array}

Here π(⋅) is the permutation index that sorts the components of v_i in ascending order: $x_{π (i)} \leq x_{π (i + δ)} \leq \dots \leq x_{π (i + (L - 1) δ)}$ . The number of possible permutations, that is, of possible patterns, grows as L!. The six possible patterns of length L=3 are shown in Fig. 2c and examples are shown in Fig. 2f: the vector $[1.1, - 1.4, 4.1]$ is represented by pattern “102” while $[3.2, 4.4, 1.3]$ is represented by “120”, etc. We remark that the ordinal transformation takes into account relative values, not absolute ones.

Applying this transformation to every v_i(L,δ) with $i = 1, \dots, n = N - (L - 1) δ$ , gives a sequence of n ordinal patterns, $s (L, δ) = {s_{1}, s_{2}, \dots, s_{i}, \dots, \dots s_{n}}$ . Labeling the possible patterns (symbols) from k=1 to k=L! (for L=3, k=1 corresponds to pattern “012”, k=2 to pattern “021”, etc., as illustrated in Fig. 2c), we can count the number of times each pattern appears in s. Let n^k be the number of times the kth pattern appears. Then, the probability of this pattern is $p^{k} (L, δ) = n^{k} / n$ . Figure 2d displays an example of six probabilities of patterns of length L=3. The permutation entropy (PE) is then defined as Shannon's entropy of the probabilities (Bandt and Pompe, 2002):

\begin{matrix} (1) & H (L, δ) = - \frac{1}{\log (L!)} \sum_{k = 1}^{L!} p^{k} (L, δ) \log (p^{k} (L, δ)) . \end{matrix}

The coefficient $1 / \log (L!)$ normalizes H(L,δ) between 0–1 and enables the comparison between values obtained from ordinal pattens of different lengths. A small entropy value is obtained where one pattern predominates (this occurs when the sequence $x = {x_{1}, x_{2}, \dots, x_{i}, \dots, x_{N}}$ is periodic, or when it has a strong trend), whereas a high entropy value is obtained when all symbols are almost equally probable, which normally occurs when the sequence x is fully stochastic.

To estimate the patterns' probabilities with good statistics, the number of symbols, $n = N - (L - 1) δ$ , needs to be much larger than the number of possible patterns, L!. In practical terms, this limits the values of L to the range from 3 to 6. Here, unless explicitly stated, we use L=4 and use different values of δ to tune the spatial scale of the analysis.

3.2 Symbol orientations in 2D spatio-temporal gridded data

The SST anomalies in the two regions of interest are represented as the time evolution of 2-dimensional N×M gridded datasets, X_i,j(t) with $i \in {1, \dots, N}, j \in {1, \dots, M}, t \in {1, \dots, T}$ , where the index i corresponds to different latitudes, the index j to different longitudes, and T is the number of time steps in the series. Given L and δ, at each time step, t, we apply the ordinal transformation using two spatial orientations: in the North–South (NS) direction, the ordered series x is defined from the values, at time t, of the columns of the grid, and in the West–East (WE) direction, from the rows of the grid, see Fig. 2e and f. In each column we can define $N - (L - 1) δ$ OPs, while in each row, we can define $M - (L - 1) δ$ OPs. Defining OPs over all the columns the total number of OPs defined along the NS direction is $M (N - (L - 1) δ)$ , and defining OPs over all the rows, the total number of OPs defined along the WE direction is $N (M - (L - 1) δ)$ . An important advantage of this methodology for the analysis of climatological data is its flexibility, because the OP orientation is not limited to NS and WE directions, but other orientations can be selected for the analysis.

We also remark that L and δ allow tuning the spatial scale of the analysis. In the original application of permutation entropy to time series analysis (Bandt and Pompe, 2002), we can interpret L and δ as parameters that allow a nonlinear embedding of a time series in a L! dimensional space, because at each time, an ordinal pattern can be defined, and the number of possible patterns is L!. In the spatial approach, the parameters L and δ have a similar role: they allow to embed a set of L gridded data points in a L! dimensional space.

From the probabilities of the OPs defined at time t along the NS and WE directions, $p_{WE}^{k} (t)$ and $p_{NS}^{k} (t)$ respectively, we compute two spatial permutation entropies at time t:

\begin{array}{l} (2) & H_{WE}^{L, δ} (t) = - \frac{1}{\log (L!)} \sum_{k = 1}^{L!} p_{WE}^{k} (t) \log (p_{WE}^{k} (t)), \\ (3) & H_{NS}^{L, δ} (t) = - \frac{1}{\log (L!)} \sum_{k = 1}^{L!} p_{NS}^{k} (t) \log (p_{NS}^{k} (t)) . \end{array}

Since SST anomalies vary over time, the ordinal probabilities vary over time and thus, the entropies vary over time. $H_{NS}^{L, δ} (t)$ and $H_{WE}^{L, δ} (t)$ will be close to 1 when there is no spatial order in the data (all OPs are equally probable), and will be <1 when there are spatial structures, such as gradients in the NS or in the WE direction, that make some OPs more or less probable.

In this work, we consider symbols defined by L=4 grid points and a spatial lag up to δ=8, as these are the largest values that allow us, given the size of the two regions studied, to estimate with good statistics the probabilities of the L!=24 possible patterns. Table 1 shows the number of symbols (defined by L=4 grid points in the geographical region analyzed) when a spatial lag of δ=1, 2, 4, and 8 is used. We see that the lowest number (in the Gulf stream with δ=8) is ≫24. We tested the robustness of our results with respect to L, and found similar results with L=3 and L=5 (see Figs. B3–B7 of Appendix B).

Table 1Number of symbols (ordinal patterns), for L=4, defined in the two regions analyzed, for each orientation, and for each spatial lag (δ) considered.

Download Print Version | Download XLSX

3.3 Entropy calculated from the distribution of data values

In this work we also compare the permutation entropy with the conventional way of calculating Shannon entropy of a spatio-temporal field, X_i,j(t), from the distribution of its data values,

\begin{matrix} (4) & H_{hist} (t) = - \frac{1}{\log m} \sum_{k = 1}^{m} p^{k} (t) \log (p^{k} (t)) . \end{matrix}

Here p^k(t) is the probability that X_i,j(t) is in the k state and m is the number of possible states. p^k(t) is estimated from histograms of m bins of equal size, that represent the possible states. H_hist(t)=1 when p^k(t)=1 and $p^{j} (t) = 0 \forall j \neq k$ (only one bin is occupied), and H_hist(t)=0 if $p^{k} (t) = 1 / m \forall k$ (the distribution of values is uniform). To compare with SPE values calculated with ordinal patterns of length L, we select the number of bins equal to the number of possible ordinal patterns, m=L!. To calculate H_hist(t) all the data points of X_i,j(t) in the regions of interest are used. As explained in Sect. 2, ERA5 and NOAA have 40×200 grid points in the Niño region and 40×90 in the Gulf Stream region.

3.4 Distance and cross-correlations measures used to compare SST-ERA5 and SST-NOAA

In this work we demonstrate that spatial permutation entropy can detect differences in ERA5 and NOAA SST products, and an important question is whether such differences can also be detected using standard correlation or distance measures. Therefore, we also compare SST anomaly values using the Average Absolute Difference (AAD), the Pearson's spatial cross-correlation coefficient (r), and the Spatial Mutual Information (SMI), which is a non-linear cross-correlation measure (Celik, 2016; Kumar and Bhandari, 2022).

The Average Absolute Difference is defined as:

\begin{matrix} (5) & AAD (t) = {〈|X_{i, j} (t) - Y_{i, j} (t)|〉}_{i, j}, \end{matrix}

where X_i,j(t) and Y_i,j(t) represent the two gridded datasets, ERA5 and NOAA, and ${〈\cdot〉}_{i, j}$ represents the spatial average in the analyzed region, at time t.

Pearson's spatial cross-correlation coefficient is defined as:

\begin{matrix} (6) & r (t) = \frac{σ_{X, Y} (t)}{σ_{X} (t) σ_{Y} (t)}, \end{matrix}

where σ_X(t), σ_Y(t) and σ_X,Y(t) are the spatial standard deviations and the spatial covariance of X_i,j(t) and Y_i,j(t) at time t. To calculate ADD and r, all the data points of X_i,j(t) and Y_i,j(t) in the regions of interest are used. As explained in Sect. 2, ERA5 and NOAA both have 40×200 grid points in the Niño region and 40×90 in the Gulf Stream region.

The spatial mutual information (SMI) of X_i,j(t) and Y_i,j(t) at time t is defined as:

\begin{matrix} (7) & SMI (t) = H_{X} (t) + H_{Y} (t) - H_{X, Y} (t), \end{matrix}

where $H_{X} (t) = - \sum_{k = 1}^{m_{X}} p_{X}^{k} \log (p_{X}^{k})$ and $H_{Y} (t) = - \sum_{l = 1}^{m_{Y}} p_{Y}^{l} \log (p_{Y}^{l})$ are the Shannon entropies of X_i,j(t) and Y_i,j(t), and H_X,Y(t) denote their joint entropy,

\begin{matrix} (8) & H_{X, Y} (t) = - \sum_{k = 1}^{m_{X}} \sum_{l = 1}^{m_{Y}} p^{k, l} (t) \log (p^{k, l} (t)), \end{matrix}

where m_X and m_Y are the numbers of possible states of X and Y respectively, and p^k,l(t) denotes the joint probability that X_i,j(t) is in state k and Y_i,j(t) in state l.

We calculate SMI using two approaches: the conventional one, in which the distributions of values of X_i,j(t) and Y_i,j(t) are divided in an equal number of bins, $m_{X} = m_{Y} = m$ , of equal size, that represent the possible states, and the ordinal approach, in which the possible states are the possible ordinal patterns, defined over Lδ-lagged gridded points, aligned along the NS or WE directions. Then, in the ordinal approach, $m_{X} = m_{Y} = L!$ . We refer to the mutual information calculated in these ways as SMI_hist(t), ${SMI}_{NS}^{L, δ} (t)$ and ${SMI}_{WE}^{L, δ} (t)$ .

Since for L=4 OPs there are $m = L! = 24$ possible patterns, which are the possible “states”, to calculate SMI_hist the probabilities $p_{X}^{k} (t)$ and $p_{Y}^{l} (t)$ were estimated from histograms with m=24 bins of equal size, and the joint probability, p^k,l(t), was estimated from 2D histograms with 24×24 bins of equal size. In this way, the SMI values were obtained from probabilities defined over the same number of possible states. To calculate all the histograms, all the data points of in the regions of interest were used. To test the robustness of the results, we also calculated ${SMI}_{NS}^{L, δ} (t)$ and ${SMI}_{WE}^{L, δ} (t)$ using ordinal patterns of length L=3 and compared with SMI_hist(t) using histograms with 6 bins and joint histograms with 6×6 bins, and obtained very similar results (shown in Fig. B7 of Appendix B).

4 Results

4.1 Detecting transitions with spatial permutation entropy analysis

Figure 3 displays the temporal evolution of the spatial permutation entropy, calculated from the probabilities of L=4 OPs defined from the values of SST anomalies in neighboring grid point (i.e. the spatial lag is δ=1). Panels (a) and (d) display $H_{NS}^{L = 4, δ = 1}$ for the two datasets analyzed, ERA5 in blue and NOAA OI v2 in red, in the two regions analyzed: panel (a) corresponds to the Niño 3.4 region, and panel (c), to the Gulf Stream region. Results are presented from the beginning of the datasets (1940 for ERA5 and 1981 for NOAA OI v2). Panels (b) and (e), instead, display $H_{WE}^{L = 4, δ = 1}$ , also in the two regions and for the two datasets under analysis. For comparison, panels (c) and (f) show the entropy calculated from the distribution of data values, H_hist. As explained in Sect. 3.3, to estimate the distribution we use histograms with 24 equal bins.

https://esd.copernicus.org/articles/17/533/2026/esd-17-533-2026-f03

Figure 3Entropies of (a–c) Niño 3.4 region; (d–f) Gulf Stream region. Panels (a), (b), (d), and (e) show the spatial permutation entropy calculated with spatial lag δ=1, while panels (c) and (f) show the usual entropy obtained from the distributions of the SST anomaly values. Blue lines correspond to the entropy from the ERA5 dataset, red lines to the entropy from the NOAA OI v2 dataset, and the black arrows indicate the change points detected by the PELT algorithm in the ERA5 dataset.

Detecting transitions and quantifying differences in two SST datasets using spatial permutation entropy

3.1 Ordinal patterns and spatial permutation entropy

3.2 Symbol orientations in 2D spatio-temporal gridded data

3.3 Entropy calculated from the distribution of data values

3.4 Distance and cross-correlations measures used to compare SST-ERA5 and SST-NOAA

4.1 Detecting transitions with spatial permutation entropy analysis

4.2 Comparison between ERA5 and NOAA datasets

A1 Implementation

A2 Summary of detected points