the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Detecting transitions and quantifying differences in two SST datasets using spatial permutation entropy
Giulio Tirabassi
Cristina Masoller
Marcelo Barreiro
Weather prediction systems rely on the vast amounts of data continuously generated by Earth modeling and monitoring systems, and efficient data analysis techniques are needed to track changes and compare datasets. Here we show that a nonlinear quantifier, the spatial permutation entropy (SPE), is useful to characterize spatio-temporal complex data, allowing detailed analysis at different scales. Specifically, we use SPE to analyze ERA5 and NOAA OI v2 sea surface temperature (SST) anomalies in two key regions, Niño 3.4 and Gulf Stream. We perform a quantitative comparison of these two SST products and find that SPE detects differences at short spatial scales (<1°). We also identify several transitions, including a transition that occurs in 2007 when ERA5 changed its sea–surface boundary condition to OSTIA, in 2013 when OSTIA updated the background error covariances, and in 2021 when NOAA SST changed satellite, from MeteOp-A to MeteOp-C. The robustness and statistical significance of the detected transitions are tested using surrogate data. We demonstrate that, using standard distance and cross-correlation analyses, the transitions are not detected with the same level of statistical significance and robustness as when using ordinal analysis.
- Article
(19788 KB) - Full-text XML
- BibTeX
- EndNote
Due to the large amount of data generated by Earth modeling and monitoring systems, much effort is currently being devoted to developing new, efficient climate data analysis techniques (Dijkstra et al., 2019; Messori et al., 2017; Boers et al., 2019; Gupta et al., 2021; Díaz et al., 2023; Krouma et al., 2024; Allen et al., 2025). Ordinal analysis (Bandt and Pompe, 2002) is a popular symbolic method of time-series analysis that has been applied to geophysical data. For example, ordinal analysis was used to study time series of surface air temperature anomalies in a regular grid over the earth's surface (reanalysis data from the National Center for Environmental Prediction/National Center for Atmospheric Research NCEP/NCAR) and uncovered long-range tele-connections across multiple time scales (Barreiro et al., 2011; Deza et al., 2013). The ordinal method is based on estimating the probabilities of symbols, known as ordinal patterns (OPs), defined in terms of the temporal order of the relative values of L data points. As an example, for L=3, triplets of consecutive data values such that are encoded in the symbol “012” where the digits represent the rank of the corresponding value within the triplet. The symbols' probabilities are estimated from their frequencies of occurrence within the time series and their Shannon entropy, known as permutation entropy (PE), is a quantifier of nonlinear temporal correlations. PE is low when some OPs are much more probable than others, and maximum when all possible OPs are equally probable (Bandt and Pompe, 2002). Ordinal analysis is computationally very efficient and robust to the presence of artifacts and noise. The use of time-lagged (non-consecutive) data points adds versatility to the method, since it allows to select different temporal scales for the analysis. For example, for analyzing a climatic time series with monthly resolution, the L=3 OPs can be defined by considering data values in three consecutive months (e.g. January, February, March; February, March, April; etc), in three consecutive years, or equally spaced over a period of time (for example, a year) (Deza et al., 2013). An important limitation of the ordinal methodology is that the symbolic coding rule does not take into account the actual values of the data points, but their relative values, and therefore, ordinal analysis gives partial information, complementary to that obtained by using standard time series analysis techniques.
Ordinal analysis was originally proposed for time series analysis and adapted for the analysis of gridded two-dimensional spatial data (Ribeiro et al., 2012), by defining the OPs in terms of the relative values of L grid points. Spatial ordinal analysis is a versatile tool because one can choose different “shapes” and/or different spatial orientations for the symbols. For example, for symbols defined in terms of the data values of L=4 grid points, one can consider squares of 2×2 grid points, a line (horizontal or vertical) of 4 grid points, an “L” composed by 3+1 grid points, etc. Furthermore, the use of spatially lagged grid points allows tuning the spatial scale of the analysis. This spatial lag is an important parameter for the application of this analysis tool in climate science, as the dynamics of our climate involves complex processes and interactions that act at different spatial scales. Ordinal analysis has also been expanded to include more information from the analyzed signals, such as the variance of the data points that define an OP (Fadlallah et al., 2013), their amplitude (Azami and Escudero, 2016), or the dispersion of data points that define different OPs (Politi, 2017). However, ordinal analysis is one of many symbolic techniques available, and in addition to permutation entropy other entropy quantifiers for time series can also provide valuable information (Prado et al., 2020; Falasca et al., 2020; Ikuyajolu et al., 2021; Novi et al., 2024; Paluš et al., 2024).
The spatial permutation entropy (SPE), which is Shannon's entropy estimated from the probabilities of spatial ordinal patterns, has been used to analyze images, art works and textures (Sigaki et al., 2018, 2019; Tirabassi and Masoller, 2023; Tirabassi et al., 2023; Muñoz-Guillermo, 2023; Tarozo et al., 2025). It has also been used to analyze complex spatio-temporal data such as EEG recordings (Boaretto et al., 2023; Gancio et al., 2024) and cardiac synthetic data (Schlemmer et al., 2015, 2018). However, to our knowledge, SPE has not yet been tested on climate data.
Since SPE can be calculated from the relative values of a climate variable at a given time in a particular geographic region, it yields information about nonlinear spatial correlations of that climate variable, in that region, at that time. In contrast, the “temporal” PE of the variable at a particular grid point is calculated from the analysis the variable's time series at that grid point, and therefore, it yields information about nonlinear temporal correlations of that variable, at that grid point.
Our goal is to demonstrate that SPE is a reliable and versatile tool, and specifically, is able to capture subtle differences between datasets and also, changes within the same dataset. We focus on a key variable, sea surface temperature (SST) anomalies, and compare two SST products, ERA5 and NOAA Optimal Interpolation version 2 (NOAA OI v2), in two key regions, the equatorial Pacific and the the Gulf Stream. We show that SPE identifies differences in the datasets in short spatial scales, which can be more or less pronounced over different periods of time. We interpret our findings in terms of changes in the methodologies and data used to construct the SST products.
We consider only monthly SST anomalies with respect to the seasonal cycle in the Niño 3.4 region (170–120° W, 5° N–5° S), and in the western north Atlantic (32.5–42.5° N, 67.5–45° W), a box centered on the Gulf Stream (Dong and Kelly, 2004; Parfitt and Czaja, 2016; Vries et al., 2019; Comunian et al., 2026). Both regions are highligthed in Fig. 1a. These regions were chosen not only because of their importance for the global climate, but also because they display different spatio-temporal SST dynamics. SST in the Niño 3.4 region is governed by tropical dynamics, and in particular the SST dynamics results from ocean–atmosphere interactions leading to variability mainly on interannual time scales. On the other hand, the Gulf Stream dynamics, as one of the most intense western boundary currents, is governed by internal ocean dynamics and the extratropical winds across the basin, resulting in SST variability on several time scales, from fast changes due to atmospheric-driven heat fluxes to decadal shifts in spatial structure.
We analyze NOAA Optimal Interpolation version 2 (NOAA OI v2) (Reynolds et al., 2007; Huang et al., 2021), and ERA5 global reanalysis (Hersbach et al., 2020). Both datasets have spatial resolution of 0.25°×0.25°. ERA5 starts in January 1940, while NOAA OI v2 starts in September 1981; both extend to June 2025 (therefore, the NOAA time series have 526 datapoints each, while the ERA5 time series have 1026 datapoints each).
NOAA SST includes observations from ships, drifting and moored buoys, and the Advanced Very High Resolution Radiometer (AVHRR) (Huang et al., 2021) retrieved from NOAA series and MetOp-A/-B satellites by US Navy before November 2021. After this date, NOAA SST switched to the Advanced Clear Sky Processor for Ocean (ACSPO) (Huang et al., 2023; Jonasson et al., 2020) satellite SSTs retrieved from AVHRR and the Visible Infrared Imager Radiometer Suite (VIIRS) (Huang et al., 2023).
ERA5 SST is the combination of HadISST2 (Titchner and Rayner, 2014) up to August 2007 and OSTIA (Donlon et al., 2012) from September 2007 onwards (Hirahara et al., 2016). HadISST2 assimilates in-situ observations as well as two radiometers: AVHRR and the Along Track Scanning Radiometer (ATSR).
OSTIA was originally constructed at a resolution of 0.05° and includes in situ data from various sources, as well as derived from several satellite products including AVHRR and VIIRS. It is worth noting that the higher resolution of OSTIA allows it to better resolve the tropical instability waves and sub-mesoscale eddies in the midlatitudes (Hirahara et al., 2016).
Within the regions of interest (see Fig. 1a), both datasets employ a similar grid (with 40×200 grid points for the Niño3.4, and 40×90 for the Gulf Stream region), the only difference being a small offset of 0.005° both in latitude and longitude.
3.1 Ordinal patterns and spatial permutation entropy
Ordinal analysis is a symbolic data analysis technique proposed by Bandt and Pompe (2002) that has been extensively applied in a wide variety of different scientific fields (Leyva et al., 2022). The application of ordinal analysis to a climatological spatio-temporal dataset is schematically illustrated in Fig. 2. Ordinal analysis takes an ordered series of N data values, , and translates it into a sequence of symbols, . For example, considering a grid point in the geographical region shown in Fig. 2a, x can be the time series of SST anomalies at this grid point, as schematically shown in Fig. 2b. On the other hand, the ordered series x can be the sequence of values of SST anomalies at time t, along a row (or a line), from right to left (from top to bottom), in the region shown in Fig. 2a, as schematically shown in Fig. 2e and f.
Figure 1Panel (a) highlights the regions of interest: Niño 3.4 (in green), and the Gulf Stream (in orange). Panels (b) and (c) show the SST anomaly in the Niño 3.4 region, and panels (d) and (e), in the Gulf Stream region, calculated from ERA5 (b, d) and NOAA OI v2 (c, e) datasets. In panels (b)–(e), the thick lines represent the spatial mean of the anomalies, while the shading indicates the spatial standard deviation.
Figure 2Illustration of the procedure used to define ordinal patterns (OPs) and to calculate the permutation entropy (PE) of patterns of length L. (a) Location of a grid point inside the Gulf Stream region. (b) The temporal evolution of NOAA SST anomaly at this grid point (sequence x) is represented by a sequence of “temporal” OPs (sequence s) that are defined by the relative ordering of L=3 consecutive (δ=1) data values. (c) The six possible orderings represent the six possible OPs. (d) The OPs' probabilities are estimated by counting the number of times each OP occurs in the sequence s, and the permutation entropy (PE), H(L,δ), is calculated using Eq. (1). (e) Snapshot of NOAA SST anomalies in the Gulf Stream region in July 2002. A row of this snapshot defines a sequence of consecutive data values, x, along the WE direction, from which spatial OPs are defined using the same rule as in (b). From the probabilities of the OPs defined over all the rows in the snapshot, the value of the entropy, for July 2002, , is calculated using Eq. (2). Repeating this procedure for the columns (along the NS direction) gives (Eq. 3). (f) Detail of how the spatial lag, δ, is used to define spatial OPs: The three consecutive values (δ=1) marked in green in the NS direction, , are represented by pattern “102”, while the three values marked in orange, lagged δ=2 in the WE direction, , are represented by pattern “120”.
The ordinal symbolic transformation requires defining only two parameters: the symbol length, L, and the lag, δ. These parameters are used to associate, to a vector vi whose components are L data points lagged by δ, a symbol si that is known as ordinal pattern (OP),
Here π(⋅) is the permutation index that sorts the components of vi in ascending order: . The number of possible permutations, that is, of possible patterns, grows as L!. The six possible patterns of length L=3 are shown in Fig. 2c and examples are shown in Fig. 2f: the vector is represented by pattern “102” while is represented by “120”, etc. We remark that the ordinal transformation takes into account relative values, not absolute ones.
Applying this transformation to every vi(L,δ) with , gives a sequence of n ordinal patterns, . Labeling the possible patterns (symbols) from k=1 to k=L! (for L=3, k=1 corresponds to pattern “012”, k=2 to pattern “021”, etc., as illustrated in Fig. 2c), we can count the number of times each pattern appears in s. Let nk be the number of times the kth pattern appears. Then, the probability of this pattern is . Figure 2d displays an example of six probabilities of patterns of length L=3. The permutation entropy (PE) is then defined as Shannon's entropy of the probabilities (Bandt and Pompe, 2002):
The coefficient normalizes H(L,δ) between 0–1 and enables the comparison between values obtained from ordinal pattens of different lengths. A small entropy value is obtained where one pattern predominates (this occurs when the sequence is periodic, or when it has a strong trend), whereas a high entropy value is obtained when all symbols are almost equally probable, which normally occurs when the sequence x is fully stochastic.
To estimate the patterns' probabilities with good statistics, the number of symbols, , needs to be much larger than the number of possible patterns, L!. In practical terms, this limits the values of L to the range from 3 to 6. Here, unless explicitly stated, we use L=4 and use different values of δ to tune the spatial scale of the analysis.
3.2 Symbol orientations in 2D spatio-temporal gridded data
The SST anomalies in the two regions of interest are represented as the time evolution of 2-dimensional N×M gridded datasets, Xi,j(t) with , where the index i corresponds to different latitudes, the index j to different longitudes, and T is the number of time steps in the series. Given L and δ, at each time step, t, we apply the ordinal transformation using two spatial orientations: in the North–South (NS) direction, the ordered series x is defined from the values, at time t, of the columns of the grid, and in the West–East (WE) direction, from the rows of the grid, see Fig. 2e and f. In each column we can define OPs, while in each row, we can define OPs. Defining OPs over all the columns the total number of OPs defined along the NS direction is , and defining OPs over all the rows, the total number of OPs defined along the WE direction is . An important advantage of this methodology for the analysis of climatological data is its flexibility, because the OP orientation is not limited to NS and WE directions, but other orientations can be selected for the analysis.
We also remark that L and δ allow tuning the spatial scale of the analysis. In the original application of permutation entropy to time series analysis (Bandt and Pompe, 2002), we can interpret L and δ as parameters that allow a nonlinear embedding of a time series in a L! dimensional space, because at each time, an ordinal pattern can be defined, and the number of possible patterns is L!. In the spatial approach, the parameters L and δ have a similar role: they allow to embed a set of L gridded data points in a L! dimensional space.
From the probabilities of the OPs defined at time t along the NS and WE directions, and respectively, we compute two spatial permutation entropies at time t:
Since SST anomalies vary over time, the ordinal probabilities vary over time and thus, the entropies vary over time. and will be close to 1 when there is no spatial order in the data (all OPs are equally probable), and will be <1 when there are spatial structures, such as gradients in the NS or in the WE direction, that make some OPs more or less probable.
In this work, we consider symbols defined by L=4 grid points and a spatial lag up to δ=8, as these are the largest values that allow us, given the size of the two regions studied, to estimate with good statistics the probabilities of the L!=24 possible patterns. Table 1 shows the number of symbols (defined by L=4 grid points in the geographical region analyzed) when a spatial lag of δ=1, 2, 4, and 8 is used. We see that the lowest number (in the Gulf stream with δ=8) is ≫24. We tested the robustness of our results with respect to L, and found similar results with L=3 and L=5 (see Figs. B3–B7 of Appendix B).
3.3 Entropy calculated from the distribution of data values
In this work we also compare the permutation entropy with the conventional way of calculating Shannon entropy of a spatio-temporal field, Xi,j(t), from the distribution of its data values,
Here pk(t) is the probability that Xi,j(t) is in the k state and m is the number of possible states. pk(t) is estimated from histograms of m bins of equal size, that represent the possible states. Hhist(t)=1 when pk(t)=1 and (only one bin is occupied), and Hhist(t)=0 if (the distribution of values is uniform). To compare with SPE values calculated with ordinal patterns of length L, we select the number of bins equal to the number of possible ordinal patterns, m=L!. To calculate Hhist(t) all the data points of Xi,j(t) in the regions of interest are used. As explained in Sect. 2, ERA5 and NOAA have 40×200 grid points in the Niño region and 40×90 in the Gulf Stream region.
3.4 Distance and cross-correlations measures used to compare SST-ERA5 and SST-NOAA
In this work we demonstrate that spatial permutation entropy can detect differences in ERA5 and NOAA SST products, and an important question is whether such differences can also be detected using standard correlation or distance measures. Therefore, we also compare SST anomaly values using the Average Absolute Difference (AAD), the Pearson's spatial cross-correlation coefficient (r), and the Spatial Mutual Information (SMI), which is a non-linear cross-correlation measure (Celik, 2016; Kumar and Bhandari, 2022).
The Average Absolute Difference is defined as:
where Xi,j(t) and Yi,j(t) represent the two gridded datasets, ERA5 and NOAA, and represents the spatial average in the analyzed region, at time t.
Pearson's spatial cross-correlation coefficient is defined as:
where σX(t), σY(t) and σX,Y(t) are the spatial standard deviations and the spatial covariance of Xi,j(t) and Yi,j(t) at time t. To calculate ADD and r, all the data points of Xi,j(t) and Yi,j(t) in the regions of interest are used. As explained in Sect. 2, ERA5 and NOAA both have 40×200 grid points in the Niño region and 40×90 in the Gulf Stream region.
The spatial mutual information (SMI) of Xi,j(t) and Yi,j(t) at time t is defined as:
where and are the Shannon entropies of Xi,j(t) and Yi,j(t), and HX,Y(t) denote their joint entropy,
where mX and mY are the numbers of possible states of X and Y respectively, and pk,l(t) denotes the joint probability that Xi,j(t) is in state k and Yi,j(t) in state l.
We calculate SMI using two approaches: the conventional one, in which the distributions of values of Xi,j(t) and Yi,j(t) are divided in an equal number of bins, , of equal size, that represent the possible states, and the ordinal approach, in which the possible states are the possible ordinal patterns, defined over Lδ-lagged gridded points, aligned along the NS or WE directions. Then, in the ordinal approach, . We refer to the mutual information calculated in these ways as SMIhist(t), and .
Since for L=4 OPs there are possible patterns, which are the possible “states”, to calculate SMIhist the probabilities and were estimated from histograms with m=24 bins of equal size, and the joint probability, pk,l(t), was estimated from 2D histograms with 24×24 bins of equal size. In this way, the SMI values were obtained from probabilities defined over the same number of possible states. To calculate all the histograms, all the data points of in the regions of interest were used. To test the robustness of the results, we also calculated and using ordinal patterns of length L=3 and compared with SMIhist(t) using histograms with 6 bins and joint histograms with 6×6 bins, and obtained very similar results (shown in Fig. B7 of Appendix B).
4.1 Detecting transitions with spatial permutation entropy analysis
Figure 3 displays the temporal evolution of the spatial permutation entropy, calculated from the probabilities of L=4 OPs defined from the values of SST anomalies in neighboring grid point (i.e. the spatial lag is δ=1). Panels (a) and (d) display for the two datasets analyzed, ERA5 in blue and NOAA OI v2 in red, in the two regions analyzed: panel (a) corresponds to the Niño 3.4 region, and panel (c), to the Gulf Stream region. Results are presented from the beginning of the datasets (1940 for ERA5 and 1981 for NOAA OI v2). Panels (b) and (e), instead, display , also in the two regions and for the two datasets under analysis. For comparison, panels (c) and (f) show the entropy calculated from the distribution of data values, Hhist. As explained in Sect. 3.3, to estimate the distribution we use histograms with 24 equal bins.
Figure 3Entropies of (a–c) Niño 3.4 region; (d–f) Gulf Stream region. Panels (a), (b), (d), and (e) show the spatial permutation entropy calculated with spatial lag δ=1, while panels (c) and (f) show the usual entropy obtained from the distributions of the SST anomaly values. Blue lines correspond to the entropy from the ERA5 dataset, red lines to the entropy from the NOAA OI v2 dataset, and the black arrows indicate the change points detected by the PELT algorithm in the ERA5 dataset.
In the Niño 3.4 region, panel (a) shows that the evolution of for ERA5 and NOAA OI v2 is quite similar. However, panel (b) shows differences in the evolution of which persist until 2007. After 2007 the differences reduce until 2022, when the two values of diverge again. The years when these transitions occur coincide with the switch of the sea-surface boundary condition of ERA5, from HadISST2 to OSTIA, in 2007 (Hersbach et al., 2020), and with the inclusion of MeteOp-C satellite data in NOAA's dataset in November 2021 (Jonasson et al., 2020).
Although many transitions can be identified by visual inspection, to objectively identify change points in the temporal evolution of the entropies (and in all the quantifiers used), we employed a well-known unsupervised change point detection (CPD) algorithm, the Pruned Exact Linear Time (PELT) (Killick et al., 2012). We tested the significance of the detected change points by PELT using a surrogate analysis, and their robustness against variations of the penalty parameter (see Appendix A for details).
The arrows in Fig. 3 indicate the change points detected by the PELT algorithm. We note that no change point is detected in panel (a), but a change point is detected in panel (b), in 2007. In the Gulf Stream region, panel (d) shows differences between the values of of ERA5 and NOAA OI v2, which become relatively small after 2013, a fact that could be due to the update of the background error covariances in OSTIA in January 2013 (Good et al., 2020; Roberts-Jones et al., 2016). Panel (e) also shows considerable differences between the values of HWE, from 2021 onward, and which could be due to the inclusion of MetOp-C AVHRR data in NOAA OI v2 (Huang et al., 2023). While change points in 2007 and 2013 are returned by the PELT algorithm, the one in 2021, although significant in the NOAA signal, does not pass the robustness test (See Table. A1 in Appendix A). For the same level of statistical significance with respect to surrogates and robustness with respect to variations of the penalty parameter, no change points are detected for Hhist (panels c and f).
Summarizing, most changes that can be identified by visual inspection are consistent with changes returned by the PELT algorithm when used to analyze (ERA5), (NOAA OI v2), (ERA5) and (NOAA OI v2) in Niño 3.4 and Gulf Stream regions. The PELT change points are:
-
A first transition occurs in ERA5 in 2007, which we interpreted as due to the change of the sea-surface boundary conditions: the inclusion of OSTIA in 2007. This transition is associated to change points that are identified in and in the Gulf Stream region, and in in the Niño 3.4 region, but not in in the Niño 3.4 region. We remark that ENSO is a large-scale phenomenon characterized by large zonal temperature changes, and in the Niño 3.4 region, the spatial permutation entropy best captures differences along the direction of the largest gradients.
-
A second transition occurs in ERA5 in 2013, which we interpreted as due to the update of OSTIA. This transition is associated to change points that are identified in the Gulf stream region, both in and , but not in the Niño 3.4 region.
Figure 4 displays the entropies as in Fig. 3, but now the ordinal patterns are defined with non-neighboring grid points. Specifically, the grid points are spaced by a lag δ=8 that corresponds to 2°. Now we see that the temporal evolution of the entropies of ERA5 and NOAA OI v2 is consistent, in the two regions and for the two orientations. Discrepancies are small: not only the fluctuations are highly correlated, but also, the trends are similar, but no change points are detected at this scale. For the Nino 3.4 region, although (panel a) does not present a significant trend, in panel (b) we see that from the two datasets display a negative trend. This trend is interpreted as due to the inhomogeneous long-term SST variations over the equatorial Pacific, since over the years, SST has warmed in the west and cooled in the east (Xie et al., 2010; Wills et al., 2022). This represents an intensifying westward large-scale gradient over the Niño 3.4 region that can make the patterns that encode monotonic increasing or decreasing, such as “0123” and “3210”, more prevalent than those that encode an oscillation (See Fig. B15 of Appendix B), thus decreasing the entropy. However, low SPE values do not always imply that the “0123” and “3210” patterns dominate, and a careful inspection of the patterns' probabilities is needed in order to be able to confidently draw conclusions.
Figure 4Entropies calculated with δ=8 in (a, b) Niño 3.4 region; (c, d) Gulf Stream region. A better agreement between NOAA OI v2 and ERA5 signals is observed with δ=8 than with δ=1 (Fig. 3). Dashed blue lines indicate linear fits of ERA5 entropy signals; which linear coefficients are: (a) (p value=0.18), (b) (), (c) (p value=0.0016), (d) (). The negative trend in (b) can be due to the strengthening westward of large-scale gradient in the Niño 3.4 region, while the positive trends in (c) and (d) reveal the decrease of spatial structures.
In the Gulf Stream region, which is also heating due to global warming (Seidov et al., 2017; Todd and Ren, 2023), HWE and HNS in ERA5 and in NOAA present a positive trend, which reveals a decrease of spatial structure, as it means that the different symbols become equally probable. In this case, since the northwest part is warming faster than the rest of this area (Bulgin et al., 2020), it decreases the climatological gradient of the SST across the Gulf stream, thus homogenizing the entire region, and the resulting symbols resemble more closely those of random fluctuations (See Fig. B15 of Appendix B).
To check the robustness of our results, we also performed the analysis after removing the linear temporal trend in each grid point, and obtained similar results, shown in Figs. B9–B12 of Appendix B. In these figures we also include the analysis of the raw SST data – including the seasonal cycle.
Figure 5 shows the same entropy signals as Figs. 3 and 4 for the Niño 3.4 region, but in the period 1981–2025, and also shows the mean SST anomaly of this region (black line, right vertical axis) computed from the NOAA dataset.Table 2 displays the Pearson cross-correlation coefficients between the entropies computed from NOAA and ERA5, and the corresponding SST anomaly. In this Table we observe a good agreement between NOAA and ERA5 in Niño 3.4, in terms of the sign and magnitude of the cross-correlation coefficient, for all the entropies. In panel (b) of Fig. 5, we can see that from the NOAA dataset is positively correlated with the SST anomaly (the Pearson cross-correlation coefficient is r=0.42), most clearly between 2007–2022 (r=0.70). The values are the largest during El Niño years, capturing the decrease in SST gradients along the longitudinal direction. For ERA5, this correlation is strongest from 2007 onward (r=0.57). In contrast, (panel a) is negatively correlated with the SST anomaly ( for NOAA, and for ERA5), thus during some El Niño events (such as 1997–1998 and 2015–2016 years) the entropy decreases, reflecting the north-south gradients, one in the Northern Hemisphere and one in the Southern Hemisphere, that occur as the equatorial zone is warmer than the north-south edges of the region. This is also observed for some La Niña events, such as the one in 1988–1989 or in 1998–1999.
Table 2Pearson correlation coefficients obtained between the mean SST anomaly and the entropy signals, for L=4 and different lags, and for both ocean regions.
Figure 5Analysis of the Niño 3.4 region, in the time interval in which ERA5 and NOAA OI v2 datasets are available (1981–2025). Panels (a)–(d) display the temporal evolution of the SST anomaly (black line, right vertical axis) computed from the NOAA dataset (a similar curve would be obtained from ERA5, as shown in Fig .1), and the temporal evolution of the spatial permutation entropy (red and blue, left vertical axis) calculated with ordinal patterns with NS (a, c) and WE (b, d) orientation. In (a), (b) the spatial lag used to define the ordinal patterns is δ=1, in (c), (d) δ=8. The Pearson correlation coefficients between the mean SST anomaly and the entropy signals are shown in Table 2. In panel (a), where the OPs are aligned along the NS direction, the entropy signals are negatively correlated with the SST anomaly, while in panel (b), where the OPs aligned along the WE direction, the entropy signals and the SST anomaly are positively correlated.
For the same analysis, but with δ=8, sudden drops of (Fig. 5d) can be observed, and some correlate with El Niño events, such as the ones in 1982–1983 or 1997–1998, but also at the end of a strong La Niña in 2008. In these cases (see Fig. B1), a WE gradient is formed due to the strong warming of the eastern Pacific, which dominates at this long spatial scale, and in consequence, pattern “0123” prevails (thus, the entropy drops). In contrast, for δ=1, the distribution of ordinal probabilities presents larger contributions of two patterns, “0123” and “3210”, so the entropy is higher. Some drops are also seen in the temporal evolution of (Fig. 5c), although at this scale cannot capture the NS gradients appropriately. In fact, for δ=8 the ordinal patterns span 6° (more than half of the region length in this direction), because the data points are separated by and thus, L=4 data points that define a pattern cover a distance of ). As El Niño develops in a waveguide between 5° S–5° N, the SST anomalies evolve particularly in that area, and the NS gradients are highly concentrated near the equator. Therefore, they can be seen with δ=1, but if δ=8 is used, the OPs can include the SST variations north and south of the equator (SST decreases north and south of the equator), mixing both gradients. This does not occur in the longitudinal WE direction, as the WE gradients have a larger spatial scale. To gain further insight, we inspected the temporal evolution of the ordinal probabilities, and found that their variation is consistent with the interpretation of the gradients that are captured when using using different δs. Videos showing how the probabilities of NS and WE OPs change over time can be found in the Video Supplement.
We also performed this analysis in the Gulf Stream region. The results, displayed in Fig. B8 and Table 2 reveal that there is no consistent correlation between the entropy signals and the SST anomalies in the two datasets (i.e. the cross-correlation coefficients obtained from NOAA and ERA5 differ both in magnitude and sign). We speculate that this is due to the fact that SST anomalies, even when aggregated at the basin scale, do not present a smooth behavior (as in the Niño 3.4 region).
4.2 Comparison between ERA5 and NOAA datasets
To analyze similarities and differences between ERA5 and NOAA, in this section we only consider the period when both datasets are available (1981–2024). To visualize the effect of the spatial lag between the grid points, Fig. 6 displays the entropies (as done in Figs. 3 and 4), for grid points that are consecutive (i.e. they are separated by 0.25°, that is, δ=1), that are separated by 0.5° (δ=2), by 1° (δ=4), and by 2° (δ=8). Therefore, in this figure, the left and right columns display the same entropies as in Figs. 3 and 4.
Figure 6Effect of the lag between grid points. Panels (a)–(h) correspond to the Niño 3.4 region, and (i)–(p) to the Gulf Stream region. We can see that as δ increases, the entropy differences gradually decrease, revealing that the SPE analysis captures the differences between ERA5 and NOAA datasets at small spatial scales.
We observe that as δ increases the behavior of and for the two datasets converge, which indicates that the differences found between ERA5 and NOAA occur mainly at short spatial scales, while at long scales, linear trends in the temporal evolution of the entropies are consistently identified in both, ERA5 and NOAA. We also notice that, as δ increases, so do the entropies, suggesting that the gradients become less pronounced as the spatial scale increases.
To perform a quantitative comparison between the datasets, we begin by analyzing their SMI, reported in Fig. 7. In panels (a)–(h) we report SMIL,δ estimated from ordinal probabilities, in particular δ=1 for panels (a)–(d) and δ=8 for panels (e)–(h), while panels (i) and (j) display SMIhist. estimated from the probabilities of values of SST anomalies.
When SMI is calculated from the ordinal probabilities, in the two regions and for the two orientations, we can see that it increases with time, which reveals that the amount of information shared by the datasets. In particular, the probability that the same symbols occur in the same locations and at the same time in both datasets increases with time (see Fig. B16 of Appendix B). PELT detects a transition in 2007 in all the panels.
For the Niño 3.4 region, we observe that the PELT analysis now also detects a transition at the small scale (δ=1) in (Fig. 7c), which was not detected in the analysis of the signal (Fig. 3). We note that El Niño events have an impact on , as we observe that the coherence between the datasets decreases during such events, as we can observe, for example, during the El Niño events of 1997, 2010, and 2015. We highlight that this effect is observed at all scales and in the current version of the products since it is observed during the 2023 warm ENSO event. Regarding SMIhist. (Fig. 7i), although we observe some increase in the average value in 2007 and 2015, no robust and significant change points are detected in this signal.
Figure 7Spatial mutual information, Eq. (7), between SST anomalies of ERA5 and NOAA datasets. The left column corresponds to the Niño 3.4 region, while the right column, to the Gulf Stream region. In the first three rows, panels (a)–(h), SMI is calculated from the probabilities of ordinal patterns with δ=1 (a–d) and δ=8 (e–h); the OPs are constructed with WE orientation in (a), (b), (e), and (f), and with NS orientation in (c), (d), (g), and (h). In the bottom row, panels (i) and (j), SMI is calculated from the probabilities of the data values, using the same number of bins, 24, as the number of possible OPs. In all cases SMI increases with time, revealing, as expected, that the discrepancies between ERA5 and NOAA datasets diminish; however, note the difference in the vertical scales in (a–d) and (e–h): The higher SMI values when ordinal patterns are defined with a spatial lag δ=8 relative to δ=1 reveal that the agreement between the two data sets is better at long spatial scales than at short ones. The arrows indicate change points detected by the PELT algorithm.
For the Gulf Stream region (Fig. 7b, d, f, h, and j), all the SMI values reported present oscillations around a relatively constant value before 2007, which appears to increase consistently from this year on. Additionally, on the large scale, we detected two other transitions: one at end of 2015/beginning 2016 (the same observed in SMINS for δ=1 in the Niño 3.4 region), and another one in 2021. Both transitions correlate with changes in the satellites experienced by NOAA SST those years (Huang et al., 2023). We also note that 2016 corresponds to the start date of version 2.1 of the NOAA OI product, which also includes the Argo observational data (Huang et al., 2021), and several changes in the conventional and radiance observations assimilated by ERA5 also occurred that year (Hersbach et al., 2020).
Regarding SMIhist., we observe that the annual cycle has some impact on this signal, but not on SMINS, nor SMIWE.
Finally, to demonstrate the added value of using symbolic ordinal analysis, in Fig. 8 we present the results obtained with two well-known measures of linear relationship between two datasets: the average absolute difference, AAD, and the spatial Pearson's correlation coefficient, r. They provide complementary information because when AAD and r are both high, the data sets differ in values but their spatial distributions are consistent, whereas when both are low, there is agreement between the values, but not in the spatial distributions. In Fig. 8 we see that both measures show continuous improvement of the agreement between the two datasets in the two regions; however, there are oscillations and no clear transitions are observed. In the Gulf Stream region, AAD and r show a strong coupling with the annual cycle, with disagreement (high AAD and low r) peaking during northern winters. A possible explanation could be due to the high cloud coverage in this region during winters, which could difficult infrared measurements, leading to larger differences between ERA5 and NOAA datasets. On the other hand, in the Niño 3.4 region ENSO events affect the agreement between the datasets, but their effect is captured differently by the two measures. For example, AAD peaks during El Niño in 1988, 1997 and during La Niña of 2011, but these events do not affect r, and vice-versa, La Niña in 1996 and 1999, and El Niño in 2004, affect r but not AAD.
Figure 8Average absolute difference, AAD (a, b), and spatial Pearson's correlation coefficient, r (c, d), between ERA5 and NOAA datasets in (a), (c) Niño 3.4 region, and (b), (d) Gulf Stream region.
For the same levels of statistical significance (with respect to surrogate data) and robustness (with respect to variations of the penalty parameter) used for the analysis of SMI signals, no robust and significant change points are detected in the analysis of AAD and r signals.
We have shown that a nonlinear quantifier, the spatial permutation entropy (SPE), is useful and flexible for analyzing the spatiotemporal dynamics of SST anomalies. SPE is computed from the probabilities of symbols, known as ordinal patterns (OPs), defined by four SST anomaly values in grid points that are geographically oriented north-south (NS) or west-east (WE), and that are separated by a spatial lag, δ: δ=1 corresponds to neighboring grid points separated by 0.25°; δ=8, to grid points separated by 2°.
We used the temporal variations of SPE calculated with NS or WE OPs, and respectively, to compare ERA5 and NOAA OI v2 SST anomalies in two key regions: Niño 3.4 and the Gulf Stream. By tuning the spatial lag δ, we were able to tune the spatial scale at which the datasets were compared.
A main conclusion of our study is that, in both regions, we found differences in the temporal evolution of and computed at short spatial scales, that is, using OPs made of neighboring points (Fig. 3). Differences gradually disappear when δ increases (Fig. 6), and when δ is large enough, the temporal variations of HNS and HWE in the two datasets are remarkably similar (Fig. 4). The short-scale differences found here between ERA5 and NOAA datasets by using SPE analysis add on to previously reported discrepancies (Yao et al., 2021; Dai, 2023). Importantly, our analysis also reveals clear improvements in the similarity of the two datasets in recent years, following significant advances in Earth observation systems due to the introduction of new satellite observations and new data processing methodologies.
Regarding these recent advances, they generated subtle changes in both datasets, and some of them were detected by our analysis technique. Specifically, the temporal variations of HNS and HWE allowed us to identify four transitions that occurred in 2007 when ERA5 changed its sea–surface boundary condition to OSTIA, in 2013 when OSTIA updated the background error covariances, in 2016 when several changes occurred both in NOAA (as the inclusion of MeteOp-B and Argo data) and ERA5, and in 2021 when NOAA SST changed satellite observations from MeteOp-A/-B to MeteOp-C. While these transitions can be observed by simply inspecting the temporal evolution of SPE, to corroborate these findings we applied an unsupervised change detection algorithm. Of course, not all the improvements in observation systems or data processing methodologies will modify the datasets in a way that can be detected by our analysis technique, therefore, methods that return complementary information should also be employed. Since the information lost by the SPE concerns the absolute values of the data, such methods should likely focus on that. Spatial Fourier analysis, or cross-correlation and mutual information analysis (Figs. 7 and 8), are all valid choices that can be used for a more complete dataset comparison.
We also report different temporal trends of and in the Niño 3.4 and Gulf Stream regions (Fig. 4) that are consistent in the two datasets. We interpret the different trends in terms of different responses to greenhouse gas forcing in the Niño 3.4 and Gulf Stream regions (uneven warming and cooling respectively).
In our study we calculated SPE from symbols in the NS and WE orientations only, which are particularly fit for the equatorial dynamics of the Niño 3.4 region. This region was selected because it is the one used to define ENSO events and captures the ocean-atmosphere interaction phenomena that characterize them. In contrast, for the Gulf Stream region the area analyzed was selected as a compromise to consider the Gulf Stream geographically (as the current moves away from the coast and flows northeast, it becomes the Gulf Stream extension or North Atlantic Drift) while also being the area with the strongest and relatively zonal currents. Even though this choice is a good approximation, given that the mean current's trajectory is not purely zonal, further work may consider other regions and orientations such as along current and across current. This would allow refining our results and better capturing latitudinal shifts. Additionally, we could integrate in the definition of the OPs not only the SST anomalies spatial variation (as done here) but also the temporal variation. This can be achieved by computing OPs in time, or by building the OPs in a way they extend both in space and time. The introduction of temporal OPs will allow us to integrate information from different time scales, since varying the value of δ allows us to study daily, intra-seasonal or interannual variations in SST anomalies. Moreover, the discrepancies, trends, and transitions found in this work can be confirmed and/or others can be uncovered, if different generalizations of ordinal analysis are used (for example, those that “weight” the symbols to include additional information from the signal Fadlallah et al., 2013).
Furthermore, it would be interesting to compare the results obtained with SPE and its temporal counterpart to those derived from linear analysis techniques, such as Fourier analysis. Since SPE relies on the relative ordering of data values, it constitutes a nonlinear approach, which we expect can yield results that complement those obtained by linear methods.
Finally, the increase in the spatial resolution of SST products seen in latest years will allow to investigate the role of the mesoscale and sub-mesoscale dynamics on bio-geo-chemical processes such as the importance of eddies and fronts on the distribution of nutrients, phytoplankton growth and carbon uptake. Thus, it is crucial that different SST products are able to characterize the small scale SST anomalies variability adequately. Our analysis reveals differences in SST datasets at these scales that could hinder their use. Additionally, with the emergence of AI climate emulators, ordinal analysis can be useful to compare AI models with traditional, physics-based ones.
Overall, our results highlight the versatility and robustness of SPE for analyzing the spatiotemporal dynamics of SST anomalies. Both the size and the orientation of the ordinal patterns play important roles in capturing the small scale differences between the datasets, and the long-term evolution in the analyzed regions. This evolution at the large scale is consistent with the consequences of global warming. Furthermore, SPE allows the identification of change points just by visual inspection, which could be traced back to specific changes in methodology and data inputs in the two SST products. Since not every modification of the SST products has consequences that can be detected by our analysis technique, we propose using ordinal analysis in conjunction with other linear or nonlinear methodologies, to obtain a more complete characterization of the dynamics.
Table A1Summary of the significant change points detected by the PELT algorithm. An asterisk next to the R value indicates that the change point is discussed in the main text.
Figure A1Robustness of the change points detected in the signal of HWE from the ERA5 dataset with δ=1 in the Niño 3.4 region. Panel (a) shows the original signal detrended, while panel (b) shows its iAAFT surrogate (Schreiber and Schmitz, 1996). Panels (c) and (d) display the corresponding bar plots showing the persistence of each detected change point as the penalty parameter is increased. Dashed line corresponds to of the signal, while the continuous line marks the 99.5th percentile of P* from the surrogates.
A1 Implementation
To formally detect transitions in the quantities introduced in Sect. 3.1, we applied a CPD algorithm known as Pruned Exact Linear Time (PELT) (Killick et al., 2012) implemented in the Python package ruptures (Truong et al., 2020), with a Gaussian kernel cost function. PELT has been used to analyze geophysical time series such as temperature (Khapalova et al., 2018), vegetation (Wang and Fan, 2021), and stream flow (Rocha and de Souza Filho, 2020). Given a penalty parameter, P, PELT returns a number of change points. The value of P needs to be carefully selected because it determines the number of change points found: if it is too low, too many change points are detected, while if it is too high, no change point is detected. To deal with this problem, we propose a methodology similar to changepoints for a range of penalties – CROPS (Haynes et al., 2017). As in CROPS, we evaluate a large range of penalty parameters. However, CROPS ultimately relies on manually choosing the optimal penalty value (usually graphically) for which perturbations of its value do not alter significantly the number of change points detected. Our approach is intended to avoid this kind of subjectivity in the implementation, providing clear and quantifiable criterion for the selection of the penalty value, while at the same time selecting the most significant change points, as an attempt to minimize false detections. The procedure followed to select P is based on two steps: the first consists of the analysis of surrogate time series where the null hypothesis (NH) is that there are no change points, and the second step is the analysis of the robustness of the change points passing the surrogate test, with respect to variations in the penalty parameter. However, we note that NH can sometimes fail and surrogate signals still present change points that are detected by PELT at high values of P, and therefore the selected significance threshold may be too high, which can result in genuine change points not being detected, even if they are visually evident. Therefore, while PELT is a helpful method for change point detection, it should be further refined in future work.
CPD analysis typically assumes that the signals are piecewise stationary (Truong et al., 2020), and that within each segment the distribution follows an independent and identically distributed sequence of random variables (Garreau and Arlot, 2018), therefore linear trends have to be removed before using the PELT algorithm. This was the case for the time series of the spatial permutation entropy (Eq. 1), and of the linear measures (AAD and r), where a simple detrend was sufficient as pre-processing. In contrast, for the time series of the spatial mutual information (Eq. 7), sections with and without a linear trend are observed in the signal; hence we performed the following sequence of steps: (1) The time series were first divided in segments where the linear trend is constant. To do this, we combined the PELT algorithm with a statistical test of the linear trend: we run PELT without detrending the input, and test if there is a significant (p value<0.01) linear trend using a Wald test. If two continuous sections fail the test (i.e. none of them has a linear trend) or they have different results of the test (i.e. one has a significant linear trend while the other has not), the change point is considered as a true, genuine change-point. Otherwise, the change-point was disregarded as an artifact due to the linear trend present. (2) After performing this procedure for every change-point discovered by the first iteration of PELT, we detrend the signal in each segment delimited by the genuine change-points, and (3) Run the PELT again in each of these segments to detect change points in them. In these signals, no change points were detected within these detrended segments, but all reported ones correspond to transitions between in the linear trend/no trend, with significance assigned by the Wald test (Step 1), so there was no need to compute the surrogates of these signals.
For the other signals, the ones with a single trend (SPE, AAD, and r), in order to select an appropriate value of P, the following procedure was performed: (1) Each signal was detrended. (2) 10 iterated Amplitude Adjusted Fourier Transform (iAAFT) surrogates (Schreiber and Schmitz, 1996) were generated for each signal. (3) Each surrogate was analyzed in a range of . (4) For each change point detected (between 400–600 total in the 10 surrogated signals), we recorded the maximum P for which it is detected (from now on we note this value by P*). (5) The 99.5th percentile of this distribution of P* is selected as the value of P for which we analyze the original detrended signal.
In the previous step we have identified significant change points by comparing changes in the signal with changes in the surrogates generated from the signal. Now we test if these change points are robust in the sense that they can be detected for penalty values considerably higher than the median of change points (including not significant). For this, we first calculate the maximum penalty parameter for which each change point is detected, P*, and then calculate the relative difference between P*(t) of the significant change point that occurs at time t and the median of P* for all the change points detected in the signal ():
Some change points have a relatively low robustness score, and they are significant in the sense that the surrogate time series did not display almost any change points. To avoid this effect, in the main text we have only shown change points with a robustness score R≥19, which discards the first two quartiles of the distribution of R values, but in Table A1 we include all the significant change points and their corresponding R values.
Now, in Fig. A1c, we show an example of such analysis performed in the signal of HWE from the ERA5 dataset with δ=1 in the Niño 3.4 region (blue signal in Fig. 3a). Here we show the change points detected as a function of the penalty parameter, P, in cyan. In black continuous line, we mark the 99.5th percentile of the distribution of maximum P for which each change point is detected, P*, in the surrogated signals. In black dashed line, we display the median of P* from the signal: . In red we have highlighted the P* of the two significant change points (those which can be detected for P larger than the 95th percentile of P* from the surrogated signals), but only one (in 2007) is robust enough (the relative distance, R, between P* and is larger than 19) to be considered a true change point.
Panel (b) of Fig. A1 showcases an example of an iteratively adjusted amplitude adjusted Fourier transform (IAAFT) surrogate (Schreiber and Schmitz, 1996) obtained from the signal from panel (a), and panel (d) show the results of the Pelt algorithm for different penalty parameters.
Figure A2 displays the same analysis, but now for the Gulf Stream region. In the case of this signal, the surrogates produce change points that are quite robust (see Fig. A2d) which increases the significance threshold (black continuous line) considerably. However, two significant change points are detected, which are also robust.
A2 Summary of detected points
Table A1 summarizes the significant change points returned by the PELT algorithm. Here we can see how the 2007 transition is widely detected by different ordinal measures (H and SMI), in both regions and mainly on the small scale (δ=1). However, the other transitions, specifically the ones in 2013 and 2021, are only detected in the Gulf Stream region. Additionally, it is clear that the majority of the transitions are only detected by SMI, computed from ordinal pattern probabilities.
The signals of SMI shown in Fig. 7, especially those obtained from ordinal patterns (panels a–h), present sections with and without a linear trend, most notably pre- and post-2007. All the change points found on the small scale (Fig. 7a–d) correspond to a transition between the signal having a significant nonzero linear trend and a nonsignificant linear trend (or vice versa). On the other hand, the 2007 change point at the large scale (Fig. 7e–h) always present a linear trend before and after this year, but these change point persist after detrending the signal. Regarding the other change points detected at the large scale in the Gulf Stream region, the one in 2016 in SMIWE and the one in 2021 in SMINS correspond again to transitions trend/no-trend, while the one in 2021 in SMIWE corresponds to a jump: no significant linear trend before or after.
For SMIhist. in the Gulf Stream region region (Fig. 7j, the algorithm detects two change points, but one in 2007 and the other one in 2021. The first one corresponds to a change point with linear trend before and after, which survives detrending, and the second one corresponds to a trend/no-trend transition.
In addition to the suitability of the chosen surrogates mentioned before (change points still occur in the surrogates, which increases the significance threshold and prevents the detections of true change points), we highlight that the last step in our CPD analysis, the robustness check of the significant change points, allowed us to distinguish between what look like spurious detections and evident ones (such as 2007). But some of the discarded change points (as in 2013 or 2021 in SMIWE and SMINS for δ=1 in the Gulf region), which are visually evident and significant, do not present large robustness levels. In fact, although the relative robustness (R) is computed from each signal, the threshold for significant robustness () is computed from the robustness distribution obtained by considering all the robustness values from all the change points detected in all the time series (entropies, mutual information, and linear measures). Therefore, the robustness values of a single time series affect the robustness threshold by affecting the median of the distribution. For example, a time series with many robust change points (high P) would increase the median of P, making R small and the robustness of its own change points less significant; or a time series with a single very robust change points (high R) increases the overall median of R () potentially making non-significant robust change points of another time series. However, the change points that pass the double check of significance and significant robustness are simultaneously robust to other change points in the signals and to all other significant change points.
While ordinal analysis allows us to detect several changes that could not be detected by the distance or cross-correlation measures considered, it has the drawback that ordinal analysis typically has computational costs and hyper-parameters (the length of the symbol and the lag) which need to be carefully selected. However, ordinal analysis has the important advantage of offering a large degree of flexibility, by allowing to tune the shape of the pattern, and the spatial scale of the analysis.
Figure B1Snapshot of the Niño 3.4 region SST anomalies during different El Niño/La Niña events from NOAA (left column) and ERA5 (right column) datasets.
Figure B2Snapshot of the Gulf Stream region during a large meandering in January 2010 (Zeng and He, 2016) from NOAA (left column) and ERA5 (right column) datasets.
Figure B5Same as Fig. 4 but for L=3. Fittings of the ERA5 data have a linear coefficient of: (a) (), (b) (), (c) (), (d) ().
Figure B6Same as Fig. 4 but for L=5. Fittings of the ERA5 data have a linear coefficient of: (a) (p value=0.087), (b) (), (c) (), (d) ().
Figure B9Analysis of the Niño 3.4 region, with L=4 and a small spatial lag (δ=1) of the raw data (top row), of the anomalies (after removing the seasonal cycle, as in the manuscript, middle row) and of the detrended anomalies (after removing the linear trend and the seasonal cycle, bottom row). The columns display results for the two OP orientations (left: NS; right: WE).
Figure B12Same as Fig. B11 but for δ=8. The variation of the entropies in the top row of Figs. B9 and B10 is consistent with the fact that in the equatorial Pacific, the seasonal cycle is significant in the WE direction and therefore manifests itself on both short and long scales. In the Gulf Stream, Figs. B11 and B12, the seasonal cycle is more significant in the NS direction because the current is nearly zonal. Therefore, it can be seen in NS when using δ=1 (panel a in Fig. B11), but not in the WE direction (panel b in Fig. B11). However, when δ=8, the distances are large enough for the seasonal cycle to also manifest itself also in the WE direction (panel b in Fig. B12).
Figure B13Spatial Mutual Information (SMIL,δ) as a function of the spatial lag, δ, in the two regions considered, and for the two orientations of the ordinal patterns (WE and NS). The analysis was done for the first 120 months (top row) and for the last 120 months (bottom row) of the time interval in which both, ERA5 and NOAA OI v2 datasets are available (1981–2025). The symbols show the mean value and the error bars show the standard deviation. Here we show analysis for ordinal patterns of length L=3, but similar results were obtained with L=4 (see Fig. B14). Note the different vertical scales. Here we see that the agreement between the datasets improves with δ, and, for the same δ, the similarity between the two datasets in the two periods is quite different, being more similar in the second period (note the different vertical scale). In general, the two datasets are more similar in the Gulf than in the Pacific, which can be due to the fact that the North Atlantic is the area with the highest concentration of in-situ measurements globally. In the Gulf Stream region, the similarity of the two datasets for the NS orientation saturates for δ=5, that is, when the NS ordinal patterns cover 4°, meaning the submesoscale (0.1–10°), an intermediate scale between typical ocean turbulence and eddies. In contrast, in the case of the Niño 3.4, there does not appear to be a scale at which the similarity of the datasets stops increasing.
Figure B14Same as Fig. B13 but for L=4. Similar results are obtained, only that saturates for δ=6 in the Niño 3.4 region.
Figure B15Ordinal probabilities in the Niño 3.4 region (top row) and in the Gulf Stream region (bottom row) calculated from NOAA for January 2015) with δ=1 (a, b, e, f) and with δ=8 (c, d, g, h). The value of the spatial permutation entropy is indicated in each panel.
Figure B16Evolution of the probability that, in a grid point the spatial ordinal pattern (in the NS or WE direction) is the same in the two datasets. Panels (a) and (b) correspond to the Niño 3.4 region, while panels (c) and (d) to the Gulf Stream region. In panels (a) and (c) the ordinal patterns are constructed with δ=1, while in panels (b) and (d) with δ=8.
NOAA OI v2 data was obtained from the KNMI Climate Explorer (https://climexp.knmi.nl/select.cgi?id=someone@somewhere&field=sstoiv2_monthly_mean, last access: 5 May 2026). ERA5 data was obtained from the Copernicus Climate Data Store (https://doi.org/10.24381/cds.f17050d7, Hersbach et al., 2023). The code used for the analysis and the generation of the figures is available at https://github.com/juangancio/climate-spatial-analysis, last access: 5 May 2026, and archived at https://doi.org/10.5281/zenodo.17250157 (Gancio, 2025).
The video supplements are archived at https://doi.org/10.5281/zenodo.19051869 (Gancio, 2026).
JG: conceptualization, formal analysis, methodology, software, writing – original draft, writing – review and editing. GT: conceptualization, software, supervision, writing – original draft, writing – review and editing. CM: conceptualization, supervision, writing – original draft, writing – review and editing. MB: supervision, writing – original draft, writing – review and editing.
The contact author has declared that none of the authors has any competing interests.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
JG acknowledges from AGAUR (FI scholarship no. 2023 FI-1 00034), CM acknowledges support from Ministerio de Ciencia e Innovación (Project no. PID2024-160573NB-I00), and GT acknowledges support from the Serra Húnter Programme (Generalitat de Catalunya).
This research has been supported by the Agència de Gestió d'Ajuts Universitaris i de Recerca (grant nos. 021 SGR 00606 and 2023 FI-1 00034) and the Ministerio de Ciencia e Innovación (grant no. PID2024-160573NB-I00).
This paper was edited by Gabriele Messori and reviewed by Martin Bonte, D. Monselesan, and one anonymous referee.
Allen, A., Markou, S., Tebbutt, W., Requeima, J., Bruinsma, W. P., Andersson, T. R., Herzog, M., Lane, N. D., Chantry, M., Hosking, J. S., and Turner, R. E.: End-to-end data-driven weather prediction, Nature, 641, 1172–1179, https://doi.org/10.1038/s41586-025-08897-0, 2025. a
Azami, H. and Escudero, J.: Amplitude-aware permutation entropy: illustration in spike detection and signal segmentation, Comput. Meth. Prog. Bio., 128, 40–51, https://doi.org/10.1016/j.cmpb.2016.02.008, 2016. a
Bandt, C. and Pompe, B.: Permutation entropy: a natural complexity measure for time series, Phys. Rev. Lett., 88, 174102, https://doi.org/10.1103/PhysRevLett.88.174102, 2002. a, b, c, d, e
Barreiro, M., Marti, A. C., and Masoller, C.: Inferring long memory processes in the climate network via ordinal pattern analysis, Chaos: An Interdisciplinary Journal of Nonlinear Science, 21, 013101, https://doi.org/10.1063/1.3545273, 2011. a
Boaretto, B. R., Budzinski, R. C., Rossi, K. L., Masoller, C., and Macau, E. E.: Spatial permutation entropy distinguishes resting brain states, Chaos Soliton. Fract., 171, 113453, https://doi.org/10.1016/j.chaos.2023.113453, 2023. a
Boers, N., Goswami, B., Rheinwalt, A., Bookhagen, B., Hoskins, B., and Kurths, J.: Complex networks reveal global pattern of extreme-rainfall teleconnections, Nature, 566, 373, https://doi.org/10.1038/s41586-018-0872-x, 2019. a
Bulgin, C. E., Merchant, C. J., and Ferreira, D.: Tendencies, variability and persistence of sea surface temperature anomalies, Sci. Rep.-UK, 10, 7986, https://doi.org/10.1038/s41598-020-64785-9, 2020. a
Celik, T.: Spatial mutual information and PageRank-based contrast enhancement and quality-aware relative contrast measure, IEEE T. Image Process., 25, 4719–4728, https://doi.org/10.1109/TIP.2016.2599103, 2016. a
Comunian, A., Giudici, M., and Panzeri, A.: A Bayesian approach to map oceanic structures, with an application to the Gulf Stream, J. Marine Syst., 253, 104175, https://doi.org/10.1016/j.jmarsys.2025.104175, 2026. a
Dai, A.: The diurnal cycle from observations and ERA5 in surface pressure, temperature, humidity, and winds, Clim. Dynam., 61, 2965–2990, https://doi.org/10.1007/s00382-023-06721-x, 2023. a
Deza, J. I., Barreiro, M., and Masoller, C.: Inferring interdependencies in climate networks constructed at inter-annual, intra-season and longer time scales, Eur. Phys. J. Special Topics, 222, 511–523, https://doi.org/10.1140/epjst/e2013-01856-5, 2013. a, b
Díaz, N., Barreiro, M., and Rubido, N.: Data driven models of the Madden-Julian Oscillation: understanding its evolution and ENSO modulation, npj Climate and Atmospheric Science, 6, 203, https://doi.org/10.1038/s41612-023-00527-8, 2023. a
Dijkstra, H. A., Hernandez-Garcia, E., Masoller, C., and Barreiro, M.: Networks in Climate, Cambridge University Press, https://doi.org/10.1017/9781316275757, 2019. a
Dong, S. and Kelly, K. A.: Heat budget in the Gulf Stream region: the importance of heat storage and advection, J. Phys. Oceanogr., 34, 1214–1231, https://doi.org/10.1175/1520-0485(2004)034<1214:HBITGS>2.0.CO;2, 2004. a
Donlon, C. J., Martin, M., Stark, J., Roberts-Jones, J., Fiedler, E., and Wimmer, W.: The Operational Sea Surface Temperature and Sea Ice Analysis (OSTIA) system, Remote Sens. Environ., 116, 140–158, https://doi.org/10.1016/j.rse.2010.10.017, 2012. a
Fadlallah, B., Chen, B., Keil, A., and Príncipe, J.: Weighted-permutation entropy: a complexity measure for time series incorporating amplitude information, Phys. Rev. E, 87, 022911, https://doi.org/10.1103/PhysRevE.87.022911, 2013. a, b
Falasca, F., Crétat, J., Braconnot, P., and Bracco, A.: Spatiotemporal complexity and time-dependent networks in sea surface temperature from mid-to late Holocene, Eur. Phys. J. Plus, 135, 392, https://doi.org/10.1140/epjp/s13360-020-00403-x, 2020. a
Gancio, J.: juangancio/climate-spatial-analysis: Supporting code for EDS submission: “Detecting transitions and quantifying differences in two SST datasets using spatial permutation entropy” (v2.0.0), Zenodo [code], https://doi.org/10.5281/zenodo.17250157, 2025. a
Gancio, J.: Supplemental videos for ESD article “Detecting transitions and quantifying differences in two SST datasets using spatial permutation entropy”, Zenodo [video], https://doi.org/10.5281/zenodo.19051869, 2026. a
Gancio, J., Masoller, C., and Tirabassi, G.: Permutation entropy analysis of EEG signals for distinguishing eyes-open and eyes-closed brain states: comparison of different approaches, Chaos, 34, 043130, https://doi.org/10.1063/5.0200029, 2024. a
Garreau, D. and Arlot, S.: Consistent change-point detection with kernels, Electron. J. Stat., 12, 4440–4486, https://doi.org/10.1214/18-EJS1513, 2018. a
Good, S., Fiedler, E., Mao, C., Martin, M. J., Maycock, A., Reid, R., Roberts-Jones, J., Searle, T., Waters, J., While, J., and Worsfold, M.: The current configuration of the OSTIA system for operational production of foundation sea surface temperature and ice concentration analyses, Remote Sens.-Basel, 12, 720, https://doi.org/10.3390/rs12040720, 2020. a
Gupta, S., Boers, N., Pappenberger, F., and Kurths, J.: Complex network approach for detecting tropical cyclones, Clim. Dynam., 57, 3355–3364, https://doi.org/10.1007/s00382-021-05871-0, 2021. a
Haynes, K., Eckley, I. A., and Fearnhead, P.: Computationally efficient changepoint detection for a range of penalties, J. Comput. Graph. Stat., 26, 134–143, https://doi.org/10.1080/10618600.2015.1116445, 2017. a
Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz-Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D., Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P., Biavati, G., Bidlot, J., Bonavita, M., De Chiara, G., Dahlgren, P., Dee, D., Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer, A., Haimberger, L., Healy, S., Hogan, R. J., Hólm, E., Janisková, M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., de Rosnay, P., Rozum, I., Vamborg, F., Villaume, S., and Thépaut, J. N.: The ERA5 global reanalysis, Q. J. Roy. Meteor. Soc., 146, 1999–2049, https://doi.org/10.1002/qj.3803, 2020. a, b, c
Hersbach, H., Bell, B., Berrisford, P., Biavati, G., Horányi, A., Muñoz Sabater, J., Nicolas, J., Peubey, C., Radu, R., Rozum, I., Schepers, D., Simmons, A., Soci, C., Dee, D., and Thépaut, J.-N.: ERA5 monthly averaged data on single levels from 1940 to present, Copernicus Climate Change Service (C3S) Climate Data Store (CDS) [data set], https://doi.org/10.24381/cds.f17050d7, 2023. a
Hirahara, S., Balmaseda, M. A., Boisseson, E., and Hersbach, H.: Sea Surface Temperature and Sea Ice Concentration for ERA5, ERA Report Series, 26, 1–25, https://www.ecmwf.int/en/elibrary/79624-sea-surface-temperature-and-sea-ice-concentration-era5 (last access: 5 May 2026), 2016. a, b
Huang, B., Liu, C., Banzon, V., Freeman, E., Graham, G., Hankins, B., Smith, T., and Zhang, H.-M.: Improvements of the daily optimum interpolation sea surface temperature (DOISST) version 2.1, J. Climate, 34, 2923–2939, https://doi.org/10.1175/JCLI-D-20-0166.1, 2021. a, b, c
Huang, B., Yin, X., Carton, J. A., Chen, L., Graham, G., Liu, C., Smith, T., and Zhang, H.-M.: Understanding differences in sea surface temperature intercomparisons, J. Atmos. Ocean. Tech., 40, 455–473, https://doi.org/10.1175/JTECH-D-22-0081.1, 2023. a, b, c, d
Ikuyajolu, O. J., Falasca, F., and Bracco, A.: Information entropy as quantifier of potential predictability in the tropical Indo-Pacific basin, Frontiers in Climate, 3, 675840, https://doi.org/10.3389/fclim.2021.675840, 2021. a
Jonasson, O., Gladkova, I., Ignatov, A., and Kihai, Y.: Progress with development of global gridded super-collated SST products from low Earth orbiting satellites (L3S-LEO) at NOAA, in: Ocean Sensing and Monitoring XII, Vol. 11420, SPIE, https://doi.org/10.1117/12.2551819, 5–22, 2020. a, b
Khapalova, E. A., Jandhyala, V. K., Fotopoulos, S. B., and Overland, J. E.: Assessing change-points in surface air temperature over Alaska, Frontiers in Environmental Science, 6, 121, https://doi.org/10.3389/fenvs.2018.00121, 2018. a
Killick, R., Fearnhead, P., and Eckley, I. A.: Optimal detection of changepoints with a linear computational cost, J. Am. Stat. Assoc., 107, 1590–1598, https://doi.org/10.1080/01621459.2012.737745, 2012. a, b
Krouma, M., Specq, D., Magnusson, L., Ardilouze, C., Batté, L., and Yiou, P.: Improving subseasonal forecast of precipitation in Europe by combining a stochastic weather generator with dynamical models, Q. J. Roy. Meteor. Soc., https://doi.org/10.1002/qj.4733, 2024. a
Kumar, R. and Bhandari, A. K.: Spatial mutual information based detail preserving magnetic resonance image enhancement, Comput. Biol. Med., 146, 105644, https://doi.org/10.1016/j.compbiomed.2022.105644, 2022. a
Leyva, I., Martínez, J. H., Masoller, C., Rosso, O. A., and Zanin, M.: 20 years of ordinal patterns: Perspectives and challenges, Europhysics Letters, 138, 31001, https://doi.org/10.1209/0295-5075/ac6a72, 2022. a
Messori, G., Caballero, R., and Faranda, D.: A dynamical systems approach to studying midlatitude weather extremes, Geophys. Res. Lett., 44, 3346–3354, https://doi.org/10.1002/2017GL072879, 2017. a
Muñoz-Guillermo, M.: Multiscale two-dimensional permutation entropy to analyze encrypted images, Chaos: An Interdisciplinary Journal of Nonlinear Science, 33, 013112, https://doi.org/10.1063/5.0130538, 2023. a
Novi, L., Bracco, A., Ito, T., and Takano, Y.: Evolution of oxygen and stratification and their relationship in the North Pacific Ocean in CMIP6 Earth system models, Biogeosciences, 21, 3985–4005, https://doi.org/10.5194/bg-21-3985-2024, 2024. a
Paluš, M., Chvosteková, M., and Manshour, P.: Causes of extreme events revealed by Rényi information transfer, Science Advances, 10, eadn1721, https://doi.org/10.1126/sciadv.adn1721, 2024. a
Parfitt, R. and Czaja, A.: On the contribution of synoptic transients to the mean atmospheric state in the Gulf Stream region, Q. J. Roy. Meteor. Soc., 142, 1554–1561, https://doi.org/10.1002/qj.2689, 2016. a
Politi, A.: Quantifying the dynamical complexity of chaotic time series, Phys. Rev. Lett., 118, 144101, https://doi.org/10.1103/PhysRevLett.118.144101, 2017. a
Prado, T., Corso, G., dos Santos Lima, G., Budzinski, R., Boaretto, B., Ferrari, F., Macau, E., and Lopes, S.: Maximum entropy principle in recurrence plot analysis on stochastic and chaotic systems, Chaos: An Interdisciplinary Journal of Nonlinear Science, 30, https://doi.org/10.1063/1.5125921, 2020. a
Reynolds, R. W., Smith, T. M., Liu, C., Chelton, D. B., Casey, K. S., and Schlax, M. G.: Daily high-resolution-blended analyses for sea surface temperature, J. Climate, 20, 5473–5496, https://doi.org/10.1175/2007JCLI1824.1, 2007. a
Ribeiro, H. V., Zunino, L., Lenzi, E. K., Santoro, P. A., and Mendes, R. S.: Complexity-entropy causality plane as a complexity measure for two-dimensional patterns, PLoS One, 7, 1–9, https://doi.org/10.1371/journal.pone.0040689, 2012. a
Roberts-Jones, J., Bovis, K., Martin, M. J., and McLaren, A.: Estimating background error covariance parameters and assessing their impact in the OSTIA system, Remote Sens. Environ., 176, 117–138, https://doi.org/10.1016/j.rse.2015.12.006, 2016. a
Rocha, R. V. and de Souza Filho, F. d. A.: Mapping abrupt streamflow shift in an abrupt climate shift through multiple change point methodologies: Brazil case study, Hydrolog. Sci. J., 65, 2783–2796, https://doi.org/10.1080/02626667.2020.1843657, 2020. a
Schlemmer, A., Berg, S., Shajahan, T., Luther, S., and Parlitz, U.: Quantifying spatiotemporal complexity of cardiac dynamics using ordinal patterns, in: 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), IEEE, https://doi.org/10.1109/EMBC.2015.7319283, 4049–4052, 2015. a
Schlemmer, A., Berg, S., Lilienkamp, T., Luther, S., and Parlitz, U.: Spatiotemporal permutation entropy as a measure for complexity of cardiac arrhythmia, AIP Conf. Proc., 6, 39, https://doi.org/10.3389/fphy.2018.00039, 2018. a
Schreiber, T. and Schmitz, A.: Improved surrogate data for nonlinearity tests, Phys. Rev. Lett., 77, 635–638, https://doi.org/10.1103/PhysRevLett.77.635, 1996. a, b, c
Seidov, D., Mishonov, A., Reagan, J., and Parsons, R.: Multidecadal variability and climate shift in the North Atlantic Ocean, Geophys. Res. Lett., 44, 4985–4993, https://doi.org/10.1002/2017GL073644, 2017. a
Sigaki, H. Y. D., Perc, M., and Ribeiro, H. V.: History of art paintings through the lens of entropy and complexity, PNAS, 115, E8585–E8594, https://doi.org/10.1073/pnas.1800083115, 2018. a
Sigaki, H. Y. D., de Souza, R. F., de Souza, R. T., Zola, R. S., and Ribeiro, H. V.: Estimating physical properties from liquid crystal textures via machine learning and complexity-entropy methods, Phys. Rev. E, 99, 013311, https://doi.org/10.1103/PhysRevE.99.013311, 2019. a
Tarozo, M. M., Pessa, A. A. B., Zunino, L., Rosso, O. A., Perc, M., and Ribeiro, H. V.: Two-by-two ordinal patterns in art paintings, PNAS Nexus, 4, pgaf092, https://doi.org/10.1093/pnasnexus/pgaf092, 2025. a
Tirabassi, G. and Masoller, C.: Entropy-based early detection of critical transitions in spatial vegetation fields, P. Natl. Acad. Sci. USA, 120, e2215667120, https://doi.org/10.1073/pnas.2215667120, 2023. a
Tirabassi, G., Duque-Gijon, M., Tiana-Alsina, J., and Masoller, C.: Permutation entropy-based characterization of speckle patterns generated by semiconductor laser light, APL Photonics, 8, 126112, https://doi.org/10.1063/5.0169445, 2023. a
Titchner, H. A. and Rayner, N. A.: The Met Office Hadley Centre sea ice and sea surface temperature data set, version 2: 1. Sea ice concentrations, J. Geophys. Res.-Atmos., 119, 2864–2889, https://doi.org/10.1002/2013JD020316, 2014. a
Todd, R. E. and Ren, A. S.: Warming and lateral shift of the Gulf Stream from in situ observations since 2001, Nat. Clim. Change, 13, 1348–1352, https://doi.org/10.1038/s41558-023-01835-w, 2023. a
Truong, C., Oudre, L., and Vayatis, N.: Selective review of offline change point detection methods, Signal Process., 167, 107299, https://doi.org/10.1016/j.sigpro.2019.107299, 2020. a, b
Vries, H. D., Scher, S., Haarsma, R., Drijfhout, S., and Delden, A. V.: How Gulf-Stream SST-fronts influence Atlantic winter storms: results from a downscaling experiment with HARMONIE to the role of modified latent heat fluxes and low-level baroclinicity, Clim. Dynam., 52, 5899–5909, https://doi.org/10.1007/s00382-018-4486-7, 2019. a
Wang, S. and Fan, F.: Analysis of the response of long-term vegetation dynamics to climate variability using the Pruned Exact Linear Time (PELT) Method and Disturbance Lag Model (DLM) based on remote sensing data: a case study in Guangdong Province (China), Remote Sens.-Basel, 13, 1873, https://doi.org/10.3390/rs13101873, 2021. a
Wills, R. C. J., Dong, Y., Proistosecu, C., Armour, K. C., and Battisti, D. S.: Systematic climate model biases in the large-scale patterns of recent sea-surface temperature and sea-level pressure change, Geophys. Res. Lett., 49, e2022GL100011, https://doi.org/10.1029/2022GL100011, 2022. a
Xie, S.-P., Deser, C., Vecchi, G. A., Ma, J., Teng, H., and Wittenberg, A. T.: Global warming pattern formation: sea surface temperature and rainfall, J. Climate, 23, 966–986, https://doi.org/10.1175/2009JCLI3329.1, 2010. a
Yao, L., Lu, J., Xia, X., Jing, W., and Liu, Y.: Evaluation of the ERA5 sea surface temperature around the Pacific and the Atlantic, IEEE Access, 9, 12067–12073, https://doi.org/10.1109/ACCESS.2021.3051642, 2021. a
Zeng, X. and He, R.: Gulf Stream variability and a triggering mechanism of its large meander in the South Atlantic Bight, J. Geophys. Res.-Oceans, 121, 8021–8038, https://doi.org/10.1002/2016JC012077, 2016. a