Recent trends in the frequency  and duration of global floods

Najibi, Nasser; Devineni, Naresh

doi:https://doi.org/10.5194/esd-9-757-2018

Articles | Volume 9, issue 2

https://doi.org/10.5194/esd-9-757-2018

© Author(s) 2018. This work is distributed under
the Creative Commons Attribution 4.0 License.

Special issue:

Hydro-climate dynamics, analytics and predictability

https://doi.org/10.5194/esd-9-757-2018

© Author(s) 2018. This work is distributed under
the Creative Commons Attribution 4.0 License.

Articles | Volume 9, issue 2

Research article

|

08 Jun 2018

Research article |

| 08 Jun 2018

Recent trends in the frequency and duration of global floods

Nasser Najibi and Naresh Devineni

Download

Final revised paper (published on 08 Jun 2018)
Preprint (discussion started on 04 Jul 2017)

Interactive discussion

Status: closed

AC: Author comment | RC: Referee comment | SC: Short comment | EC: Editor comment

- Printer-friendly version

- Supplement

RC1: 'Review of Recent Trends in Frequency and Duration of Global Floods by Nasser Najibi and Naresh Devineni', Anonymous Referee #1, 17 Aug 2017
- AC1: 'Authors' Responses to RC#1 and RC#2', Nasser Najibi, 06 Oct 2017
RC2: 'Review of Najibi and Devineni (2017)', Anonymous Referee #2, 22 Aug 2017

Peer-review completion

AR: Author's response | RR: Referee report | ED: Editor decision

ED: Reconsider after major revisions (30 Oct 2017) by Julia Hall

Dear Authors,
Thank you for your reply to the referees’ comments in the interactive discussion stage of the manuscript.

You have already indicated in your reply to the referees that you have already incorporated some of the comments. Apart from these points, I think it would be important that you add the elaborations/clarifications that you provided to the referees into your manuscript, as these sections of the manuscript clearly seem to be in need of further explanation. This would strengthen both the focus and the clarity of the manuscript.

After taking into consideration the manuscript, the referee comments, your responses to the referees and my own editorial comments (for details please see below), the manuscript would benefit from a thorough revision to clarify certain aspects and to further strengthen the document, particularly with regard to the validity of data and methods used.

I look forward to receiving the revised version of your manuscript that should incorporate the reviewers’ comments and the editorial remarks below.

With best wishes,
Julia Hall

General Comments:
Overall, plit is essential that all the comments of the reviewers are incorporated into the manuscript. Particularly, if points were in need for further elaboration (as pointed out by one of the reviewers) please add the explanations directly into the manuscript (with similar detail as in the response to the reviewer question).

I) The core of the manuscript is the analysis of the Dartmouth flood Observatory global data set. However, as also pointed out by one of the reviewers, the data might not have the required quality for the trend analysis performed in the manuscript.
Therefore, when the data is introduced, the authors need to present the uncertainties and possible data quality issues associated with the dataset in detail. That is, the quality assessment needs to be done a priori or at the latest accompany the data analysis procedure itself, so that the reader knows of the limitations and possible influences on the results so that the outcomes are not misleading. Performing the analysis and only later discussing possible issues with the data quality in the discussion (as currently done in the manuscript) is thus not appropriate. Additionally, the authors should discuss in detail, why in the data quality is sufficient to perform the analysis on the data (similar to their response to the reviewer). This is of particular importance with regard to the analysis for the flood frequency, as for example reporting bias has a strong impact on the results.

II) All methods need to be moved into the methods section and not appear in the results section. Additionally, the GLM framework is an important part of the paper and should be further elaborated. This means not presenting it at the end of the paper but making it a fundamental part of the analysis, which also requires the methodological details to be moved to the methods section.

III) In parts of the analysis, the characteristics of the flood duration distribution (moments) are analysed for a change. However, each year has a different sample size. This results in the possibility that the changes observed could be artificially caused by the different sample sizes. This is of particular importance for the southern mid-latitudes, where only 59 floods are recorded for the entire period, which means that on average there are about 2 floods per year. However, the maximum for one year in this region is 7 floods, which will leave many years without a flood and in turn not really allow for a meaningful analysis of the changes in any of the analysed variables.
Therefore, the authors need to show that the changes in sample size do not impact the results, or find a way how to correct for the changes in the sample size.

IV) Both trend analysis and change point analysis are performed. Generally, the presence of a change point influences the interpretation of the obtained statistically significant trends. (Villarini et al 2009) Therefore, after a detection of a change point, the timeseries before and after the change point should also be analysed separately to put the overall trend into perspective.
Additionally, the data seems to not have a monotonic change but rather an oscillatory behaviour (e.g. Fig 2). Instead of ignoring this characteristic and simply reporting the results of trend analysis, the authors need to at least discuss this or use some method that can account for such characteristics in the time series.

V) As already pointed out by reviewer 2 (point 4): The MK test might not be the best for zero inflated count data. How is a zero count being handled in the analysis (i.e. as a zero value or as missing data)? As this influences, the output of the MK test. Please consider using trend tests that are meaningful to the data used or explicitly justify why the MK is considered appropriate. See also Table 5 where one obtains a statistically significant trend using the MK test but the Sen Slope is zero. What is the physical meaning of this?

VI) The data is aggregated using latitudinal belts. This reasoning needs to be further elaborated, as the current reasoning is not convincing. When moving from west to east in each of these latitudinal belt, the temperature and precipitation characteristics change considerably for example due to increasing continentality. Therefore, ‘binning’ floods from for example the west coast of Europe with floods in central Russia or even norther China seems to be not quite intuitive a requires a better explanation.

VII) I agree with the reviewer that given the uncertainties with the data quality (homogeneity) the results should be corroborated with another global dataset such as the GRDC. There has been a previous study that showed agreement between the DFO in Europe and the US, however this does not mean that this is the case for regions in which the data is less uncertain. This fact needs to be mentioned/discussed.

VIII) I would suggest that in section 4 a discussion of the results in the context of already existing (partly smaller scale) studies on flood frequency /duration is conducted, as section 4.1 should be moved into the data section.

IX) Spatial aggregation (globally or into latitudinal belts) of a GPH destroys crucial information needed to characterise atmospheric circulations, namely spatial contrasts. Therefore, any trends calculated over time series resulting from these aggregated fields do not provide physically consistent information to characterise changes in atmospheric circulations. In fact, the weather is not determined by the value of the GPH per se but rather by spatial contrasts (related to spatial pressure gradients). Therefore, the analysis has no physical meaning with regard to flood frequency or duration. Similarly, a spatial aggregation of PWC (globally or into latitudinal belts) and the associated trend does not provide any information whether it will rain or not as it will depend where the PW content is located. Therefore, the analysis of these variables does not provide any additional insights.
Consequently, there is no physical value in using these aggregated GPH and PWC as predictors in the GLM. It is important that the authors reconsider the variables that they use for the GLM.

X) When rewriting, please pay attention to spelling, grammar and sentence structure.

Specific Comments:

Throughout the document the phrase ‘at all spatial scales’ is used several times. However, it is not clear what this expression means. Do you want to refer to ‘both at global and in the latitudinal belts’? If so, the expression could be misunderstood. The same potential for a misunderstanding applies to ‘latitudinal scales’. Please replace these phrases with something less ambiguous.

Table 9 and 10 contains colour, which will not be possible in the final publication, please remove the colours and adjust the tables so that they will be readable/understandable without the colours.

L 35 ‘This research…’ Which one? Please specify.
L39-40. ‘Understanding these trends can help…’ How can the understanding of what aspect of the trends help? Please specify.

L65: ‘…most susceptible…’ In your analysis only the percentages of the three classes of flood duration are given and 4 countries are shown as an example. This has nothing to do with susceptibility or ‘vulnerability’ as mentioned in other parts of the manuscript. A clear definition of ‘susceptibility’/‘vulnerability’ as used in this paper needs to be provided.

L68: ‘We consider…’ is this a hypothesis? Please be more specific.

L70: ‘Understanding the temporal trends (regime like behaviour)…’ The meaning is not clear. Additionally, the understanding … will help to understand better… In the current manuscript, the trends are not ‘understood’, but rather characterised. (See also comment above P2L39-40)

L 71-74: ‘this will ultimately lead to…’ How? I think this sentence is out of scope of the current research.

L 92: The data might be ‘globally consistent’ (i.e. same methods) but what about temporal consistency? Please elaborate.
L101: please provide more detail on how the ‘unusually large’ are being quantitatively determined.

L112 ‘as suggested by Env (2016)’ ? Is this an incomplete reverence?

L113 ‘…more consistent…’ compared to what? Please specify.

L115 ‘four countries that have a high flood frequency’ It is not clear why these countries were selected as from Fig 7 there seem to be other countries such as Indonesia or Vietnam that have a higher number of floods then Thailand. Please elaborate.

L 190: the maximum number of floods is not occurring around 2005. It is only the LOESS that peaks in this period for most of the analysed regions. I recommend a more careful interpretation of the results.

L213: please provide a ‘physical’ meaning of MAD.

L 259: ‘The magnitude…’ This is can be easily confused with flood magnitude. Please use ‘The length…’ or similar.

L 308: Please specify whether it is possible to have more than one flood of different length per year. If so the statement ‘to have at least one flood per year’ should be changed to ‘to have on average one flood per year’. Additionally, please elaborate why it is considered of importance to have the 31 floods per country.

L315: The analysis is interesting, but please provide some explanation/interpretation of the results.

L321, L323, L 325, Please add % behind each number.

L 442: Please replace the term ‘attributed’ as no attribution within a formal attribution framework has been performed.

L 446: Please elaborate why no formal autocorrelation testing was performed.

L394-395: The quotation of Wang and Zhou is not correct, since the statement cannot be substantiated from their analysis. Moreover, the statement is physically inconsistent.

L 396-398: This is not clear. Please rewrite.

L 453-462: I am not sure that this section deserves a separate section in the discussion of the results. I suggest rather incorporating it into a general discussion section.

L 491-494: This was not really shown in the paper. Please focus only what has been found in the current manuscript.

P 29: For Figure 7a) I suggest using a ‘ternary plot’ (also called triangle plot, simplex plot, Gibbs triangle or de Finetti diagram) instead for ease of interpretation.

References:
Villarini et al 2009, On the stationarity of annual flood peaks in the continental United States during the 20th century, WATER RESOURCES RESEARCH, VOL. 45, W08417, doi:10.1029/2008WR007645.

Hide

AR by Nasser Najibi on behalf of the Authors (05 Jan 2018) Manuscript

ED: Referee Nomination & Report Request started (08 Jan 2018) by Julia Hall

RR by Anonymous Referee #3 (05 Feb 2018)

RR by Anonymous Referee #2 (16 Feb 2018)

Suggestions for revision or reasons for rejection

Review of Najibi and Devineni (2018) for ESD

I have re-reviewed the paper by Najibi and Devineni (2018) for ESD. My original major concern was the use of the DFO dataset as a single source of information on flood frequency and duration in which the conclusions of the paper are founded. I acknowledge that the authors have answered most of my original comments to a satisfactory degree, and indeed made many positive changes to the manuscript. This includes addressing issues around assumptions for statistical testing and including more information about the limitations of the DFO dataset.

However, after reading the revised manuscript the same concerns about the sole use of the DFO dataset still remain. Here is the author response to this concern: “Regarding the DFO dataset as a single source of information employed here, we should emphasize that to our knowledge, this is the first analysis of “global flood events” that focuses exclusively on the variability of “flood duration” derived from DFO dataset over the last three decades. The database has recorded flood inundation events using satellite sources and media verification since 1985, and currently has over 4200 entries with approximate location of the center of the area flooded, the dates and duration of flooding and notes as to societal impacts. This is the only global data set of this kind and we believe that an analysis of the trends provides value. Much of the prior studies either focused on rainfall-based datasets or model-based river flow data. In this regard, our study adds a new dimension to the flood literature (especially the understanding of the floods that last for longer time) at a global scale”.

My view is that just because this study is the first to analyse an event based dataset at the global scale is not the only criteria for scientific publication. The conclusions made are profound and could be misleading if they are not supported by other event-based and/or physical flood datasets. For example, there are five key conclusions outlined on Pg. 16 in the revised manuscript: “The frequency of flood events has increased”; “there is a statistically significant trend in the moments of flood duration at the global scale”; “The yearly number of moderate and long duration flood occurrences increased”; “there was no monotonic trend observed in the frequencies of short duration floods”, and “the increase in frequency of long duration floods during recent years can be related to the persistent patterns in the low-frequency climate induces”. There is a chance that these conclusions could be an artefact of the known changing quality and/or changing sources of information ingested into the DFO dataset since 1985, and not to mention the relatively short period of record used for a trend analysis (without trying to place shorter term patterns into their longer term context).

Of particular concern is how the ‘flood duration’ variable in the DFO dataset relates to physical flood inundation extent/duration of inundation, or just an artefact of news reporting. According to the DFO website (http://floodobservatory.colorado.edu/Archives/ArchiveNotes.html), flood duration is established from the reported flood start and end dates, and from this website: “Ocassionally there is no specific beginning date mentioned in news reports, only a month; in that case the DFO date will be the middle of that month. Ending dates are often harder to determine - sometimes the news will note when the floods start to recede. We make an estimate based on a qualitative judgement concerning the flood event”. To what degree of certainty does the flood duration variable used in the DFO, and hence a key aspect of the paper, relate to physical flood duration? Is this qualitative uncertainty mentioned on the DFO website only for a small number of floods in some lower income countries, or is the use of such qualitative estimates of flood duration common within the DFO dataset as a whole?

I acknowledge that the authors corroborate flood frequency from DFO with the EM-DAT dataset in their response to my original review, why did this not feature in the main analysis in the revised manuscript/or at least supplementary information? Are both datasets using similar sources of news reporting? Does the temporal pattern of flood frequency match physical flood frequency as observed with river flow gauges over the same time period, notwithstanding the known issue of low density gauges in many regions?

Overall, I do not feel confident accepting these conclusions based only on the DFO dataset. This is unfortunate as I think the authors have performed a nice analysis with very interesting and extremely worthwhile hypotheses, but at this time, in my opinion, the underlying dataset by itself does not provide a strong enough foundation for the application of these hypotheses and hence conclusions.

A possible suggestion for moving this study forward would be to perform a validation of reported flood frequency and duration based on in-situ observations, at least for regions with available overlapping data (as suggested in my original review). I understand this will be more straightforward for flood frequency than for flood duration (as flood inundation is not directly measured at a flow gauging station), but the only other option would be for a consistent satellite product to be available across the full 1985-2015 study period, which I do not believe exists(?). Nonetheless, some effort towards increased corroboration and/or validation even for a range of case study regions/basins would provide a path towards supporting the conclusions. I hope the authors understand my concern and can find the time to provide more validation.

Other comments

Pg 3, Line 6: There has been several authors (e.g. Merz et al., 2012) who highlight the lack of scientific rigour in attribution studies in hydrology. The terms “attribute/attributed” should be avoided if not performing a rigorous formal attribution framework.

Pg 4, Line 2 and 3: Need to mention that ‘flood duration’ within DFO is simply calculated from the flood beginning and end date, and need to expand on to what degree the start and end date is from news reporting and/or from satellite images, and over which time periods (i.e. MODIS only started in 1999)?

Pg 7, Lines 26-29 & pg 8 Lines 1-7: This is methodological detail about trend tests and would be best placed in section 2.

Pg 10, Lines 2-3: Delete/rephrase sentence on “found an abrupt shift” as no longer assessing change points.

Pg 11, Line 24: Should “30 and 40%” not be “20 and 30%” if my interpretation of Fig. 7 is correct?

References

Merz, B., Vorogushyn, S., Uhlemann, S., Delgado, J., and Hundecha, Y.: HESS Opinions "More efforts and scientific rigour are needed to attribute trends in flood time series", Hydrol. Earth Syst. Sci., 16, 1379-1387, https://doi.org/10.5194/hess-16-1379-2012, 2012.

Hide

ED: Reconsider after major revisions (02 Mar 2018) by Julia Hall

Dear Authors,
Thank you for submitting a revised version of your manuscript, which incorporated most of the referees comments.

Following the latest reviews provided by the referees and the referee reports in the previous round of the discussion, it becomes clear that the data source used in this analysis is still a major concern.
Particularly the comments provided by the Anonymous Referee #2 raise important points in this regard. For example, if in the data no date is mentioned the middle of the month is used. This certainly affects the calculation of the flood duration and any subsequent analysis of the time series .
Both referees recommend that the uncertainties need to be better explained and highlighted and that results obtained should at least be partly validated with additional data sets such as the one from the GRDC.
Therefore, to improve the confidence in the analysis, the authors should validate their results using a different data set. Regions for validation should be selected with particular focus on where the data quality in the DFO is perceived to be low.

Moreover, please take into account all the other points highlighted by the two referees, as they both raise valid concerns.

Additionally, the authors mention several times the changes before and after ~ 2000. This is an interesting finding; however, it also raises again concerns with the data homogeneity/reliability as the authors mention themselves that only consistent information in the data exists particularly since 1999. This fact needs to be further investigated and discussed in detail in the discussion section.

The abstract strongly highlights the results about trend results. Given that the DFO data is used (which has known data quality issues and all the concerns raised by the referees) a statement regarding the uncertainties associated with this data needs to be included. Additionally, in the data section 2.1 a more detailed discussion on data quality is needed so that the reader is aware of the potential issues BEFORE the data is analyzed and the results are presented.

Section 4.2.1. More detail to the results need to be added by linking the statistical results obtained to the actual physical mechanisms and how these interconnections interact with the floods and why there is the difference in their effects to different indices analyzed.

Additional comments:

P1L14-15: Add reference to this statement.

P2L 22: Unclear. Please elaborate more how 'temporal trend' and 'regime like behavior' are related.

P3L6: Add sentence on the country scale analysis.

P4L15: Add already a sentence in this section why these 4 countries were selected.

Section 3.2: Please add some elaboration to each sub section what these changes mean for the floods in a physical way. E.g. what does is mean when 'the asymmetrical/symmetrical behavior of the distribution … changes from 5 to ~8.

Section 3.4: Much detail on the methods presented in this section should be moved to the methods section.

P11L19-20: Please provide more detail on this data on damages (e.g. how was it derived in the original data set)

Section 3.4: Please add a paragraph summarizing the most important findings from this section. Currently this is not clear.

Section 4.1. Please discuss the possible effects on the trends obtained and the uncertainties associated with the in-homogeneity of the data pre- and post- inclusion of the MODIS product.

Figure 8a) The figure is difficult to read due to the thin line width of the gray points. Please increase the quality of the figure.

Hide

AR by Nasser Najibi on behalf of the Authors (07 Apr 2018) Author's response Manuscript

ED: Referee Nomination & Report Request started (17 Apr 2018) by Julia Hall

RR by Anonymous Referee #2 (30 Apr 2018)

Suggestions for revision or reasons for rejection

The authors have taken on board my previous concerns by providing additional clarification and most importantly now include some validation of the DFO dataset in the Appendix. I believe this step was absolutely necessary. The uncertainty about the validity of flood duration in DFO has been assessed and the flood frequency patterns have been corroborated by the EM-DAT data at the global scale between 1985-2015. The map of the overlap between GRDC stations and reported flood events in the DFO dataset across 1985-2015 in the response to referee #2 just shows the limited global (open) availability of in situ discharge observations – there is no substitute to good quality observations, but given the limited availability this study provides a worthwhile contribution to the literature. I commend the authors on their efforts to address previous concerns during the peer-review process and recommend for publication in the special issue. Below are some suggested technical corrections:

Pg1; L2: Suggest changing “detect the significant trends” to “explore evidence of trends”, or something similar.
Pg1; L2: Change “at the global and the latitudinal scales” to “at global and latitudinal scales”
Pg1; L4: Please remove “(H1, H2, and H3)” as these are not explicitly (nor need to be) defined in the abstract
Pg1; L5: Please remove reference to H4, suggest to change to “We also evaluated if trends could be related to large-scale atmospheric teleconnections using a Generalized Linear Model framework”.
Pg14; L1: Change “ground-based” to “in situ”
Pg18; L2: Please change “fairly” to “reasonably”
Pg20; L24: No need to add “(it is significant at 5% significance level)” given you provide the p-value anyway, please make sure to remove this in any other instance throughout the manuscript

Hide

ED: Publish subject to technical corrections (16 May 2018) by Julia Hall

Dear Authors,
Thank you for submitting a revised version of your manuscript

The manuscript improved during the discussion phase, particularly due to the additional analysis performed to identify/account for possible data shortcomings/inaccuracies and the increased attention to the physical interpretation of the results obtained.

After the revisions there is one point remaining that should be better accounted for/discussed and communicated to the reader:
From the responses and the figures provided by the authors to the reviewers, it becomes evident that there is a likely bias of the start date of floods that is reported in the DFO.
The authors highlight that only 11% of the entire number of flood events starts on the 1st (6.5%) and the 15th (4.5%) of a month. However, the flood counts on these dates are considerably higher (more and almost double respectively) compared to the ~2.8% of flood events starting on an ‘average day’.
This indicates the existence of a bias in the reporting of the start date in the DFO database.
Additionally, when examining the distribution of flood dates across the month, it becomes also apparent that the day 1, 5, 10, 15, 20 and 25 have also an increased number of flood counts (roughly ~4%) , which also suggest a reporting bias (likely rounding of the start date to a number that can be divided by 5).
Therefore, the authors need to discussed the possibility of a reporting bias in the date in the data section, together with the possible implications of such a bias for the results of the analysis in the discussion section (and provide the figure with the counts per day in the appendix).
Additionally, the authors also need to check and report on this for the end date of the floods.
The authors should also note in the data section (as they have already done so in their response to the referees) that there seems to be no spatial pattern for these apparent reporting biases)

When conducting the revision, please also incorporate the technical corrections suggested by Referee #2.

Additional comments:
P4 L9: Please provide the reference to the publication in which the DFO reports this statement.

P14 L1-5: Please describe in more details what ‘very little errors’ are (i.e quantify what ‘very little’ entails in number of days). Additionally, given the discussed shortcomings of the DFO data, please replace ‘evidently reliable’ with a term that better describes the outcome of the comparison

P18L1-2 Please specify in more detail what ‘fairly reliable’ entails.

Figure 8a: The grey color used for the point data is too light, which makes it very difficult to distinguish the shapes used to indicate the different countries. Maybe the grey color used for the points could be replaced by the colors used for the countries. Additionally, the two different shades of red are too similar. Please consider using different colors.

Hide

AR by Nasser Najibi on behalf of the Authors (23 May 2018) Author's response Manuscript

Short summary

A global assessment of flood events using the Dartmouth Flood Observatory (DFO) database is performed here to explore the planetary nature of the trends in the frequency and duration of floods (short, moderate, and long). This comprehensive study is the very first global study of actual flood events which identifies temporal changes in frequencies and characteristics of probability distribution of flood durations to understand the changing organization of the local to global dynamical systems.