Generative deep learning for data generation in natural hazard analysis: motivations, advances, challenges, and opportunities

Abstract

Data mining and analysis are critical for preventing or mitigating natural hazards. However, data availability in natural hazard analysis is experiencing unprecedented challenges due to economic, technical, and environmental constraints. Recently, generative deep learning has become an increasingly attractive solution to these challenges, which can augment, impute, or synthesize data based on these learned complex, high-dimensional probability distributions of data. Over the last several years, much research has demonstrated the remarkable capabilities of generative deep learning for addressing data-related problems in natural hazards analysis. Data processed by deep generative models can be utilized to describe the evolution or occurrence of natural hazards and contribute to subsequent natural hazard modeling. Here we present a comprehensive review concerning generative deep learning for data generation in natural hazard analysis. (1) We summarized the limitations associated with data availability in natural hazards analysis and identified the fundamental motivations for employing generative deep learning as a critical response to these challenges. (2) We discuss several deep generative models that have been applied to overcome the problems caused by limited data availability in natural hazards analysis. (3) We analyze advances in utilizing generative deep learning for data generation in natural hazard analysis. (4) We discuss challenges associated with leveraging generative deep learning in natural hazard analysis. (5) We explore further opportunities for leveraging generative deep learning in natural hazard analysis. This comprehensive review provides a detailed roadmap for scholars interested in applying generative models for data generation in natural hazard analysis.

Global high-resolution growth projections dataset for rooftop area consistent with the shared socioeconomic pathways, 2020–2050

Abstract

Assessment of current and future growth in the global rooftop area is important for understanding and planning for a robust and sustainable decentralised energy system. These estimates are also important for urban planning studies and designing sustainable cities thereby forwarding the ethos of the Sustainable Development Goals 7 (clean energy), 11 (sustainable cities), 13 (climate action) and 15 (life on land). Here, we develop a machine learning framework that trains on big data containing ~700 million open-source building footprints, global land cover, road, and population datasets to generate globally harmonised estimates of growth in rooftop area for five different future growth narratives covered by Shared Socioeconomic Pathways. The dataset provides estimates for ~3.5 million fishnet tiles of 1/8 degree spatial resolution with data on gross rooftop area for five growth narratives covering years 2020–2050 in decadal time steps. This single harmonised global dataset can be used for climate change, energy transition, biodiversity, urban planning, and disaster risk management studies covering continental to conurbation geospatial levels.

Downscaling algorithms for CMIP6 GCM daily rainfall over India

Abstract

The global climate models (GCMs) are sophisticated tools for determining how the climate system will respond. However, the output of GCMs has a coarse resolution, which is unsuitable for basin-level modelling. Global climate models need to be downscaled at a local/basin scale to determine the impacts of climate change on hydrological responses. The present study attempted to evaluate how effectively various large-scale predictors could reproduce local-scale rain in 35 different locations in India using artificial neural networks (ANN), change-factors (CF), K-nearest neighbour (KNN), and multiple linear regression (MLR). The selection of predictors is made based on the correlation value. As potential predictors, air temperature, geo-potential height, wind velocity component, and relative humidity at specific mean sea-level pressure are selected. The comparison of four different downscaling methods concerning the reproduction of various statistics such as mean, standard deviation at chosen locations, quantile–quantile plots, cumulative distribution function, and kernel density estimation of the PDFs of daily rainfall for selected stations is examined. The CF approach outperforms the other methods at almost all sites (R2 = 0.92–0.99, RMSE = 1.37–28.88 mm, and NSE = –16.55–0.99). This also closely resembles the probability distribution pattern of IMD data.

Uniformly elevated future heat stress in China driven by spatially heterogeneous water vapor changes

Abstract

The wet bulb temperature (Tw) has gained considerable attention as a crucial indicator of heat-related health risks. Here we report south-to-north spatially heterogeneous trends of Tw in China over 1979-2018. We find that actual water vapor pressure (Ea) changes play a dominant role in determining the different trend of Tw in southern and northern China, which is attributed to the faster warming of high-latitude regions of East Asia as a response to climate change. This warming effect regulates large-scale atmospheric features and leads to extended impacts of the South Asia high (SAH) and the western Pacific subtropical high (WPSH) over southern China and to suppressed moisture transport. Attribution analysis using climate model simulations confirms these findings. We further find that the entire eastern China, that accommodates 94% of the country’s population, is likely to experience widespread and uniform elevated thermal stress the end of this century. Our findings highlight the necessity for development of adaptation measures in eastern China to avoid adverse impacts of heat stress, suggesting similar implications for other regions as well.

MAUNet: a max-average neural network architecture for precipitation downscaling

Abstract

Most operational weather and climate models carry out forecasts or simulations of atmospheric variables at low spatial resolutions, but we often need high-resolution projections of such variables. The process of mapping low-resolution projections to high-resolution projections is called spatial downscaling. This is analogous to the computer vision task of single-image super-resolution (SISR). In recent studies, convolution-based architectures including UNet and its variants have emerged as a good choice for SISR. Since the gridded spatial map of any climate variable is analogous to an image, we can use the SISR-based models for spatial downscaling. In this paper, we present a novel UNet-based architecture called max-average UNet (MAUNet) to downscale the precipitation. We have proposed Max-Average Units (MAUs) that include a max-pooling, an average-pooling, and an averaging unit for the encoding part of the UNet. For the decoding part, we develop UpSampler Unit (USU), which too utilizes averaging. We demonstrate the importance of max-averaging units through toy experiments on the MNIST and CelebA dataset. We also examine the role of dropouts for this task and experimentally demonstrate their skill to improve the model’s performance on different datasets. The task examined here is to obtain fourfold resolution enhancement of monsoon precipitation over the Indian landmass and also over Southeast Continental USA (CONUS) region. The models are evaluated using RMSE, PSNR, MSSIM, and correlation coefficient as evaluation matrices. We have made detailed comparisons to show that MAUNet can produce superior downscaling than standard interpolation techniques as well as previously used deep learning models like UNet, EDSR, and SRDRN.

Spatiotemporal variations of ecosystem services in the Aral Sea basin under different CMIP6 projections

Abstract

The Aral Sea, located in Central Asia, has undergone significant reduction in surface area owing to the combined impacts of climate change and human activities. This reduction has led to a regional ecological crisis and profound repercussions on ecosystem services. Investigating the spatiotemporal variations and synergistic trade-offs of ESs in the Aral Sea basin is crucial for fostering the integrated development of the region’s socioeconomic ecology. This study utilizes the Future Land-Use Simulation and InVEST models to analyze future land-use scenarios, integrating CMIP6 projections to assess the quality of four key ecosystem services: water production, soil conservation, carbon storage, and habitat quality over two timeframes: the historical period (1995–2020) and the projected future (2021–2100). Employing Spearman correlation, the study explores the trade-offs and synergies among these ecosystem services. Findings reveal that the primary forms of land-use change in the Aral Sea basin are the reduction in water area (− 49.59%) and the rapid expansion of urban areas (+ 504.65%). Temporally, habitat quality exhibits a declining trend, while carbon storage shows an increasing trend, and water production and soil retention fluctuate initially decreasing and then increasing. Spatially, water production and carbon storage demonstrate an increasing trend from the northwest to the southeast. Habitat quality exhibits a higher spatial pattern in the southeast and south, contrasting with lower spatial patterns in the north and west. Low-level soil conservation is predominantly distributed in the northwest, while medium to low-level soil conservation is prevalent in the east of the basin. The trade-off and synergy analysis indicates that between 1995 and 2020, a trade-off relationship existed between carbon storage and habitat quality and water production, whereas synergies were observed between soil conservation and carbon storage, water production and habitat quality, and soil conservation. The correlation between water production and soil conservation emerges as the strongest, whereas the correlation between carbon storage and habitat quality appears to be the weakest. The dynamic spatiotemporal changes, trade-offs, and collaborative relationships of ESs constitute major aspects of ecosystem service research, holding substantial implications for the effective management of the regional ecological environment.

An evaluation of the reliability of the Weather Research Forecasting (WRF) model in predicting wind data: a case study of Burundi

Abstract

Background

The Weather Research and Forecasting (WRF) Model is an exceptional software for mesoscale climate modeling. It is extensively used to simulate key meteorological variables, including temperature, rainfall, and wind.

Methods

This study thoroughly examined the effectiveness of the WRF model in generating precise wind data for assessing the potential of wind power in Burundi. A meticulous evaluation of various combinations of model physics parameterization schemes was conducted to ensure accuracy.

By comparing the simulated data with measurements from four meteorological stations and utilizing statistical metrics such as root-mean-square error (RMSE) and bias, the accuracy of the WRF model was determined.

Results

 The findings of the study uncovered that utilizing WRF Single-Moment 3-Class (WSM3) for microphysics, Grell-Devenyi ensemble for cumulus physics, and Yonsei University for planetary boundary layer yields highly accurate wind data results for Burundi.

Furthermore, the WRF model was utilized to create detailed seasonal and annual mean wind maps with a high resolution.

Conclusion

These maps demonstrated that the western part of Burundi experiences higher wind speeds (ranging from 4 to 9.7 m/s) during the dry seasons revealing the potential for wind energy harvesting in the different areas of Burundi.

An evaluation of the reliability of the Weather Research Forecasting (WRF) model in predicting wind data: a case study of Burundi

Abstract

Background

The Weather Research and Forecasting (WRF) Model is an exceptional software for mesoscale climate modeling. It is extensively used to simulate key meteorological variables, including temperature, rainfall, and wind.

Methods

This study thoroughly examined the effectiveness of the WRF model in generating precise wind data for assessing the potential of wind power in Burundi. A meticulous evaluation of various combinations of model physics parameterization schemes was conducted to ensure accuracy.

By comparing the simulated data with measurements from four meteorological stations and utilizing statistical metrics such as root-mean-square error (RMSE) and bias, the accuracy of the WRF model was determined.

Results

 The findings of the study uncovered that utilizing WRF Single-Moment 3-Class (WSM3) for microphysics, Grell-Devenyi ensemble for cumulus physics, and Yonsei University for planetary boundary layer yields highly accurate wind data results for Burundi.

Furthermore, the WRF model was utilized to create detailed seasonal and annual mean wind maps with a high resolution.

Conclusion

These maps demonstrated that the western part of Burundi experiences higher wind speeds (ranging from 4 to 9.7 m/s) during the dry seasons revealing the potential for wind energy harvesting in the different areas of Burundi.

Statistical downscaling of precipitation in northwestern Iran using a hybrid model of discrete wavelet transform, artificial neural networks, and quantile mapping

Abstract

Downscaling of daily precipitation from Global Circulation Models (GCMs)-predictors at a station level, especially in arid and semi-arid regions, has remained a formidable challenge yet. The current study aims at proposing a coupled model of Discrete Wavelet Transform (DWT), Artificial Neural Networks (ANNs), and Quantile Mapping (QM) for statistical downscaling of daily precipitation. Given the historic (1978–2005) and future (2006–2100) predictors of eight-selected GCMs under Representative Concentration Pathways (RCPs) 2.6, 4.5, and 8.5, a viable DWT-ANN model was developed for each station. Subsequently, we linked QM to DWT-ANN for bias correction and drizzle effect postprocessing of the DWT-ANN-historic/future projected precipitation. The skill of DWT-ANN-QM was demonstrated using various evaluation metrics, including Taylor diagram, Quantile–Quantile plot, Empirical Cumulative Distribution Function, and wet/dry spell analysis. We appraise the efficacy of the coupled model at 12 weather stations over the Gharehsoo River Basin (GRB) in northwestern Iran. Compared to the observed wet/dry spells, the dry-spells were better simulated via DWT-ANN-QM rather than the wet-spells wherein length and exceedance probability of the spells were overestimated. Results indicated that the future precipitation across the GRB will rise, on average, from 10 to 17% depending on weather station. Seasonal spatial distribution of the middle future (2041–2070) precipitation illustrated that an increase for fall and winter, especially, is expected, whereas the amount of the future spring and summer precipitation is projected to be declined. Having been developed and tested in a semi-arid basin, the efficacy of the coupled model should be further assessed in more humid climates.

Modeling deficit irrigation water demand of maize and potato in Eastern Germany using ERA5-Land reanalysis climate time series

Abstract

ERA5-Land reanalysis (ELR) climate time series has proven useful in (hydro)meteorological studies, however, its adoption for local studies is limited due to accuracies constraints. Meanwhile, local agricultural use of ELR could help data-scarce countries by addressing gaps in (hydro)meteorological variables. This study aimed to evaluate the first applicability of the ELR climate time series for modeling maize and potato irrigation water demand (IWD) at field scale and examined the performance of ELR precipitation with bias correction (DBC) and without bias correction (WBC). Yield, actual evapotranspiration (ETa), irrigation, water balance, and crop water productivity (CWP) were evaluated using the deficit irrigation toolbox. The study found that maize (13.98–14.49 ton/ha) and potato (6.84–8.20 tons/ha) had similar mean seasonal yield under different irrigation management strategies (IMS). The Global Evolutionary Technique for OPTimal Irrigation Scheduling (GET-OPTIS_WS) IMS had the highest mean seasonal yields under DBC and WBC, while rainfall and constant IMS had the most crop failures. DBC had a higher mean seasonal ETa than WBC, except for the potato FIT and rainfall IMS. Global Evolutionary Technique for OPTimal Irrigation Scheduling: one common schedule per crop season (GET-OPTIS_OS) and GET-OPTIS_WS IMS outperformed conventional IMS in IWD by 44%. Overall, GET-OPTIS_OS and GET-OPTIS_WS performed best for maize and potato CWP in terms of IWD, scheduling, and timing. Therefore, adoption of ELR climate time series and advanced irrigation optimization strategies such as GET-OPTIS_OS and GET-OPTIS_WS can be beneficial for effective and efficient management of limited water resources, where agricultural water allocation/resource is limited.