Abstract
Xinjiang Uygur Autonomous Region is a typical inland arid region in China with a sparse and uneven distribution of meteorological stations, limited access to precipitation data, and significant water scarcity. Evaluating and integrating precipitation datasets from different sources to accurately characterize precipitation patterns has become a challenge to provide more accurate and alternative precipitation information for the region, which can even improve the performance of hydrological modelling. This study evaluated the applicability of widely used five satellite-based precipitation products (Climate Hazards Group InfraRed Precipitation with Station (CHIRPS), China Meteorological Forcing Dataset (CMFD), Climate Prediction Center morphing method (CMORPH), Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks-Climate Data Record (PERSIANN-CDR), and Tropical Rainfall Measuring Mission Multi-satellite Precipitation Analysis (TMPA)) and a reanalysis precipitation dataset (ECMWF Reanalysis v5-Land Dataset (ERA5-Land)) in Xinjiang using ground-based observational precipitation data from a limited number of meteorological stations. Based on this assessment, we proposed a framework that integrated different precipitation datasets with varying spatial resolutions using a dynamic Bayesian model averaging (DBMA) approach, the expectation-maximization method, and the ordinary Kriging interpolation method. The daily precipitation data merged using the DBMA approach exhibits distinct spatiotemporal variability, with an outstanding performance, as indicated by low root mean square error (RMSE=1.40 mm/d) and high Person’s correlation coefficient (CC=0.67). Compared with the traditional simple model averaging (SMA) and individual product data, although the DBMA-fused precipitation data are slightly lower than the best precipitation product (CMFD), the overall performance of DBMA is more robust. The error analysis between DBMA-fused precipitation dataset and the more advanced Integrated Multi-satellite Retrievals for Global Precipitation Measurement Final (IMERG-F) precipitation product, as well as hydrological simulations in the Ebinur Lake Basin, further demonstrate the superior performance of DBMA-fused precipitation dataset in the entire Xinjiang region. Our results showed that the proposed framework for solving the fusion problem of multi-source precipitation data with different spatial resolutions is feasible for application in inland arid regions, and aids in obtaining more accurate regional hydrological information and improving regional water resources management capabilities and meteorological research in these regions.