Articles | Volume 11, issue 2
https://doi.org/10.5194/ascmo-11-203-2025
https://doi.org/10.5194/ascmo-11-203-2025
09 Oct 2025
 | 09 Oct 2025

A spatio-temporal weather generator for the temperature over France

Caroline Cognot, Liliane Bel, David Métivier, and Sylvie Parey
Abstract

Stochastic weather generators are efficient statistical models producing synthetic weather series by replicating key statistical properties without the computational cost of physical models. However, for applications requiring temperature simulation over a large area, challenges arise due to non-stationarity over time and spatial-temporal dependencies. This paper introduces a daily stochastic weather generator for temperature with arbitrary spatial resolution. The non-stationarity issue is addressed using a decomposition method to separate deterministic terms (trends and seasonality), from the stochastic part representing the underlying climate variability. We extend the existing local decomposition method to extrapolate to any point in space. The spatial-temporal dependence is modeled through a Gaussian field with a non-separable covariance function, accommodating complex interactions between time and space. Our generator, calibrated on a few French weather stations, is validated using several spatio-temporal indicators. First, we evaluate the generator's performance at the fitting stations, comparing simulated and observed indicators. Subsequently, we compare our spatial simulations to a high resolution gridded observation dataset. Results demonstrate that the proposed generator accurately captures the observed spatio-temporal statistics, even for extreme events such as large scale persistent heat waves.

Share
1 Introduction

Many fields are impacted by climate change, from agriculture and health to energy management. These impacts can be studied through risk evaluation, which often relies on probabilistic models that use meteorological variables as inputs. While historical records can provide such data, they are often limited in both space and time and may not represent future conditions accurately. Another limitation is that they typically offer only a single realization, which can lead to an underestimation of variability.

Physical climate models can generate additional data, but they are computationally expensive, making it difficult to perform many simulations. Moreover, their coarse spatial resolution makes them unsuitable for local studies. In this context, stochastic weather generators have emerged as powerful tools for simulating weather data that captures both spatial and temporal variability (see IPCC2023, Sect. 10.3.3.7). These generators consider observations of meteorological variables as one of the infinite possible trajectories of a stochastic model. The primary objective is to simulate numerous plausible sequences of the desired variables that share common statistical properties with the observations. These simulations are fast and allow for a more comprehensive sampling of climate variables than physical models.

In the context of climate adaptation in the energy sector, it is crucial for decision-makers to understand the potential risks associated with events that affect large areas simultaneously or involve a set of specific sites distributed across the area, such as production sites. A desirable model must effectively capture the spatial dependence in temperature and be capable of generating simulations for sites where no data is available. Another key objective is to separate the contribution of the changing climate (long-term trend) from stationary effects. This distinction allows the model, once calibrated to the present climate, to be adapted for future climate conditions.

Since the development of the first single-site weather generator by Richardson (1981), many others have been proposed using various approaches to produce spatially correlated series. Resampling methods do not require a model for the spatial relationship, instead, the values of all variables at all sites at time t+1 are generated based on the nearest neighbors at time t (Rajagopalan and Lall1999; Buishand and Brandsma2001; Prairie et al.2006). The resampling procedure can be conditioned on weather states (Apipattanavis et al.2007) or circulation variables (Beersma and Buishand2003). More recent techniques use analogues (Yiou2014). Although these non-parametric methods have the advantage of being distribution-free, their limitation lies in their inability to generate new, unobserved values, which often leads to simulations that are overly similar to historical events and underestimate variability. Semi-parametric procedures mitigate this problem by fitting single-site margins, generate simulations, and reconstruct the relationships between all variables and sites using a shuffling method, related to empirical copulas (Clark et al.2004; Li2014; Li and Babovic2019; Li et al.2019; Wang et al.2021). Recent works have also explored the use of neural networks (Ji et al.2024; Sha et al.2024).

When constructing a model for the spatial relationship, two different approaches can be separated. Multisite models focus on a small number of distributed sites and treat each location as a separate variable and rely on multivariate statistics to represent inter-site correlations, employing methods such as multivariate autoregressive models (Wilks2009; Dubrovsky et al.2020) or copulas (Kroiz et al.2021; Dawkins et al.2022). These methods have two main limitations: they cannot generate data for new locations, and they often require estimating a large number of parameters, making them difficult to scale. While feasible for small numbers of sites, they do not efficiently utilise spatial information and become impractical for larger grids. The use of Empirical Orthogonal Functions (EOFs) as in Sparks et al. (2018) offers dimensionality reduction but does not resolve location dependence issues. Contrary to that, approaches making use of spatial statistics focus on dense, local grids. They use Gaussian process models, which treat dependence as a spatial or spatio-temporal field, allowing for data generation at unobserved locations once the model is fitted (Wilks2009; Kleiber et al.2012; Baxevani and Lennartsson2015; Verdin et al.2015; Bourotte et al.2016; Sparks et al.2018; Verdin et al.2019). However, their application as of now has mostly been limited to specific regions, while broader applications require simulations across larger areas.

The temporal dependence for each location is usually modeled using the same autoregressive (AR) structure introduced in Richardson (1981). It captures both the temporal dependence and the distribution of the variables (Wilks2009; Dubrovsky et al.2020). The AR model can be modified to accomodate various distributions such as the skew exponential power distribution used by Evin et al. (2019), to better represent the tails of the distribution. The parameters can depend on time to represent the spatial-temporal interactions. In a spatial model, the AR approach can be replaced by a covariance model that quantifies variability between two points separated in space, for every time difference, and the space-time interactions.

Weather variables have strong non-stationarity. When available, covariates can be introduced in the model to tackle both spatial and temporal instationarity using Generalised Linear Models (GLMs), Generalised Additive Models (GAMs), or related methods (Furrer and Katz2007; Holsclaw et al.2016; Chandler2020; Dawkins et al.2022). Common covariates for this purpose include large-scale information such as sea surface temperature (SST) or climate indices. One drawback of these approaches is that they may be unable to generate events that differ significantly from observations, as they constrain generation to observed patterns. A proxy for this large-scale information is the use of weather types which are fitted directly on the weather data; see Allard et al. (2015) for a review, Gobet et al. (2025) for a recent application. The temporal non-stationarity include seasonal effects. They can be addressed by fitting a separate set of parameters for each month, as in Verdin et al. (2015), or by using smooth periodic functions (Evin et al.2019). Global warming can be taken into account by using a linear model to represent the increase in mean temperature over the past century. This increase can include a breakpoint to capture recent acceleration (Touron2019; Evin et al.2019).

In this work, we build from the preprocessing approach of Hoang (2010) that decomposes the temperature series for one location into deterministic and stochastic terms. We propose a spatial-temporal model for temperature by extending this concept to a spatial grid.

The chosen approach uses non-parametric LOESS (LOcally Estimated Scatterplot Smoothing; Cleveland1979) regression to extract long-term trends from station data spread across France. Compared to simpler approaches based on linear trends, it better represents interannual variability caused by large-scale atmospheric patterns. It also captures the varying pace of warming over time more effectively. The seasonality of the temperature is obtained through parametric regression with trigonometric functions, in the mean and the variance. We propose an interpolation method to obtain spatial trends and seasonality from components computed separately for each location.

Residuals from the decomposition are assumed to be a centered Gaussian process for each location. Here, they are modeled as a spatio-temporal stationary Gaussian process. The interactions between space and time are modeled using a non-separable covariance function as in Allard et al. (2022). While previous applications of such a model have focused on smaller regions, we show that this approach allows to account for the strong spatial dependence in temperature across a large area.

We also propose indicators to validate the model, focusing on important properties of the temperature. We compare these indicators computed on observations to simulations generated with our model. These indicators include correlations in space and time. We show that our model accurately reproduces these correlations, showing a good capacity to reproduce the spatial-temporal relationships. We develop a particular focus on extremes, using first pairwise threshold exceedance. We also develop an indicator for spatial events called the quantile exceedance ratio, that is used to represent the spatial size of heat events, and show that the simulations from our model slightly underestimate the real size of spatial events, producing heat episodes that are quite as large as observed events.

The paper is organised as follows. In Sect. 2, we describe the data used to train our model. The model is described in detail in Sect. 3. In Sect. 4, we first simulate and test the model on weather stations used during the training. Then, we validate it on a high resolution gridded dataset at the scale of France. Finally, we conclude with a discussion of future work in Sect. 5.

2 Data

The generator developed in this work is applied to French daily average temperature data. We use weather station data from the ECA&D dataset to fit the model parameters and validate it by comparing gridded spatial simulations to the E-OBS gridded dataset.

2.1 The ECA&D dataset

The ECA&D dataset (https://www.ecad.eu/, last access: March 2023; Klein Tank2002) is a high-quality European dataset of observed daily station data, covering periods that range from centuries for some stations to decades for others. We used the February 2023 version of the dataset, which includes 2068 stations across Europe with less than 8 % missing data for the period 1985–2015. From these, we selected 41 stations located in France, as shown in Fig. 1, forming the set of locations in space 𝒮stations⊂𝒟, where 𝒟= [6° W; 10° E] × [42° N; 52° N].

These stations are distributed regularly in space, with distances between pairs ranging from 70.95 to 1263.78 km, as illustrated in Fig. 2 (left). They are mostly situated in low-elevation areas (less than 500 m), as depicted in the histogram in Fig. 2 (right).

https://ascmo.copernicus.org/articles/11/203/2025/ascmo-11-203-2025-f01

Figure 1Selected 41 stations from the ECA&D dataset.

https://ascmo.copernicus.org/articles/11/203/2025/ascmo-11-203-2025-f02

Figure 2Histograms of the distances between stations (a) and of the elevation within the dataset (b).

Download

For each location, missing values in the temperature series were replaced with the average value for the corresponding calendar day. As there are only few stations with missing data, and at most 6.99 % of missing values in that case, using this method does not have a significant impact on the variance of the data. Days corresponding to the 29 February were removed for simplification. The remaining data is indexed by day tT=[[1,nT]], with nT= 11 315 the number of days in the series.

2.2 The E-OBS dataset

The E-OBS dataset (Cornes et al.2018) is a gridded version of the ECA&D dataset. It is obtained from the station data by statistical learning methods. Each day is interpolated independently first by fitting a spatial trend with a Generalised Additive Model (GAM) on the coordinates and background field, and then a kriging of the residuals with an exponential variogram and a fixed nugget parameter representing measurement uncertainty. The background field represents the monthly values of temperature. The GAM uses smoothing splines in order to model the large-scale spatial trends. The E-OBS dataset exists in 2 versions with different resolutions, 0.1 or 0.25°, we use the 0.25° version of April 2023 (version 27.0e). In the following, the set of grid points will be named 𝒮grid⊂𝒟, with |Sgrid|=1071.

3 Methodology

In this section, we present the methodology for constructing a spatio-temporal model for daily average temperature. This model extends the single-site temporal model introduced by Hoang (2010) to a spatial context. We adapt the procedures for estimation, simulation, and validation to accommodate the spatio-temporal framework.

3.1 Model

Let X(s,t) be the temperature field at location s∈𝒟 and day t∈𝒯. X(s,t) is written as a combination of deterministic and stochastic terms. As in the very first weather generator of Richardson (1981), the temperature series is standardised by its variance. Following Hoang (2010), our decomposition also accounts for the annual cycle of variance and its temporal trend.

(1) X ( s , t ) = T m ( s , t ) + S m ( s , t ) + T σ 2 ( s , t ) S σ 2 ( s , t ) Z ( s , t ) , s D , t T ,

where Tm(s,t) and Sm(s,t) are the trend and the seasonality in the mean, Tσ2(s,t) and Sσ2(s,t) are the trend and seasonality in the variance. Z is the stochastic part called residuals. The trend terms are interpreted as representing long-term climate variations, and the seasonality terms represent the cyclic influence of the sun's radiation. Both of them represent the deterministic components of the temperature, while the stochastic part represents the intra-annual climate variability. The main assumption made in this work is that spatial and temporal non-stationarities are in the margins, while the dependence structure can be assumed stationary. Consequently, all the non stationarities are taken into account in the trend which is a function of time and space, and the dependence structure is handled in the residuals covariance which is stationary.

3.1.1 Deterministic terms

The trends in temperature over time for a given point in space s, Tm(s,),Tσ2(s,) are commonly modeled using linear regression or constants. However, to better capture long-term dynamics, we prefer as in Hoang (2010) to use LOESS (LOcally Estimated Scatterplot Smoothing; Cleveland1979). LOESS regression fits local linear regression lines at time t to the data at time t1,,tn based on a chosen span parameter sps, with tri-cubic weighting proportional to (1-|t-tisps|3)3. This non-parametric approach provides a more accurate representation of long-term variations compared to a simple linear trend.

The seasonality terms are modeled as trigonometric polynomials of the following form:

(2) S θ β θ ( s , t ) = β θ , 1 ( s ) + i = 1 d θ [ β θ , 2 i ( s ) cos 2 i π t 365 + β θ , 2 i + 1 ( s ) sin 2 i π t 365 ] , s D , t T ,

with βθR2dθ+1 the seasonality parameters, with θ=m or σ2 either the mean or the variance, and dθ the degree of the trigonometric polynomial. The value of dθ has to be chosen large enough to capture seasonal details (summer, winter, transitions) but not too large to avoid overfitting.

3.1.2 A spatio-temporal model for the residuals

Due to the decomposition of Eq. (1), we assume that the spatio-temporal residuals Z(s,t),sD,tT is a Gaussian random field (Cressie and Wikle2015) with zero mean, stationary, and isotropic properties. Under these assumptions, the residuals are fully characterised by their covariance function

(3) C ( h , u ) = Cov ( Z ( s + h , t + u ) , Z ( s , t ) ) , h R + , u R + ,

where h is the spatial distance and u is the temporal lag.

Choosing a covariance model that reflects the properties of the data is not an easy task. There are numerous covariance models that can describe spatio-temporal dependence, see Porcu et al. (2021).

A straightforward approach to constructing a spatio-temporal covariance function is to use the product of a spatial covariance function CS and a temporal covariance function CT:

(4) C sep ( h , u ) = C S ( h ) C T ( u ) .

This model is known as separable and is computationally convenient because the full covariance matrix can be expressed as the Kronecker product of two smaller matrices. However, as noted in the literature (De Iaco et al.2019; Porcu et al.2021), separable models may not adequately capture complex interactions between space and time, particularly for environmental or meteorological variables. In such cases, more sophisticated models that account for space-time interactions are required.

Non-separable spatio-temporal covariance functions can be constructed easily from spatial and temporal covariance functions. For instance the product-sum model is obtained using the two covariance functions CS and CT and positive weights a1,a2,a3 by constructing the spatio-temporal function CST(h,u)=a1CS(h)+a2CT(u)+a3CS(h)CT(u). Another construction can be made using a single covariance function C and two positive scaling factors a1,a2 by CST(h,u)=C(a12h2+a22u2). a12h2+a22u2 can be seen as a space-time distance. This construction considers time as a 3rd dimension of space, which oversimplifies the fundamentally different behaviors and units of space and time.

To tackle this problem, Gneiting (2002) proposes a construction method for classes of non-separable covariance function, that is, covariance functions that do not satisfy Eq. (4). It was extended by Allard et al. (2020) for the class of Gneiting-type covariance functions. Given ϕ a completely monotone function, γ a continuous variogram , i.e., a conditionally negative semidefinite function (Chilès and Delfiner2012),

(5) C γ , ϕ ( h , u ) = σ 2 ( γ ( u ) + 1 ) ϕ | | h | | 2 γ ( u ) + 1 , h R 2 , u R ,

is a spatio-temporal non-separable covariance function that propose more flexibility in the interaction between the decay of spatial correlation over time than separable models. Usually, the spatial function ϕ(h) is of the form ϕ0(h/r) with r a spatial scale parameter. In the case of Eq. (5), the space-time relationship can be considered as using a spatial scale parameter that varies with the temporal lag, with the variogram γ representing how the temporal distance interacts with the spatial distance. Another advantage of this construction is the existence of ad hoc simulation methods as described in Sect. 3.3.2.

We use the following functions for ψ, γ as in Allard et al. (2022):

ϕ(x)=M(x;r,ν),

where M(,r,ν) is the Matérn function with spatial range r>0, regularity ν>0.

γ(t)=ta2α+1b-1,a>0,b[0,1],α[0,1]

This leads to the model defined as:

(6) C γ , ϕ ( h , u ) = σ 2 u a 2 α + 1 b M h u a 2 α + 1 b ; r ; ν

The parameter b is called the separability parameter. When it equals to 0, the model reduces to a separable function. When b>0, the model is non-separable.

One drawback of this model is that the temporal covariance C(0,u), which describes the covariance at a fixed spatial location, can be overly constrained by the chosen parameters. A solution to this problem is to multiply Eq. (6) by a purely temporal covariance function of the form ua2α+1-δ. By introducing a slight reparametrization, τ=δ+b, this yields the Gneiting-Matern covariance model Gneiting (2002, Eq. 16)

(7) C GM ( h , u ) = σ 2 ( 1 - η 2 ) u a 2 α + 1 τ M h u a 2 α + 1 b ; r ; ν + σ 2 η 2 1 h = 0 , u = 0

The nugget effect η2 is added to account for small-scale variability and measurement error, as a proportion of the total variance.

This model has recently found applications and extensions, particularly for meteorological variables. Gneiting et al. (2010) introduced a multivariate spatial version of the model. Subsequently, Bourotte et al. (2016) and Allard et al. (2022) refined it by proposing multivariate spatio-temporal classes.

3.2 Estimation

Estimating the parameters of the trend and seasonality terms βθ and the parameters σ2,η2,α,a,b,r,δ,ν of the covariance function simultaneously is theoretically possible maximising the whole likelihood function. However, estimating that many parameters at the same time may cause computational challenges due to identifiability issues. We prefer proceeding by a multi-stage approach and estimate “deterministic terms” first (site-by-site), introducing robustness in the estimation. This approach comes down to estimate marginal parameters first, then the covariance parameters, ensuring a stable and interpretable decomposition of the spatiotemporal structure.

3.2.1 Deterministic terms

The different components describing the temperature X(s,t) in Eq. (1) are first obtained for each station location s∈𝒮stations, for tT=[[1:nT]] in 4 steps:

  1. First, a LOESS regression on X(s,) is used to get Tm(s,). The span parameter sps is chosen using the Modified Partitioned Cross-Validation (MPCV) of Hoang (2010).

  2. The trigonometric polynomial coefficients βm(s) (Eq. 2) involved in the seasonality of the mean Smβm(s,), are obtained by linear regression on X(s,)-Tm(s,). The order of the polynomial dm(s) is chosen using the Akaike criterion.

  3. The variance term defined as V(s,)=X(s,)-Tm(s,)-Sm(s,)2. Tσ2(s,t), is obtained by LOESS regression on the variance V(s,) with the same span parameter sps as in the trend in mean.

  4. The trigonometric polynomial coefficients βσ2(s) in Sσ2βσ2(s,), are obtained by linear regression on V(s,)Tσ2(s,). The order of the polynomial dσ2(s) is also chosen with the Akaike criterion and can be different from dm(s).

This decomposition approach is described in detail in Hoang (2010). It relies on a modified partitioned validation criterion suited to heteroscedastic and dependent data to choose the optimal smoothing parameter. The obtained residuals Z are zero-mean, stationary in time for each site s and with variance that averages to 1 over the year. The order in which the components are estimated bears some importance: if the smoothing parameter is fixed, estimating seasonality first then trends leads to the same results as estimating trends first then seasonality. However, if seasonality is identified first, it includes some part of the trend, and the optimal parameter may not be the same as the one obtained when identifying the trend first.

3.2.2 Residuals

Once the trends and seasonality components are obtained, the values of the residual field Z(s,t)sSstations,t[[1,nT]] can be calculated and used to estimate the parameters of the Gneiting-Matern covariance model in Eq. (7).

The covariance matrix of the vector residuals Z is of size |Sstations|×nT2 , and computing its inverse becomes infeasible when |Sstations| and nT exceed a few tens, which limits the use of full likelihood methods. Composite Likelihood methods (Lindsay1988; Varin et al.2011), which involve only pairwise likelihoods, are much easier to compute and exhibit good convergence properties. For a dataset (y1,,yn) and a vector of parameters θ, this approach means replacing the optimisation of the likelihood L(y1,,yn;θ) by the pairwise likelihood PL(y1,,yn;θ)=i<jL(yi,yj;θ). To further reduce the number of terms in the pairwise likelihood, we employ the Weighted Composite Likelihood (WCL), assigning a weight of 0 to pairs of sites that are far apart based on a threshold distance, chosen as half the maximum observed distance (around 650 km in this study) and a weight of 1 otherwise. This choice of half the observed distance is a usual practice in the field of geostatistics. Bourotte et al. (2016) for example have studied the relative efficiency of the pairwise likelihood estimator, showing that it first increases then decreases as a function of this cutoff parameter.

In the pairwise estimation procedure, the score function L(y1,,yn;θ) is replaced by the composite score PL(y1,,yn;θ). In that case, the classical Fisher information matrix is no longer appropriate for variance estimation. Instead, we rely on the Godambe information matrix (Lindsay1988) defined as

(8) G ( θ ) = H ( θ ) - 1 J ( θ ) H ( θ ) - 1 ,

where H(θ) is the sensitivity matrix (the negative expected derivative of the pairwise score function) and J(θ) is the variability matrix (the variance of the pairwise score). Because these components are not directly available from the pairwise likelihood, they are estimated via empirical averages of the observed scores and their derivatives (Varin et al.2011).

3.3 Simulation

Simulations are to be performed over an arbitrary set of points in space 𝒮, such as a grid of points 𝒮grid covering the country, according to Eq. (1), where the deterministic components are extrapolated on each point, and the residuals are simulated as a spatio-temporal Gaussian field with covariance function determined by Eq. (7).

3.3.1 Spatial extension of the deterministic terms

Spatial extrapolation of parameters computed at station locations is achieved using ordinary kriging (Cressie1991) a statistical interpolation method that leverages spatial correlation between data points. The key to kriging is the variogram model γ(h), which quantifies the spatial dependence between points at distance h. For a spatial variable W, the variogram is defined as γ(h)=12EW(x)-W(x+h)2. Prediction at a location x0 is a weighted sum of the observed values W*(x0)=i=1nλiW(xi), where the weights depend on the chosen variogram model.

Seasonality

The seasonality coefficients βm(s),βσ(s) for s∈𝒮 are obtained by ordinary kriging of the coefficients calculated at locations s∈𝒮stations using a Gaussian variogram fitted to the observed data. The Gaussian variogram is defined as follows:

(9) γ g ( h ) = g 1 + g 2 1 - exp - h 2 g 3 2 , g 1 , g 2 , g 3 0 .

The annual cycles are then obtained by plugging these estimated coefficients in the seasonality functions defined in Eq. (2).

Trends

To provide the trends at each grid point, we first correct the trend in mean at sites s∈𝒮stations using the International Standard Atmosphere (ISA) model (ISO1975) which describes the evolution of the standard temperature as a function of altitude:

(10) T ( z ) = T 0 - c z ,

where c=6.5×10-3 °C m−1 and T0 is the temperature at sea level (0 elevation). An equivalent temperature trend at z=0 is given by

(11) T eq ( s , t ) = T m ( s , t ) + c z ( s ) , s S stations .

Teq is interpolated from 𝒮stations to 𝒮 by kriging with a linear variogram γl(h)=g+h,g0. The trend at any grid point s is then recovered by:

(12) T m ( s , t ) = T eq ( s , t ) - c z ( s ) , s S .

3.3.2 Spatio-temporal Gaussian process simulation

Simulating from the covariance function in Eq. (7) at sites 𝒮 and times t∈𝒯 means simulating from a Multivariate Gaussian process with |S|×|T| components. Working with large spatiotemporal grids makes it unmanageable to simulate a Gaussian process using standard procedures like the Cholesky decomposition, which has in general a complexity in O(|S|3×|T|3), and requires the manipulation of matrices of size |S|2×|T|2. To address this issue, we use two different methods: an iterative method and a spectral method.

Iterative simulation

We take advantage of the temporal structure of the data, noting that time steps far apart are unlikely to be dependent. Using this observation, simulations at time t are obtained iteratively, conditioned only on the previous time steps, according to the conditional distribution of Gaussian vectors. The choice of is consistent to the temporal range parameter estimated in the covariance model. The Cholesky decomposition is then computed on matrices of size (|S|×)2 at most, which is reasonable if is not too large.

Spectral simulation

Allard et al. (2020) propose various algorithms based on spectral decomposition for simulating spatial and spatio-temporal processes with covariance models of Gneiting type, such as in Eq. (5). We use the algorithm known as the substitution approach.

Let Y be the spatio-temporal process defined by

(13) Y ( s , t ) = - 2 log ( U ) cos ( 2 R Ω , x ( s ) + | Ω | / 2 W ( t ) + Φ ) , s S , t T ,

where, with the notations of Eq. (5), x(s) are the cartesian coordinates of s, UU([0,1]), Rμ, ΩN(0,I2), Wγ, ΦU([0,2π]) and the measure μ is the measure related to the Bernstein representation of ϕ. This process has the covariance function in Eq. (5). For p copies of such a process Yi,i[[1,,p]], the Central Limit Theorem states that

(14) Z ̃ ( s , t ) = 1 p i = 1 p Y i ( s , t ) , s S , t T ,

converges to a Gaussian process with the covariance function of Eq. (5) when p→∞.

This approach separates the simulation of the spatial component (simulated from μ) and the temporal component (simulated from γ), thereby reducing the dimensionality of the problem in both space and time. It is more scalable than the iterative method, which is faster for smaller grids but becomes very slow as the grid size increases. Conversely, the spectral simulation remains tractable even for larger grids, with a complexity in O(p|S|×|T|)

3.4 Validation

To assess the validity of the weather generator, several indicators are computed for the simulations and compared to those obtained from the observations. We will compare the envelope of the simulated values with the observed values to evaluate the generator.

For spatio-temporal models, informative indicators are correlation between pairs of stations (si,sj)Sstations2 both in the temperature Cor(X(si,),X(sj,)) and in the residuals Cor(Z(si,),Z(sj,)) (Sparks et al.2018; Wilks1998).

Because of the non-separable assumption, it is also important to evaluate the spatio-temporal correlations, between different stations and at different time lags: Cor(Z(si,+Δt),Z(sj,)).

Finally, to evaluate the ability of the model to generate joint spatial extremes, we consider the pairwise conditional threshold exceedances for high (or low) quantiles, pi,jα (and pi,jα): for 0<α<1 and qα(i) such that α=P(X(si,)<qα(i)) and a pair of stations (si,sj)Sstations2

(15) p i , j α = P ( X ( s i , ) < q α ( i ) | X ( s j , ) < q α ( j ) )  and  p i , j α = P ( X ( s i , ) > q 1 - α ( i ) | X ( s j , ) > q 1 - α ( j ) ) ,

They provide insight into how pairs of stations behave during extreme temperature events and are empirically estimated by

(16) p ^ i , j α = t T 1 X ( s i , t ) < q α ( i ) X ( s j , t ) < q α ( j ) t T 1 X ( s j , t ) < q α ( j )  and  p ^ i , j α = t T 1 X ( s i , t ) > q 1 - α ( i ) X ( s j , t ) > q 1 - α ( j ) t T 1 X ( s j , t ) > q 1 - α ( j ) .

The values of p^i,jα for the winter season and p^i,jα for the summer season, for all pairs {i,j}, are then compared with the values from the simulations. Note that qα(i) is not necessarily equal to qα(j).

In order to better understand the capacity of the model to show spatial dynamics, let us define the quantile exceedance ratio Rα(t). It represents the proportion of locations in a set of locations 𝒮 exceeding a given quantile at any time. For t∈𝒯, 0<α<1, this proportion is written as

(17) R α ( t ) = s S 1 X ( s , t ) > q 1 - α ( i ) | S | .

This indicator serves two main purposes. First, its survival function P(Rα(t)), computed over all time samples, describes the model's ability to generate “spatial hot days”. These are days when at least a certain (high) percentage of the domain exceeds their (high) quantile of a specified order. Second, the indicator can be used to examine the temporal evolution of this proportion, which relates to persistent heat waves. Heat waves with the most significant impact (large spatial extent over several days) are expected to exhibit a plateau for most quantiles, including even higher ones.

It is similar to the heatwave indicator of Miloshevich et al. (2023), where a measure of persistence and amplitude of anomalies is given by

(18) A ( t ) = 1 T t t + T 1 D D X ( t , s ) - E ( X ( t , s ) ) d s d t ,

where T is the time scale of interest, 𝒟 the spatial domain, X the temperature. E(X(t,s)) is the expected value of temperature at day t. This indicator averages the temperature anomaly over both space and a time period of length T. Compared to our quantile exceedance ratio of Eq. (17), it can yield non-zero values even for mild events, as long as the temperature remains above its expected value over the given period and space. However, the resulting value may be low if extreme temperatures occur only in part of the domain, while the rest of the area experiences mild or even cool temperatures, making it difficult to distinguish between different types of events. In contrast, our indicator reflects the presence of extreme values within part of the area. Additionally, this indicator is limited to detecting events that occur on a time scale close to T. Isolated hot days may go undetected.

4 Application: temperature over France

4.1 Estimated parameters

4.1.1 Deterministic components

The decomposition for each station in the ECA&D dataset is computed on the 1960–2015 data when available, or otherwise on the longest available period. Because of the initial choice of stations, it is ensured that at least the 1985–2015 period is covered. The decomposition produces the non-parametric trends and the seasonality parameters for each station.

An example of single-site decomposition is shown in Fig. 3 for the Toulouse-Blagnac station. The seasonality reveals the typical winter/summer oscillation. The trend in the mean indicates a general increase with occasional low peaks, showing an average increase of 2° over the years of observations. This highlights the necessity of including a trend and using a non-linear model. For this station, as for the others, the seasonal variance is higher in winter compared to summer. The trend in variance reflects contrasting patterns in winter and summer, along with interannual variability. The seasonality term in the variance acts as a modulating factor on the trend component, together forming the overall variance structure. The trend captures long-term changes and can be considered approximately constant throughout a single year. The seasonal component adjusts this baseline, meaning that the variance is more pronounced in winter than in summer. The slight increase in variance observed during summer in Toulouse in Fig. 3 likely reflects an increasing trend in summer compared to the annual trend, possibly because extreme events with higher amplitude tend to occur more frequently in summer in the context of French climatology. Other sites may not have this increase, instead have something like a low plateau in the summer.

https://ascmo.copernicus.org/articles/11/203/2025/ascmo-11-203-2025-f03

Figure 3Decomposition at the Toulouse station (south-west of France). From left to right and top to bottom: seasonality in mean, trend in mean, 3 years of residuals Z, seasonality in variance, trend in variance and monthly boxplots of the residuals Z.

Download

4.1.2 Spatial model

The decomposition into deterministic and stochastic terms has removed most of the seasonality from the signal. We assume hence that the model is stationary in time. However, initial investigations revealed differences between seasons in the empirical spatial covariance structure of the residuals Z. To address this, the model for the residuals is built separately for two seasons: the extended winter (October, November, December, January, February, March – ONDJFM) and the extended summer (April, May, June, July, August, September – AMJJAS). The residuals Z are assumed to be stationary in those seasons.

The estimation of covariance parameters was conducted for all years at once, for each season, by implementing the pairwise likelihood in Julia. The optimization was done using the LBFGS algorithm and automatic differentiation (AD) using an appropriate package compatible with AD for the Matérn function (Geoga et al.2022). A maximum distance cutoff of 650 km in space and 5 d in time was applied to reduce the number of pairs and avoid rare spatial differences.

We also estimated the variance of our pairwise likelihood estimator, using the Godambe information matrix (see Varin et al.2011). The H matrix in Eq. (8) is estimated using the Hessian of the objective function for the observations. The J matrix is estimated by computing the variance-covariance matrix of the gradient on 100 simulations.

To check the robustness of the parameter estimation, the training data is split into 4 groups of equal size, with each group removed once from the estimation stage to construct 4 models in addition to the complete model estimated using all available station data. This provides insight on the impact of different input data on the estimation and simulations. The obtained parameters are shown in Table 1. Except for the spatial range parameter r, there are not many differences between estimations using the full data or estimations removing a fourth of the data.

Table 1Estimated covariance parameters and their estimated standard deviation using a bootstrapped Godambe information matrix.

Download Print Version | Download XLSX

Several strategies for estimation were conducted by fixing different sets of parameters to better understand their relationships, the likelihood, and the corresponding covariance function.

The first thing to notice is that adding the flexibility in the temporal covariance with the parameter δ was in the end not necessary. In practice, the best value for this parameter is 0. As this is the minimum allowed value, estimating the hessian and gradient is not relevant and the estimated standard deviation for this parameter is not meaningful.

The spatial shape parameter ν is estimated close to 1 in both seasons, meaning the spatial regularity is not very different in the two seasons. It is interesting to see that the Matérn model for ν=1 corresponds to a linear × exponential covariance model, which is coherent with the observed empirical curves. The fact that ν is close between the seasons allows the comparison of the spatial scale parameter r, related to the typical correlation distance. It is estimated at around 800 km for both seasons, though higher in the winter than in the summer. In a Matérn model with ν≃1, the correlation drops below 0.05 at around h=5.6r. In our case, this means that for u=0 the stations become uncorrelated for a distance of more than 4000 km, which is quite large, but expected as the temperature variable exhibits strong large-scale correlation. The estimated spatial scale is inherently dependent on the data used to estimate it. When removing a fourth of the stations from the dataset, the estimated r can be quite increased in some cases, however when that happens, the summer and winter values stay coherent.

The temporal scale parameter a is estimated as higher in the winter than in the summer, but the associated standard deviation of the estimate is also quite larger. This highlights the difficulty to fit a model in the winter, where the data has both large-scale episodes and sudden variations.

The separability parameter b[0,1] is estimated as 0.663 in the summer and 0.883 in the winter. Both of these values are far from 0 and indicate a clear non-separable behaviour of the spatio-temporal dependence.

The obtained variogram functions Γ(h,u)=12Var[Z(s,t)-Z(s+h,t+u)], along with the empirical function computed on the observed data Γ^(h,u), are shown in Fig. 4. The Gneiting-Matérn model fits the data better than the separable model, which struggles to represent space-time interactions even for small time differences. The nugget effect accounts for approximately 10 % of the total variability and reflects short-range variation caused by several factors, including measurement error, micro-topographic features, and spatial variability at scales smaller than the sampling resolution.

https://ascmo.copernicus.org/articles/11/203/2025/ascmo-11-203-2025-f04

Figure 4Variogram functions corresponding to the fitted parameter in winter (ONDJFM) and in summer (AMJJAS) for the separable model (Exp-Exp) and the Gneiting-Matérn model (Gneiting-Matern). The x-axis represents the distance in space, the y-axis holds the values of the variogram for several values of temporal distances u denoted each by a colored curve. The empirical variogram in the data is represented by the circle points. The vertical line indicates the spatial cutoff, beyond which the pairs were not taken into account in estimation. The value for h=u=0 is missing from the empirical values because it should be the value of the nugget effect and cannot be computed in the empirical variogram.

Download

4.2 Simulation at the training sites

Temperature series were simulated 100 times at each of the 41 fitting data points by first simulating residuals over 31 summers and 31 winters from the fitted spatio-temporal covariance model (Eq. 7) and then adding the deterministic parts in Eq. (1).

For the stochastic component, different simulation methods were employed. The Gneiting-Matérn model was simulated using both the iterative and spectral methods. Additionally, the iterative method was applied to simulate from two alternative models: first, a purely temporal version of the covariance model (no spatial component), aimed at assessing how much spatial information is conveyed by the deterministic trend and seasonality versus the spatial model in the stochastic component. A separable exponential model was also used to demonstrate the limitations of an overly simplified model.

4.2.1 An example of simulation

An example of 4 consecutive days in one simulation is presented in Fig. 5. The first column displays the observations, the second column shows a simulation based on a separable spatio-temporal covariance model, and the next two columns present simulations generated using the non-separable covariance function (via the iterative and spectral methods, respectively). The last column illustrates a simulation according to a spatially independent model. While a direct comparison of the values is not meaningful, we can evaluate whether the range of values is reasonable and whether the spatial pattern is adequately reproduced. Obviously, the last column shows a more discontinuous pattern compared to the others, highlighting the importance of incorporating a spatial component in the model.

https://ascmo.copernicus.org/articles/11/203/2025/ascmo-11-203-2025-f05

Figure 5Observations and simulations using the separable model (Exp-Exp), the 10 d-memory simulation algorithm (GM-iterative), the spectral method (GM-spectral) and the purely temporal model (Temporal).

4.2.2 Marginal properties

Figure 6 presents the violin plots of the observed and simulated temperatures at several stations for both extended seasons (summer on the top row, winter in the bottom row). Across all models and simulation methods, the distribution of simulated temperatures closely matches the distribution of observed temperatures, demonstrating the model's ability to accurately reproduce the temperature distribution.

https://ascmo.copernicus.org/articles/11/203/2025/ascmo-11-203-2025-f06

Figure 6Violin-plot of the temperature in the two extended seasons (summer on the top row, winter in the bottom row) for 4 stations, with their corresponding locations. For simulations, 100 simulations were made using a temporal only model, a separable model and the non-separable model using the methods described in Sect. 3.3.2.

Download

4.2.3 Spatial correlation

Figure 7 (left) illustrates the pairwise correlation coefficients between stations, comparing simulated correlation in the residuals Cor(Z(si,),Z(sj,)) (right) and temperatures Cor(X(si,),X(sj,)) (left) with their observed counterpart. For both simulation methods based on the spatio-temporal Gneiting-Matérn model – conditional simulation over the previous 10 d and spectral simulation – the observed and simulated correlations align closely with the 1 : 1 line. This indicates that this choice of correlation function effectively captures the spatial dependence structure in the data. In contrast, using a purely temporal model results in a correlation of zero for the residuals (Z). Although the corresponding simulated temperature values are non-zero due to the deterministic components, they deviate significantly from the observed values, emphasizing the benefit of incorporating a spatial model. The separable model performs similarly to the non-separable model but overestimates lower correlations. Although the significance of the non-separability property is not immediately evident in this specific graph, its advantages may become clearer when analyzing more complex spatial patterns or longer time scales.

https://ascmo.copernicus.org/articles/11/203/2025/ascmo-11-203-2025-f07

Figure 7Correlations and conditional probabilities of threshold exceedance. x-axis: observed values, y-axis: median simulated values for 100 simulations. Left: Pairwise correlation coefficient for all pairs of stations, left in the full temperature Cor(X(si,),X(sj,)) and right in the residuals Cor(Z(si,),Z(sj,)). Top row represents the result in the extended summer, second row in the winter. Right: Empirical pairwise conditional probabilities from Eq. (15). Top: P(X(si,)<qα(i)|X(sj,)<qα(j)) for α=0.01,0.02,0.05. Bottom: P(X(si,)>q1-α(i)|X(sj,)>q1-α(j)), for α=0.01,0.02,0.05.

Download

To illustrate where the non-separability property enhances the model's performance, Fig. 8 presents the covariance function C(h,u) for the residuals Z at various time lags u. This function is computed from both the observations and one simulation for each of the three models: temporal only, separable, and non-separable. The purely temporal model fails to reproduce the observed correlations, as expected. Because the observed spatial covariance is scattered, making direct comparison difficult, a LOESS smoothing line is added to help visualise the difference between the separable and non-separable models. For u=0, both spatio-temporal models (separable and non-separable) overestimate the covariance for stations that are far apart, reflecting the previous observations from pairwise correlations. However, the non-separable model demonstrates a modest improvement from the separable model in these cases. This overestimation likely results from the fact that far-apart station pairs are underrepresented in the dataset and were not used during model training. For non-zero time lags, the separable model tends to overestimate high covariances and underestimate low covariances compared to the non-separable model. This discrepancy emphasizes the necessity of using a non-separable model, as further detailed in Appendix B, where we investigate empirically the separability of the data.

https://ascmo.copernicus.org/articles/11/203/2025/ascmo-11-203-2025-f08

Figure 8Empirical covariance in the reduced variable Z, C(h,u), for different time lags u=0,1,2,3. Each dot denotes the covariance between a pair of stations separated by the distance h. The small dots are the covariance for each pair, the solid line the loess values. The colored dots correspond to 100 simulations for the observations and all 3 possible models: temporal-only, separable, non-separable.

Download

Figure 7 (right) shows the pairwise conditional probabilities of threshold exceedance in the observations compared to the simulations. For all models except the spatially uncorrelated model, the values in simulation are in the range of the observations. For the highest quantile (0.99), the small probabilities are overestimated, and the large probabilities are underestimated. This means that the dependance is overestimated when it is high, and underestimated when low. Such behaviour was to be expected as the Gaussian distribution is known to be asymptotically independent (Coles2001). Nonetheless, once the effects of trends and seasonalities are accounted for, the resulting extremal dependence in the simulated temperatures aligns reasonably well with the observed data, except at the very most extreme quantiles. Enhancing the model accordingly would require at least two improvements. First, adopting a distribution that better captures the behavior in the extremes, such as (extended) Pareto distributions. However, this shift would introduce new complexities during parameter estimation. An alternative would be to explicitly model the extremes in a separate statistical framework. Second, choosing a dependance structure for one that exhibits stronger dependence in the extremes such as max-stable models which are asymptotically dependent or one intermediate.

4.2.4 Quantile Exceedance Ratio

Focusing on the summer months, with quantiles qi(α) and Quantile Exceedance Ratios Rα(t) (as defined in Eq. 17) computed only on June, July and August (JJA) data and simulations, the survival function of Rα(t) for several values of α is represented in Fig. 9. There is about 10 % chance that at least 15 out of the 41 stations exceed their 90 % quantile at the same time. In the simulations, it has about 9 % chance. Overall, the simulations tend to underestimate the probability to exceed a threshold for several locations at the same time. This is coherent with the previous observations on pairwise threshold exceedance probabilities, where we found that the simulations underestimated the high values.

https://ascmo.copernicus.org/articles/11/203/2025/ascmo-11-203-2025-f09

Figure 9Survival function of the Quantile Exceedance Ratio, ℙ(Rα(t)≥y) for several quantiles α, computed on the observations (ECA&D) and simulations from the non-separable model.

Download

4.3 Simulations on a grid

This section shows the use of the generator trained on the station data for simulation over the E-OBS grid. The simulations are then compared to the E-OBS data that serves as a reference.

4.3.1 1985–2015 simulations

We generated 100 simulations over the same 1985–2015 period. Examples of these simulations are shown in Fig. 10, alongside the corresponding values from the E-OBS database. Among the 100 simulations, we selected examples closest to the median temperature of E-OBS, as well as the coldest and warmest simulations. Although direct comparison of day-to-day values is not meaningful, the overall patterns are similar. The range of simulated temperatures is broad, accommodating both cool and hot days with strong temporal correlation.

https://ascmo.copernicus.org/articles/11/203/2025/ascmo-11-203-2025-f10

Figure 10From left to right: 4 d in E-OBS, an example of the same simulated 4 d, coldest and warmest simulation for the same days.

The simulated temperatures appear less smooth than those from the E-OBS dataset, which is expected for several reasons. First, E-OBS is an interpolated product based on station measurements, resulting in inherently smoother spatial fields. In contrast, our simulation model includes a nugget effect, which explicitly captures short-range variability – such as measurement error or micro-scale environmental effects – as a proportion of the total variance.

The spatial resolution of E-OBS is approximately 25 km, while the minimum distance between stations in the training data was closer to 100 km. This relatively coarse station density limited the model's ability to capture very short-range spatial structure, likely leading to an overestimation of the nugget effect and, consequently, rougher simulations.

However, the presence of a nugget effect in our model is not necessarily a drawback. It reflects a realistic component of spatial variability at small scales and ensures that simulations account for local heterogeneity. With denser station coverage, this short-range behavior could be estimated more precisely, potentially yielding smoother simulated fields. Alternatively, removing the nugget effect would produce smoother outputs, but at the risk of under-representing local variability and generating unrealistically homogeneous fields.

The quantile exceedance ratio Rα(t) can be analyzed in the same manner. Figure 11 shows the survival function for various quantiles. These quantiles are extracted from the E-OBS database for each grid cell. The simulated survival function is represented by a ribbon spanning the 0.1–0.9 quantile range of the simulations, with a dotted line indicating the median value, and the survival function in E-OBS is represented by a full line. The simulations tend to overestimate the number of heat days over smaller areas while underestimating the frequency of events covering larger areas, compared to the observed values.

https://ascmo.copernicus.org/articles/11/203/2025/ascmo-11-203-2025-f11

Figure 11Survival function ℙ(Rα(t)≥y) for several quantiles α, computed on E-OBS and gridded simulations.

Download

It is also important to reproduce not only isolated hot days but also sequences of hot days with length comparable to the observed events. Figure 12 illustrates the evolution of Rα(t) for several quantiles in some observed summers and some simulated summers. It illustrates the model's ability to reproduce summer heatwave characteristics in terms of duration, intensity, and spatial extent. Each panel shows the proportion of the domain exceeding a temperature quantile (color-coded) over the days of summer. The two left panels show observed summers of 2003 and 2009, with 2003 notable for its intense and long-lasting heatwave affecting much of France, used here as a reference. Its exceptional spatial extent and duration are highlighted with arrows. The other panels show examples of simulated summers. These demonstrate the model's capacity to generate heat events of varying types: short and localised events (all graphs), short events that are large in space (bottom right), events resembling the 2003 heatwave in both intensity and duration (top middle and top right), and even a long-duration heatwave with slightly less intensity than 2003 (bottom middle). These examples support the model’s ability to capture realistic variability in heatwave behavior.

https://ascmo.copernicus.org/articles/11/203/2025/ascmo-11-203-2025-f12

Figure 12Spatial coverage of extreme heat over time, Rα(t), for several quantiles α, across two observed summers and four simulated ones.

Download

We consider spatial heat episodes as sequences of days when the temperature exceeds a specified threshold in at least a given proportion of the domain. Figure 13 shows the distribution of heat episode lengths for both E-OBS (red dots) and the 100 simulations (black bars).

https://ascmo.copernicus.org/articles/11/203/2025/ascmo-11-203-2025-f13

Figure 13Length distribution of heat events for the observations and for 100 simulations. From left to right : spatial size of event (cumulative % of surface over the quantile threshold), from top to bottom: 0.85,0.9, and 0.95 quantile. For each plot, the x-axis represents the episode length and the y-axis holds the number of events of this length in one 1985–2015 series. Log-scale on the y-axis. The dotted bars indicate the whole range of values found in the simulations, while the full bars represent the 0.1–0.9 quantile range of simulated values. The red dots indicate the E-OBS values, the circled dots represent the 2003 heatwave.

Download

For heat episodes that cover at least 40 % or 60 % of the domain, and for moderate temperature quantiles such as the 0.85 or 0.9 quantiles, the observed values fall within the range of simulated values. The model even succeeds in generating events longer than those observed. However, for higher spatial proportions, the simulations struggle more to generate extended episodes, indicating some limitations in replicating the most extreme spatial heat events. This is coherent with the conclusion given by the previous Fig. 11.

https://ascmo.copernicus.org/articles/11/203/2025/ascmo-11-203-2025-f14

Figure 14Distribution of the length of cold events in the observations and in 100 simulations. From left to right: spatial size of event (cumulative % of surface over the quantile threshold), from top to bottom: considered quantile. For each plot, the x-axis represent the episode length and the y-axis holds the number of events of this length in one 1985–2015 series. Log-scale on the y-axis. The dotted bars indicate the whole range of values found in the simulations, while the full bars represent the 0.1–0.9 quantile range of simulated values. The red dots indicate the E-OBS values.

Download

Cold events can be similarly defined as sequences of days when the temperature in winter (December–January–February) is under a given threshold for a given proportion of the domain. Figure 14 shows the distribution of the length of cold events in the observations and the simulations. For events that are below the 0.05 quantile, the model generates an appropriate number of single-day events for all considered spatial extents, but does not reproduce events with high temporal length. However, for such a low quantile and for a large spatial extent, the observations contain only a few events. For milder quantiles, the model is able to generate longer events but overestimates the number of single day events.

Using the E-OBS observational database as a reference, we analyze the model’s ability to replicate heat and cold events, defined by the exceedance of specific temperature quantiles over large spatial areas. The model captures the overall behavior of temperature extremes, including heatwave sequences similar to the notable 2003 European heatwave. However, while it successfully reproduces the frequency and duration of moderate heat episodes, the model tends to overestimate the occurrence of heat events in smaller regions and underestimates those covering larger areas. For cold episodes, the model generates realistic short-term events but struggles to reproduce longer events, particularly those covering large spatial extents.

4.3.2 Projections for 2016–2023

In this section, we demonstrate how the complete model can be utilised for near future projections. The following reasonable assumptions are made:

  • The seasonality in mean and variance will remain the same in the near future. This is because they are estimated mainly as the result of the incoming solar radiation linked to earth rotation. The temperature change over time is included in the trend part which is removed first.

  • The spatio-temporal dependence in the residuals Z will remain unchanged in the future, as far as only the near future is considered.

  • Future trends in mean and variance have been derived here from the observations, but can be obtained from climate projections using various scenarios if farther future periods are targeted.

  • Should the projections concern farther future periods, the seasonality in mean and variance and the spatio-temporal structure of the residuals may have to be fitted again on climate projections.

https://ascmo.copernicus.org/articles/11/203/2025/ascmo-11-203-2025-f15

Figure 15Mean temperature over the France grid for 2016–2022. The 100 simulations are obtained using a model fitted on the 1985–2015 period. Observed temperatures are shown in red. Simulations are represented with dotted black lines for the quantiles of order 0.025 and 0.975 of the simulated temperatures, and a solid black line for the median simulated temperature.

Download

https://ascmo.copernicus.org/articles/11/203/2025/ascmo-11-203-2025-f16

Figure 16Distribution of the length of heat events in the observations (red dots) and in 100 simulations (black bars), for the period 2016–2022.

Download

The stochastic model is calibrated using data from 1985 to 2015. To validate the model's projection capabilities, we apply it to the period from 2016 to 2022. Trends in mean and variance for this future period can be derived using the same decomposition method, but with updated data from 1985 to 2022, while retaining the seasonal patterns from the earlier period.

We generated 100 simulations based on the updated trends and previous seasonality and stochastic models. The resulting mean temperatures averaged across the France grid are illustrated in Fig. 15 by their medians and 95 % intervals, and compared with the E-OBS values. The coverage rate of the interval is 92.21 %, which is close to the expected 95 %, with a small underestimation of the real variability. This demonstrates the model's capability to accurately project future temperatures, provided that the trends are known. This shows that future climate projected trends can be plugged into the model if farther future is considered.

The year 2022 experienced many important heatwaves. In Fig. 16, the distribution of the length of spatial heat episodes are represented for different spatial thresholds. The quantiles used to define the “hot days” are the 1985–2015 quantiles.

Most observed episodes fall within the range of the simulations. For all considered quantiles and spatial extents, the model successfully generates heat episodes that are longer than those observed. This suggests that the model can account for the possibility of longer heat waves than those recorded.

5 Conclusions

The spatio-temporal stochastic weather generator presented in this study and applied to French station data is a tool to generate realistic temperature series across a large area. By leveraging spatial statistics, the model effectively captures spatial-temporal correlations, enabling the simulation of plausible large-scale temperature events across the study area. This capability is particularly valuable for applications requiring temperature modeling across various locations, whether data is available for specific points, and even when those points are spread out spatially. To our knowledge, previous studies using spatial statistics have focused on more localised regions and have not addressed temperature trends with the same level of precision.

To study a changing climate, the trend components of the model can be adjusted based on climate simulation outputs and various scenarios. By either keeping the seasonal cycles and spatial covariance structure stationary, or retraining them based on corrected model output, this approach allows for generating numerous simulations corresponding to different scenarios or specific areas of interest. This method significantly reduces computation time compared to physical models, thereby complementing physical simulations and providing valuable insights into future climate conditions.

Further investigation into the marginal distributions may be necessary to improve the representation of extreme conditions, such as extreme heat events. For many applications, accurately sampling these unfavorable conditions is crucial. Some further investigation on the margins may also be needed. For instance, the skewed-exponential-power distribution used in Evin et al. (2019) could provide a better fit for the temperature distribution, especially in the tails.

In this paper, while the Gaussian model was used, and although it is known to lack extremal dependence, the spatial extremes were found to be reasonably well represented in the simulations. However, there was a slight underestimation of the high-temperature extremal pairwise correlations. Future research should explore different non-Gaussian dependence structures, as the Gaussian approach, while adequate for temperature alone, might not be suitable for other variables or for modeling cross-correlations.

Additionally, there are plans to extend this model to cover the entire European continent. This expansion may require addressing issues of non-stationarity due to varying climates across different regions.

Many applications require modeling additional variables such as precipitation, wind speed, or solar radiation, which necessitates accounting for their spatial and temporal cross-correlations. While the Gneiting-Matérn model used in this study is well-suited for temperature, extending it to multiple variables presents challenges. Authors such as Bourotte et al. (2016) and Allard et al. (2022) have proposed flexible multivariate covariance functions that extend the univariate Gneiting-Matérn class. However, these models have not been tested over large areas like ours and may not be fully suitable. As for the marginals, they cannot be considered Gaussian. For precipitation amounts, while mixtures of Gamma or Exponential distributions are commonly used, they may not always adequately capture the right tail of the distribution. The Extended Pareto distribution, as proposed by Naveau et al. (2016), could be a valuable alternative, as it is designed to model both the bulk and the tail of the distribution effectively.

Another approach involves incorporating weather types into the model. Weather type modeling breaks down the dependence into different types of days, allowing for simpler models to be applied to each type (Allard et al.2015; Stoner and Economou2020; Dawkins et al.2022; Gobet et al.2025). The sequence of weather types can be modeled using a Markov chain. Weather types can be learned from the data, with the simplest classification being dry/wet states, while more complex models might include hidden or unobserved states. Future work will build on the extensive literature on multi-site and spatial weather type models to integrate the temperature generator from this study into a comprehensive multivariate generator.

Appendix A: Data locations

The Table A1 presents the stations used in this study: station identifier STAID, name STANAME, coordinates in the LON/LAT format and elevation. Additionnal columns hold the start and end date of the temperature file, and the column “missing” holds the proportion of missing data over the period 1985–2015.

Table A1List of the French ECA&D stations.

Download Print Version | Download XLSX

Appendix B: Separability

The choice of covariance structure reflects the non-separability in the data.

The separability property holds if the covariance function of the process under study satisfies

(B1) C ( h , u ) = C ( h , 0 ) C ( 0 , u )

where C(h,0)=CS(h) is the covariance in space and C(0,u)=CT(u) is the covariance in time.

Tests have been proposed to study this assumption such as in Li et al. (2007), but they fail in practice with our temperature data because of the strong correlation in time. However, it can be empirically seen in Fig. B1 that for most considered h,u[1,,T]×D, C(h,u)C(h,0)C(0,u). This means that not only is the covariance of the data non-separable, but also that according to the definition of De Iaco and Posa (2013), the non-separability is said to be “positive”.

Conveniently, any model of the Gneiting class satisfy this property, which is why this class was chosen.

https://ascmo.copernicus.org/articles/11/203/2025/ascmo-11-203-2025-f17

Figure B1Comparison between the empirical spatial-temporal covariance function (black) and the product between the spatial and temporal covariance function (red). Each dot denotes the covariance between a pair of stations separated by a distance h.

Download

Appendix C: Parameter estimates using station data

C1 Gridded trends

The Fig. C1 shows the interpolated temperature trend using the equivalent temperature that takes into account the elevation, compared to trends obtained from the much more complete E-OBS database. The addition of geographical information seems to correct quite well the values of the trend in mean.

https://ascmo.copernicus.org/articles/11/203/2025/ascmo-11-203-2025-f18

Figure C1Trend in mean interpolated from site-by-site decomposition using ordinary kriging and altitude equivalent temperature, compared with trends obtained directly from the E-OBS database.

Figure C2 shows the average maps over 5 years of the standard deviation trend, using the kriging method. The stations are indicated by dots. The variance is stable in mean over 5 years.

https://ascmo.copernicus.org/articles/11/203/2025/ascmo-11-203-2025-f19

Figure C2Trend in standard deviation, interpolated from site-by-site decomposition using ordinary kriging.

C2 Gridded seasonality coefficients

Kriging with a Gaussian variogram model was used to interpolate each seasonality coefficient. Resulting coefficients maps are given in Figs. C3, C4. Similar to what is observed when working with the trends, removing stations from the training set sometimes introduced significant error. It is mostly the case when several stations from the borders of the domain are missing, or in the case of the mountainous stations pointed out previously.

https://ascmo.copernicus.org/articles/11/203/2025/ascmo-11-203-2025-f20

Figure C3Seasonality coefficient for the mean, interpolated from site-by-site decomposition using ordinary kriging.

https://ascmo.copernicus.org/articles/11/203/2025/ascmo-11-203-2025-f21

Figure C4Seasonality coefficient for the standard deviation interpolated from site-by-site decomposition using ordinary kriging.

Code and data availability

Most of the code will be made available soon on GitHub. Both weather station data (ECA&D) and gridded data were downloaded from the European Climate Assessment & Dataset website (https://www.ecad.eu/, last access: March 2023; Klein Tank2002). In this paper, we used the March 2023 version of both datasets.

Author contributions

CC, LB and SP conceptualised the original idea. CC, LB, developped the mathematical methodology with domain knowledge from SP. CC performed the validation of the model and wrote the paper with feedback from all the authors. LB, SP and DM supervised the project. The original code was partly written in R by CC, with the estimation procedure implemented in Julia by CC and DM. Part of the gridded simulation algorithm was also implemented in Julia by DM.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Also, please note that this paper has not received English language copy-editing. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Acknowledgements

The authors thank Denis Allard for the useful feedback at various points in the work, as well as the implementation of the spectral simulation method.

Financial support

This research has been supported by the Association Nationale de la Recherche et de la Technologie (grant no. 2023/0061).

Review statement

This paper was edited by Soutir Bandyopadhyay and reviewed by three anonymous referees.

References

Allard, D., Ailliot, P., Monbet, V., and Naveau, P.: Stochastic weather generators: an overview of weather type models, Journal de la Societe Française de Statistique, 156, 101–113, 2015. a, b

Allard, D., Emery, X., Lacaux, C., and Lantuéjoul, C.: Simulating space-time random fields with nonseparable Gneiting-type covariance functions, Statistics and Computing, 30, 1479–1495, https://doi.org/10.1007/s11222-020-09956-4, 2020. a, b

Allard, D., Clarotto, L., and Emery, X.: Fully nonseparable Gneiting covariance functions for multivariate space–time data, Spatial Statistics, 52, 100706, https://doi.org/10.1016/j.spasta.2022.100706, 2022. a, b, c, d

Apipattanavis, S., Podestá, G., Rajagopalan, B., and Katz, R. W.: A semiparametric multivariate and multisite weather generator, Water Resources Research, 43, https://doi.org/10.1029/2006WR005714, 2007. a

Baxevani, A. and Lennartsson, J.: A spatiotemporal precipitation generator based on a censored latent Gaussian field, Water Resources Research, 51, 4338–4358, https://doi.org/10.1002/2014WR016455, 2015. a

Beersma, J. and Buishand, T.: Multi-site simulation of daily precipitation and temperature conditional on the atmospheric circulation, Climate Research, 25, 121–133, https://doi.org/10.3354/cr025121, 2003. a

Bourotte, M., Allard, D., and Porcu, E.: A flexible class of non-separable cross-covariance functions for multivariate space–time data, Spatial Statistics, 18, 125–146, https://doi.org/10.1016/j.spasta.2016.02.004, 2016. a, b, c, d

Buishand, T. A. and Brandsma, T.: Multisite simulation of daily precipitation and temperature in the Rhine Basin by nearest-neighbor resampling, Water Resources Research, 37, 2761–2776, https://doi.org/10.1029/2001WR000291, 2001. a

Chandler, R. E.: Multisite, multivariate weather generation based on generalised linear models, Environmental Modelling & Software, 134, 104867, https://doi.org/10.1016/j.envsoft.2020.104867, 2020. a

Chilès, J. and Delfiner, P.: Geostatistics: Modeling Spatial Uncertainty, Wiley Series in Probability and Statistics, Wiley, 1st edn., ISBN 978-0-470-18315-1, https://doi.org/10.1002/9781118136188, 2012. a

Clark, M., Gangopadhyay, S., Hay, L., Rajagopalan, B., and Wilby, R.: The Schaake Shuffle: A Method for Reconstructing Space–Time Variability in Forecasted Precipitation and Temperature Fields, Journal of Hydrometeorology, 5, https://doi.org/10.1175/1525-7541(2004)005<0243:TSSAMF>2.0.CO;2, 2004. a

Cleveland, W. S.: Robust Locally Weighted Regression and Smoothing Scatterplots, Journal of the American Statistical Association, 74, 829–836, https://doi.org/10.1080/01621459.1979.10481038, 1979. a, b

Coles, S.: An Introduction to Statistical Modeling of Extreme Values, Springer Series in Statistics, Springer London, London, ISBN 978-1-84996-874-4, https://doi.org/10.1007/978-1-4471-3675-0, 2001. a

Cornes, R. C., Van Der Schrier, G., Van Den Besselaar, E. J. M., and Jones, P. D.: An Ensemble Version of the E‐OBS Temperature and Precipitation Data Sets, Journal of Geophysical Research: Atmospheres, 123, 9391–9409, https://doi.org/10.1029/2017JD028200, 2018. a

Cressie, N. and Wikle, C. K.: Spatio-Temporal Statistics, John Wiley & Sons, ISBN 978-0-471-69274-4, 2015. a

Cressie, N. A. C.: Statistics for spatial data, Wiley series in probability and mathematical statistics, Wiley, New York, ISBN 978-0-471-84336-8, 1991. a

Dawkins, L. C., Osborne, J. M., Economou, T., Darch, G. J. C., and Stoner, O. R.: The Advanced Meteorology Explorer: a novel stochastic, gridded daily rainfall generator, Journal of Hydrology, 607, 127478, https://doi.org/10.1016/j.jhydrol.2022.127478, 2022. a, b, c

De Iaco, S. and Posa, D.: Positive and negative non-separability for space–time covariance models, Journal of Statistical Planning and Inference, 143, 378–391, https://doi.org/10.1016/j.jspi.2012.07.006, 2013. a

De Iaco, S., Posa, D., Cappello, C., and Maggio, S.: Isotropy, symmetry, separability and strict positive definiteness for covariance functions: A critical review, Spatial Statistics, 29, 89–108, https://doi.org/10.1016/j.spasta.2018.09.003, 2019. a

Dubrovsky, M., Huth, R., Dabhi, H., and Rotach, M. W.: Parametric gridded weather generator for use in present and future climates: focus on spatial temperature characteristics, Theoretical and Applied Climatology, 139, 1031–1044, https://doi.org/10.1007/s00704-019-03027-z, 2020. a, b

Evin, G., Favre, A.-C., and Hingray, B.: Stochastic generators of multi-site daily temperature: comparison of performances in various applications, Theoretical and Applied Climatology, 135, 811–824, https://doi.org/10.1007/s00704-018-2404-x, 2019. a, b, c, d

Furrer, E. and Katz, R.: Generalized linear modeling approach to stochastic weather generators, Climate Research, 34, 129–144, https://doi.org/10.3354/cr034129, 2007. a

Geoga, C. J., Marin, O., Schanen, M., and Stein, M. L.: Fitting Matérn Smoothness Parameters Using Automatic Differentiation, 33, 48, https://doi.org/10.1007/s11222-022-10127-w, 2022. a

Gneiting, T.: Nonseparable, Stationary Covariance Functions for Space–Time Data, Journal of the American Statistical Association, 97, 590–600, https://doi.org/10.1198/016214502760047113, 2002. a, b

Gneiting, T., Kleiber, W., and Schlather, M.: Matérn Cross-Covariance Functions for Multivariate Random Fields, Journal of the American Statistical Association, 105, 1167–1177, https://doi.org/10.1198/jasa.2010.tm09420, 2010. a

Gobet, E., Métivier, D., and Parey, S.: Interpretable seasonal multisite hidden Markov model for stochastic rain generation in France, Adv. Stat. Clim. Meteorol. Oceanogr., 11, 159–201, https://doi.org/10.5194/ascmo-11-159-2025, 2025. a, b

Hoang, T. T. H.: Modeling of non-stationary, non-linear time series. Application to the definition of trends on mean, variability and extremes of air temperatures in Europe, PhD thesis, Université Paris Sud – Paris XI, https://theses.hal.science/tel-00531549 (last access: 30 May 2025), 2010. a, b, c, d, e, f

Holsclaw, T., Greene, A. M., Robertson, A. W., and Smyth, P.: A Bayesian Hidden Markov Model of Daily Precipitation over South and East Asia, Journal of Hydrometeorology, https://doi.org/10.1175/JHM-D-14-0142.1, 2016. a

Intergovernmental Panel On Climate Change (IPCC): Climate Change 2021 – The Physical Science Basis: Working Group I Contribution to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change, Cambridge University Press, 1st edn., ISBN 978-1-009-15789-6, https://doi.org/10.1017/9781009157896, 2023. a

International Organization for Standardization (ISO): Standard Atmosphere, ISO 2533:1975, https://www.iso.org/standard/7472.html (last access: 4 April 2024), 1975. a

Ji, H. K., Mirzaei, M., Lai, S. H., Dehghani, A., and Dehghani, A.: Implementing generative adversarial network (GAN) as a data-driven multi-site stochastic weather generator for flood frequency estimation, Environmental Modelling & Software, 172, 105896, https://doi.org/10.1016/j.envsoft.2023.105896, 2024. a

Kleiber, W., Katz, R. W., and Rajagopalan, B.: Daily spatiotemporal precipitation simulation using latent and transformed Gaussian processes: PRECIPITATION SIMULATION USING GAUSSIAN PROCESSES, Water Resources Research, 48, https://doi.org/10.1029/2011WR011105, 2012. a

Klein Tank, A. A. C.: Daily dataset of 20th-century surface air temperature and precipitation series for the European Climate Assessment, Int. J. Climatol., 22, 1441–1453, 2002. a, b

Kroiz, G. C., Majumder, R., Gobbert, M. K., Neerchal, N. K., Markert, K., and Mehta, A.: Daily Precipitation Generation using a Hidden Markov Model with Correlated Emissions for the Potomac River Basin, PAMM, 20, e202000117, https://doi.org/10.1002/pamm.202000117, 2021. a

Li, B., Genton, M. G., and Sherman, M.: A Nonparametric Assessment of Properties of Space–Time Covariance Functions, Journal of the American Statistical Association, 102, 736–744, https://doi.org/10.1198/016214507000000202, 2007. a

Li, X. and Babovic, V.: Multi-site multivariate downscaling of global climate model outputs: an integrated framework combining quantile mapping, stochastic weather generator and Empirical Copula approaches, Climate Dynamics, 52, 5775–5799, https://doi.org/10.1007/s00382-018-4480-0, 2019. a

Li, Z.: A new framework for multi-site weather generator: a two-stage model combining a parametric method with a distribution-free shuffle procedure, Climate Dynamics, 43, 657–669, https://doi.org/10.1007/s00382-013-1979-2, 2014. a

Li, Z., Li, J. J., and Shi, X. P.: A Two-Stage Multisite and Multivariate Weather Generator, Journal of Environmental Informatics, https://doi.org/10.3808/jei.201900424, 2019. a

Lindsay, B. G.: Composite likelihood methods, in: Contemporary Mathematics, edited by: Prabhu, N. U., Vol. 80, 221–239, American Mathematical Society, Providence, Rhode Island, ISBN 978-0-8218-5087-9, https://doi.org/10.1090/conm/080/999014, 1988. a, b

Miloshevich, G., Cozian, B., Abry, P., Borgnat, P., and Bouchet, F.: Probabilistic forecasts of extreme heatwaves using convolutional neural networks in a regime of lack of data, Physical Review Fluids, 8, 040501, https://doi.org/10.1103/PhysRevFluids.8.040501, 2023. a

Naveau, P., Huser, R., Ribereau, P., and Hannart, A.: Modeling jointly low, moderate, and heavy rainfall intensities without a threshold selection, Water Resources Research, 52, 2753–2769, https://doi.org/10.1002/2015WR018552, 2016. a

Porcu, E., Furrer, R., and Nychka, D.: 30 Years of space–time covariance functions, WIREs Computational Statistics, 13, e1512, https://doi.org/10.1002/wics.1512, 2021. a, b

Prairie, J. R., Rajagopalan, B., Fulp, T. J., and Zagona, E. A.: Modified K-NN Model for Stochastic Streamflow Simulation, Journal of Hydrologic Engineering, 11, 371–378, https://doi.org/10.1061/(ASCE)1084-0699(2006)11:4(371), 2006. a

Rajagopalan, B. and Lall, U.: A k-nearest-neighbor simulator for daily precipitation and other weather variables, Water Resources Research, 35, 3089–3101, https://doi.org/10.1029/1999WR900028, 1999. a

Richardson, C. W.: Stochastic simulation of daily precipitation, temperature, and solar radiation, Water Resources Research, 17, 182–190, https://doi.org/10.1029/WR017i001p00182, 1981. a, b, c

Sha, J., Chen, X., Chang, Y., Zhang, M., and Li, X.: A spatial weather generator based on conditional deep convolution generative adversarial nets (cDCGAN), Climate Dynamics, 62, 1275–1290, https://doi.org/10.1007/s00382-023-06971-9, 2024.  a

Sparks, N. J., Hardwick, S. R., Schmid, M., and Toumi, R.: IMAGE: a multivariate multi-site stochastic weather generator for European weather and climate, Stochastic Environmental Research and Risk Assessment, 32, 771–784, https://doi.org/10.1007/s00477-017-1433-9, 2018. a, b, c

Stoner, O. and Economou, T.: A coupled hidden markov model for daily rainfall at multiple sites, in: Proceedings of the 35th International Workshop on Statistical Modelling, 20–24 July 2020, Bilbao, Basque Country, Spain, ISBN 978-84-1319-267-3, 228–232, https://addi.ehu.eus/handle/10810/45863 (last access: 9 September 2024), 2020. a

Touron, A.: Modélisation multivariée de variables météorologiques, PhD thesis, Université Paris-Saclay, https://theses.hal.science/tel-02319170 (last access: 8 June 2023), 2019. a

Varin, C., Reid, N., and Firth, D.: An Overview of Composite Likelihood Methods, Statistica Sinica, 5–42, http://www.jstor.org/stable/24309261 (last access: 29 September 2025), 2011. a, b, c

Verdin, A., Rajagopalan, B., Kleiber, W., and Katz, R. W.: Coupled stochastic weather generation using spatial and generalized linear models, Stochastic Environmental Research and Risk Assessment, 29, 347–356, https://doi.org/10.1007/s00477-014-0911-6, 2015. a, b

Verdin, A., Rajagopalan, B., Kleiber, W., Podestá, G., and Bert, F.: BayGEN: A Bayesian Space-Time Stochastic Weather Generator, Water Resources Research, 55, 2900–2915, https://doi.org/10.1029/2017WR022473, 2019. a

Wang, Y., Xie, J., You, Y., Wang, Y., Xu, Y.-P., and Guo, Y.-X.: A new multi-site multi-variable stochastic model with inter-site and inter-variable correlations, low frequency attributes and stochasticity: A case study in the lower Yellow River basin, Journal of Hydrology, 599, 126365, https://doi.org/10.1016/j.jhydrol.2021.126365, 2021. a

Wilks, D. S.: Multisite generalization of a daily stochastic precipitation generation model, Journal of Hydrology, 210, 178–191, https://doi.org/10.1016/S0022-1694(98)00186-3, 1998. a

Wilks, D. S.: A gridded multisite weather generator and synchronization to observed weather data, Water Resources Research, 45, https://doi.org/10.1029/2009WR007902, 2009. a, b, c

Yiou, P.: AnaWEGE: a weather generator based on analogues of atmospheric circulation, Geosci. Model Dev., 7, 531–543, https://doi.org/10.5194/gmd-7-531-2014, 2014. a

Download
Short summary
Weather generators efficiently create realistic weather data based on historical records. This study introduces a daily temperature generator for large regions, separating deterministic factors (trends, seasonality) from random variations modeled using space-time interactions. Validated on French weather station data, it replicates observed patterns, including heatwaves. It offers a practical solution for generating realistic weather data, for applications such as climate impact assessments.
Share