Interpretable seasonal multisite hidden Markov model for stochastic rain generation in France

Gobet, Emmanuel; Métivier, David; Parey, Sylvie

doi:https://doi.org/10.5194/ascmo-11-159-2025

Articles | Volume 11, issue 2

https://doi.org/10.5194/ascmo-11-159-2025

Articles | Volume 11, issue 2

08 Sep 2025

| 08 Sep 2025

Interpretable seasonal multisite hidden Markov model for stochastic rain generation in France

Emmanuel Gobet, David Métivier, and Sylvie Parey

Abstract

We present a lightweight stochastic weather generator (SWG) based on a multisite hidden Markov model (HMM) trained on a large area with French weather station data. Our model captures spatiotemporal precipitation patterns with a strong emphasis on seasonality and the accurate reproduction of dry and wet spell distributions. The hidden states serve as interpretable large-scale weather regimes, learned directly from the data without requiring exogenous inputs. Compared to existing approaches, it offers a robust balance between interpretability and performance, particularly for extremes. The model architecture enables seamless integration of additional weather variables. Finally, we demonstrate its application to future climate scenarios, highlighting how parameter evolution and extreme event distributions can be analyzed in a changing climate.

Download & links

How to cite.

Received: 24 Sep 2024 – Revised: 13 May 2025 – Accepted: 11 Jun 2025 – Published: 08 Sep 2025

1 Introduction

1.1 Context

The current context of climate change necessitates a careful analysis of industrial resilience in future climate conditions to anticipate adaptation needs. This includes estimating extreme hydrometeorological conditions, such as the frequency of long-lasting dry spells, which are critical for hydropower and nuclear generation. Parliamentary missions in France (Christophe and Pompili, 2018) have highlighted the need to quantify hydro-stress impacts on nuclear power generation. Similarly, understanding future hydrometeorological conditions is essential for farmers to develop robust agricultural strategies (see Pascual et al., 2017; Zhao et al., 2017; Parent et al., 2018, and references therein).

Rainfall can trigger natural hazards with diverse spatiotemporal characteristics, ranging from short-duration, localized intense showers to prolonged meteorological droughts affecting vast regions. As a result, precipitation modeling must be adapted to the specific hazard being addressed. Global and regional climate models can simulate the climate system and project its evolution under different forcing scenarios. However, they remain computationally expensive, limiting the number of simulations that can be performed. In practice, climate projections are made available in public repositories, such as the French national project DRIAS (Soubeyroux et al., 2021), which provides around 30 projections (see Sect. 8 for details). Yet, for accurate risk assessments, additional scenarios may be needed, leading to challenges in data augmentation and scenario resampling. Another limitation of climate models is their inability to fully capture local extremes, despite advancements in spatial resolution and process modeling (Luu et al., 2022). Consequently, stochastic weather generators remain widely used in impact studies. The recent IPCC Working Group 1 report of the 6th assessment (Arias et al., 2021) emphasizes the importance of such tools, stating that “Methodologies such as statistical downscaling, bias adjustment, and weather generators are beneficial as an interface between climate model projections and impact modeling and for deriving user-relevant indicators.” Unlike climate models, which represent the physical mechanisms governing climate evolution, stochastic weather generators are calibrated to reproduce the statistical properties of climate variables, including distributions, spatial and temporal correlations, and inter-variable dependencies in multivariate models.

Generating statistically coherent weather series in time and space is a challenging problem. Mathematically, these series form multivariate time-dependent data that are far from being independent and identically distributed. In simple terms, today's weather is strongly influenced by past conditions and spatial correlations with surrounding locations. Additionally, weather patterns evolve throughout the year and under climate change, making it essential to accurately reproduce both typical weather conditions and extreme events such as heavy precipitation and intense heatwaves.

For industries operating spatially scattered installations, a key concern is the simultaneous exposure of multiple facilities to the same extreme event. In electricity generation, prolonged, spatially extensive droughts can complicate grid management, making it crucial to assess their frequency to anticipate appropriate adaptation measures (International Energy Agency, 2022, e.g., Chap. 3 – Nuclear Power). Large-scale weather regimes can also significantly impact renewable energy production (van der Wiel et al., 2019). However, estimating the occurrence and intensity of such situations is not straightforward. Historical observations provide only a single realization among many possible trajectories influenced by climate variability. Climate models can expand the range of possible events but often fail to produce enough extreme cases for robust statistical analysis. For example, Lang and Poschlod (2024, Sect. 4.1) discuss an ensemble of 50 climate model simulations used to estimate return periods of heavy rainfall, while Fischer et al. (2023) show that an ensemble of 30 climate model simulations over 31 years each could not reproduce heatwaves of the same magnitude as the 2021 Pacific Northwest event. To address this limitation, they propose ensemble boosting methods to enrich simulations of extreme heatwaves. Stochastic weather generators (SWGs) offer a way to overcome this limitation by increasing the sample size, enabling better extreme event statistics at a manageable computational cost.

Beyond hazard modeling, stochastic weather generators are essential tools for climate stress testing (Robertson et al., 2007; Manzano and Ines, 2020; Ranger et al., 2022). By generating realistic large ensembles of weather simulations under a given climate scenario, they help decision-makers anticipate climate variability and implement proactive adaptation measures. Given the increasing impact of climate change, leveraging these models is crucial for building resilience and ensuring sustainability.

The purpose of this paper is to develop an interpretable parametric model that efficiently simulates spatially coherent rainfall patterns – both occurrences and amounts – across France, incorporating self-taught large-scale weather patterns. Unlike pure generative models based on neural networks (Goodfellow et al., 2014), the stochastic generator developed in this study benefits from easily interpretable parameters, enabling the incorporation of climate change factors (see Sect. 8.3). Recent studies (Miloshevich et al., 2024) compare the use of stochastic weather generators and deep learning models for extreme heatwave sampling, highlighting the complementary nature of these approaches. Other studies have shown that classical generative models are not adapted to learn heavy-tailed distributions (like rainfall), requiring specialized architecture dedicated to extremes (Allouche et al., 2022). Finally, enforcing physical constraints within such models remains a challenge, which is critical for generating long-term realistic weather simulations (Dueben and Bauer, 2018).

1.2 Background literature

Many stochastic weather generators are devoted to the generation of precipitation time series, as precipitation is a crucial variable for many impact studies in agriculture or hydrology. For reviews, see Wilks and Wilby (1999), Chen and Brissette (2014), Ailliot et al. (2015 a), and Nguyen et al. (2023). Stochastic weather generator (SWG) development dates back to the 1980s, with the model proposed by Richardson (1981) to generate long samples of precipitation, minimum and maximum temperature, and solar radiation.

They can have different spatial scales, e.g., single-site models (Richardson, 1981), multisite models for closely located stations (Benoit et al., 2018), gridded-resolution models (Wilks, 2009; Dawkins et al., 2022), or models for widely separated stations (≳100 km) (Zucchini and Guttorp, 1991; Robertson et al., 2004). The temporal resolution can also vary, typically from sub-hourly (Cowpertwait et al., 2007) to hourly (Stoner and Economou, 2020) or daily rainfall amounts. Multisite daily stochastic generators will be the main focus of this work. A class of models focuses on nonparametric approaches, typically using resampling methods (e.g., Boé and Terray, 2008), which mix parametric and resampling techniques. The main drawback of these methods is that they cannot produce samples outside the observed distribution, limiting their usefulness for extreme value analysis. In such cases, parametric models are typically preferred.

The first important modeling choice is whether rain occurrence and rain amount should be generated separately. To simulate both rain occurrence and amounts simultaneously, one typically uses latent Gaussian variables, e.g., censored Gaussian latent variables (Bardossy and Plate, 1992; Ailliot et al., 2009; Baxevani and Lennartsson, 2015) and complex covariance structures (Flecher et al., 2010; Benoit et al., 2018; Bennett et al., 2018). These approaches conveniently model spatiotemporal dependencies within a common latent space using a covariance structure. They are highly flexible, allowing, for example, the easy inclusion of past dependence or other weather variables such as temperature. However, these approaches assume an underlying normal dependence, which is not always satisfied. Their computational complexity makes them very hard to train. Moreover, they tend to misrepresent dry and wet spells (e.g., Bennett et al., 2018, Fig. 4, or Baxevani and Lennartsson, 2015, Figs. 7 and 19), as they treat spatiotemporal dependence in rain occurrence and rain amounts the same way. One can argue that these are two distinct processes. In that case, modeling rain occurrences separately has typically been done using first-order Markov chains (Richardson, 1981), which can be easily extended to higher orders to better reproduce dry and wet spells (Srikanthan and Pegram, 2009). To model subsequent rain amounts, parametric distributions with light or heavy tails are used (see Chen and Brissette, 2014, for a comparison). In this second approach, addressing spatial dependence becomes challenging since it must be handled separately for rain amount and occurrence. To this end, a class of models referred to in this paper as WGEN models was proposed in Wilks (1998). It introduces a latent Gaussian copula to model correlations between rain occurrences at different sites. While this approach effectively reproduces pairwise correlations, it treats time and space dependence separately and, as we will show in Sect. 5.3, fails to capture large-scale, temporally persistent dry states.

Another option is to include spatial dependence using meteorologically defined weather types (e.g., dry, wet, or atmospheric circulation patterns), also referred to as weather regimes or circulation patterns. In this case, weather regimes are a finite set of large-scale patterns that characterize the weather of a given day. They can either be predefined and incorporated into the model as exogenous variables or inferred as latent variables; see Vaittinada Ayar et al. (2016) and Gutiérrez et al. (2019) for comparison of different approaches. For example, Vrac et al. (2007) identified weather types a priori through the classification of either precipitation data or exogenous atmospheric variables and trained a Markov-like precipitation model conditional on these inputs. Nguyen et al. (2024) applied a similar approach using a multivariate autoregressive model and a Gaussian latent model for rain occurrence and precipitation. Known dependence can also be incorporated using the generalized linear model (GLM) framework to make model parameters dependent on covariates such as previous-day dependence, known weather regimes, month of the year, and so on (see Yang et al., 2005; Chandler, 2020). See Holsclaw et al. (2016), Verdin et al. (2019), and Stoner and Economou (2020) for Bayesian versions of these GLM approaches. These approaches have two main drawbacks. First, they require selecting the relevant weather types or exogenous variables, along with the stochastic properties of precipitation conditional on weather types. See Philipp et al. (2016), Beck et al. (2016), and Huth et al. (2016) for comparisons and discussions on the impact of the choice of weather regimes based on synoptic variables. Additionally, Najibi et al. (2021) study the quality of a weather generator conditioned on different predefined weather patterns that were obtained by different methods. Second, the exogenous variables or weather regimes need to be specified by the user or accurately modeled in order to utilize the weather generator, which may pose limitations in certain applications.

To circumvent these difficulties, hidden Markov models (HMMs) introduce weather types as latent variables (Zucchini and Guttorp, 1991; Ailliot et al., 2009; Sansom and Thomson, 2010), which are directly inferred from the station and variable of interest without requiring additional data. This approach has the advantage of identifying weather regimes specifically adjusted to the weather characteristics of the area of interest (Najibi et al., 2021). The weather regimes are typically modeled using a latent Markov chain, and the distribution of the observations is conditioned on these latent states.

We may recall that it is not surprising from a probabilistic point of view that using latent variables to model complex dependencies might simplify analysis; see, for example, Kim et al. (2019) and Yamanishi (2023) for machine and deep learning reviews or Ghassempour et al. (2014) for time series clustering. In some very specific settings, it is even possible to prove that general exchangeable random variables can be realized as a mixture of product distributions (for some latent distribution) thanks to De Finetti's theorem (Diaconis and Freedman, 1980). In our setting, the exchangeability of the rain variables is not satisfied (e.g., rain occurrence probabilities differ at different weather stations); however, this case is still inspiring for modeling dependencies.

Coming back to multisite rain occurrence models, spatial HMMs have been proposed by Zucchini and Guttorp (1991), with the latent variable identified during inference shown to yield meaningful large-scale patterns (Robertson et al., 2004). In these papers, the main limitation is the conditional independence assumption, which states that multisite rain occurrences are independent given the weather states. This makes these generators suitable for widely separated stations where the assumption holds. In this paper, we adopt and verify this hypothesis a posteriori.

Other studies have incorporated exogenous weather variables into spatial HMMs to introduce more spatial correlations beyond those produced by the conditional independence assumption while also making them sensitive to other spatial phenomena. This is referred to in the literature as nonhomogeneous HMM (Hughes and Guttorp, 1994 a, b; Hughes et al., 1999; Bellone et al., 2000; Greene et al., 2011) and used for statistical downscaling. As previously mentioned, selecting these additional dependencies is challenging and thus might not always be beneficial, as shown in Hughes and Guttorp (1994 b, Table 4), where the authors, using an HMM-based model, compare a version with exogenous variables and one without. The conclusion is that the best model is the one without external forcing.

See Hughes and Guttorp (1994 a) for a comparison of different model choices. In Kirshner (2005), the author provides an overview and tests different options for multivariate distributions, ranging from conditional independence to complex dependence structures, including tree structures.

Models incorporating spatial HMM conditional independence with rain amounts directly using exogenous variables have been explored (Bellone et al., 2000; Neykov et al., 2012; Holsclaw et al., 2016). In Kroiz et al. (2020), the model is first fitted under the conditional independence assumption without external variables, and then a Gaussian copula is applied conditional on the weather states to correlate rain amounts.

Finally, most approaches described above assume constant parameters over a period of interest, e.g., a month or a season. Therefore, time nonhomogeneity in the HMM, i.e., a transition matrix that depends on time, has also been proposed, for instance, to introduce wind diurnal cycles (Ailliot and Monbet, 2012; Ailliot et al., 2015 b, or Touron, 2019 a) for multivariate (temperature and precipitation) single-site HMM.

1.3 Our contribution

We introduce the seasonal hierarchical hidden Markov model (SHHMM), a lightweight seasonal model based on a hidden Markov model (HMM) for generating multisite and temporally realistic weather series, specifically precipitation. As in Touron (2019 b), our model is fully time-nonhomogeneous, with parameters varying periodically throughout the year. The first layer consists of an autonomous spatial HMM for rain occurrences, similar to Zucchini and Guttorp (1991) and Robertson et al. (2004), while the second layer models multisite seasonal rainfall amounts conditioned on the learned hidden states.

1.3.1 Capturing large-scale dependencies

Our approach decomposes distributions using conditional independence with respect to hidden states, effectively capturing complex spatiotemporal dependencies without requiring external synoptic data. This ensures that large-scale weather regimes emerge naturally from the data. Moreover, they are shown to be robust across different station selections. To help with the station selection, we propose a simple metric to evaluate conditional independence. Rather than modeling rain amounts directly within the HMM, we focus on discrete rain occurrences. While previous attempts (Kroiz et al., 2020; Holsclaw et al., 2016) struggled to capture meaningful spatial correlations when including rain amounts in the hidden states, our results show that the inferred states remain interpretable and relevant for both rain occurrences and related meteorological variables such as mean sea level pressure. Unlike approaches that introduce additional spatial correlation structures (Hughes and Guttorp, 1994 b; Kirshner et al., 2004), our model enforces conditional independence, ensuring that spatial dependencies are fully learned by the hidden states. This prevents ambiguity in model identification, where correlations could otherwise be absorbed by multiple components. Section 2.6 argues for the relevance of learning weather regimes with rain occurrences and conditional independence. In Sect. 2.5, the statistical identifiability of our model is discussed, showing in particular that a minimal number of stations is needed to be identifiable. Lastly, we propose a simple heuristic to initialize the SHHMM in the expectation maximization inference algorithm, which is known to be prone to local maxima (Cappé et al., 2005).

1.3.2 Improved temporal persistence and rainfall representation

To better reproduce wet and dry spell persistence, we introduce an additional autoregressive (local memory) Markov dependence (Cappé et al., 2005; Kirshner, 2005) to improve the simulation of spatiotemporal statistics. After generating rain occurrences, a Gaussian copula is used to conditionally add rain amounts, yielding significantly improved results over Kroiz et al. (2020). Our work contributes to the development of multisite SWGs for rainfall amounts, based on self-taught weather regimes, while explicitly decomposing the rain occurrence and amount processes.

1.3.3 Validation, comparison, and applications

We extensively validate the model's ability to reproduce key hydrometeorological statistics, including dry spell distributions and extreme rainfall accumulations. The model is compared with WGEN-type models (Wilks, 1998; Srikanthan and Pegram, 2009; Evin et al., 2018), which rely spatially on latent Gaussian structures and high-order Markov models locally. We show that our approach is more scalable in terms of complexity and better captures large-scale dry spells.

We also illustrate its usefulness in two climate-related applications: (i) estimating climate variability through multiple trajectory sampling, thereby showing how this can be used to compare climate models (used in IPCC reports) more accurately than with a single historical trajectory, and (ii) training our model on climate change scenarios and analyzing the evolution in terms of parameters and extremes.

Compared to existing multisite HMMs, our model uniquely combines local memory, seasonal parameter variation at low computational cost, and interpretable conditional rainfall generation. These properties make it suitable for studying large-scale risks such as prolonged droughts and extreme precipitation events, relevant in many applications.

Note that this multisite model generates weather data only at the training sites and hence cannot produce high-resolution fields. Nevertheless, multisite simulation remains highly useful in many operational contexts. For instance, in the energy sector, a critical question is the likelihood of prolonged dry spells affecting a large portion of the territory simultaneously. Such events can stress multiple power plants at once – particularly nuclear plants whose cooling systems depend on river flows, which are impacted by large-scale droughts. In this context, estimating the frequency of co-occurring dry episodes across regions is more relevant than reproducing detailed spatial variability. See Sect. 9 for a more detailed discussion on the perspective of this work. Similar challenges arise in large-scale agricultural production planning.

1.3.4 Software

The model and its code are available in the Julia package StochasticWeatherGenerators.jl (Métivier, 2024). It contains a reproducible step-by-step tutorial in its documentation describing all the data loading, training process, and simulations of the model described in this paper. Most figures in this paper can be exactly reproduced using the tutorial.

1.4 Organization of the paper

In Sect. 2, we describe the construction of the SHHMM. We explain in Sect. 3 the procedure to infer and select the model. Section 4 is entirely dedicated to the interpretation of the model parameters; in particular, the trained hidden states are interpreted as weather regimes for France and will be compared to other well-known weather regimes such as the North Atlantic Oscillation (NAO). In Sect. 5 we show simulation results for the spatiotemporal rain occurrence sequences with a special focus on extreme dry/wet sequences; we also compare our model to a WGEN-like model (Wilks, 1998). The actual rain amounts are then added on top of the previous model in Sect. 6 and tested in simulations in Sect. 7. In Sect. 8, we train our model with data from climate models on a reference historical period and on future climate change scenarios and discuss the results.

1.5 Notations used in the paper

For a positive integer M, we set $[[1 : M]] := {1, 2, \dots, M - 1, M}$ . If Θ is a finite set, $| Θ |$ denotes its cardinality. We make the distinction between t for a day and n for a date; see Sect. 2.2. The number of days is T=366.

2 Hidden Markov chain modeling

In this section, we introduce the statistical models considered in this work. The underlying mathematical framework is based on hidden Markov models (HMMs), which we develop and adapt with a focus on their application to stochastic weather generation.

2.1 Data

Daily rainfall observation time series are extracted from the European Climate Assessment & Dataset (ECA&D) (Klein Tank, 2002). We focus only on stations in France and close by. Among the available ECA&D weather stations in France and Luxembourg, 66 stations have 100 % valid data from 1 January 1956 to 31 December 2019, i.e., a 64-year range and, 23 376 data rows. We select S=10 of these stations in all of France (and Luxembourg): these weather stations are indexed with $s \in S := {1, \dots, S}$ .

This weather generator scales to the size of France and aims to capture large-scale, interpretable weather patterns. Hence, we select the stations to be as representative as possible of French weather. As explained in Sect. 2.2, where the conditional independence hypothesis is presented, the major limitation of the model is its requirement for some degree of independence between stations. In principle, we should define a criterion to optimize the selection of S=10 stations among the 66 available. Even with a simple approach, such as maximizing the mean distance between station pairs, the problem is computationally intractable, as there are $(\binom{66}{10}) ≃ 2 \times 10^{11}$ possibilities. A more relevant yet even costlier criterion is the MSE_CI in Eq. (19) that is the mean square error (MSE) between observed and simulated rain occurrence correlations. To select 𝒮, we start with a reasonable initial configuration and iteratively perturb it by changing a few stations, retaining the set with the minimum MSE_CI. In Appendix C, we show that replacing 8 out of the 10 stations in 𝒮 with nearby ones does not affect the model's interpretation in terms of weather patterns. We show in Fig. 1 all the selected stations; in addition, we report in the heatmap scale the historical maximum of consecutive days without rain – dry spell – at each location. One of the goals of our modeling is to reproduce similar records. In Sect. 8, we will investigate how our model (and its parameters) evolves when historical data are replaced by future projection data according to some Representative Concentration Pathway (RCP) scenarios.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f01

Figure 1The 10 selected stations with their respective dry spell historical records over the period 1956–2019 (length in number of days).

The N=23 376 consecutive weather observations are labeled with $n \in D := [[1, N]]$ . The multisite rain amount (MRA for short) is calculated as

\begin{matrix} (1) & R^{(n)} := (R_{1}^{(n)}, \dots, R_{S}^{(n)}) \in R_{+}^{S}, \end{matrix}

for date n, where $R_{s}^{(n)} (mm)$ is the daily rainfall amount at site s. Note that rainfall is measured in millimeters (mm) of depth typically using a rain gauge, which physically corresponds to a volume per unit area – that is, 1 mm of rainfall equals 1 L m⁻². It is the RR variable in the ECA dataset. Similarly, the multisite rain occurrence (MRO for short) is

\begin{matrix} (2) & Y^{(n)} := (Y_{1}^{(n)}, \dots, Y_{S}^{(n)}) \in I := I_{s}^{S}, \end{matrix}

where each $Y_{s}^{(n)} \in I_{s} := {dry, wet}$ , where dry means no rain and wet means nonzero rain, i.e., $R_{s}^{(n)} \geq 0.1 mm$ of daily cumulated rain.

In Sect. 4.1.2, to connect the weather patterns inferred by the model with physically meaningful atmospheric patterns, sea level pressure reanalysis data from the ERA5 reanalysis (Hersbach et al., 2020) are used. ERA5 is the latest climatic reanalysis produced by the ECMWF (European Centre for Medium-Range Weather Forecasts), providing hourly time series for various atmospheric, oceanic, and land surface parameters over the historical period from 1940 onward. It utilizes a 4D-Var data assimilation process to produce data on a 0.25° spatial resolution grid and is freely available on the Copernicus Climate Change Climate Data Store.

Lastly, in Sect. 8, two climate projections provided by the French climate service DRIAS (Soubeyroux et al., 2021), operated by Météo-France and Institut Pierre-Simon Laplace, are used. Developed to provide the best climate change information at the French national level for practitioners, DRIAS offers projections based on the Euro-CORDEX regionalization initiative, further statistically downscaled over France at an 8km resolution. A total of 42 simulations are available: 12 for the historical period (1951–2005) and 30 for the future (2006–2100), with 12 using the RCP8.5 scenario, 10 using RCP4.5, and 8 using RCP2.6. Among this set of projections, only two are used here, covering the historical period and the RCP8.5 scenario:

CNRM-ALADIN63, regionalizing the CNRM-CM5 global climate model with the ALADIN63 regional climate model
IPSL-WRF381P, regionalizing the IPSL-CM5A-MR global climate model with the Weather Research and Forecasting (WRF) regional climate model

The next subsections are devoted to the design of the model for the evolution of the MRO. The actual nonzero rain amount will be added on top of the model after it is trained in Sect. 6. Our approach relies on a hidden Markov model: generally speaking, it is made of a hidden component ${Z^{(n)} : n \geq 1}$ (that should be inferred) and of an observed one ${Y^{(n)} : n \geq 1}$ (here the MRO). All processes are discrete-time processes. See Cappé et al. (2005) for a general account about hidden Markov models.

2.2 Seasonal hidden Markov model (SHMM), model 𝒞₀

For the sake of clarity, we start with a simplified model, which will be extended hereafter. See Zucchini and MacDonald (2009) for an introduction to hidden Markov models for time series. Consider first the hidden component Z, common to all stations s∈𝒮: it can take discrete values in $K := [[1 : K]]$ that will be later interpreted as climate states for the region of interest, here France. We will thus refer to this variable as a weather regime (WR), as often done in the literature (e.g., van der Wiel et al., 2019). Note that other equivalent names also exist in the literature, such as weather types, weather patterns, or atmospheric circulation patterns.

The time evolution of ${Z^{(n)} : n \geq 1}$ follows a nonhomogeneous Markov chain on the state space 𝒦, with initial distribution $ξ = (ξ_{1}, \dots, ξ_{K})$ , i.e., $ξ_{k} = P (Z^{(1)} = k)$ , and transition matrix $Q_{n} \in R^{K \times K}$ for n≥1,

\begin{matrix} (3) & Q_{n} (k, k^{'}) = P (Z^{(n + 1)} = k^{'} ∣ Z^{(n)} = k) . \end{matrix}

To fit the climate context, we assume that the transition matrix Q_n is a T-periodic function of n with T=366, i.e., $Q_{n + T} = Q_{n}$ ; we will thus refer to the Markov chain as a seasonal Markov chain. In that case, we will distinguish between the label day of the year $t \in T := [[1 : T]]$ and the label full date n used to denote the position in the sequence. Each n corresponds to one t, but for each t there are as many date n∈𝒟 variables as the number of periods in the sequence: the matrices Q depend on time only through the day t. If T was equal to 1, this SHMM would be a regular homogeneous HMM, i.e., a constant matrix $Q (k, k^{'}) = P (Z^{(n + 1)} = k^{'} ∣ Z^{(n)} = k)$ for all n∈𝒟 values. Next, we design the model for the time evolution of the MRO Y. The intuition behind the choice of well-spread stations is that local weather variables Y, conditional on weather regimes Z, are independent. In addition, we assume that the conditional distribution of Y⁽ⁿ⁾ does not depend on the past of Y and is also periodic. All is summarized in the following assumption.

(H-𝒞₀)

Z evolves as a seasonal Markov chain with period T=366. Conditional on the process ${Z^{(n)} : n \geq 1}$ , the spatial components $Y_{1}^{(n)}, \dots, Y_{S}^{(n)}$ are independent and, furthermore, the conditional distribution of each $Y_{s}^{(n)}$ only depends on Z⁽ⁿ⁾. This is a Bernoulli distribution describing the probability of rain at a station s and date n, conditional on Z⁽ⁿ⁾=k: it is denoted as $f_{k, n, s}$ (called emission distribution in the HMM literature) and assumed to T-periodic, i.e., $f_{k, n + T, s} = f_{k, n, s}$ , and thus represented as

\begin{matrix} (4) & \begin{aligned} f_{k, t, s} (y_{s}) & = P (Y_{s}^{(n)} = y_{s} ∣ Z^{(n)} = k) \\ = λ_{k, t, s} 1_{y_{s} = wet} + (1 - λ_{k, t, s}) 1_{y_{s} = dry} \end{aligned} \end{matrix}

for some parameters $λ_{k, t, s} \in [0, 1]$ .

The above model for $\{(Z^{(n)}, Y^{(n)}) : n \geq 1\}$ is referred¹ to as 𝒞₀ and called a seasonal hidden Markov model (SHMM), with period T, initial distribution $ξ_{\cdot} = P (Z^{(1)} = \cdot)$ , transition matrix Q_t, and distributions $f_{k, t, s}$ . This SHMM terminology is borrowed from Touron (2019 a).

The SHMM chain is illustrated in Fig. 2.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f02

Figure 2A seasonal hidden Markov process $(ξ, Q_{t}, f_{k, t})_{k \in K, t \in T}$ , where Z represents the hidden variables (weather regimes), Y the observed multisite rain occurrence (MRO), Q the transition matrix, and f the distribution of the observations.

Download

A few remarks before going further can be found below.

This model accounts for leap years: for instance, the date $n = 59 + 366 = 425$ corresponds to 28 February 1957, i.e., to the day t=59, while the next date n=426, 1 March 1957, is the day t=61. All 29 February dates are labeled with t=60. With this convention, the estimation of parameters for t=60 will be performed with 3 times fewer data than for other dates; nevertheless, it will have a quite minor impact on the procedure because of the time smoothing of parameters discussed in Sect. 2.4.
The annual periodicity of the distributions Q_t and $f_{t, k, s}$ is questionable. On the one hand, for obvious reasons of statistical inference, it is not possible to try to estimate as many distributions (parameterized by n∈𝒟) as there are data available, which leads to the reasonable assumption of annual stationarity as in Touron (2019 b). On the other hand, annual stationarity is probably not accurate considering climate change. In our methodology, the calibrated parameters should be understood as valid over the data horizon used. We will see in Sect. 8 that shifting the data period into the future (using climate projection under different RCPs) will cause some parameters to evolve. Let us mention some tests in Touron (2019 b, Chap. V) showing that the effect of climate change on precipitation is not easily identifiable (unlike for temperatures), supporting the stationarity hypothesis of our model. Including nonstationary effects with spatial HMM would require modifying the model to allow exogenous variables like in Bellone et al. (2000), Greene et al. (2011), and Dawkins et al. (2022) and will not be explored in this paper.

As a consequence of the spatial independence assumption in Sect. 2.2, the conditional likelihood of the MRO at date n is given by

\begin{matrix} (5) & f_{k, t} (y) := P (Y^{(n)} = y ∣ Z^{(n)} = k) = \prod_{s \in S} f_{k, t, s} (y_{s}) . \end{matrix}

This probability depends only on n by the corresponding day t. This assumption forces the model to learn spatial features (and spatial dependence) through the hidden states.

Later in this paper, we show (see Fig. B2) that this SHMM produces, in general, shorter dry or wet spells than the ones observed, suggesting that the Markov dynamics of the weather regime Z are not enough to stochastically explain the temporal evolution of the MRO Y. Indeed, Z is a weather regime over all of France and does not take into account the local dynamics of rain occurrence $Y_{s}^{(n)}$ , i.e., that in addition to being influenced by the global weather, local weather should also be dependent on the local previous day's MRO $Y^{(n - 1)}, Y^{(n - 2)}, \dots$ . Hence, it makes sense to define the dynamics of the MRO conditional on several previous days. This is the raison-d'être of the next models 𝒞_m and m>0.

We end this section with another set of remarks considering our model assumptions, which will also apply to 𝒞_m>0 models.

The conditional independence hypothesis in Sect. 2.2 is discussed in Sect. 2.6.
Note that conditional independence does not imply independence between stations. The model will learn, through the hidden states, “long-range correlation”, whereas conditional independence will mean that there is no short-range correlation. The actual correlations between the selected stations can be seen in Fig. 13 for MRO and range between 0 and ≃0.5.
In Hughes and Guttorp (1994 b, Fig. 3), the pairwise correlation conditional on hidden states (and synoptic forcing) is shown to decrease very quickly with distance. Typically, the characteristic decay length is around 50 km for most station pairs.

2.3 Seasonal hierarchical hidden Markov model (SHHMM), model 𝒞_m with m>0

To better reproduce the dry and wet spell distributions, we consider additional local conditioning. Different lengths of this additional local conditioning $Y_{s}^{(n)} ∣ (Z^{(n)}, Y_{s}^{(n - 1)}, Y_{s}^{(n - 2)}, \dots, Y_{s}^{(n - m)})$ will correspond to different models 𝒞_m (with some memory parameter $m = 1, 2, \dots$ ). Intuitively, models with history 𝒞_m>0 should display better temporal persistence than the 𝒞₀ model' i.e., consecutive day sequence statistics should be replicated better. On the other hand, these models 𝒞_m>0 require more parameters to be fitted for the same number of data, and thus one should expect statistically less accurate estimates if m is too large.

Given m>0, we introduce the history variable

H^{(n)} := (Y^{(n - 1)}, Y^{(n - 2)}, \dots, Y^{(n - m)}) \in H^{(m)} := I^{m}

and its local analog $H_{s}^{(n)} := (Y_{s}^{(n - 1)}, Y_{s}^{(n - 2)}, \dots, Y_{s}^{(n - m)}) \in H_{s}^{(m)} := I_{s}^{m}$ . The following hypothesis 𝒞_m summarizes the model.

(H-𝒞_m)

Z evolves as a seasonal Markov chain with period T=366. Conditional on the hidden variable ${Z^{(n^{'})} : n^{'} \geq 1}$ and the local history H⁽ⁿ⁾, the spatial components $Y_{1}^{(n)}, \dots, Y_{S}^{(n)}$ are independent, and, furthermore, the conditional distribution of each $Y_{s}^{(n)}$ only depends on Z⁽ⁿ⁾ and $H_{s}^{(n)}$ . This distribution is a Bernoulli distribution describing the probability of rain at a station s and date n, conditional on Z⁽ⁿ⁾=k and $H_{s}^{(n)} = h_{s}$ : it is denoted as $f_{k, n, s, h_{s}}$ , assumed to be T-periodic, and represented as

\begin{matrix} (6) & \begin{aligned} f_{k, t, s, h_{s}} & (y_{s}) := P (Y_{s}^{(n)} = y_{s} ∣ Z^{(n)} = k, H_{s}^{(n)} = h_{s}) \\ = λ_{k, t, s, h_{s}} 1_{y_{s} = wet} + (1 - λ_{k, t, s, h_{s}}) 1_{y_{s} = dry} \end{aligned} \end{matrix}

for some parameters $λ_{k, t, s, h_{s}} \in [0, 1]$ depending only on n by the associated day t, the hidden state k, and the m previous days' observation value h_s at the station s.

As a consequence, and similarly to Eq. (5),

\begin{matrix} (7) & \begin{aligned} f_{k, t, h} (y) & := P (Y^{(n)} = y ∣ Z^{(n)} = k, H^{(n)} = h) \\ = \prod_{s \in S} f_{k, t, s, h_{s}} (y_{s}) . \end{aligned} \end{matrix}

The 𝒞_m>0 models are defined by $(ξ, Q_{t}, f_{k, t, h})_{k \in K, t \in T, h \in H^{(m)}}$ , where the law of the first observations $ξ_{\cdot} = P (H^{(1)} = \cdot, Z^{(1)} = \cdot)$ , where $H^{(1)} = (Y^{(0)}, \dots, Y^{(1 - m)})$ is added.

Regarding the usual terminology of hidden Markov chains, the model 𝒞₀ is a standard (periodic) HMM (Cappé et al., 2005, Sect. 2.2) since the observed variables ${Y^{(n)} : n \geq 0}$ are independent given the hidden variables ${Z^{(n)} : n \geq 0}$ . For other models $C_{1}, C_{2}, \dots$ , because of the dependence with respect to previous days through $Y^{(n - 1)}, Y^{(n - 2)}, \dots$ , we are rather in the presence of autoregressive HMMs as described in Kirshner (2005, Sect. 3.1.1) (also discussed in Cappé et al., 2005, Sect. 2.2.3, under the name hierarchical HMMs): conditional on ${Z^{(n)} : n \geq 0}$ , the MRO process ${Y^{(n)} : n \geq 0}$ evolves as a Markov chain with memory m. This is a significant difference from other precipitation models in the literature, such as Touron (2019 a), Holsclaw et al. (2016), and Kroiz et al. (2020).

In the remainder of the article, we will use the term seasonal hierarchical hidden Markov model (SHHMM) to refer to the model 𝒞_m>0. Note that we will also use the same term to describe the full model, i.e., 𝒞_m>0 with added rainfall amounts (see Sect. 6).

We illustrate the 𝒞_m=1 model in Fig. 3.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f03

Figure 3Illustration of a seasonal hierarchical hidden Markov model with 1 d memory m=1.

Download

Note that a time-independent autoregressive HMM was proposed in the PhD work of Kirshner (2005, Sect. 6.1.1) as a promising option but, to the best of our knowledge, has not been explored further. However, the combination of HMM with local memory, seasonality (see Sect. 2.4), and the subsequent addition of rainfall amount (see Sect. 6) appears to be new.

2.4 Hypothesis and modeling of the time regularity of parameters

The previous models 𝒞₀ and 𝒞_m>0 depend on the T-periodic functions $(Q_{t}, f_{k, t, h})_{k \in K, h \in H^{(m)}}$ . A quick inspection of the number of scalar parameters to estimate on each day t∈𝒯 gives

K(K−1) coefficients for the transition matrix Q_t and
$K \times S \times 2^{m}$ coefficients for the Bernoulli distribution parameter $λ_{k, t, s, h_{s}}$ for all $k, s,$ and h_s.

For S=10, K=4, and m=1, it gives 92 scalar parameters, which is larger than the number of available data at each day t (64 for usual days and 16 for 29 February). On the one hand, estimating the parameters by maximizing the observed likelihood independently at each day t∈𝒯 is conceptually simple. On the other hand, the estimated parameters would suffer from high variance as there are too few data at each day t. Therefore, in the inference procedure that will be exposed in Sect. 3, a time regularity constraint will be imposed. This procedure (detailed later) will be essential to recover interpretable and meaningful results.

Let us argue in more detail. Intuitively, the timescale of variation of the model parameters should be of the order of magnitude of a month (30 d). Hence, once fitted, the parameters should evolve as a smooth function of day t. The advantages of imposing a smoothing are multiple:

This avoids unrealistic, erratic day-to-day changes in the parameters while allowing for a physically realistic seasonal evolution.
It helps to overcome the lack of data at each day t; indeed, the smoothing implies that the data from neighboring days $t - 1, t + 1, t - 2, t + 2, \dots$ are accounted for when making an inference at day t.
In terms of identifiability of the model, it is well-known that HMMs are identifiable up to relabeling of the hidden states. In the case of SHMM, the model is not identifiable up to the relabeling of hidden states at each day t (Touron, 2019 a). Thus, it is very likely that a naive likelihood optimization routine gives quite different parameters on consecutive days, whereas for obvious interpretability reasons, we seek a smooth evolution as a function of the day t of the calendar year.

A popular choice in the literature is to use trigonometric polynomials (Langrock and Zucchini, 2011; Papastamatiou et al., 2018; Touron, 2019 b) to parameterize the parameters as a function of the day t∈𝒯 (see Eqs. 8–9b below) and directly infer new parameters. The final SHMM or SHHMM is then only identifiable up to a global relabeling common to all t. Other methods, such as cyclic penalized splines (Feldmann et al., 2023; Dawkins et al., 2022), could have been considered. Thus, each parameter $(Q_{t}, λ_{k, t, s, h_{s}})_{k \in K, s \in S, h \in I_{s}}$ is composed with the trigonometric polynomial as follows: given some coefficients $c_{0}, c_{1}, \dots$ , set

\begin{matrix} (8) & P_{c} (t) := c_{0} + \sum_{d = 1}^{Deg} (c_{2 d - 1} \cos (\frac{2 π d}{T} t) + c_{2 d} \sin (\frac{2 π d}{T} t)) \end{matrix}

for some degree Deg. For all k∈𝒦 the transition matrices are given by

\begin{matrix} (9a) & \begin{aligned} Q_{t} (k, l) & = \frac{e^{P_{c_{k, l}} (t)}}{1 + \sum_{l = 1}^{K - 1} e^{P_{c_{k, l}} (t)}} for 1 \leq l < K, \\ Q_{t} (k, K) = \frac{1}{1 + \sum_{l = 1}^{K - 1} e^{P_{c_{k, l}} (t)}}, \end{aligned} \end{matrix}

and the Bernoulli parameters in Eqs. (4)–(6) by

\begin{matrix} (9b) & λ_{k, t, s, h_{s}} = \frac{1}{1 + e^{P_{c_{k, s, h_{s}}} (t)}} . \end{matrix}

The parameterization of Q_t corresponds to the log-ratio transformation well-known in compositional data analysis (Pawlowsky-Glahn and Buccianti, 2011). These definitions ensure $0 < λ_{k, t, s, h_{s}} < 1$ , $0 < Q_{t} < 1$ and $\sum_{l \in K} Q_{t} (k, l) = 1$ , $\forall t \in T$ , and $\forall k \in K$ . A model with a high degree Deg will be able to capture shorter and shorter sub-seasonal/monthly/sub-monthly phenomena.

A quick inspection of the number of parameters (the coefficients c) gives (for S=10, K=4, m=1, and Deg=1 corresponding to roughly four seasons) $92 \times 3 = 276$ scalar parameters (for all t∈𝒯) instead of $92 \times 366 = 33 672$ in the previous day-by-day parameterization. The gain is quite significant. However, the maximization step has no analytical solutions: the subsequent numerical optimization is heavy because now $(Q_{t}, λ_{k, t, s, h_{s}})_{k \in K, s \in S, h \in I_{s}}$ is not independent with respect to t∈𝒯. The resulting parametric problem is of lower dimensions but more complex to solve than the T individual problems.

In the rest of the paper, we denote by θ all the coefficients appearing in Eqs. (8)–(9b), and those are to be optimized:

\begin{matrix} (10) & \begin{aligned} θ & := {c_{k, l} \in R^{2 Deg + 1}, c_{k, s, h_{s}} \in R^{2 Deg + 1} : \\ k \in K, l \in [[1 : K - 1]], s \in S, h_{s} \in I_{s}} . \end{aligned} \end{matrix}

2.5 Identifiability

For the inference problem to make sense, the model must be identifiable. Latent models are known to be only identifiable up to label swapping. Moreover, Bernoulli mixtures are known to be non-identifiable (Gyllenberg et al., 1994). However, they are identifiable under a weaker notion of generic identifiability up to label swapping if the following condition holds (Allman et al., 2009, Corollary 5):

\begin{matrix} (11) & 2 ⌈\log_{2} (K)⌉ + 1 \leq S . \end{matrix}

Generically identifiable (Allman et al., 2009) implies in particular that the set of points for which identifiability does not hold has measure zero. Hence, for the applications, this notion is enough. For our application, we explore K being at most 8 so that S≥7.

In Touron (2019 a, Theorem 1), the identifiability up to label swapping of the seasonal hidden Markov model is proven under the following assumptions.

For $1 \leq t \leq T$ , the transition matrices $Q_{t}^{*}$ are invertible and irreducible.
The matrix $Q_{1}^{*} \dots Q_{T}^{*}$ is ergodic, and its unique stationary distribution ξ^∗ is the distribution of Z⁽¹⁾.
For each $t \in [[1, T]]$ , the K distributions $(ν_{k}^{*} (t))_{k \in {1, \dots, K}}$ are linearly independent.

The star (∗) denotes the set of true parameters. The irreducibility and ergodicity are satisfied under the parametric assumption for Q_t since all the matrix coefficients are strictly positive. The invertibility of Q_t is proven to hold up to a negligible set of parameters (Touron, 2019 a, Sect. 2.4.1) for our parametric choice. The second condition can be shown using the coefficients of Q_t as strictly positive, so those of $Q_{1}^{*} \dots Q_{T}^{*}$ are also, and therefore $Q_{1}^{*} \dots Q_{T}^{*}$ is irreducible and aperiodic. To prove that the third assumption is satisfied in our case, we use the equivalence (Yakowitz and Spragins, 1968, Theorem Sect. 3) between linear independence of K distributions (ν_k)_k∈𝒦 and the identifiability of the mixture $\sum w_{k} ν_{k, t}^{*}$ for some weights (w_k)_k∈𝒦. Together with the condition in Eq. (11), it follows that the model 𝒞_m=0 is generically identifiable up to a global relabeling. For higher-order models 𝒞_m>0, the local memory (autoregressive structure) of the distribution prevents direct application of the previous results; however, one can reasonably expect a similar condition to hold.

2.6 Model justification

We end the modeling section with a discussion on our choice of having inferred weather regimes using rain occurrences only.

Training hidden state models with binary variables such as wet/dry is well-established in machine learning classification techniques (see Bishop, 2006). Hence, rather than using a complex distribution (rain amount) in the HMM, we first focus on discrete rain occurrences, as in a Bernoulli mixture. Discrete distributions might seem like a simplification compared to existing methods, where rain amounts are directly expressed as a mixture of an atomic and continuous distribution (Touron, 2019 a) or modeled using censored Gaussian distributions (Ailliot et al., 2009; Baxevani and Lennartsson, 2015), or in the context of Markov switching models where complex weather variables are modeled (Ailliot and Monbet, 2012; Ailliot et al., 2015 b; Monbet and Ailliot, 2017; Ailliot et al., 2020). However, it can be argued that rain occurrences and amounts are very distinct processes with different statistical properties (Wilks, 1998; Dunn, 2004; Yang et al., 2019). For example, Vaittinada Ayar et al. (2020) use a spatial censored latent Gaussian model (conditioned on predefined weather regimes, but that is not the point) with the rain amount R directly. Hence, it assumes the same spatial correlation coefficient for the variables R>0 and Y. Similarly, while in our model we induce autoregressive Markov local memory for rain occurrences Y, their model assumes temporal memory using a MAR(1) model for R.

As mentioned previously, attempts to directly include spatial rain amounts within the hidden states have not been completely satisfactory in terms of learned correlations (e.g., Kroiz et al., 2020, Fig. 1, Ailliot et al., 2009, Model Cγ, or Holsclaw et al., 2016), where no correlation check seems to have been performed. Our approach produces fully interpretable hidden states that are relevant not only for rain occurrence but also for other variables such as rain amounts and mean sea level pressure. This is made possible by our assumption described in Sect. 2.2, which is both a strength and a limitation of the model: it requires a sparse station distribution but forces the hidden states to learn spatial patterns with temporal Markov dependence. Hence, at smaller scales (or for a denser station distribution), this assumption might not hold, and other, often less interpretable, methods may be required.

In fact, we argue that more complex HMMs can be increasingly difficult to train and lack interpretability. See Pohle et al. (2017) and de Chaumaray et al. (2023) for discussions on how imperfect parametric distributions can, for example, lead to an overestimation of the number of hidden states. For instance, extreme precipitation events often fall outside the reach of standard parametric rain distributions and could affect the weather regimes. Hence, learning hidden states directly from rain amounts might affect their quality. Moreover, a higher number of states could be necessary, but this would come at the expense of robustness since they are identified from the same amount of data. The choice of Bernoulli distributions for binary variables is, however, exact, suggesting that our model will likely pick a smaller number of hidden states, i.e., more interpretable.

Moreover, we also argue that breaking conditional independence, as in Hughes and Guttorp (1994 b) and Kirshner et al. (2004) (which are the only two attempts we found in that direction), must be done carefully, as it complicates model identification. Specifically, spatial correlation can either be learned by the hidden states or by the added correlation structure. The proportion of dependence captured by each component is not explicitly controlled, and in some cases, all correlations may be learned through the additional correlation structure, rendering the hidden states irrelevant. Enforcing conditional independence in our model ensures that all spatial dependencies are learned exclusively by the hidden states and is validated a posteriori (see Sect. 5.2.2). In addition, the complexity of models like that in Hughes and Guttorp (1994 b) is such that it is not clear how many more stations could be added (with respect to the conditional independence model) before reaching computational limits.

3 Fitting the SHHMM and selecting the hyperparameters

In Touron (2019 a), the maximum likelihood estimator is shown to be a consistent estimator for the seasonal HMM, i.e., 𝒞_m=0. Proving the consistency for the autoregressive model 𝒞_m>0 is outside the scope of this paper; however, we will still use the maximum likelihood estimator to infer the model parameters.

Maximizing the likelihood of a latent model is usually done with the expectation maximization (EM) algorithm. See McLachlan and Krishnan (2007) for a general review of the EM algorithm and its extensions. To maximize the log-likelihood of the SHHMM, we will use a heterogeneous version of the Baum–Welch algorithm, which is a special kind of EM algorithm for hidden Markov models. The details of the algorithm can be found in Appendix E. Note that in this paper, we do not consider Bayesian inference as in Stoner and Economou (2020) and Verdin et al. (2019). Hence, the estimated parameters will be deterministic, and the resulting SWG model will solely be responsible for the climate variability, i.e., the uncertainty in the parameters' estimation will not be accounted for. A known issue of EM algorithms is that they can converge to local maxima. As we will illustrate, a naive random initialization of the algorithm without a good guess will likely land in some meaningless local maxima – even if multiple random initial conditions are tried – and/or take a very long time to converge.

Hence, before fitting SHHMM with the Baum–Welch algorithm, we will first find a crude estimator of the SHHMM by solving many simpler subproblems by using the procedure described below.

3.1 Initialization: the slice estimate

The idea is to first treat the MRO observations of each day of the year t∈𝒯 separately. On each day t, the distributions ${{\tilde{f}}_{1, t}, \dots, {\tilde{f}}_{K, t}}$ form a mixture model that can be fitted with a standard EM algorithm. Once this is done, we relabel the hidden state at each day t to ensure some continuity in the estimated parameters ${\tilde{θ}}_{k, t, h, s}$ . Finally, by identifying the most likely a posteriori states on each date n, we obtain an estimated sequence, ${{\tilde{z}}^{(n)} : n \in D}$ , which we use to fit the transition matrices $\hat{Q} (t)$ . The whole procedure is described in Appendix F. In Appendix F7, we show the gain in terms of likelihood and number of iterations when using the slice estimate compared to random initialization.

3.2 Baum–Welch algorithm for SHHMM

In the previous section, we provide an estimated SHHMM that we will use as a starting point in the Baum–Welch algorithm. The algorithm alternates between estimation (E) and maximization (M) steps to converge to a local maximum of the observed likelihood defined for the SHHMM $(ξ, Q_{t}, f_{k, t, h})_{k \in K, t \in T, h \in H^{(m)}}$ with m≥1 (see Sect. 2.3) by

\begin{matrix} (12) & \begin{aligned} L (y^{(1 : N)}; θ) & = P (Y^{(1 : N)} = y^{(1 : N)}) \\ = \sum_{z^{(1)}, \dots, z^{(N)} \in K^{N}} ξ_{z^{(1)}, h_{1}} f_{z^{(1)}, t_{1}} (y^{(1)} ∣ h^{(1)}) \\ \prod_{n = 2}^{N} Q_{t_{n}} (z^{(n - 1)}, z_{n}) f_{z_{n}, t_{n}} (y^{(n)} ∣ h^{(n)}), \end{aligned} \end{matrix}

where for sake of simplicity we assume that h₁ is known so that $ξ_{z_{1}, h_{1}} = P (Z^{(1)} = z_{1}, H^{(1)} = h_{1}) = P (Z^{(1)} = z_{1})$ . Note that this is the case in practice, as we have a few extra days of data to define h₁. We briefly detail each step of the EM algorithm in Algorithm 1, and more details can be found in Appendix E.

Algorithm 1EM algorithm for SHHMM 𝒞_m.

Result: A SHHMM

(ξ, Q_{t}, f_{k, t, h})_{k \in K, t \in T, h \in H^{(m)}}

with parameters

{\hat{θ}}^{(i_{stop})}

Initialization:

An initial set of parameters θ⁽⁰⁾ is given, as mentioned, we use the Slice Estimate SHHMM described in Sect. 3.1.

Step (i>0):

E-step: Compute the smoothing probabilities

π_{n ∣ N}^{θ^{(i)}} (k) = P_{θ^{(i)}} (Z^{(n)} = k ∣ Y^{(1 : N)})

and

π_{n, n + 1 ∣ N}^{θ^{(i)}} (k, l) = P_{θ^{(i)}} (Z^{(n)} = k, Z^{(n + 1)} = l ∣ Y^{(1 : N)})

under the current parameter θ⁽ⁱ⁾. These probabilities can be computed using the Forward-Backward procedure (Appendix E).

M-step: Maximize the function

R (θ, θ^{(i)}) = E^{θ^{(i)}} [\log L (Y^{(1 : N)}, Z^{(1 : N)}; θ) ∣ Y^{(1 : N)}]

with respect to θ.

Stop:

The iterations stop at i=i_stop when

L ({\hat{θ}}^{(i + 1)}) - L ({\hat{θ}}^{(i)}) < ϵ_{atol}

Note that at M-step, the maximization can be done independently for the transition matrices and the distributions of the observations (and initial distributions). However, since we enforce the coefficients θ⁽ⁱ⁾ as periodic functions of the day of the year t, the maximization step cannot be done explicitly even for a simple Bernoulli distribution and is thus done numerically.

In all our numerical applications, the stopping criterion is $ϵ_{atol} = 10^{- 3}$ . The log-likelihood at convergence is typically for the settings K=4, m=1, Deg=1, and the historical data $L ({\hat{θ}}^{(i_{stop})}) ≃ - 117 127$ , i.e., $ϵ_{atol} / | L ({\hat{θ}}^{i_{stop}}) | \sim 10^{- 8}$ . We also check that this stopping criterion is relevant for the θ parameters as we have $max (| θ^{(i_{stop})} - θ^{(i_{stop} - 1)} |) ≃ 10^{- 3}$ , where the max is taken as the largest difference between two iterations over all the parameters θ in Eq. (10).

To avoid being trapped in a local minimum, we run the algorithm 10 times with initial conditions randomized around the initial state θ⁽⁰⁾ provided in Sect. 3.1; see Appendix F6 for more details. We then select the maximum likelihood amongst the different runs.

3.3 Hidden state inference: the Viterbi algorithm

Once the SHHMM parameters are found $\hat{θ}$ , the most likely hidden states given the observed data sequence ${{\hat{z}}^{(n)} : n \in D}$ , i.e.,

\begin{matrix} (13) & ({\hat{z}}^{(1)}, \dots, {\hat{z}}^{(n)}) = \begin{matrix} arg min \\ z^{(1 : N)} \end{matrix} P (Z^{(1 : N)} = z^{(1 : N)} ∣ Y^{(1 : N)} = y^{(1 : N)}), \end{matrix}

can be inferred with the Viterbi algorithm (Viterbi, 1967). In this algorithm, to estimate Eq. (13), for n∈𝒟 and k∈𝒦, the quantity

\begin{matrix} (14) & δ_{n} (k) = \begin{matrix} max \\ z^{(1 : n - 1)} \end{matrix} P (Z^{(1 : n - 1)} = z^{(1 : n - 1)}, Z^{(n)} = k, Y^{(1 : n)} = y^{(1 : n)}) \end{matrix}

is estimated recursively. For a homogeneous HMM with no local memory, Eq. (14) is simply

\begin{aligned} δ_{n} (k) & = (\begin{matrix} max \\ l \in K \end{matrix} δ_{n} (l) Q (k, k^{'})) \\ P (Y^{(n + 1)} = y^{(n + 1)} ∣ Z^{(n + 1)} = k) . \end{aligned}

See Viterbi (1967). For an SHHMM Eq. (14) is

\begin{aligned} δ_{n} (k) & = (\begin{matrix} max \\ l \in K \end{matrix} δ_{n} (l) Q_{n} (k, k^{'})) \\ P (Y^{(n + 1)} = y^{(n + 1)} ∣ Z^{(n + 1)} = k, H^{(n + 1)} = h^{(n + 1)}) . \end{aligned}

This can be shown by a straightforward adaptation of the original proof.

This algorithm provides a very efficient way to decode the whole hidden state sequence corresponding to the observations, allowing us to match historical weather events to hidden state sequences. This is illustrated in Sect. 4.4.

3.4 Model selection

We introduced three hyperparameters to our model: the local memory length $m = 0, 1, 2, \dots$ , the number of hidden states (weather regimes) $K = 1, 2, 3, 4, \dots$ , and the degree $Deg = 0, 1, 2, \dots$ of the trigonometric expansion in Eq. (8). In particular, the number of hidden states K must be large enough to reproduce spatial correlations but low enough to avoid overfitting and loss of interpretability. In this model, we fix m and Deg to be the same for all stations and variables.

In the literature, several methods have been used to assess the best hyperparameters of HMMs, information criterion coefficients like the Bayesian information criterion (BIC), and cross-validation; see de Chaumaray et al. (2023), and references therein. From a theoretical point of view, no result guarantees the quality of these estimators for SHHMM. To select the hyperparameter K, we use the integrated complete-data likelihood (ICL) criterion, as it favors nonoverlapping hidden states and shows better empirical performance with HMM than other model selection methods (Celeux and Durand, 2008; Pohle et al., 2017). It is defined as $L_{C} (y^{(1 : N)}, z^{(1 : N)}; θ) = P (Z^{(1 : N)} = z^{(1 : N)}, Y^{(1 : N)} = y^{(1 : N)}; θ)$ , which is not accessible in practice. The estimate

{\hat{L}}_{C} (y^{(1 : N)}, {\hat{z}}^{(1 : N)}; \hat{θ}) = P (Z^{(1 : N)} = {\hat{z}}^{(1 : N)}, Y^{(1 : N)} = y^{(1 : N)}; \hat{θ})

uses the fitted parameter $\hat{θ}$ and the decoded Viterbi most likely hidden state sequence $({\hat{z}}^{(n)})_{n \in D}$ . The ICL is then computed as

\begin{matrix} (15) & ICL (m, Deg, K) = \log ({\hat{L}}_{C} (y^{(1 : N)}, {\hat{z}}^{(1 : N)}; \hat{θ})) - \frac{\log (N)}{2} | \hat{θ} | . \end{matrix}

The optimal ${m, Deg, K}$ set is obtained by maximizing $ICL (m, Deg, K)$ . In Fig. 4, we see that $K = 4, m = 1$ , and Deg=1 maximize the ICL. Hence, for the rest of the paper, unless specified otherwise, we will choose these parameters. Note that in Robertson et al. (2004), K=4 hidden states were also found for northeast Brazil using cross-validation. Note that, in principle, we could use different m_s at each station s∈𝒮, as well as different degrees Deg_s for each type of variable and station (transition matrix coefficient, Bernoulli parameter, etc.). We tested configurations where some stations had a larger local memory (m_s=2 or 3), but this consistently resulted in a lower ICL. This suggests that while the ICL criterion is well-suited for selecting the number of states K, it may not be optimal for choosing other hyperparameters, as some stations, such as La Hague, show signs of higher-order temporal dependence (see Figs. 11 and 12). Alternative criteria such as BIC (Katz, 1981) or parsimonious higher-order Markov models (e.g., Raftery, 1985) might be considered. For the remainder of the paper, we fix m=1 at all stations.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f04

Figure 4ICL for different values of the hyperparameters. The model with $K = 4, m = 1$ , and Deg=1 is the maximizer.

Download

3.5 Comparison with multisite WGEN-type models

In this section, we introduce another multisite rain occurrence model that will be used for comparison. This model was first proposed by Wilks (1998) using first-order Markov models to simulate rain occurrence with a Gaussian latent model to generate spatially correlated amounts. Srikanthan and Pegram (2009) later extended it to fourth-order Markov models to better reproduce dry/wet spell distributions. This class of weather generators is also referred to as WGEN-type models (Nguyen et al., 2023) and is typically used alongside rainfall amount models (e.g., Evin et al., 2018). Note that in the literature, the acronym WGEN has also been used to refer to other models. Seasonality is accounted for by assuming that the parameters remain constant within each month.

Mathematically, at each station s∈𝒮 and for a month $mth \in [[1 : 12]]$ , rain occurrence follows a Markov model of order m_W, with transition probabilities given by

\begin{matrix} (16) & f_{mth, s}^{W} (y_{s} ∣ h_{s}) = P (Y_{n}^{(s)} = y ∣ H_{s}^{(n)} = h_{s}), \end{matrix}

where $H_{s}^{(n)} = (Y_{s}^{(n - 1)}, Y_{s}^{(n - 2)}, \dots, Y_{s}^{(n - m)}) \in H_{s}^{(m_{W})} := I_{s}^{m_{W}}$ is the history variable of order m_W, introduced in Sect. 2.3.

The multisite correlations are modeled using an unobserved Gaussian process U. At each day n∈𝒟 and for a given month $mth \in [[1 : 12]]$ ,

\begin{matrix} (17) & U^{(n)} \sim N (0, Ω_{mth}), \end{matrix}

where $U^{(n)} = {U_{s}^{(n)}}_{s \in S}$ , and $Ω_{mth} = {ω_{s, s^{'}}^{mth}}$ is an S×S positive-definite correlation matrix.

The rain occurrence $Y_{s}^{(n)}$ at site s is determined by the value of $U_{s}^{(n)}$ . Given a history $h_{s}^{(n)} \in I_{s}^{m_{W}}$ and the Bernoulli probability $p^{W} = f_{mth, s}^{W} (y_{s} = wet ∣ h_{s}^{(n)})$ ,

\begin{matrix} (18) & Y_{n}^{(s)} = \{\begin{cases} wet, & if U_{s}^{(n)} \leq Φ^{- 1} (p^{W}) \\ dry, & otherwise, \end{cases} \end{matrix}

where Φ⁻¹ is the quantile function of the standard normal distribution.

As in Srikanthan and Pegram (2009) and Evin et al. (2018), we set the Markov model order to m_W=4. The correlation matrix is estimated following the previous references by simulating each site pair for each month to determine $ω_{s, s^{'}}^{mth}$ that yields the observed correlations, $cor ({Y_{s}^{(n)}}_{n \in D_{mth}}, {Y_{s^{'}}^{(n)}}_{n \in D_{mth}})$ where 𝒟_mth is the set of all days in month mth.

The model is fitted, yielding $2^{m_{W}} \times S \times 12$ parameters for the Markov chains and $S (S - 1) / 2 \times 12$ for the correlation matrices.

The biggest advantage of this model is that it is not limited by the conditional independence hypothesis; i.e., stations can be as close or as far apart as needed. Moreover, the fitting procedure is slightly simpler, as no expectation maximization algorithm is required. The seasonality treatment differs slightly, as WGEN-type models assume parameters to be constant per month, while in our setting, they evolve smoothly throughout the year. Ignoring this minor difference, the complexity of this model is significantly greater than ours. The number of correlation coefficients grows as ∼S², whereas our model scales as ∼K² for the spatial part, with typically K≪S. Thus, while our model is limited in the number of stations due to the conditional independence assumption, WGEN-type models may be constrained in practice by computational complexity. Moreover, as we will show, setting our local history to m=1 provides good results in general, whereas WGEN-type models typically require larger m_W values to adequately reproduce dry/wet sequences (see Sect. 5.2.1). This suggests that part of the temporal dependency is captured by the hidden states, simplifying the local Markov models. Additionally, large-scale dry spells will not be accurately represented, as described in Sect. 5.3. This is not entirely surprising, as WGEN-type models only account for pairwise correlations. Note that higher orders could, in principle, be added at a much greater computational cost.

4 Interpretability: making sense of the hidden states

One of the main messages of this paper is to show that the resulting hidden states are fully interpretable, both spatially and temporally. In particular, forcing conditional independence (see Eqs. 5 and 7) forces all spatial correlations to be in the hidden states.

The discrete latent variable Z∈𝒦 used here corresponds to weather regimes, sometimes referred to as weather types or patterns, which represent a finite set of possible atmospheric states acting as quasi-stationary, persistent, and recurrent large-scale flow patterns. They are commonly used in weather generators to characterize the daily atmospheric circulation (e.g., Garavaglia et al., 2010), which influences the values of the generated variables at the daily timescale. Various methods exist for identifying these weather regimes, with hidden Markov models being one such approach. In our case, these hidden states are not constrained by any external variables and will be interpreted as specialized weather states for France. To our knowledge, no previous approach using spatial HMM has been applied to infer and interpret weather regimes over France (or western Europe). Generally, only a few attempts (e.g., Robertson et al., 2004, Sect. 4) identify and interpret weather regimes without using exogenous variables.

We describe in this section different points of view to give a sense of these hidden states that we also refer to as weather regimes. In the following, all plots and interpretations are done for the model 𝒞_m=1 with K=4 and Deg=1, which was the model selected in Sect. 3.4.

4.1 Spatial features

The hidden states have been introduced to give correlated rain events across France. Hence, we expect the hidden states to form some spatial patterns specific to French weather; typically, the south is generally drier than the north.

4.1.1 Rain probability

In Fig. 5, we show the rain probability given the hidden state k and that the previous day was dry, averaged over the year. The Z=1 state corresponds to a high probability of rain over all of France, Z=2 corresponds to a rainy climate in the north and drier in the south, and Z=3 is more or less the opposite, while in the state Z=4 the probabilities of rain are low all over France. The trained model satisfactorily recovers known regional features of the French climate. For higher-order models K>4, the spatial features are more and more specific to peculiar regimes, e.g., rainy only in Bastia. It can also be a signal of overfitting.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f05

Figure 5Yearly mean rain probability $T^{- 1} \sum_{t \in T} λ_{k, t, s, h}$ for m=1 and h=dry, i.e., the probability of rain at a location s, conditional on the hidden state $Z = k \in [1, K = 4]$ and on a previous dry day.

4.1.2 North Atlantic pressure maps

To interpret the model beyond the stations it was trained on and beyond the rain occurrence variable, we will look at the hidden states in terms of weather patterns over the North Atlantic with pressure maps. This is a common practice in climatology and is used to classify weather patterns. For example, the four North Atlantic weather regimes are large-scale weather regimes over the North Atlantic Ocean responsible for most of the climate variability (Woollings et al., 2010). There are various definitions of the regimes of the North Atlantic weather, and it is typically done using clustering methods such as K-means on the daily maps of anomalous 500 hPa geopotential height (e.g., van der Wiel et al., 2019). Note that, as noted in Garavaglia et al. (2010), “it is almost impossible to assert that a given classification is the best” – hence, the comparisons presented in this section are mostly qualitative, as our model classifies weather regimes using only rain occurrence, while other weather variables or classification techniques would yield different regimes.

In Fig. 6, we show how the weather regimes are relevant in terms of pressure maps. We consider the mean sea level pressure (MSP) from the reanalysis ERA5 hourly data on single levels (Hersbach et al., 2020) from 1979 to 2017. The pressure map is averaged over all winter days $D_{W} = D \cap {December, January, February}$ conditional on the hidden state (inferred before via the Viterbi algorithm (see Sect. 3.3), giving a pressure anomaly map $Δ MSE = E_{t \in D_{W}} (MSP (t) ∣ {\hat{z}}_{t} = k) - E_{t \in D_{W}} (MSP (t))$ at each longitude and latitude. The geographical area where the pressure maps are computed is $(longitude \in [80 ° W, 40 ° E])$ and $(latitude \in [25 ° N, 80 ° N])$ . It is much larger than France and corresponds roughly to the North Atlantic area. The results are shown in Fig. 6.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f06

Figure 6Winter (December–January–February) mean sea level pressure (Pa) anomalies (difference) between the average of all winter days in Z=k state and the average of all winter days. The relative frequency of occurrence of each state during winter in the historical data is shown in parentheses.

The four maps clearly show four distinct regimes with different pressure anomalies. It is remarkable that coherent large-scale structures over mean sea level pressure are found with a model only trained over S=10 stations, all located in France, with rain occurrences as training data.

As a first comparison, we consider the four Euro-Atlantic regimes defined in Cassou (2004, Fig. 7). We display the same differential pressure map as they do over the winter months for a similar spatial domain. In Cassou (2004) and van der Wiel et al. (2019), the four regimes are NAO−, Atlantic Ridge, blocking, and NAO+. The two NAO regimes correspond to the reinforcement or attenuation of the Icelandic low and Azores high, leading to a strengthened or weakened westerly flow over France. The two other regimes correspond to different deviations of this flow, having different consequences for French weather, depending on the season.

In our model selection, we found the same number of hidden states K=4; see Sect. 3.4. If to some extent the states defined by the SHHMM (Fig. 6) look similar to the Euro-Atlantic regimes (see Cassou, 2004, Fig. 7) in terms of the order of magnitude of the mean pressure anomalies (∼10 hPa) and patterns, e.g., Z=4 can be seen as a blocking-like pattern.

However, a close inspection shows important differences: the structures we found are more centered toward France and slightly more intense. This is to be expected as the training data are only in France. A fairer comparison can be found with Boé and Terray (2008), where the weather types (WTs) found with rain amounts in France are also coherent with ours and easier to compare as they represent, like us, the MSP anomalies. They define eight WTs over extended winter months (November to March). WT2 (25 % relative frequency of occurrence) is very close to our Z=4 (31.8 %) in terms of pattern and amplitude, even far outside France. WT6 (12.5 %) is also very close to our Z=3 (15.5 %), i.e., a milder depression centered on the Azores and an anticyclone centered in northern Europe. The other two rainier hidden states Z=1 and Z=2 probably cannot be viewed as simple combinations of the remaining WTs. This is consistent with Boé and Terray (2008), who use rainfall amounts for clustering.

In Garavaglia et al. (2010), the weather patterns (WPs) are found using stations mostly located in the southeast of France and a more complex rain variable. Note that their main figure (Fig. 3) shows the WP geopotential height at 1000hPa averaged all year long, making the comparison with our Fig. 6 harder. Their anticyclonic state (WP8) (25 %) is close to our Z=4, with a pressure high over the north of France and south of Great Britain.

4.2 Seasonality

The SHHMM's transition matrix and distributions have periodic coefficients varying across the year. A consequence is that the hidden states are not fixed in time but can also vary. We expect variations to be smooth enough so that weather regime Z=k has a similar interpretation during the whole year.

4.2.1 Transition matrix

We display, in Fig. 7, the 16 coefficients of the transition matrix Q_t. The “dry” state Z=4 is the state where the probability of staying in the same state is the highest. Hence, we expect longer global dry sequences than the other regimes. The probability of remaining in the same state is the lowest in states Z=2 and Z=3; hence, these can be seen as transitional states. Moreover, state Z=4 has a very low probability of switching directly to state Z=1 (and vice versa), confirming that an intermediate state is required for this to happen. This makes sense with the intuition that a dry day all over the country is rarely followed by a wet day all over France. During some seasons, e.g., summer (June, July, August, September), state Z=2 will prefer to transition to a dry state Z=4 rather than the wet state Z=1. This is the opposite situation in the rest of the year. Again, this is consistent with the fact that during summer we expect the state Z=1 to be less frequent.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f07

Figure 7Temporal variation of the transition matrix Q(t) for the SHHMM K=4, Deg=1, and m=1.

Download

4.2.2 Rain probability

We plot, in Fig. 8, the rain probabilities as a function of the station and climate variable Z=k. In almost all stations, the extreme states Z=1 (4) are where it rains most (least) often. As seen in Fig. 5, states Z=2 and 3 are different in the north and south.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f08

Figure 8Estimated $λ_{k, t, s, h}$ probability for m=1 and h=dry, i.e., the probability of rain at the location s, conditional on the hidden state $k \in [[1, K]]$ and on a previous dry day. The stations are sorted by latitude from northernmost (top left) to the southernmost (bottom right).

Download

The success of the SHHMM fit can be observed as, at each station, most states are completely separated all year long. That is, in general, the rain probability conditioned on different states Z does not cross states or become equal, showing meaningful states. When converging to local minima, we would typically observe such state crossings during the year, indicating potential issues in the fit.

4.3 Mean rain amount

Even if the model training does not involve any rain amounts $R_{s}^{(n)}$ , the hidden states Z=k should still be meaningful for these. In Fig. 9, we plot the daily mean rain amount $R_{k, s}^{(t)} > 0$ for each station and state k. The values obtained are smoothed with a periodic moving average of time window ±15 d; see Appendix D for the definition. The “rainy” weather regime k=1 is the state where it rains the most at almost every location and all year long. Similarly, the “dry” regime k=K is where it rains the least. Interestingly, the intermediate regimes, k=2 and k=3, are rainier in the north in different seasons. Southern stations have a different behavior as expected.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f09

Figure 9Daily mean strictly positive rain quantity R>0 (mm) at every station per kth component. We smooth the results as in Eq. (D1). We use the model 𝒞₁ with K=4 to get a posteriori the most likely state associated with each date n; see Eq. (F3).

Download

4.4 Weather regime spells

To illustrate the dynamics of the weather regimes, we show in Fig. 10 for different years the Viterbi estimated hidden states $(\hat{z})$ (see Sect. 3.3). As previously noticed, dry and wet spells last longer in general than in other states. For historical events such as the drought in the of summer 1976, we observe a long dry sequence (27 d in a row in state Z=4 starting from 3 June). The famous 2003 heatwave from 1 to 15 August also corresponds to a 15 d dry spell.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f10

Figure 10Estimated hidden state sequence for a selection of years. Each color corresponds to a hidden state: Z=1 is blue, Z=2 is green, Z=3 is purple, and Z=4 is orange.

Download

5 Simulations: multisite rain occurrence

Now that the model is fully inferred and interpreted, we will test its validity. To do so, we will sample multiple independent and identically distributed (IID) realizations of the training period 1956 to 2019 and compare several spatiotemporal statistics with the historical data.

5.1 Simulation algorithm of the SHHMM

We first sample the hidden states $(z^{(n)} : n \in D)$ according to the nonhomogeneous periodic transition matrix $Q_{t_{n}}$ and initial distribution ξ, and then we draw the MRO $(y^{(n)} : n \in D)$ from the conditional distributions $f_{z^{(n)}, t_{n}, s, h_{s}^{(n)}}$ . The procedure is summarized in Algorithm 2.

Algorithm 2Simulation of the SHHMM

Result: Sequence hidden states z⁽ⁿ⁾, sequence of MRO y⁽ⁿ⁾

z^{(n = 1)} \sim ξ

for n∈𝒟 do

z^{(n)} \sim Q_{t_{n}} (z^{(n - 1)}, \cdot)

end for

y^{(n = m - 1 : 0)} = y_{ini}^{n = m - 1 : 0}

for n∈𝒟 do

for s∈𝒮 do

y_{s}^{(n)} \sim f_{z^{(n)}, t_{n}, s, h_{s}^{(n)}} (\cdot)

end for

In the simulations, we choose the initial date as 1 January 1956. Our final date is 31 December 2019 so that the total simulated range is 64 years, which corresponds to our dataset span. We choose $ξ = (1, 0, 0, 0)$ , i.e., z⁽¹⁾=1, which is a rainy weather regime, because it was rainy all over France on that day. We assume that the MROs before the first simulation day $y_{ini}^{n = m - 1 : 0}$ are observed and use them as input to draw $y_{s}^{(1)} \sim f_{z^{(1)}, t, s, h_{s}^{(1)}}$ .

The model is lightweight, allowing fast generation – which can easily be parallelized – of many sequences to study climate variability. On a standard computer, generating one 64-year sequence of rain occurrences and hidden states using the StochasticWeatherGenerators.jl package (Métivier, 2024) takes approximately ∼0.01 s.

5.2 Results

In the following, we will use $J = 5 \times 10^{3}$ IID. realizations (stochastic simulations) of the SHHMM over a 64-year span and compare its statistics with the 64-year observed sequence and denote by ${y_{s}^{(n)}}_{n \in D}^{(j)}$ the $j \in [[1, J]]$ realizations. When the j is dropped, it refers to the ensemble of all simulations.

5.2.1 Dry/wet state sequence

The dry spell sequences are of particular interest to estimate risks associated with droughts. We show the observed dry (wet) spell distributions, i.e., probability mass function (PMF), in Fig. 11 (and Fig. 12) at all the stations and compare them to the simulated spells for the J realization. When the historical distribution is contained in the simulations' envelope, we may conclude that the model does a good job of reproducing the dry (wet) spells: note that this works systematically well, except for La Hague station (bordering the Channel Sea) at a few data points. In general, for Lille, La Hague, and Chassiron, the observed PMFs deviate from the center of the envelope, suggesting that higher m>1 might be required.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f11

Figure 11Dry spell distribution (in number of days) at every station and for a time range 𝒟 of the historical data (blue) and of the J=10³ simulated wet spell distribution. The gray envelope covers the full range (q_0,100) of the simulations, while the red envelope covers the interquartile range (q_25,75), and the line (red) is the median. Simulations are obtained over the same time range 𝒟 and using the model $K = 4, Deg = 1$ , and 𝒞_m=1.

Download

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f12

Figure 12Wet spell distribution (in number of days) at every station and for a time range 𝒟 of the historical data (blue) and of the $J = 5 \times 10^{3}$ simulated wet spell distribution. The gray envelope covers the full range (q_0,100) of the simulations, while the red envelope covers the interquartile range (q_25,75), and the line (red) is the median. Simulations are obtained over the same time range 𝒟 and using the model $K = 4, Deg = 1$ , and 𝒞_m=1.

Download

In Appendix B, we show and discuss the distribution obtained using the memoryless 𝒞_m=0 model to highlight the gain of the model 𝒞_m=1 in both the center and the tails of spell distributions; see Figs. B1 and B2. We note that even though wet spells are in general much shorter than dry spells, having m=1 is necessary to accurately reproduce the wet spells. Note that the WGEN model presented in Sect. 3.5 is also expected, for a sufficiently high order, to perform well for dry/wet spells, as it is trained at each station to learn the temporal dependence of dry/wet sequences.

5.2.2 Spatial correlations

We compare in Fig. 13 the observed and simulated $S (S - 1) / 2$ correlation coefficients between all sites $cor ({Y_{s}^{(n)}}_{n \in D}, {Y_{s^{'}}^{(n)}}_{n \in D})$ for all $s \neq s^{'} \in S$ . Most correlations are well-reproduced, showing that the conditional independence hypothesis in Sect. 2.2 (or Sect. 2.3) is valid.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f13

Figure 13 Observed pair correlations $cor ({Y_{s}^{(n)}}_{n \in D}, {Y_{s^{'}}^{(n)}}_{n \in D})$ for all $s \neq s^{'} \in S$ compared with the correlations computed from the simulations (we average the $J = 5 \times 10^{3}$ pair correlations of our simulations). The conditional independence metric MSE_CI in Eq. (19), is displayed in the figure.

Download

To measure this quantitatively, we define the following conditional independence metric:

\begin{matrix} (19) & \begin{aligned} {MSE}_{CI} & = \sum_{s < s^{'} \in S} (cor ({Y_{s}^{(n)}}_{n \in D}, {Y_{s^{'}}^{(n)}}_{n \in D}) \\ - {\frac{1}{J} \sum_{j = 1}^{J} cor ({y_{s}^{(n)}}_{n \in D}^{(j)}, {y_{s^{'}}^{(n)}}_{n \in D}^{(j)}))}^{2} . \end{aligned} \end{matrix}

The closer to zero, the better the conditional independence hypothesis will be satisfied. This criterion helps compare different choices of stations 𝒮; see Sect. 2.1 and Appendix C for more details. This is the biggest limitation of this work: to produce meaningful hidden states that correctly learn spatial correlations, the MSE_CI must be small. For example, a pair of stations at the center of Paris and Orly (only ≃13 km apart) would not satisfy the conditional independence hypothesis. Note, however, that the conditional independence between stations is not necessarily isotropic; hence, station configurations with better MSE_CI are not necessarily those with the largest pairwise station distances.

To give an upper bound to this metric, we trained the model, while keeping seasonality, with K=1 state, i.e., no hidden states (meaning completely independent stations), and found ${MSE}_{CI}^{(K = 1)} = 0.096$ .

5.3 Spatiotemporal spells

To check the spatiotemporal properties of the model, we can focus on the temporal properties of the following spatial quantity: the rain occurrence rate (ROR). Note that a similar quantity is considered in Baxevani and Lennartsson (2015). On a given day n, it is defined as the fraction of stations above some precipitation threshold R_th,

\begin{matrix} (20) & {ROR}_{R_{th}}^{(n)} = \frac{\sum_{s \in S} 1_{R_{s}^{(n)} > R_{th}}}{| S |} . \end{matrix}

In general, the precipitation threshold could be made station-dependent, e.g., a given quantile. Such quantities have been used to study large-scale phenomena, e.g., temperature heatwaves (Miloshevich et al., 2023; Cognot et al., 2025). Here, we focus on dry or wet events relevant, for example, for droughts, choosing R_th=0. The previous quantity simplifies to

\begin{matrix} (21) & {ROR}^{(n)} = \frac{\sum_{s \in S} Y_{s}^{(n)}}{| S |} . \end{matrix}

To evaluate large-scale lasting dry events, we consider spells of ROR⁽ⁿ⁾≤0.2; i.e., only 20 % or less of the stations are rainy. We show in Fig. 14 the distribution (PMF) of the spells for each season, i.e., December–January–February (DJF), March–April–May (MAM), June–July–August (JJA), and September–October–November (SON). We observe that the SHHMM is able to reproduce short spells as well as the distribution tails for every season. It can even produce longer spells than observed. On the contrary, the WGEN model overestimates short spell durations while underestimating longer spells. This clearly shows that simple correlation models are not adapted to produce correct large-scale weather events, even though they perform well for pairwise correlation and local dry/wet spells. Using censored Gaussian models, Kleiber et al. (2012) and Serinaldi and Kilsby (2014) consider similar quantities but obtain rather poor results. In contrast, Vaittinada Ayar et al. (2020) achieve good results; however, their model is conditioned on synoptic weather regimes and tested on a much smaller area.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f14

Figure 14 Distribution of spells of ROR⁽ⁿ⁾ lower than or equal to 0.2 for each season. We show in blue the observed spells and in gray the simulations obtained with the SHHMM; in green are the simulations obtained with the WGEN model with Markov chains of order 4. In both cases, the gray envelope shows the quantiles q_5,95 of the $J = 5 \times 10^{3}$ simulations, and the line is the median (q₅₀).

Download

6 Modeling: precipitation amount

In this section, we attach to the rain occurrence model 𝒞_m an add-on: a multisite precipitation amount generator. The procedure is carried out hierarchically, i.e., without modifying or retraining the original model. In fact, other variables such as temperature and solar irradiance could be attached similarly to what will be presented in this section. To do so, one only needs a generator for the new variable, e.g., AR(1) model for temperature, and to allow its parameters to depend on the weather regimes Z⁽ⁿ⁾=k and to evolve smoothly (as in Sect. 2.4) with the day of the year t. We hypothesize that the new variable has some dependence on both the weather regime and the season. We discussed in Sect. 4 various spatiotemporal interpretations of the weather regime, and thus it makes sense to consider how this global variable is relevant for other weather variables. Hence, the resulting add-on generator should generate a variable at least partially correlated with the original SHHMM. This makes the model very modular, allowing easy extensions without affecting its original performances and interpretations. Figure 9 highlights the rain amount dependence on the weather regime k and seasonality. This principle is applied in this section to build an add-on rainfall generator.

The multisite rain amount (MRA for short) is denoted R⁽ⁿ⁾ as in Eq. (1). Building an MRA generator directly is hard because of the ambivalent probabilistic nature of rain, being neither a discrete nor a continuous variable. Here we can just focus on strictly positive rain amounts R>0 because the SHHMM directly indicates when R=0 or R>0.

To train the rain amount generator, we will use the hidden states $Z^{(n)} = {\hat{z}}^{(n)}$ found in Sect. 3.3. The schematic of the resulting model is shown in Fig. 15.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f15

Figure 15SHHMM with rain amounts. g_k,t denotes the MRA generator with respect to the weather regime k and day of the year t.

Download

6.1 Marginal rain distributions

The rain amount generator we use to fit the marginal distributions R_s>0 at each station is a mixture g(r) of two exponential distributions, with density

\begin{matrix} (22) & g (r) = w \frac{e^{- \frac{r}{ϑ_{1}}}}{ϑ_{1}} + (1 - w) \frac{e^{- \frac{r}{ϑ_{2}}}}{ϑ_{2}} . \end{matrix}

This choice is widely adopted in the literature, e.g., Kirshner (2005), Touron (2019 b), and Kroiz et al. (2020), and has only three parameters denoted by $γ = {ϑ_{1}, ϑ_{2}, w}$ . It is flexible enough to be used for different climate types and locations. Other popular choices such as gamma (Kroiz et al., 2020; Holsclaw et al., 2016) or heavy-tail distributions (Baxevani and Lennartsson, 2015; Naveau et al., 2016; Tencaliec et al., 2020) could be used at specific locations s or weather regimes k when needed. Note that these heavy-tail rainfall distributions are notoriously hard to estimate (Evin et al., 2016), so we will not consider them in the present paper. See Chen and Brissette (2014) for a review of univariate precipitation models. For example, precipitation in the south of France is less frequent than in the north but more intense, leading to extreme events which are better described with heavy-tailed distributions. In the simulation part, Sect. 7, we show that despite being light-tailed, this choice of generator g(r) trained with respect to weather regimes and seasonality is able to reproduce both the bulk and the tails of most observed rain distributions well.

As in Sect. 2.4, the parameters of the mixture are periodic functions $γ (t) = {ϑ_{1} (t), ϑ_{2} (t), w (t)}$ , where $ϑ_{1 or 2} (t) = e^{P_{1 or 2} (t)} > 0$ , $w (t) = 1 / (1 + e^{P_{θ} (t)})$ and the P functions are trigonometric polynomials (see Eq. 8).

To fit the mixtures $g_{k, t, s}$ for each station s and hidden state k we use the classical EM algorithm. The maximization step has to be performed with numerical optimization as in Sect. 3. Note that optimization can be done separately for each weather regime k and station s.

6.2 Multisite distribution: Gaussian copula

After training the marginal distributions $g_{k, t, s}$ at each hidden state and site, we now focus on generating correlated multisite rainfall amounts (MRAs). To generate multisite rain occurrences (MROs), we used the conditional independence with respect to the hidden state (and possibly local history). For a vector of Bernoulli (dry/wet) random variables, this was enough to approximate the observed correlation matrix well (see Fig. 13). However, for a vector of non-discrete random variables, such as rain amounts, mixtures of conditionally independent distributions typically underestimate the joint distribution (Holsclaw et al., 2016). It means that despite the hidden states carrying some part of the MRA correlations, we have to add correlations in another way. A classical approach is to use copulas (Nelsen, 2006). Amongst the various families of copula, the Gaussian copula is the easiest to train and manipulate and has been used for weather models (Pandey et al., 2018; Kroiz et al., 2020). In this paper, we will thus train and use a Gaussian copula conditional on the hidden states to generate multisite (strictly) positive rain amounts.

Let $(ρ_{s, s^{'}})_{s, s^{'}}$ be the correlations between a pair of stations (s, s^′) for joint rainy events, i.e., $(ρ_{s, s^{'}})_{s, s^{'}} = Cor (R_{s} | R_{s} > 0, R_{s^{'}} | R_{s^{'}} > 0)$ . To reproduce the correct observed (Pearson) correlation $(ρ)_{s, s^{'}}$ , we train a Gaussian copula. A Gaussian copula takes a correlation matrix $Σ^{(G)} = {ρ_{s, s^{'}}^{(G)}}_{s, s^{'} \in S^{2}}$ and the marginal distributions g_s as input. The matrix Σ^(G) is not directly observed, but for an elliptic copula, there is a relationship between the correlation ρ^(G) and the Kendall (rank) correlations (Fang et al., 2002, Theorem 3.1),

\begin{matrix} (23) & ρ^{(G)} = \sin (\frac{π}{2} ρ_{Kendall}) . \end{matrix}

Hence, to compute ρ^(G), we use the observed Kendall correlation ρ_Kendall , which is preserved under monotonic transformation, such as quantile and cumulative distribution functions (CDFs).

We estimate the correlation matrices $Σ_{k}^{(G)} = {ρ_{k, s, s^{'}}^{(G)}}_{s, s^{'} \in S^{2}}$ conditional on the hidden state Z=k. Indeed, we expect and observe that the weather regime impacts the correlation. For the driest state Z=K a rain event should be largely independent, and in the rainy state precipitation should be correlated. We actually enforce the conditional independence when k=K; i.e., $Σ_{K}^{(G)}$ is a diagonal covariance matrix. This choice is also motivated by the lack of observations of joint rain events in the state k=K.

In this work, we also assume, for simplicity, that the correlation matrices have no seasonality dependence, i.e., are independent of the day t. Moreover, we also do not model local temporal correlations for rain amounts. This shortcoming could be overcome using, for example, a spatiotemporal covariance matrix (Benoit et al., 2018).

6.2.1 Simulation procedure

To simulate the rainfall amounts, we first simulate the SHHMM chain $(z^{(n)}, y^{(n)} : n \in D)$ ; see Algorithm 2. Then for all the stations where rain is predicted, $S_{wet}^{(n)} = {s : Y_{s}^{(n)} = wet, \forall s \in S}$ , the rain amounts $R_{s}^{(n)} > 0$ are generated conditionally using the Gaussian copula with marginal $g_{z^{(n)}, t_{n}, s}$ and correlation matrix $Σ^{(n)} = {ρ_{z^{(n)}, s, s^{'}}}_{s, s^{'} \in (S_{wet}^{(n)})^{2}}$ .

Remark. In Appendix A, we visually show and with an approximate χ² test that the Gaussian copula model is a valid model for most station pairs. Note that this Gaussian copula can underestimate the joint extreme rain amount (Renard and Lang, 2007), e.g., for close stations. In that case, other copulas might be used, as in Dawkins et al. (2022), for example, but will not be explored in this paper.

7 Simulations: multisite rain amount

In this section, we will test the full multisite model combining the SHHMM and the rain amounts. We will test the marginal distributions, the spatial and temporal correlations, and the seasonality. Note that all previous results of the MRO simulations (see Sect. 5) are still valid since the addition of rain amount is done “on top” of the SHHMM.

In the simulations, we use the parameters obtained in Sect. 3.4: m=1 local memory, K=4 hidden states, and Deg=1 order of trigonometric polynomial. We use the SHHMM transition matrix and Bernoulli distributions obtained in Sect. 3.2 and the rain amount marginal and copula obtained in Sect. 6.

7.1 Correlations

We first compare the spatial correlation of MRA over the 64 years of data; i.e., for all pairs of stations s and s^′ we estimate $Cor (R_{s}, R_{s^{'}})$ . The results are shown in Fig. 16 (left), where observed correlations are compared to simulations. In Fig. 16 (right), we perform a similar comparison for the symmetric tail correlation (or upper tail dependence) (Nelsen, 2006) defined by

\begin{matrix} (24) & \begin{aligned} (ρ_{T})_{s, s^{'}} (q) & = ((ρ_{T})_{s ∣ s^{'}} (q) + (ρ_{T})_{s^{'} ∣ s} (q)) / 2 \\ with (ρ_{T})_{s ∣ s^{'}} (q) & = P (R > F_{R_{s}}^{- 1} (q) ∣ R_{s^{'}} > F_{R_{s^{'}}}^{- 1} (q)) \end{aligned} \end{matrix}

for $q \in [0, 1]$ . The tail correlation indicates how extreme events are correlated at different stations. We observe a good match for most stations; however, for stations with larger tail correlation, ≳0.2, the tail correlation is underestimated by the simulations. This can be an indication that the Gaussian copula is not enough for these pairs of stations. Improvement using the Student copula (whose manipulation is less easy but more capable of generating tail dependence) is a possibility that should be explored in future work.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f16

Figure 16Comparison of the multisite correlation (a) and symmetric tail correlation in Eq. (24) with q=0.95 (b) from the observed and simulated data. The correlations computed from simulations are averaged over $J = 5 \times 10^{3}$ realizations.

Download

7.2 Rain amounts

7.2.1 Distribution and autocorrelation of precipitation

Figure 17 shows the nonzero rain amount distributions R_s>0 at each station s all year long during the 64-year span of data. It shows the historical distributions (blue) and 5×10³ realizations of our generator (gray). The model reproduces the bulk of the distributions as well as the tails. The observed distribution is in the interquartile range (red envelope) of the simulations up to PDF values $≃ 3 \times 10^{- 4}$ for all stations (except for a few points). Even for southernmost stations (Toulouse, Marignane, Bastia) with heavier tails, our model is able to capture most extremes; i.e., the observations lie in the full simulation envelope (gray). This might be due to the seasonal training of the marginals in Eq. (22), allowing the distributions to be more extreme in late summer when heavy storms are common in the south of France. This good performance is interesting in light of recent efforts to use more complex distributions, e.g., generalized Pareto distribution (Naveau et al., 2016; Tencaliec et al., 2020). At some stations like Embrun and Toulouse, it can even generate extremes twice as large as the current maximal value observed, which might be questionable. Note that it cannot reproduce the most extreme rain event at Chassiron (that occurred on 14 August 1972), which in this case shows the limitation of short-tailed exponential distributions.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f17

Figure 17 Distribution of the nonzero precipitation amount R>0 (mm) at every station and for all years 𝒯. The distribution of the historical data is shown by the blue line. The gray envelope covers the full range (q_0,100) of the $J = 5 \times 10^{3}$ simulations, while the red envelope covers the interquartile range (q_25,75), and the line is the median. To ensure correct representation, the bin intervals of all histograms are set to $0, 2, 4, \dots, R_{\max}^{(s)} (mm)$ , where $R_{\max}^{(s)}$ is the maximum rainfall recorded in observations or simulated among the J simulations at the station s.

Download

In Fig. 18, we perform a temporal dependence check for the rainfall amounts (zero and nonzero) by showing the autocorrelation function (ACF) at different lags. The results show that the ACF with lag=1 is always underestimated. This is expected as the current model does not include explicit dependence on the previous day's rainfall, as in Benoit et al. (2018), where the quantity $R^{n + 1} ∣ R^{n}$ is explicitly modeled. The only dependence in our model comes from the weather regimes as well as the autoregressive rain occurrences. Larger lags (>1), however, are better reproduced at most stations. The model fails at La Hague and Chassiron, which are both located on the coast (Channel Sea and Atlantic Ocean, respectively), where the observed dry and wet spells also deviate more from the simulation interquartile range than for other stations.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f18

Figure 18 Estimated autocorrelation function (ACF) of the daily rainfall amount R (mm) for lag 0 to 15 at every station and for all years 𝒯. The ACF of the historical data is shown by the blue line. The gray envelope covers the full range (q_0,100) of the $J = 5 \times 10^{3}$ simulated ACF.

Download

In the context of flood analysis, a more relevant statistic is the distribution of aggregated rainfall amounts over a period of time. It is crucial, as, for example, heavy rainfall over 5 consecutive days poses a greater risk of flooding than the same amount of rain dispersed over several months. Some models specifically train on these aggregated events (e.g., Evin et al., 2018), while here the only temporal dependence is provided by the underlying HMM. To this end, we display in Fig. 19 the distribution of cumulated rain sequences (non-overlapping) of 5 d. The results again show that the bulk of the distribution (inset) is very well-reproduced, while the tails of the observations generally lie up to 10⁻³ within the interquartile range of the simulations. Once again, the model is able to produce extremes that were not observed, e.g., at Embrun and Bourges. At Chassiron, it fails again to envelope the most extreme observation (which is due to the same extreme event as seen in Fig. 17). The worst results are obtained for La Hague and Chassiron. Again, this is an indication that these two stations require a more complex local model, e.g., higher local memory and dependency on past rainfall amounts.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f19

Figure 19 Distribution of cumulated 5 d precipitation amount R (mm) at every station and for all years 𝒯. The distribution of the historical data is shown by the blue line. The gray envelope covers the full range (q_0,100) of the $J = 5 \times 10^{3}$ simulations, while the red envelope covers the interquartile range (q_25,75), and the line is the median. In the inset, we show in regular scale a zoom of the bulk of the distribution. To ensure correct representation, the bin intervals of all histograms are set to $0, 3, 6, \dots, R_{\max}^{(s)} mm$ , where $R_{\max}^{(s)}$ is the maximum of aggregated rainfall recorded in observations or simulated among the J simulations at the station s.

Download

7.2.2 Precipitation during the year

To test the seasonality of the model, we show the quantiles 0.1,0.5, and 0.9 of the accumulated monthly amount at every location. This tests how the model performs in each region and for each month under different regimes (very dry, median, and rainy months). Note that each blue point (historical data) at each station and month is obtained using the observed quantile over 64 points. Hence, it is prone to greater estimation error than most other statistics studied in this paper that use daily observables. Most observed points are located within the envelope of the $J = 5 \times 10^{3}$ simulations (colored regions), indicating a fair match with the model. At the southernmost stations, Marignane and Bastia, the observed seasonality does not seem to match that of the model: at the beginning of summer, the model appears to produce rainier July months, while at the beginning of autumn, in October, the generated months are drier than observed. This suggests that the rainfall amount model might not be fully adapted; for example, a higher seasonal dependence (larger Deg) could be required.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f20

Figure 20The $(0.1, 0.5, 0.9)$ quantile of the cumulated monthly mean rain amount in millimeters per month (orange, gray, and light blue, respectively). Historical data (dark blue). For each quantile, we show the envelope of the $J = 5 \times 10^{3}$ simulations and in darker colors the [25,75] percentiles, and the line is the median of the simulations.

Download

8 Application to climate change projections

So far, we have trained and validated the SHHMM using historical data. Using the same hyperparameters as found in the model selection (see Sect. 3.4), we can train the model on other datasets. In this section, we show how the stochastic weather generator developed in this paper can be used to study climate change impacts. The focus of the paper is not to perform an in-depth analysis but rather to show, as a proof of concept, how SWG could be useful in that context. To this end, we will train the model with projection data made available by climate model institutes participating in the scientific projects coordinated in the IPCC framework (Arias et al., 2021). A new Coupled Model Intercomparison Project is launched for each new IPCC cycle, and each participating institute runs the latest versions of their global climate model or Earth system model under prescribed radiative forcing conditions. Because these simulations are global and present biases compared to local observations (Tootoonchi et al., 2023), we will use the downscaled and bias-adjusted projections provided by the French climate service DRIAS. It is based on a selection of regional projections made in the international CORDEX initiative based on CMIP5 global projections (projections made in the framework of the 5th IPCC assessment report).

8.1 DRIAS data

We use the DRIAS website (Soubeyroux et al., 2021) that aggregates different regional (European) projections forced by some chosen global projections made by different institutes. DRIAS-2020 provides 30 climate projections (2006–2100) with three scenarios (RCP2.6, RCP4.5, and RCP8.5) and 12 historical simulations (1951–2005). These simulations are further downscaled and bias-adjusted over France using the ADAMONT method (Verfaillie et al., 2017) with SAFRAN reanalysis (Vidal et al., 2010) covering France with 8 km resolution for many daily variables. We select the closest grid points to the S=10 considered stations and extract the precipitation amount. The exact grid point choice should not matter too much, since the reanalyzed simulations are smoothly interpolated. Since these physical models tend to overestimate the frequency of light rain amounts R, we set to R=0 all amounts smaller than 0.1 mm to match what is done at the experimental weather stations.

8.2 Direct comparison of models with the reference period

Climate models provide historical simulations (1951–2005) to be able to validate models against observed data. Because a model does not simulate the same interannual variability as observed, the evaluations are based on the statistical properties of the variables rather than on their chronology. For example, in Fig. 21, we compare the monthly rain quantiles computed as in Sect. 7.2 obtained from the historical climate simulations and from the SHHMM simulations (the same as previously trained on historical observation). We use here two climate models, Aladin (CNRM-ALADIN63 – CNRM-CERFACS-CNRM-CM5) and IPSL (IPSL-WRF381P – IPSL-IPSL-CM5A-MR), as an example. Using an SWG allows a better sampling of the natural climate variability because it is possible to run many more realizations than can be done with climate models. This sample can then be used to check how climate model simulations are positioned. For example, at Lille station, in July for the 0.9 quantile, we observe that the historical point around ≃120 mm (blue) is far from the two climate models (orange and green): ≃100 mm. However, when looking at the predicted statistical envelope, the climate models are exactly at the median, while the historical observation is actually an extreme value. When comparing the two climate models, we observe that the IPSL model produces more points outside the statistical envelope than the Aladin model, suggesting that the model may present stronger biases.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f21

Figure 21 Same as Fig. 20 with added monthly rain quantile for the model Aladin (CNRM-ALADIN63 – CNRM-CERFACS-CNRM-CM5) and IPCC (IPSL-WRF381P – IPSL-IPSL-CM5A-MR) for the reference period 1952 to 2006.

Download

We can perform the same comparison task on dry spell distributions with the Aladin model; see Fig. 22. Again, the SWG samples allow for comparison of the climate model within the predicted variability. At a lot of stations, e.g., Marignane, Orly, and Toulouse, the results are within the interquartile range. However, we can also observe that in the tails, the climate model is always under or equal to the historical curve. This raises the question of whether climate models are able to produce yet unseen extremes.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f22

Figure 22 Same as Fig. 11 with the dry spell distribution of Aladin (CNRM-ALADIN63 – CNRM-CERFACS-CNRM-CM5) for the reference period 1952 to 2006.

Download

8.3 Training on RCP scenarios

Once the comparison has been made for the historical period, in this section, we will study how the spatial rainfall may evolve in the future by fitting the SHHMM on climate model projections under different RCP scenarios. The RCP scenarios are designed to represent differentiated trajectories of greenhouse gas and aerosol emissions that drive climate change until the end of the century (and beyond in some cases). To do so, we select the data over a 64-year range, here 2032–2096, which simplifies the statistical comparison with the 64-year range of the historical data we considered.

In Fig. 23, the transition matrices obtained when training on historical and IPSL-RCP8.5 data are compared. The aim here is only to highlight the ability of the SHHMM to be used in climate change conditions, not to conduct an impact study; that is why only one climate model is used. The two matrices are still close, meaning that the hidden states of our model are robust to parameter evolutions. However, we can observe interesting differences. For example, Q_3→3 and Q_1→1 are significantly larger in summer months. The weather regimes 1 and 3 were interpreted as rainy all over France and heavy rain in the south, respectively. This means that the IPSL model under RCP8.5 projects longer stretches of heavy rain. Figure 24 shows the analog of Fig. 20 with simulations from the trained SHHMM with IPSL-RCP8.5 data and in blue the historical data. It clearly shows that the IPSL model under the RCP8.5 scenario projects rainier periods. In fact, it is known that the regional climate model for the EURO-Coordinated Regional Downscaling Experiment used in all DRIAS models presents this type of bias (Boé et al., 2020; Vautard et al., 2021). In particular, summer periods are consistently rainier, even for the 0.1 quantile of the monthly mean rain amount. This example shows how the proposed SWG can be used to analyze and compare models: either directly interpreting the coefficients change or sampling from the fitted model to study extreme behaviors.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f23

Figure 23Temporal variation of the transition matrix Q_t trained on historical data (plain line) and on RCP8p5 from IPSL-WRF381P data (dashed line) for the period 2032–2096.

Download

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f24

Figure 24The $(0.1, 0.5, 0.9)$ quantile of the cumulated monthly mean rain amount in millimeters per month (orange, green, and light blue, respectively). Historical data (dark blue) and J=10³ realization of the model trained with the RCP8p5-IPSL-WRF381P data for the period 2032–2096. For each quantile, we show the envelope of the J simulations and in darker colors the [25,75] percentiles and the median. Note that the color palette has been slightly modified with respect to Fig. 20 to highlight that these envelopes are obtained with simulations from the model trained on an RCP scenario.

Download

9 Conclusions

In this paper, we define a multisite stochastic weather generator for precipitation named the seasonal hierarchical hidden Markov model (SHHMM). Even though it is based on a hidden Markov model introduced for weather applications in the 1990s, we propose an original combination of features based on two important assumptions: (a) the conditional independence for the multivariate rain occurrence (MRO) variable (see Eqs. 5–7) and (b) the imposed smooth seasonal evolution of most model parameters (see Sect. 2.4). Assumption (a) forces the model to learn spatial correlations, leading to fully interpretable hidden states (weather regimes). This differs from part of the literature, where hidden states and correlation coefficients are trained jointly with continuous variables or additionally conditioned on predefined synoptic-scale variables. Thanks to the discrete nature of MRO and the station locations, we checked the validity of hypothesis (a); see Fig. 13. Assumption (b) is natural and has been introduced before in Touron (2019 a); it stabilizes training, i.e., removes a lot of identifiability issues that occur while training nonhomogeneous HMMs, and leverages the relatively small number of observation years. To capture more of the local weather, i.e., station-wise, we introduced an autoregressive dependence of rain occurrence on past weather, allowing better temporal correlations.

The model inference and identifiability were analyzed. In particular, we proposed an efficient heuristic initialization in Sect. 3.1 to accelerate model training during the Baum–Welch expectation maximization algorithm and additionally demonstrated that the model must satisfy specific identifiability conditions to remain meaningful in Sect. 2.5. To the best of our knowledge, these aspects have not been previously discussed in the literature. The model selection was performed using the integrated complete-data likelihood criteria, leading in particular to the selection of four hidden states extensively interpreted as France-wide weather regimes in Sect. 4. In particular, we were able to showcase how the hidden states found can be compared to several Euro-Atlantic and France-centered weather regimes defined in other works. These hidden states were shown to be robust to different choices of stations; see Appendix C. The model was extensively tested with simulations. Its performance in reproducing dry and wet spells, as well as the amount of precipitation, is very good, even in the tails of the distributions at most stations. Our model was compared with WGEN, a widely used model in the literature, and was found to better reproduce the distribution of areal dry spells.

The model's hierarchical structure allows for easily adding other weather variables on top of the HMM without modifying the hidden states. In fact, new variables, such as rainfall amount, benefit from the trained hidden states and can be adjusted conditionally on them. The rainfall amount was also tested in terms of distribution, autocorrelation, 5 d aggregated distribution, monthly aggregated quantile, and spatial correlation, with good performance.

Eventually, we showed how this generator can be used with climate change models. One can evaluate climate models on the reference period by comparing them to the estimated climate variability obtained with many simulations of the SWG, as in Figs. 21 and 22. This approach can give more relevant results than only comparing the climate models to a single historical observation. Additionally, training the SWG model on future climate projections allows for interpreting changes across all model parameters (see Fig. 23) or resampling from these future scenarios to better estimate variability and extremes; see Fig. 24.

The major limitation of the model is the assumption (a) that forces careful choice of the stations, as noted already in Zucchini and Guttorp (1991). Even though we show in Appendix C that the precise station choice does not significantly affect the hidden states as long as the MRE_CI is low, finding a valid station set can be a hard problem. This first model lays the groundwork for a fully seasonal spatiotemporal SWG working on a large scale with no external exogenous variables.

Having inferred the weather regime only from rain occurrences and not amounts might also not be completely satisfying, as other clustering methods using rainfall amounts or geopotential have found more complex patterns, particularly for extreme rain. Rain occurrences might limit the number of meaningful patterns, as Robertson et al. (2004) also found four hidden states using a spatial HMM for northeast Brazil, which has a completely different climate. To circumvent these limitations, one could try to train a spatial HMM with rain amounts and/or more complex variables. However, we caution against such complexities, as they can tend to produce hidden states very dependent on the chosen imperfect parametric distribution; see Pohle et al. (2017) and de Chaumaray et al. (2023). Another solution, keeping only rain occurrences, could be to break conditional independence to allow many more stations in the region of interest, which would be able to discriminate between more complex patterns. It was explored partially in Hughes and Guttorp (1994 b) and Kirshner et al. (2004) by allowing pairwise correlations or tree structures with the hidden states. Keeping interpretable hidden states while breaking conditional independence is a challenge left for future research.

Many smaller improvements could also be considered, such as different models for rain amount or seasonality at different locations to account for regional specificities. A temporal correlation with previous rain amounts is also possible with a spatiotemporal correlation matrix (Benoit et al., 2018; Bennett et al., 2018) to improve the precipitation autocorrelation function (ACF). Another question is how to incorporate nonstationarity in the model's hidden states, such as climate change trends. So far, these have been added using the generalized linear model framework with exogenous variable dependence on the parameters (e.g., Greene et al., 2011, or Dawkins et al., 2022).

Extending the model with new weather variables, such as temperature on top of the current model (and its hidden states), is another challenging problem that can be addressed with this model. Indeed, the weather regimes identified here are likely relevant for other weather variables such as temperature, solar radiation, and wind. They could therefore be used to train conditional models for these additional variables (hierarchical dependence). In fact, this idea was tested by training an SWG on five daily weather variables (rainfall amount, minimum and maximum temperature, solar irradiance, and evapotranspiration) using the same hidden states identified in this paper. The simulation outputs were then coupled with a crop model to produce a hackathon dataset; see Métivier et al. (2025) and references therein. Note that this is ongoing work. In a similar spirit, another option to include more densely packed stations could be to couple the SHHMM and its pre-trained large-scale weather regimes with a local high-resolution model. In this case, the hidden states would serve as exogenous variables. However, compared to other models where these inputs are fixed over a training period, our hidden states are modeled specifically for the large-scale area of interest and can be regenerated as needed to explore climate variability over the period of interest, e.g., with future climate scenario data.

Adding new weather variables and higher-resolution models at specific locations to extend the SHHMM could be useful in many large-scale risk analysis studies. For example, river water (temperature and flow) used to cool power plants can be modeled based on the weather (typically temperature and rainfall) over the relevant catchment areas (e.g., Nguyen et al., 2023). With an extended SHHMM, one could assess the resilience of the entire French nuclear power plant fleet under climate change projections, e.g., by estimating the probability of having half of the plant generation simultaneously limited due to long-lasting, large-scale droughts. Exploring this extension is left for future work.

Appendix A: Gaussian copula

To check the Gaussian copula approximation for the joint rain events between station pairs, we transform our data into an empirical bivariate distribution with normal margins to test its quantiles against those of a true bivariate normal distribution. Note that these checks are mostly qualitative since we apply the procedure to time series, meaning we are outside the IID framework where these kinds of tests are valid.

In detail, given a pair of stations $(s, s^{'})$ and a hidden state Z=k, we consider the joint positive rain amount $R_{s, s^{'}, k} = (R_{s} > 0, R_{s^{'}} > 0) ∣ Z = k$ for all dates n, so we can remove the superscript n. We first have to transform the observations to pseudo-observations, i.e., $R_{s, s^{'}, k} \in R_{+}^{2} \overset{η}{⟶} (u_{s, k}, u_{s^{'}, k}) \in [0, 1]^{2}$ . There are several possible transformations η, e.g., the estimated marginal CDF or ordinal ranking. We use the latter one as done in the package Copulas.jl (Laverny and Jimenez, 2024) that we use for all our copula simulations. These pseudo-observations are then transformed to normal distributions using the transformation $X_{s, s^{'}, k} = (ϕ^{- 1} (u_{s, k}), ϕ^{- 1} (u_{s^{'}, k}))$ , where ϕ⁻¹ is the quantile function of the standard normal distribution. For a vector x∈ℝⁿ and an n×n correlation matrix Σ_M, the squared Mahalanobis distance is defined as

\begin{matrix} (A1) & D_{M} (x) = x^{⊺} Σ_{M}^{- 1} x . \end{matrix}

We use the correlation coefficients $ρ_{s, s^{'}, k}^{(G)}$ obtained in Sect. 6.2 to build the 2×2 matrix Σ_M and compute the Mahalanobis distance for all samples. For a true bivariate normal distribution, the distribution of D_M follows a χ²(ν) distribution with ν=2 degrees of freedom. In Fig. A1, we compare the quantile of the χ²(ν=2) distribution with the $D_{M} (X_{s, s^{'}, k})$ for two pairs of stations and each hidden state Z=k. Note that we simplify the analysis, considering only one covariance matrix instead of the four fitted in Sect. 6.2. The correspondence is good even for close pairs. It means that Gaussian copulas are adapted when stations are far enough apart. For Z=1, i.e., the rainiest weather, only 3 out of 45 station pairs fail the one-sided Kolmogorov–Smirnov test with the 95 % confidence level that compares the theoretical χ²(ν=2) distribution with the observed squared Mahalanobis distance. These are the pairs Bourges–Orly, Lille–Orly, and Lille–Luxembourg, which are amongst the closest pairs; see, e.g., Fig. A1 (left). Interestingly, for a slightly bigger distance, the pair (334 km) Orly–La Hague passes the test; see Fig. A1 (middle). This indicates anisotropy in the correlation repartition. For other weather regimes Z>1, the Gaussian copula hypothesis also works well for most stations with enough data.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f25

Figure A1Three examples of q–q plots to test the Gaussian copula hypothesis. The squared Mahalanobis distances between station pairs vs. χ²(ν=2) distribution are shown. A good match means that the Gaussian copula hypothesis to generate pairs $(R_{s} > 0, R_{s^{'}} > 0) ∣ Z = k$ is satisfied.

Download

Appendix B: Comparison with the memoryless model 𝒞_m=0

In Sect. 2.2 and 2.3 we define and model 𝒞_m=0 and 𝒞_m>0. We later selected 𝒞_m=1 using the ICL criteria; see Fig. 4. We show here the performance of the 𝒞_m=0 model in terms of dry/wet spells in Figs. B1 and B2. The observed distribution is shown, while the $J = 5 \times 10^{3}$ simulation quantile envelopes are displayed. These figures are to be compared with Figs. 11 and 12 produced by the 𝒞_m=1 model. In the bulk of the spell distributions, i.e., short spells with higher probability, the difference is important (note that the log scale tends to visually minimize the effect), e.g., Embrun and Marignane for the dry spells and all wet spell distributions. This indicates that the model 𝒞_m=0 without local memory overestimates very short wet spells (and dry spells to a lesser degree). At some stations, it also underestimates the longer spells (tails), e.g., Bastia for wet spells.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f26

Figure B1 Dry spell distribution (in number of days) at every station and for a time range 𝒟 of the historical data (blue) and the $J = 5 \times 10^{3}$ simulated wet spell distribution. The gray envelope covers the full range (q_0,100) of the simulations, while the red envelope covers the interquartile range (q_25,75), and the line is the median. Simulations are obtained over the same time range 𝒟 and using the memoryless model $K = 4, Deg = 1$ , and 𝒞_m=0.

Download

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f27

Figure B2 Wet spell distribution (in number of days) at every station and for a time range 𝒟 of the historical data (blue) and the $J = 5 \times 10^{3}$ simulated wet spell distribution. The gray envelope covers the full range (q_0,100) of the simulations, while the red envelope covers the interquartile range (q_25,75), and the line is the median. Simulations are obtained over the same time range 𝒟 and using the memoryless model $K = 4, Deg = 1$ , and 𝒞_m=0.

Download

Appendix C: Comparison with other station choices

In this Appendix, we show that the hidden states, i.e., weather regimes, found in the main text with our choice of stations 𝒮 (see Sect .2.1) are robust to station changes. We replaced all stations with their closest station within a ≤150 km radius. In cases where no neighbor was found, the station was kept. With our pool of 66 stations and the original set, we replaced 8 out of 10 stations. The resulting moved station set, 𝒮_moved, is given in Table C1. Note that the Luxembourg_A station is different from the Luxembourg station (at the airport) used in the paper.

We trained our SHHMM with the stations in 𝒮_moved using the same procedure as before. The resulting transition matrix is shown in Fig. C1. The new transition matrix (dotted line) is almost identical to the original one (full line) found in Sect. 4.2.1. The mean probabilities conditional on the hidden states of 𝒮_moved (crosses) are also very close to those of 𝒮 (circles), exhibiting the same spatial pattern; see Fig. C2.

Table C1Original station set 𝒮 and set 𝒮_moved with the closest available station within a 150 km radius. In bold are the stations we kept for which no neighboring station was available in the given radius.

Download Print Version | Download XLSX

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f28

Figure C1Temporal variation of the transition matrix Q(t) for the SHHMM K=4, Deg=1, and m=1. The full line is the matrix trained on the original set 𝒮, and dotted lines correspond to the model trained on the set 𝒮_moved.

Download

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f29

Figure C2Yearly mean rain probability $T^{- 1} \sum_{t \in T} λ_{k, t, s, h}$ for m=1 and h=dry, i.e., the probability of rain at a location s, conditional on the hidden state $Z = k \in [1, K = 4]$ and on a previous dry day. The circle represents the results obtained when training with the 𝒮 set, while the cross indicates the result when training with the 𝒮_moved set. Bourges and Orly appear in both sets, causing the circle and cross to overlap. Additionally, Luxembourg and Luxembourg_A are nearly positioned on top of each other.

Appendix D: Periodic moving average

We define the periodic moving average used in Fig. 9. Given a T-periodic observable X^(t) for t∈𝒯, the associated moving average ${\bar{X}}^{(t)}$ is given by

\begin{matrix} (D1) & {\bar{X}}^{(t)} = \sum_{h = - H}^{h = H} K (\frac{h}{H}) \frac{X^{(t + h)}}{\sum_{h = - H}^{h = H} K (\frac{h}{H})}, \end{matrix}

where $t \in [[1, T]]$ , $X^{(t \pm T)} = X^{(t)}$ , and 𝒦 is a kernel. In the paper, we choose the window size H=15 with the Epanenchikov kernel $K (u) = \frac{3}{4} (1 - u)^{2} 1_{| u | \leq 1}$ and T=366.

Appendix E: Baum–Welch algorithm for seasonal hierarchical HMM

We use the same model 𝒞_m>0 as described in Sect. 2 and will describe the inference procedure using the Baum–Welch algorithm for a seasonal hierarchical HMM (SHHMM).

We recall that θ stands for all the SHHMM $(ξ, Q_{t}, f_{k, t, h})_{k \in K, t \in T, h \in H^{(m)}}$ parameters (see Sect. 2.3). To fit the model, we must find the θ maximizing the observed likelihood,

\begin{matrix} (E1) & \begin{aligned} L_{θ} & (y^{(1 : N)}) = P_{θ} (Y^{(1 : N)} = y^{(1 : N)}) \\ = \sum_{z^{(1)}, \dots, z^{(n)}} P_{θ} (Y^{(1 : N)} = y^{(1 : N)}, Z^{(1 : N)} = z^{(1 : N)}) \\ = \sum_{z^{(1)}, \dots, z^{(n)}} L_{θ} (y^{(1 : N)}, z^{(1 : N)}) \\ = \sum_{z^{(1)}, \dots, z^{(n)}} f_{z_{N}, t_{N}} (y^{(N)} ∣ h^{(N)}) \\ P_{θ} (Y^{(1 : N - 1)} = y^{(1 : N - 1)}, Z^{(1 : N)} = z^{(1 : N)}) \\ = \sum_{z^{(1)}, \dots, z^{(n)}} f_{z_{N}, t_{N}} (y^{(N)} ∣ h^{(N)}) Q_{z^{(N - 1)}, z_{N}} (t_{N}) \\ P (Y_{1 : N - 1}, Z^{(1 : N - 1)} = z^{(1 : N - 1)}) \\ = \sum_{z^{(1)}, \dots, z^{(N)}} ξ_{z^{(1)}, h_{1}} f_{z^{(1)}, t_{1}} (y^{(1)} ∣ h^{(1)}) \\ \prod_{n = 2}^{N} Q_{z^{(n - 1)}, z_{n}} (t_{n}) f_{z_{n}, t_{n}} (y^{(n)} ∣ h^{(n)}), \end{aligned} \end{matrix}

where the index $z_{n} \in [[1, K]]$ for all n∈𝒟 values.

The Baum–Welch algorithm is an iterative expectation maximization, where the likelihood is increased sequentially, i.e., at each step (i) of the algorithm $L_{θ^{(i)}} \leq L_{θ^{(i + 1)}}$ .

Let us detail the procedure and show that the classical element of the Baum–Welch algorithm for an HMM (homogeneous) proof remains valid when considering an SHHMM. The first step is to consider the conditional expectation of the log-likelihood of the parameter θ, ℒ_θ, with respect to the parameter at step (i), θ⁽ⁱ⁾,

\begin{matrix} (E2) & \begin{aligned} R (θ, θ^{(i)}) & = E^{θ^{(i)}} [\log L (Y^{(1 : N)}, Z^{(1 : N)}; θ) ∣ Y^{(1 : N)}] \\ = \sum_{k, l = 1}^{K} \sum_{n = 1}^{N - 1} π_{n, n + 1 ∣ n}^{θ^{(i)}} (k, l) \log Q_{t_{n}} (k, l) \\ + \sum_{k = 1}^{K} \sum_{n = 1}^{N} π_{n ∣ N}^{θ^{(i)}} (k) \log f_{k, t_{n}} (y^{(n)} ∣ h^{(n)}) \\ + \sum_{k = 1}^{K} π_{1 ∣ n}^{θ^{(i)}} (k) \log ξ_{k}, \end{aligned} \end{matrix}

where we recall the smoothing probabilities under the current parameter θ⁽ⁱ⁾,

\begin{matrix} (E3a) & π_{n ∣ N}^{θ^{(i)}} (k) = P_{θ^{(i)}} (Z^{(n)} = k ∣ Y^{(1 : N)}), \forall n \in [1, N], \\ (E3b) & \begin{aligned} π_{n, n + 1 ∣ N}^{θ^{(i)}} (k, l) & = P_{θ^{(i)}} (Z^{(n)} = k, Z^{(n + 1)} = l ∣ Y^{(1 : N)}), \\ \forall n \in [1, N - 1] . \end{aligned} \end{matrix}

These probabilities can be computed using the forward–backward procedure, which is also valid for periodic hierarchical HMM.

The E and M steps alternate as follows:

Initialization. We initialize the algorithm with an initial HMM of parameters θ⁽⁰⁾.
E-step: Compute ℛ(θ,θ⁽ⁱ⁾), which corresponds here to getting the smoothing probabilities for the current parameter θ⁽ⁱ⁾.
M-step: Maximize ℛ(θ,θ⁽ⁱ⁾) with respect to θ. Due to the sum expression of ℛ, this step can be done independently for each parameter $θ = (ξ, Q, f)$ . In particular, one can update the distributions $f_{t, k} (y_{n} ∣ h_{n})$ independently of the transition matrix. If we did not assume a periodic parametric form for the transition matrices, the maximization of each Q_t could be done analytically and independently.
Steps E and M are repeated iteratively until the observed likelihood has converged to a local maximum.

E1 Fundamental inequality of the EM algorithm

To prove that increasing ℛ(θ∣θ⁽ⁱ⁾) also increases the observed likelihood, ℒ_θ(Y^(1:N)), we first rewrite the observed likelihood as

\begin{matrix} (E4) & \begin{aligned} \log L_{θ} (Y^{(1 : N)}) & = \log L_{θ} (Y^{(1 : N)}, Z^{(1 : N)}) \\ - \log L_{θ} (Z^{(1 : N)} ∣ Y^{(1 : N)}) . \end{aligned} \end{matrix}

The conditional expectation of ℒ_θ with respect to the current parameter θ⁽ⁱ⁾, for all θ and θ⁽ⁱ⁾, gives

\begin{matrix} (E5) & \begin{aligned} E_{θ^{(i)}} & [\log L_{θ} (Y^{(1 : N)}) ∣ Y^{(1 : N)}] = \log L_{θ} (Y^{(1 : N)}) \\ = E_{θ^{(i)}} [\log L_{θ} (Y^{(1 : N)}, Z^{(1 : N)}) \\ - \log L_{θ} (Z^{(1 : N)} ∣ Y^{(1 : N)}) ∣ Y^{(1 : N)}] \\ = R (θ, θ^{(i)}) - \sum_{Z^{(1 : N)}} P_{θ^{(i)}} (Z^{(1 : N)} ∣ Y^{(1 : N)}) \\ \log P_{θ} (Z^{(1 : N)} ∣ Y^{(1 : N)}) \\ = R (θ, θ^{(i)}) + R (θ, θ^{(i)}) . \end{aligned} \end{matrix}

The Gibbs inequality ensures that $R (θ, θ^{(i)}) \geq R (θ^{(i)}, θ^{(i)})$ , so that we obtain

\begin{matrix} (E6) & \begin{aligned} \log & L_{θ} (Y^{(1 : N)}) - \log L_{θ^{(i)}} (Y^{(1 : N)}) \geq R (θ, θ^{(i)}) \\ - R (θ^{(i)}, θ^{(i)}) . \end{aligned} \end{matrix}

Hence, when we maximize (or increase) ℛ(θ,θ⁽ⁱ⁾) with respect to θ, we also increase the observed log-likelihood.

E2 Smoothing and filtering probabilities

The smoothing probabilities can be expressed as

\begin{matrix} (E7) & \begin{aligned} π_{n ∣ N} (k) & = P_{θ} (Z^{(n)} = k ∣ Y^{(1 : N)} = y^{(1 : N)}) \\ = \frac{P_{θ} (Z^{(n)} = k, Y^{(1 : N)} = y^{(1 : N)})}{P_{θ} (Y^{(1 : N)} = y^{(1 : N)})} \\ = \frac{\begin{matrix} P_{θ} (Z^{(n)} = k, Y^{(1 : n)} = y^{(1 : n)}) \\ P_{θ} (Y^{(n + 1 : N)} = y^{(n + 1 : N)} ∣ Z^{(n)} = k, Y^{(1 : n)} = y^{(1 : N)}) \end{matrix}}{P_{θ} (Y^{(1 : N)} = y^{(1 : n)})} \\ = \frac{α_{n} (k) β_{n} (k)}{\sum_{l = 1}^{K} α_{n} (l) β_{n} (l)}, \end{aligned} \end{matrix}

with

\begin{matrix} (E8a) & α_{n} (k) = P_{θ} (Z^{(n)} = k, Y^{(1 : n)} = y^{(1 : n)}), \\ (E8b) & \begin{aligned} β_{n} (k) & = P_{θ} (Y_{n + 1 : N} = y^{(n + 1 : N)} ∣ Z^{(n)} \\ = k, Y^{(1 : n)} = y^{(1 : n)}) \\ = P_{θ} (Y^{(n + 1 : N)} = y^{(n + 1 : N)} ∣ Z^{(n)} \\ = k, Y^{(n - m + 1 : n)} = y^{(n - m + 1 : n)}) . \end{aligned} \end{matrix}

Similarly,

\begin{matrix} (E9) & \begin{aligned} π_{n, n + 1 ∣ N} (k, l) = \\ P_{θ} (Z^{(n)} = k, Z^{(n + 1)} = l ∣ Y^{(1 : N)} = y^{(1 : N)}) \end{aligned} \\ (E10) & = \frac{α_{n} (k) β_{n + 1} (l) f_{t_{n + 1}, l} (y^{(n + 1)} ∣ h^{(n + 1)}) Q_{t} (k, l)}{P_{θ} (Y^{(1 : N)} = y^{(1 : N)})} . \end{matrix}

E3 Forward–backward procedure

The forward α and backward β variables are computed iteratively.

\begin{matrix} (E11a) & \{\begin{cases} α_{1} (k) = f_{k, t_{1}} (y^{(1)} ∣ h^{(1)}) ξ_{k}, \\ α_{n} (k) = f_{k} (y^{(n)} ∣ h^{(n)}) \sum_{l = 1}^{K} Q_{t - 1} (l, k) α_{n - 1} (l), \\ for 1 < n \leq N, \end{cases} \\ (E11b) & \{\begin{cases} β_{N} (k) = 1, \\ β_{n} (k) = \sum_{l = 1}^{K} f_{t_{n + 1}, l} (y^{(n + 1)} ∣ h^{(n + 1)}) Q_{t} (k, l) β_{n + 1} (l), \\ for 1 \geq n < N . \end{cases} \end{matrix}

Appendix F: Initialization of the HMM fitting: the slice estimate algorithm

In Sect. 3.2, we use the slice estimate to initialize the Baum–Welch algorithm with parameters θ⁽⁰⁾. We detail in this Appendix the inference of this slice estimate. Indeed, a random choice of θ⁽⁰⁾ in the Baum–Welch algorithm could lead to bad local maxima or longer convergence time.

F1 The EM algorithm for the distribution of the observations

The 64 years of data provide on each day t a sample of size 64, considered independent and identically distributed. Hence, for each t∈𝒯, independently of each other, we use a standard EM algorithm to fit the distribution ${f_{1, t}, \dots, f_{K, t}}$ . For a given date t, e.g., 28 February, the samples will consist of all 28 February dates from the dataset, i.e., from the year 1956 to the year 2019. The ensemble of date n corresponding to the same day t is denoted as 𝒩_t. To enrich each of these small datasets, we add the observations of every t±6 and t±12 d to each 𝒩_t (with periodicity T=366). These additional days should come from very similar distributions to the one from date t, as assumed by the smoothness assumption (see Sect. 2.4), but also be far enough to be considered independent samples. For our current dataset, each day t has samples for all the dates n with associated $t_{n} \in {t, t \pm 6, t \pm 12}$ , which gives $| N_{t}^{+} | = 320$ samples for each², where we denote by $N_{t}^{+}$ the enriched dataset.

On a day t, the mixture probability for an observation vector, $y = (y_{1}, \dots, y_{S})$ , with history $h = (h_{1}, \dots, h_{S})$ , is written as

\begin{array}{l} f_{t} (y ∣ h) & = P (Y^{(n)} = y ∣ H^{(n)} = h) = \\ \sum_{k = 1}^{K} P (Z^{(n)} = k) P (Y^{(n)} = y ∣ H^{(n)} = h, Z^{(n)} = k) \\ = \sum_{k = 1}^{K} π_{k, t} \prod_{s = 1}^{S} P (Y_{s}^{(n)} = y_{s} ∣ H_{s}^{(n)} = h_{s}, Z^{(n)} = k) \\ = \sum_{k = 1}^{K} π_{k, t} \prod_{s = 1}^{S} f_{k, t, s} (y_{s} ∣ h_{s}), \end{array}

where we use the conditional independence (see Sect. 2.3) and denoted the weight ℙ(Z⁽ⁿ⁾=k) as π_k,t. The parameters to fit are the mixture weights π_k,t and the Bernoulli parameters $λ_{k, t, h, s}$ for k∈𝒦, s∈𝒮, and $h \in I_{s}^{c}$ . We denote with a hat and tilde the estimated parameters ${\hat{π}}_{k, t}$ and ${\tilde{θ}}_{k, t, h, s}$ .

F2 Algorithm

The different steps, expectation (E), and maximization (M) of the algorithm are standard. The mixture to fit is composed of products of Bernoulli distributions given the history vector h. The same mixtures appear a lot in classification problems, for example, for digit reconnaissance (Bishop, 2006, Sect. 9.3.3).

F3 Random initialization

We choose 10 random initial parameters $(π_{k, t}^{(0)}, λ_{k, t, h, s}^{(0)})$ , run the algorithm, and select the converged point with the largest observed likelihood, defined as

\begin{matrix} (F1) & \begin{aligned} ℓ_{slice} & (y ∣ h; {\tilde{θ}}_{k, t, s, h}, {\hat{π}}_{k, t}) = \\ \log (\prod_{n \in N_{t}^{+}}^{N_{t}} P (Y^{(n)} = y^{(n)} ∣ H^{(n)} = h^{(n)})) \end{aligned} \\ (F2) & = \sum_{n \in N_{t}^{+}} \log (\sum_{k = 1}^{K} {\hat{π}}_{k, t} \prod_{s = 1}^{S} {\tilde{f}}_{k, t, s} (y_{s}^{(n)} ∣ h_{s}^{(n)})), \end{matrix}

where ${\tilde{f}}_{k, t, s}$ denotes the distribution with the estimated parameters ${\tilde{θ}}_{k, t, s, h}$ .

F4 Ordering the hidden states

A mixture distribution is identifiable up to relabeling of its components, meaning the mixture defined by $(π_{k}, λ_{k, t, h, s})$ cannot be distinguished from the mixture $(π_{σ (k)}, λ_{σ (k), t, h, s})$ where σ is a permutation of 𝒦. In our case, we need to ensure that the parameters evolve coherently with t so that labels k always refer to the same hidden states. To do so, we select one reference station in our study, Bourges, and for all t∈𝒯 values relabel as follows:

Model 𝒞₀. Sort the probability of rain for $k \in {1, \dots, K}$ from the lowest to the largest at the reference station Bourges:
${\tilde{θ}}_{k = (1), t, s = Bourges} > {\tilde{θ}}_{k = (2), t, s = Bourges} > \dots > {\tilde{θ}}_{k = (K), t, s = Bourges} .$
Model 𝒞_m. Sort the probability of rain for $k \in {1, \dots, K}$ conditional on the driest history variable $h_{dry} = (dry, \dots, dry)$ from the lowest to the largest at the reference station Bourges:
$\begin{array}{l} {\tilde{θ}}_{k = (1), t, s = Bourges, h_{d}} > {\tilde{θ}}_{k = (2), t, s = Bourges, h_{d}} \\ > \dots > {\tilde{θ}}_{k = (K), t, s = Bourges, h_{d}} . \end{array}$

This sorting provides a natural interpretation to each hidden state: k=(1) corresponds to a “rainy” climate where the probability of rain is the largest in Bourges and hopefully in the rest of the métropole (continental France). The k=(K) state corresponds to a “dry” climate where the probability of no rain is the largest.

The choice of Bourges to extract the hidden variable is heuristically justified by the fact that this station is located roughly at the center of the geographic area under study, and its parameters ${\tilde{θ}}_{k, t, s = Bourges, h_{d}}$ are well-separated for different k.

https://ascmo.copernicus.org/articles/11/159/2025/ascmo-11-159-2025-f30

Figure F1Relative improvement $(L_{slice - L_{rand}}) / | L_{slice} |$ of the final log-likelihood obtained with random or slice initialization. The slice estimate always gives a better or equal log-likelihood.

Download

F5 Transition matrices

To finish the SHHMM inference, we estimate the transition matrices Q(t). To do so, we will first infer the filtered probability of all hidden states given the model and observations using ${\tilde{f}}_{k, t}$ and ${\tilde{π}}_{k, t}$ ,

\begin{matrix} (F3) & \begin{aligned} γ_{k}^{(n)} & = P (Z^{(n)} = k ∣ Y^{(n)} = y^{(n)}, H^{(n)} = h^{(n)}) \\ = \frac{{\tilde{π}}_{k} \prod_{s = 1}^{S} {\tilde{f}}_{k, t, s} (y_{s}^{(n)} ∣ h_{s}^{(n)})}{\sum_{l = 1}^{K} {\tilde{π}}_{l} \prod_{s = 1}^{S} {\tilde{f}}_{l, t, s} (y_{s}^{(n)} ∣ h_{s}^{(n)})} . \end{aligned} \end{matrix}

The maximum a posteriori estimator is then

\begin{matrix} (F4) & {\tilde{z}}^{(n)} = {argmax}_{k \in {1, \dots, K}} γ_{k}^{(n)} . \end{matrix}

This yields the sequence of hidden states ${Z^{(n)} : n \in D}$ . The transition matrices can be estimated by counting the number of transitions on a day t from a state k to l divided by the total number of transitions from k,

\begin{matrix} (F5) & {\tilde{Q}}_{k l} (t) = \frac{\sum_{n \in N_{t}} 1_{{\tilde{z}}^{(n)} = k, {\tilde{z}}^{(n + 1)} = l}}{\sum_{n \in N_{t}} \sum_{l = 1}^{K} 1_{{\tilde{z}}^{(n)} = k, {\tilde{z}}^{(n + 1)} = l}} . \end{matrix}

F6 Multiple random initialization

To prevent the EM procedure from reaching an irrelevant local minimum, we run the algorithm 10 times with added noise around the initial state. For each coefficient c∈θ^(slice), we randomize as $c^{rand} = c (1 + σ Z)$ , where we take σ=0.5 and $Z \sim N (0, 1)$ .

F7 Slice estimate initialization vs. naive random initialization

Here we show the improvement given by the slice estimate compared to pure random initialization. The log-likelihood obtained with this initialization is compared with 10 pure random initializations, where all $β_{rand} \sim N (0, 0.5)$ . The relative improvement is plotted in Fig. F1. It is greater than 10⁴ in 21 of 36 models shown here and equal for the other cases, including the small models with K=2 and 3 where inference is easier. Note that even an improvement of a percent can lead to quite different models, in particular regarding the interpretability of the hidden states. This can be seen in the model selection in Fig. 4, where the difference between models is typically of the order of a percent or less. Furthermore, the number of steps for Algorithm 1 to converge i_stop is smaller in most cases, e.g., for $3 \leq K \leq 6$ and for the best selected models, the mean of $i_{stop}^{(rand)} - i_{stop}^{(slice)} ≃ 226$ .

Code availability

The programming language used to develop the model and produce the results of the paper is Julia (Bezanson et al., 2017). The code to build, fit, and use the SHHMM and WGEN models is available as a Julia package: StochasticWeatherGenerators.jl (Métivier, 2024). The package was constructed with an emphasis on computational efficiency and ease of use. It is maintained by one of the authors of this paper to facilitate ease of use by others. To perform the numerical expectation and maximization steps of the Baum–Welch algorithm, the JuMP.jl (Lubin et al., 2023) modeling framework was utilized alongside the Ipopt (Wächter and Biegler, 2006) solver. Figures 1, 5, 6, and C2 were made using GeoMakie.jl (Danisch and Krumbiegel, 2021), and all other plots were made with Plots.jl (Christ et al., 2023). The other main packages used in this work are Distributions.jl (Besançon et al., 2021), Copulas.jl (Laverny and Jimenez, 2024), and DataFrames.jl (Bouchet-Valat and Kamiński, 2023).

The EC&D rain data from the S=10 weather stations are directly available on the GitHub repository of the package. A tutorial built through continuous integration – ensuring compatibility and reproducibility – allows one to reproduce, step by step, most of the figures in this work. The complete tutorial, including loading packages, downloading the station data, training, simulating, generating, and saving the figures, takes only ≃11 min to execute (for J=10³) on a regular laptop. Note that the package offers an option to use parallel (distributed) computing during the training phase, significantly speeding up computations. For the sake of storage space, the datasets extracted from DRIAS and ERA5 are not included in the package repository, and the associated figures are therefore not reproduced. Figures 4 and F1 take the longest to compute, as they require training multiple models and are also not included in the tutorial.

Data availability

The data for the weather stations are extracted from the European Climate Assessment & Dataset (Klein Tank, 2002). In Sect. 8, the data from the climate models and scenarios are requested on the French DRIAS (Soubeyroux et al., 2021). The reanalysis ERA5 hourly data on single levels (Hersbach et al., 2020) for the dataset from 1979 to 2017 used in Fig. 6 were requested on the Copernicus website at https://cds-beta.climate.copernicus.eu/datasets/reanalysis-era5-single-levels?tab=download (Hersbach et al., 2018). The data were downloaded at the end of the year 2021.

Author contributions

DM and EG developed the mathematical model with domain knowledge from SP. DM wrote the paper with feedback and contributions from all the authors. The application and visualization were performed by DM with suggestions from SP for the interpretation and EG for the statistical analysis. The code and software for StochasticWeatherGenerators.jl were developed by DM.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

The authors thank Valérie Monbet and Mathieu Vrac for the feedback on an early version of this work, as well as Marc Lavielle for useful discussions.

Financial support

This research has been supported by the Fondation de l'École Polytechnique (Chaire Stress Test: Risk management and Financial Steering (BNP Paribas, École polytechnique)) and the EDF Energy Research and Development (Chaire Énergies Durables (CEA-EDF-École polytechnique)).

Review statement

This paper was edited by Christopher Paciorek and reviewed by Oliver Stoner and one anonymous referee.

References

Ailliot, P. and Monbet, V.: Markov-Switching Autoregressive Models for Wind Time Series, Environ. Model. Softw., 30, 92–101, https://doi.org/10.1016/j.envsoft.2011.10.011, 2012. a, b

Ailliot, P., Thompson, C., and Thomson, P.: Space–Time Modelling of Precipitation by Using a Hidden Markov Model and Censored Gaussian Distributions, J. Roy. Stat. Soc. Ser. C, 58, 405–426, 2009. a, b, c, d

Ailliot, P., Allard, D., Monbet, V., and Naveau, P.: Stochastic weather generators: an overview of weather type models, Journal de la société française de statistique, 156, 101–113, 2015a. a

Ailliot, P., Bessac, J., Monbet, V., and Pène, F.: Non-Homogeneous Hidden Markov-Switching Models for Wind Time Series, J. Stat. Plan. Infer., 160, 75–88, https://doi.org/10.1016/j.jspi.2014.12.005, 2015b. a, b

Ailliot, P., Boutigny, M., Koutroulis, E., Malisovas, A., and Monbet, V.: Stochastic Weather Generator for the Design and Reliability Evaluation of Desalination Systems with Renewable Energy Sources, Renew. Energ., 158, 541–553, https://doi.org/10.1016/j.renene.2020.05.076, 2020. a

Allman, E. S., Matias, C., and Rhodes, J. A.: Identifiability of Parameters in Latent Structure Models with Many Observed Variables, Ann. Stat., 37, 3099–3132, 2009. a, b

Allouche, M., Girard, S., and Gobet, E.: EV-GAN: Simulation of Extreme Events with ReLU Neural Networks, J. Mach. Learn. Res., 23, 1–39, 2022. a

Arias, P., Bellouin, N., Coppola, E., Jones, R., Krinner, G., Marotzke, J., Naik, V., Palmer, M., Plattner, G.-K., Rogelj, J., Rojas, M., Sillmann, J., Storelvmo, T., Thorne, P., Trewin, B., Achuta Rao, K., Adhikary, B., Allan, R., Armour, K., Bala, G., Barimalala, R., Berger, S., Canadell, J., Cassou, C., Cherchi, A., Collins, W., Collins, W., Connors, S., Corti, S., Cruz, F., Dentener, F., Dereczynski, C., Di Luca, A., Diongue Niang, A., Doblas-Reyes, F., Dosio, A., Douville, H., Engelbrecht, F., Eyring, V., Fischer, E., Forster, P., Fox-Kemper, B., Fuglestvedt, J., Fyfe, J., Gillett, N., Goldfarb, L., Gorodetskaya, I., Gutierrez, J., Hamdi, R., Hawkins, E., Hewitt, H., Hope, P., Islam, A., Jones, C., Kaufman, D., Kopp, R., Kosaka, Y., Kossin, J., Krakovska, S., Lee, J.-Y., Li, J., Mauritsen, T., Maycock, T., Meinshausen, M., Min, S.-K., Monteiro, P., Ngo-Duc, T., Otto, F., Pinto, I., Pirani, A., Raghavan, K., Ranasinghe, R., Ruane, A., Ruiz, L., Sallée, J.-B., Samset, B., Sathyendranath, S., Seneviratne, S., Sörensson, A., Szopa, S., Takayabu, I., Tréguier, A.-M., van den Hurk, B., Vautard, R., von Schuckmann, K., Zaehle, S., Zhang, X., and Zickfeld, K.: Technical Summary, in: Climate Change 2021: The Physical Science Basis. Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change, edited by: Masson-Delmotte, V., Zhai, P., Pirani, A., Connors, S., Péan, C., Berger, S., Caud, N., Chen, Y., Goldfarb, L., Gomis, M., Huang, M., Leitzell, K., Lonnoy, E., Matthews, J., Maycock, T., Waterfield, T., Yelekçi, O., Yu, R., and Zhou, B., Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA, 33–144, https://doi.org/10.1017/9781009157896, 2021. a, b

Bardossy, A. and Plate, E. J.: Space-Time Model for Daily Rainfall Using Atmospheric Circulation Patterns, Water Resour. Res., 28, 1247–1259, https://doi.org/10.1029/91WR02589, 1992. a

Baxevani, A. and Lennartsson, J.: A Spatiotemporal Precipitation Generator Based on a Censored Latent Gaussian Field, Water Resour. Res., 51, 4338–4358, 2015. a, b, c, d, e

Beck, C., Philipp, A., and Streicher, F.: The Effect of Domain Size on the Relationship between Circulation Type Classifications and Surface Climate, Int. J. Climatol., 36, 2692–2709, https://doi.org/10.1002/joc.3688, 2016. a

Bellone, E., Hughes, J. P., and Guttorp, P.: A Hidden Markov Model for Downscaling Synoptic Atmospheric Patterns to Precipitation Amounts, Clim. Res., 15, 1–12, https://doi.org/10.3354/cr015001, 2000. a, b, c

Bennett, B., Thyer, M., Leonard, M., Lambert, M., and Bates, B.: A Comprehensive and Systematic Evaluation Framework for a Parsimonious Daily Rainfall Field Model, J. Hydrol., 556, 1123–1138, https://doi.org/10.1016/j.jhydrol.2016.12.043, 2018. a, b, c

Benoit, L., Allard, D., and Mariethoz, G.: Stochastic Rainfall Modeling at Sub-kilometer Scale, Water Resour. Res., 54, 4108–4130, 2018. a, b, c, d, e

Besançon, M., Papamarkou, T., Anthoff, D., Arslan, A., Byrne, S., Lin, D., and Pearson, J.: Distributions.Jl: Definition and Modeling of Probability Distributions in the JuliaStats Ecosystem, J. Stat. Softw., 98, 1–30, https://doi.org/10.18637/jss.v098.i16, 2021. a

Bezanson, J., Edelman, A., Karpinski, S., and Shah, V. B.: Julia: A Fresh Approach to Numerical Computing, SIAM Review, 59, 65–98, https://doi.org/10.1137/141000671, 2017. a

Bishop, C. M.: Pattern Recognition and Machine Learning, Information science and statistics, Springer, New York, ISBN 978-0-387-31073-2, 2006. a, b

Boé, J. and Terray, L.: A Weather-Type Approach to Analyzing Winter Precipitation in France: Twentieth-Century Trends and the Role of Anthropogenic Forcing, J. Climate, 21, 3118–3133, https://doi.org/10.1175/2007JCLI1796.1, 2008. a, b, c

Boé, J., Somot, S., Corre, L., and Nabat, P.: Large Discrepancies in Summer Climate Change over Europe as Projected by Global and Regional Climate Models: Causes and Consequences, Clim. Dynam., 54, 2981–3002, 2020. a

Bouchet-Valat, M. and Kamiński, B.: DataFrames.Jl: Flexible and Fast Tabular Data in Julia, J. Stat. Softw., 107, 1–32, https://doi.org/10.18637/jss.v107.i04, 2023. a

Cappé, O., Moulines, E., and Rydén, T.: Inference in hidden Markov models, Springer Series in Statistics, Springer, New York, https://doi.org/10.1007/0-387-28982-8, 2005. a, b, c, d, e

Cassou, C.: Du changement climatique aux régimes de temps : l'oscillation nord-atlantique [prix Prud'homme 2002], La Météorologie, 2004, 21–32, https://doi.org/10.4267/2042/36039, 2004. a, b, c

Celeux, G. and Durand, J.-B.: Selecting hidden Markov model state number with cross-validated likelihood, Comput. Stat., 23, 541–564, https://doi.org/10.1007/s00180-007-0097-1, 2008. a

Chandler, R. E.: Multisite, Multivariate Weather Generation Based on Generalised Linear Models, Environ. Model. Softw., 134, 104 867, https://doi.org/10.1016/j.envsoft.2020.104867, 2020. a

Chen, J. and Brissette, F. P.: Stochastic Generation of Daily Precipitation Amounts: Review and Evaluation of Different Models, Climate Res., 59, 189–206, 2014. a, b, c

Christ, S., Schwabeneder, D., Rackauckas, C., Borregaard, M. K., and Breloff, T.: Plots.Jl – A User Extendable Plotting API for the Julia Programming Language, J. Open Res. Softw., 11, https://doi.org/10.5334/jors.431, 2023. a

Christophe, P. and Pompili, B.: Rapport fait au nom de la commission d'enquête sur “la sûreté et la sécurité des installations nucléaires”, Assemblée Nationale, 1122, http://www.assemblee-nationale.fr/15/rap-enq/r1122-tI.asp (last access: 28 August 2025), 2018. a

Cognot, C., Bel, L., Métivier, D., and Parey, S.: A spatio-temporal weather generator for the temperature over France, Adv. Stat. Clim. Meteorol. Oceanogr., in preparation, 2025. a

Cowpertwait, P., Isham, V., and Onof, C.: Point Process Models of Rainfall: Developments for Fine-Scale Structure, P. Roy. Soc. A-Math., 463, 2569–2587, https://doi.org/10.1098/rspa.2007.1889, 2007. a

Danisch, S. and Krumbiegel, J.: Makie.Jl: Flexible High-Performance Data Visualization for Julia, J. Open Source Softw., 6, 3349, https://doi.org/10.21105/joss.03349, 2021. a

Dawkins, L. C., Osborne, J. M., Economou, T., Darch, G. J. C., and Stoner, O. R.: The Advanced Meteorology Explorer: A Novel Stochastic, Gridded Daily Rainfall Generator, J. Hydrol., 607, 127478, https://doi.org/10.1016/j.jhydrol.2022.127478, 2022. a, b, c, d, e

de Chaumaray, M. D. R., Kolei, S. E., Etienne, M.-P., and Marbac, M.: Estimation of the Order of Non-Parametric Hidden Markov Models Using the Singular Values of an Integral Operator, Journal of Machine Learning Research, 25, 1–37, 2023. a, b, c

Diaconis, P. and Freedman, D.: Finite Exchangeable Sequences, Ann. Probab., 8, 745–764, https://doi.org/10.1214/aop/1176994663, 1980. a

Dueben, P. D. and Bauer, P.: Challenges and design choices for global weather and climate models based on machine learning, Geosci. Model Dev., 11, 3999–4009, https://doi.org/10.5194/gmd-11-3999-2018, 2018. a

Dunn, P. K.: Occurrence and Quantity of Precipitation Can Be Modelled Simultaneously, Int. J. Climatol., 24, 1231–1239, https://doi.org/10.1002/joc.1063, 2004. a

Evin, G., Blanchet, J., Paquet, E., Garavaglia, F., and Penot, D.: A Regional Model for Extreme Rainfall Based on Weather Patterns Subsampling, J. Hydrol., 541, 1185–1198, 2016. a

Evin, G., Favre, A.-C., and Hingray, B.: Stochastic generation of multi-site daily precipitation focusing on extreme events, Hydrol. Earth Syst. Sci., 22, 655–672, https://doi.org/10.5194/hess-22-655-2018, 2018. a, b, c, d

Fang, H.-B., Fang, K.-T., and Kotz, S.: The Meta-elliptical Distributions with Given Marginals, J. Multivariate Anal., 82, 1–16, 2002. a

Feldmann, C. C., Mews, S., Coculla, A., Stanewsky, R., and Langrock, R.: Flexible Modelling of Diel and Other Periodic Variation in Hidden Markov Models, J. Stat. Theor. Pract., 17, 45, https://doi.org/10.1007/s42519-023-00342-7, 2023. a

Fischer, E. M., Beyerle, U., Bloin-Wibe, L., Gessner, C., Humphrey, V., Lehner, F., Pendergrass, A. G., Sippel, S., Zeder, J., and Knutti, R.: Storylines for Unprecedented Heatwaves Based on Ensemble Boosting, Nat. Commun., 14, 4643, https://doi.org/10.1038/s41467-023-40112-4, 2023. a

Flecher, C., Naveau, P., Allard, D., and Brisson, N.: A Stochastic Daily Weather Generator for Skewed Data, Water Resour. Res., 46, W07519, https://doi.org/10.1029/2009WR008098, 2010. a

Garavaglia, F., Gailhard, J., Paquet, E., Lang, M., Garçon, R., and Bernardara, P.: Introducing a rainfall compound distribution model based on weather patterns sub-sampling, Hydrol. Earth Syst. Sci., 14, 951–964, https://doi.org/10.5194/hess-14-951-2010, 2010. a, b, c

Ghassempour, S., Girosi, F., and Maeder, A.: Clustering Multivariate Time Series Using Hidden Markov Models, Int. J. Environ. Res. Pub. He., 11, 2741–2763, https://doi.org/10.3390/ijerph110302741, 2014. a

Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., and Bengio, Y.: Generative adversarial nets, Advances in Neural Information Processing Systems, Curran Associates, Inc., 27, 2014. a

Greene, A. M., Robertson, A. W., Smyth, P., and Triglia, S.: Downscaling Projections of Indian Monsoon Rainfall Using a Non-Homogeneous Hidden Markov Model, Q. J. Roy. Meteor. Soc., 137, 347–359, https://doi.org/10.1002/qj.788, 2011. a, b, c

Gutiérrez, J. M., Maraun, D., Widmann, M., Huth, R., Hertig, E., Benestad, R., Roessler, O., Wibig, J., Wilcke, R., Kotlarski, S., San Martín, D., Herrera, S., Bedia, J., Casanueva, A., Manzanas, R., Iturbide, M., Vrac, M., Dubrovsky, M., Ribalaygua, J., Pórtoles, J., Räty, O., Räisänen, J., Hingray, B., Raynaud, D., Casado, M. J., Ramos, P., Zerenner, T., Turco, M., Bosshard, T., Štěpánek, P., Bartholy, J., Pongracz, R., Keller, D. E., Fischer, A. M., Cardoso, R. M., Soares, P. M. M., Czernecki, B., and Pagé, C.: An Intercomparison of a Large Ensemble of Statistical Downscaling Methods over Europe: Results from the VALUE Perfect Predictor Cross-Validation Experiment, Int. J. Climatol., 39, 3750–3785, https://doi.org/10.1002/joc.5462, 2019. a

Gyllenberg, M., Koski, T., Reilink, E., and Verlaan, M.: Non-Uniqueness in Probabilistic Numerical Identification of Bacteria, J. Appl. Probab., 31, 542–548, 1994. a

Hersbach, H., Bell, B., Berrisford, P., Biavati, G., Horányi, A., Muñoz Sabater, J., Nicolas, J., Peubey, C., Radu, R., Rozum, I., Schepers, D., Simmons, A., Soci, C., Dee, D., and Thépaut, J.-N.: ERA5 hourly data on single levels from 1940 to present, Copernicus Climate Change Service (C3S) Climate Data Store (CDS) [data set], https://doi.org/10.24381/cds.adbb2d47, 2018. a

Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz-Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D., Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P., Biavati, G., Bidlot, J., Bonavita, M., De Chiara, G., Dahlgren, P., Dee, D., Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer, A., Haimberger, L., Healy, S., Hogan, R. J., Hólm, E., Janisková, M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., de Rosnay, P., Rozum, I., Vamborg, F., Villaume, S., and Thépaut, J.-N.: The ERA5 Global Reanalysis, Q. J. Roy. Meteor. Soc., 146, 1999–2049, 2020. a, b, c

Holsclaw, T., Greene, A. M., Robertson, A. W., and Smyth, P.: A Bayesian Hidden Markov Model of Daily Precipitation over South and East Asia, J. Hydrometeorol., 17, 3–25, https://doi.org/10.1175/JHM-D-14-0142.1, 2016. a, b, c, d, e, f, g

Hughes, J. P. and Guttorp, P.: A Class of Stochastic Models for Relating Synoptic Atmospheric Patterns to Regional Hydrologic Phenomena, Water Resour. Res., 30, 1535–1546, https://doi.org/10.1029/93WR02983, 1994a. a, b

Hughes, J. P. and Guttorp, P.: Incorporating Spatial Dependence and Atmospheric Data in a Model of Precipitation, J. Appl. Meteorol. Clim., 33, 1503–1515, https://doi.org/10.1175/1520-0450(1994)033<1503:ISDAAD>2.0.CO;2, 1994b. a, b, c, d, e, f, g

Hughes, J. P., Guttorp, P., and Charles, S. P.: A Non-Homogeneous Hidden Markov Model for Precipitation Occurrence, J. Roy. Stat. Soc. Ser. Cs, 48, 15–30, https://doi.org/10.1111/1467-9876.00136, 1999. a

Huth, R., Beck, C., and Kučerová, M.: Synoptic-Climatological Evaluation of the Classifications of Atmospheric Circulation Patterns over Europe, Int. J. Climatol., 36, 2710–2726, https://doi.org/10.1002/joc.4546, 2016. a

International Energy Agency: Climate Resilience for Energy Security, Tech. rep., IEA, Paris, 2022. a

Katz, R. W.: On Some Criteria for Estimating the Order of a Markov Chain, Technometrics, 23, 243–249, https://doi.org/10.2307/1267787, 1981. a

Kim, Y., Wiseman, S., and Rush, A. M.: A Tutorial on Deep Latent Variable Models of Natural Language, arXiv [preprint], https://doi.org/10.48550/arXiv.1812.06834, 2019. a

Kirshner, S.: Modeling of Multivariate Time Series Using Hidden Markov Models, Ph.D. thesis, California State University at Long Beach, USA, ISBN 0496986732, 2005. a, b, c, d, e

Kirshner, S., Smyth, P., and Robertson, A. W.: Conditional Chow-Liu Tree Structures for Modeling Discrete-Valued Vector Time Series, in: Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence, UAI '04, AUAI Press, Arlington, Virginia, USA, 317–324, ISBN 978-0-9749039-0-3, 2004. a, b, c

Kleiber, W., Katz, R. W., and Rajagopalan, B.: Daily Spatiotemporal Precipitation Simulation Using Latent and Transformed Gaussian Processes, Water Resour. Res., 48, W01523, https://doi.org/10.1029/2011WR011105, 2012. a

Klein Tank, A. a. C.: Daily dataset of 20th-century surface air temperature and precipitation series for the European Climate Assessment, Int. J. Climatol., 22, 1441–1453, 2002. a, b

Kroiz, G. C., Majumder, R., Gobbert, M. K., Neerchal, N. K., Markert, K., and Mehta, A.: Daily Precipitation Generation Using a Hidden Markov Model with Correlated Emissions for the Potomac River Basin, PAMM, 20, e202000117, https://doi.org/10.1002/pamm.202000117, 2020. a, b, c, d, e, f, g, h

Lang, A. and Poschlod, B.: Updating Catastrophe Models to Today's Climate – An Application of a Large Ensemble Approach to Extreme Rainfall, Clim. Risk Manage., 44, 100594, https://doi.org/10.1016/j.crm.2024.100594, 2024. a

Langrock, R. and Zucchini, W.: Hidden Markov Models with Arbitrary State Dwell-Time Distributions, Comput. Stat. Data Anal., 55, 715–724, https://doi.org/10.1016/j.csda.2010.06.015, 2011. a

Laverny, O. and Jimenez, S.: Copulas.Jl: A Fully Distributions.Jl-Compliant Copula Package [code], J. Open Source Softw., 9, 6189, https://doi.org/10.21105/joss.06189, 2024. a, b

Lubin, M., Dowson, O., Garcia, J. D., Huchette, J., Legat, B., and Vielma, J. P.: JuMP 1.0: Recent Improvements to a Modeling Language for Mathematical Optimization, Math. Programm. Comput., 15, 581–589, 2023. a

Luu, L. N., Vautard, R., Yiou, P., and Soubeyroux, J.-M.: Evaluation of convection-permitting extreme precipitation simulations for the south of France, Earth Syst. Dynam., 13, 687–702, https://doi.org/10.5194/esd-13-687-2022, 2022. a

Manzano, V. J. P. and Ines, A. V.: Downscaling Seasonal Climate Forecasts for Risks Management of Rice Production in the Philippines, Indian Journal of Science and Technology, 13, 1–17, https://doi.org/10.17485/ijst/2020/v13i01/147074, 2020. a

McLachlan, G. J. and Krishnan, T.: The EM Algorithm and Extensions, John Wiley & Sons, ISBN 978-0-470-19160-6, 2007. a

Métivier, D.: StochasticWeatherGenerators.jl: A Julia package to generate weather sequences with Stochastic Weather Generators, Github [code], https://github.com/dmetivie/StochasticWeatherGenerators.jl, 2024. a, b, c

Métivier, D., Allouche, M., Saux, M., Gobet, E., and Pachebat, J.: Data from Hackathon “GenHack 3 Generative Modeling Challenge”: Predicting maize crop yield distribution under stochastic weather [data set], Recherche Data Gouv., https://doi.org/10.57745/C3FNBY, 2025. a

Miloshevich, G., Cozian, B., Abry, P., Borgnat, P., and Bouchet, F.: Probabilistic Forecasts of Extreme Heatwaves Using Convolutional Neural Networks in a Regime of Lack of Data, Phys. Rev. Fluids, 8, 040501, https://doi.org/10.1103/PhysRevFluids.8.040501, 2023. a

Miloshevich, G., Lucente, D., Yiou, P., and Bouchet, F.: Extreme Heat Wave Sampling and Prediction with Analog Markov Chain and Comparisons with Deep Learning, Environ. Data Sci., 3, e9, ISSN 2634-4602, https://doi.org/10.1017/eds.2024.7, 2024. a

Monbet, V. and Ailliot, P.: Sparse Vector Markov Switching Autoregressive Models. Application to Multivariate Time Series of Temperature, Comput. Stat. Data Anal., 108, 40–51, https://doi.org/10.1016/j.csda.2016.10.023, 2017. a

Najibi, N., Mukhopadhyay, S., and Steinschneider, S.: Identifying Weather Regimes for Regional-Scale Stochastic Weather Generators, Int. J. Climatol., 41, 2456–2479, https://doi.org/10.1002/joc.6969, 2021. a, b

Naveau, P., Huser, R., Ribereau, P., and Hannart, A.: Modeling Jointly Low, Moderate, and Heavy Rainfall Intensities without a Threshold Selection, Water Resour. Res., 52, 2753–2769, 2016. a, b

Nelsen, R. B.: An Introduction to Copulas, Springer Series in Statistics, Springer, New York, NY, ISBN 978-0-387-28659-4, 2006. a, b

Neykov, N., Neytchev, P., Zucchini, W., and Hristov, H.: Linking Atmospheric Circulation to Daily Precipitation Patterns over the Territory of Bulgaria, Environ. Ecol. Stat., 19, 249–267, https://doi.org/10.1007/s10651-011-0185-9, 2012. a

Nguyen, T. H. T., Bennett, B., and Leonard, M.: Evaluating Stochastic Rainfall Models for Hydrological Modelling, J. Hydrol., 627, 130381, https://doi.org/10.1016/j.jhydrol.2023.130381, 2023. a, b, c

Nguyen, V. D., Vorogushyn, S., Nissen, K., Brunner, L., and Merz, B.: A non-stationary climate-informed weather generator for assessing future flood risks, Adv. Stat. Clim. Meteorol. Oceanogr., 10, 195–216, https://doi.org/10.5194/ascmo-10-195-2024, 2024. a

Pandey, P. K., Das, L., Jhajharia, D., and Pandey, V.: Modelling of Interdependence between Rainfall and Temperature Using Copula, Model. Earth Syst. Environ., 4, 867–879, 2018. a

Papastamatiou, Y. P., Watanabe, Y. Y., Demšar, U., Leos-Barajas, V., Bradley, D., Langrock, R., Weng, K., Lowe, C. G., Friedlander, A. M., and Caselle, J. E.: Activity Seascapes Highlight Central Place Foraging Strategies in Marine Predators That Never Stop Swimming, Movement Ecol., 6, 9, https://doi.org/10.1186/s40462-018-0127-3, 2018. a

Parent, B., Leclere, M., Lacube, S., Semenov, M. A., Welcker, C., Martre, P., and Tardieu, F.: Maize Yields over Europe May Increase in Spite of Climate Change, with an Appropriate Use of the Genetic Variability of Flowering Time, P. Natl. Acad. Sci. USA, 115, 10642–10647, https://doi.org/10.1073/pnas.1720716115, 2018. a

Pascual, D., Pla, E., Fons, J., and Abdul-Malak, D.: Climate change impacts on water availability and human security in the intercontinental biosphere reserve of the mediterranean (Morocco-Spain), in: Environmental Change and Human Security in Africa and the Middle East, Springer, 75–93, ISBN 978-3-319-45648-5, https://doi.org/10.1007/978-3-319-45648-5_4, 2017. a

Pawlowsky-Glahn, V. and Buccianti, A.: Compositional data analysis: Theory and applications, John Wiley & Sons, https://doi.org/10.1002/9781119976462, 2011. a

Philipp, A., Beck, C., Huth, R., and Jacobeit, J.: Development and Comparison of Circulation Type Classifications Using the COST 733 Dataset and Software, Int. J. Climatol., 36, 2673–2691, https://doi.org/10.1002/joc.3920, 2016. a

Pohle, J., Langrock, R., van Beest, F. M., and Schmidt, N. M.: Selecting the Number of States in Hidden Markov Models: Pragmatic Solutions Illustrated Using Animal Movement, J. Agr. Biol. Environ. Stat., 22, 270–293, 2017. a, b, c

Raftery, A. E.: A Model for High-Order Markov Chains, J. Roy. Stat. Soc. Ser. B, 47, 528–539, https://doi.org/10.1111/j.2517-6161.1985.tb01383.x, 1985. a

Ranger, N. A., Mahul, O., and Monasterolo, I.: Assessing Financial Risks from Physical Climate Shocks: A Framework for Scenario Generation, World Bank, Washington, DC, https://hdl.handle.net/10986/37041 (last access: 29 August 2025), 2022. a

Renard, B. and Lang, M.: Use of a Gaussian Copula for Multivariate Extreme Value Analysis: Some Case Studies in Hydrology, Adv. Water Resour., 30, 897–912, 2007. a

Richardson, C. W.: Stochastic Simulation of Daily Precipitation, Temperature, and Solar Radiation, Water Resour. Res., 17, 182–190, 1981. a, b, c

Robertson, A. W., Kirshner, S., and Smyth, P.: Downscaling of Daily Rainfall Occurrence over Northeast Brazil Using a Hidden Markov Model, J Climate, 17, 4407–4424, https://doi.org/10.1175/JCLI-3216.1, 2004. a, b, c, d, e, f

Robertson, A. W., Ines, A. V. M., and Hansen, J. W.: Downscaling of Seasonal Precipitation for Crop Simulation, J. Appl. Meteorol. Clim., 46, 677–693, https://doi.org/10.1175/JAM2495.1, 2007. a

Sansom, J. and Thomson, P.: A hidden seasonal switching model for high-resolution breakpoint rainfall data, Water Resour. Res., 46, 8, ISSN 1944-7973, https://doi.org/10.1029/2009WR008602, 2010. a

Serinaldi, F. and Kilsby, C. G.: Simulating Daily Rainfall Fields over Large Areas for Collective Risk Estimation, J. Hydrol., 512, 285–302, https://doi.org/10.1016/j.jhydrol.2014.02.043, 2014. a

Soubeyroux, J.-M., Bernus, S., Corre, L., Drouin, A., Dubuisson, B., Etchevers, P., Gouget, V., Josse, P., Kerdoncuff, M., Samacoits, R., and Tocquer, F.: Les Nouvelles Projections Climatiques de Référence DRIAS-2020 Pour La Métropole, Tech. rep., Météo-France, 2021. a, b, c, d

Srikanthan, R. and Pegram, G. G. S.: A Nested Multisite Daily Rainfall Stochastic Generation Model, J. Hydrol., 371, 142–153, https://doi.org/10.1016/j.jhydrol.2009.03.025, 2009. a, b, c, d

Stoner, O. and Economou, T.: An Advanced Hidden Markov Model for Hourly Rainfall Time Series, Comput. Stat. Data Anal., 152, 107045, https://doi.org/10.1016/j.csda.2020.107045, 2020. a, b, c

Tencaliec, P., Favre, A.-C., Naveau, P., Prieur, C., and Nicolet, G.: Flexible Semiparametric Generalized Pareto Modeling of the Entire Range of Rainfall Amount, Environmetrics, 31, e2582, https://doi.org/10.1002/env.2582, 2020. a, b

Tootoonchi, F., Todorović, A., Grabs, T., and Teutschbein, C.: Uni- and Multivariate Bias Adjustment of Climate Model Simulations in Nordic Catchments: Effects on Hydrological Signatures Relevant for Water Resources Management in a Changing Climate, J. Hydrol., 623, 129807, https://doi.org/10.1016/j.jhydrol.2023.129807, 2023. a

Touron, A.: Consistency of the Maximum Likelihood Estimator in Seasonal Hidden Markov Models, Stat. Comput., 29, 1055–1075, 2019a. a, b, c, d, e, f, g, h, i

Touron, A.: Modélisation multivariée de variables météorologiques, Ph.D. thesis, Université Paris-Saclay (ComUE), https://tel.archives-ouvertes.fr/tel-02319170 (last access: 29 August 2025), 2019b. a, b, c, d, e

Vaittinada Ayar, P., Vrac, M., Bastin, S., Carreau, J., Déqué, M., and Gallardo, C.: Intercomparison of Statistical and Dynamical Downscaling Models under the EURO- and MED-CORDEX Initiative Framework: Present Climate Evaluations, Clim. Dynam., 46, 1301–1329, https://doi.org/10.1007/s00382-015-2647-5, 2016. a

Vaittinada Ayar, P., Blanchet, J., Paquet, E., and Penot, D.: Space-Time Simulation of Precipitation Based on Weather Pattern Sub-Sampling and Meta-Gaussian Model, J. Hydrol., 581, 124451, https://doi.org/10.1016/j.jhydrol.2019.124451, 2020. a, b

van der Wiel, K., Bloomfield, H. C., Lee, R. W., Stoop, L. P., Blackport, R., Screen, J. A., and Selten, F. M.: The Influence of Weather Regimes on European Renewable Energy Production and Demand, Environ. Res. Lett., 14, 094010, https://doi.org/10.1088/1748-9326/ab38d3, 2019. a, b, c, d

Vautard, R., Kadygrov, N., Iles, C., Boberg, F., Buonomo, E., Bülow, K., Coppola, E., Corre, L., van Meijgaard, E., Nogherotto, R., Sandstad, M., Schwingshackl, C., Somot, S., Aalbers, E., Christensen, O. B., Ciarlo, J. M., Demory, M.-E., Giorgi, F., Jacob, D., Jones, R. G., Keuler, K., Kjellström, E., Lenderink, G., Levavasseur, G., Nikulin, G., Sillmann, J., Solidoro, C., Sørland, S. L., Steger, C., Teichmann, C., Warrach-Sagi, K., and Wulfmeyer, V.: Evaluation of the Large EURO-CORDEX Regional Climate Model Ensemble, J. Geophys. Res.-Atmos., 126, e2019JD032344, https://doi.org/10.1029/2019JD032344, 2021. a

Verdin, A., Rajagopalan, B., Kleiber, W., Podestá, G., and Bert, F.: BayGEN: A Bayesian Space-Time Stochastic Weather Generator, Water Resour. Res., 55, 2900–2915, https://doi.org/10.1029/2017WR022473, 2019. a, b

Verfaillie, D., Déqué, M., Morin, S., and Lafaysse, M.: The method ADAMONT v1.0 for statistical adjustment of climate projections applicable to energy balance land surface models, Geosci. Model Dev., 10, 4257–4283, https://doi.org/10.5194/gmd-10-4257-2017, 2017. a

Vidal, J.-P., Martin, E., Franchistéguy, L., Baillon, M., and Soubeyroux, J.-M.: A 50-Year High-Resolution Atmospheric Reanalysis over France with the Safran System, Int. J. Climatol., 30, 1627–1644, 2010. a

Viterbi, A.: Error Bounds for Convolutional Codes and an Asymptotically Optimum Decoding Algorithm, IEEE T. Inform. Theory, 13, 260–269, 1967. a, b

Vrac, M., Stein, M., and Hayhoe, K.: Statistical Downscaling of Precipitation through Nonhomogeneous Stochastic Weather Typing, Climate Res., 34, 169–184, https://doi.org/10.3354/cr00696, 2007. a

Wächter, A. and Biegler, L. T.: On the Implementation of an Interior-Point Filter Line-Search Algorithm for Large-Scale Nonlinear Programming, Math. Programm., 106, 25–57, 2006. a

Wilks, D. S.: Multisite Generalization of a Daily Stochastic Precipitation Generation Model, J. Hydrol., 210, 178–191, https://doi.org/10.1016/S0022-1694(98)00186-3, 1998. a, b, c, d, e

Wilks, D. S.: A Gridded Multisite Weather Generator and Synchronization to Observed Weather Data, Water Resour. Res., 45, https://doi.org/10.1029/2009WR007902, 2009. a

Wilks, D. S. and Wilby, R. L.: The weather generation game: a review of stochastic weather models, Prog. Phys. Geogr., 23, 329–357, 1999. a

Woollings, T., Hannachi, A., Hoskins, B., and Turner, A.: A Regime View of the North Atlantic Oscillation and Its Response to Anthropogenic Forcing, J. Climate, 23, 1291–1307, https://doi.org/10.1175/2009JCLI3087.1, 2010. a

Yakowitz, S. J. and Spragins, J. D.: On the Identifiability of Finite Mixtures, Ann. Math. Stat., 39, 209–214, 1968. a

Yamanishi, K.: Latent Variable Model Selection, in: Learning with the Minimum Description Length Principle, edited by: Yamanishi, K., Springer Nature, Singapore, 137–183, ISBN 978-981-9917-90-7, https://doi.org/10.1007/978-981-99-1790-7_4, 2023. a

Yang, C., Chandler, R. E., Isham, V. S., and Wheater, H. S.: Spatial-Temporal Rainfall Simulation Using Generalized Linear Models, Water Resour. Res., 41, https://doi.org/10.1029/2004WR003739, 2005. a

Yang, J., Jun, M., Schumacher, C., and Saravanan, R.: Predictive Statistical Representations of Observed and Simulated Rainfall Using Generalized Linear Models, J. Climate, 32, 3409–3427, https://doi.org/10.1175/JCLI-D-18-0527.1, 2019. a

Zhao, C., Liu, B., Piao, S., Wang, X., Lobell, D. B., Huang, Y., Huang, M., Yao, Y., Bassu, S., Ciais, P., Durand, J.-L., Elliott, J., Ewert, F., Janssens, I. A., Li, T., Lin, E., Liu, Q., Martre, P., Müller, C., Peng, S., Peñuelas, J., Ruane, A. C., Wallach, D., Wang, T., Wu, D., Liu, Z., Zhu, Y., Zhu, Z., and Asseng, S.: Temperature Increase Reduces Global Yields of Major Crops in Four Independent Estimates, P. Natl. Acad. Sci. USA, 114, 9326–9331, https://doi.org/10.1073/pnas.1701762114, 2017. a

Zucchini, W. and Guttorp, P.: A Hidden Markov Model for Space-Time Precipitation, Water Resour. Res., 27, 1917–1923, https://doi.org/10.1029/91WR01403, 1991. a, b, c, d, e

Zucchini, W. and MacDonald, I. L.: Hidden Markov Models for Time Series: An Introduction Using R, Chapman and Hall/CRC, New York, ISBN 978-0-429-13953-6, https://doi.org/10.1201/9781420010893, 2009. a

The index m=0 in 𝒞₀ referring to the non-dependence of Y in its past.

Except for 29 February (and 17 and 23 February as well as 6 and 12 March)

Interpretable seasonal multisite hidden Markov model for stochastic rain generation in France

1.1 Context

1.2 Background literature

1.3 Our contribution

1.3.1 Capturing large-scale dependencies

1.3.2 Improved temporal persistence and rainfall representation

1.3.3 Validation, comparison, and applications

1.3.4 Software

1.4 Organization of the paper

1.5 Notations used in the paper

2.1 Data

2.2 Seasonal hidden Markov model (SHMM), model 𝒞0

2.3 Seasonal hierarchical hidden Markov model (SHHMM), model 𝒞m with m>0

2.4 Hypothesis and modeling of the time regularity of parameters

2.5 Identifiability

2.6 Model justification

3.1 Initialization: the slice estimate

3.2 Baum–Welch algorithm for SHHMM

3.3 Hidden state inference: the Viterbi algorithm

3.4 Model selection

3.5 Comparison with multisite WGEN-type models

4.1 Spatial features

4.1.1 Rain probability

4.1.2 North Atlantic pressure maps

4.2 Seasonality

4.2.1 Transition matrix

4.2.2 Rain probability

4.3 Mean rain amount

4.4 Weather regime spells

5.1 Simulation algorithm of the SHHMM

5.2 Results

5.2.1 Dry/wet state sequence

5.2.2 Spatial correlations

5.3 Spatiotemporal spells

6.1 Marginal rain distributions

6.2 Multisite distribution: Gaussian copula

6.2.1 Simulation procedure

7.1 Correlations

7.2 Rain amounts

7.2.1 Distribution and autocorrelation of precipitation

7.2.2 Precipitation during the year

8.1 DRIAS data

8.2 Direct comparison of models with the reference period

8.3 Training on RCP scenarios

E1 Fundamental inequality of the EM algorithm

E2 Smoothing and filtering probabilities

E3 Forward–backward procedure

F1 The EM algorithm for the distribution of the observations

F2 Algorithm

F3 Random initialization

F4 Ordering the hidden states

F5 Transition matrices

F6 Multiple random initialization

F7 Slice estimate initialization vs. naive random initialization

2.2 Seasonal hidden Markov model (SHMM), model 𝒞₀

2.3 Seasonal hierarchical hidden Markov model (SHHMM), model 𝒞_m with m>0