<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">ASCMO</journal-id><journal-title-group>
    <journal-title>Advances in Statistical Climatology, Meteorology and Oceanography</journal-title>
    <abbrev-journal-title abbrev-type="publisher">ASCMO</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Adv. Stat. Clim. Meteorol. Oceanogr.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">2364-3587</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/ascmo-12-149-2026</article-id><title-group><article-title>Improving multisite precipitation generators based on generalised linear models</article-title><alt-title>Improving multisite precipitation generators</alt-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Wessel</surname><given-names>Jakob Benjamin</given-names></name>
          <email>j.wessel@exeter.ac.uk</email>
        <ext-link>https://orcid.org/0000-0003-2621-2477</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff2">
          <name><surname>Chandler</surname><given-names>Richard E.</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-1116-222X</ext-link></contrib>
        <aff id="aff1"><label>1</label><institution>Department of Mathematics and Statistics, University of Exeter, Exeter, United Kingdom</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>Department of Statistical Science, University College London, London, United Kingdom</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Jakob Benjamin Wessel (j.wessel@exeter.ac.uk)</corresp></author-notes><pub-date><day>5</day><month>May</month><year>2026</year></pub-date>
      
      <volume>12</volume>
      <issue>1</issue>
      <fpage>149</fpage><lpage>172</lpage>
      <history>
        <date date-type="received"><day>21</day><month>November</month><year>2025</year></date>
           <date date-type="rev-recd"><day>6</day><month>April</month><year>2026</year></date>
           <date date-type="accepted"><day>24</day><month>April</month><year>2026</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2026 Jakob Benjamin Wessel</copyright-statement>
        <copyright-year>2026</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://ascmo.copernicus.org/articles/12/149/2026/ascmo-12-149-2026.html">This article is available from https://ascmo.copernicus.org/articles/12/149/2026/ascmo-12-149-2026.html</self-uri><self-uri xlink:href="https://ascmo.copernicus.org/articles/12/149/2026/ascmo-12-149-2026.pdf">The full text article is available as a PDF file from https://ascmo.copernicus.org/articles/12/149/2026/ascmo-12-149-2026.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d2e92">Precipitation generators are statistical models used to produce synthetic sequences of (multisite) precipitation for hydrological applications such as flood risk assessment and water resource management. Among these, approaches based on generalized linear models (GLMs) are widely used and often perform competitively with state-of-the-art alternatives, but they face limitations in representing seasonal variation in extremes and in flexibly capturing covariate effects on the precipitation distribution. In this paper, we extend the GLM framework in two directions. First, we introduce generalised additive models for location, scale and shape (GAMLSS) for precipitation generation. These models allow the use of spline-based model terms to flexibly capture covariate effects and allow covariates to influence multiple distributional parameters, thereby increasing flexibility in representing variation in both the mean and variance of the precipitation distribution. Second, we adapt a transformed Gaussian fields approach to jointly account for spatial dependence in both precipitation occurrence and intensity, thus allowing for potential cross-dependence between the two. A further contribution is to investigate the sensitivity of model performance to data resolution, highlighting that rounding in data pre-processing can substantially affect the reproduction of extremes. Using a well-studied daily precipitation dataset, we demonstrate that these extensions improve the realism of simulated sequences, particularly with respect to extremes, and capture spatial dependence well in both occurrence and intensity.</p>
  </abstract>
    
<funding-group>
<award-group id="gs1">
<funding-source>Engineering and Physical Sciences Research Council</funding-source>
<award-id>2696930</award-id>
</award-group>
</funding-group>
</article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d2e104">Precipitation is a key driver of most hydrological systems, so that the representation of precipitation variability in both space and time is often critical when studying the behaviour of such systems – for example in flood risk assessment or water resource management. To characterise this variability and its effect, a common approach is to use stochastic or statistical models to generate synthetic sequences of precipitation and other relevant quantities, and to use the resulting sequences as inputs to hydrological models; see <xref ref-type="bibr" rid="bib1.bibx12" id="text.1"/>, for example.</p>
      <p id="d2e110">The development of techniques for producing sequences of precipitation and other variables, preserving relationships between them, can be traced back to the “weather generator” of <xref ref-type="bibr" rid="bib1.bibx68" id="text.2"/>. This built on a large body of literature focused on the development of stochastic and statistical models specifically for precipitation without considering other variables: such models are referred to henceforth as “precipitation generators”. Some reviews, from differing perspectives, are given by <xref ref-type="bibr" rid="bib1.bibx56 bib1.bibx22" id="text.3"/>; and <xref ref-type="bibr" rid="bib1.bibx63" id="text.4"/>.</p>
      <p id="d2e122">In the original weather generator of <xref ref-type="bibr" rid="bib1.bibx68" id="text.5"/>, daily precipitation occurrence is modelled as a Markov chain with wet-day precipitation intensities generated independently using either exponential or gamma distributions. The approach has several known deficiencies, including difficulty in reproducing the distributions of extreme precipitation and of wet and dry spell durations. Several alternative approaches have been proposed, some designed explicitly to overcome these deficiencies. For example, the LARS-WG generator of <xref ref-type="bibr" rid="bib1.bibx73" id="text.6"/> models spell length distributions directly, while <xref ref-type="bibr" rid="bib1.bibx31" id="text.7"/> use heavy-tailed distributions for wet-day intensity to improve the representation of extremes. Other approaches include those based on resampling the historical record <xref ref-type="bibr" rid="bib1.bibx13 bib1.bibx10" id="paren.8"/>, those based on transformed Gaussian distributions <xref ref-type="bibr" rid="bib1.bibx77" id="paren.9"/>; and those based on Hidden Markov Models (HMMs) in which different precipitation distributions are associated with each of a collection of underlying “weather states” <xref ref-type="bibr" rid="bib1.bibx23 bib1.bibx1 bib1.bibx42 bib1.bibx27" id="paren.10"/>.</p>
      <p id="d2e144">Many studies have been carried out to compare subsets of these approaches, by evaluating their ability to reproduce key features of observed weather sequences using frameworks such as that of <xref ref-type="bibr" rid="bib1.bibx57" id="text.11"/>. The results from some of these studies should be treated with caution, however, because the effective use of some methods requires a high degree of technical awareness: in the hands of a trained user, such methods are therefore likely to outperform the “off-the-shelf” implementations that are sometimes used in comparison studies. Our own experience is that when implemented with insight and understanding, several of the more sophisticated modelling approaches are capable of reproducing a wide range of properties of observed precipitation sequences over a range of spatial scales, including basic summary statistics, measures of temporal persistence and extremal behaviour.</p>
      <p id="d2e151">One of these “more sophisticated” approaches is based on generalised linear models (GLMs) which, notwithstanding the caveats above regarding comparison studies, are often found to perform well in absolute terms and to compete favourably with other state-of-the-art approaches such as HMMs <xref ref-type="bibr" rid="bib1.bibx30 bib1.bibx50 bib1.bibx61 bib1.bibx5 bib1.bibx6 bib1.bibx24 bib1.bibx78" id="paren.12"><named-content content-type="pre">e.g.</named-content></xref>. Indeed, some other approaches can be regarded as special cases of GLMs <xref ref-type="bibr" rid="bib1.bibx37" id="paren.13"><named-content content-type="pre">e.g.</named-content></xref>. As well as the potential to produce sequences with realistic spatial, temporal and inter-variable dependence structures over a range of scales, the GLM-based approach is able to represent systematic variation over space and time, including under scenarios of climate change (see Sect. <xref ref-type="sec" rid="Ch1.S2.SS2"/>); to generate sequences at locations for which no data are available; and to cope with missing values and unequal record lengths in the historical records used for model calibration <xref ref-type="bibr" rid="bib1.bibx17" id="paren.14"/>.</p>
      <p id="d2e169">Despite the strengths of the GLM-based approach to precipitation generation, it has some known deficiencies. For example, it typically struggles to capture seasonal variation in extreme precipitation – a phenomenon first noted by <xref ref-type="bibr" rid="bib1.bibx89" id="text.15"/>. Other precipitation generators suffer from similar problems: indeed, to address the issue in applications for which the reproduction of extremes is critical, some authors have advocated the use of models in which precipitation intensities are considered to be drawn from distributions with two separate parts, one for the main body of the data and the other for the extremes <xref ref-type="bibr" rid="bib1.bibx14 bib1.bibx31" id="paren.16"><named-content content-type="pre">e.g.</named-content></xref>. Arguably, however, this addresses the symptom of the problem rather than its cause, which, for GLMs at least, is likely to be partly a lack of flexibility in representing variation in different aspects of the precipitation distribution. This is because, in standard applications of the GLM methodology to daily precipitation, non-zero precipitation intensities are modelled using gamma distributions in which the standard deviation is proportional to the mean as discussed in Sect. <xref ref-type="sec" rid="Ch1.S3.SS1"/> below. This restriction is made for mathematical and computational convenience, but it lacks physical justification: it is therefore worth investigating whether the reproduction of extremal behaviour can be improved by relaxing it.</p>
      <p id="d2e182">Another feature of GLMs is that the means of the daily precipitation distributions are related to linear combinations of predictors (referred to as “covariates” below), representing systematic variation associated with seasonality, regional variation, large-scale atmospheric conditions, previous days' weather, and so forth: again, Sect. <xref ref-type="sec" rid="Ch1.S3.SS1"/> provides details. A further potential enhancement is to relax the requirement for the covariates to be combined linearly and, instead, to let the data determine the most appropriate form of the relationship. Generalised Additive Models (GAMs) allow this, via smooth semiparametric representations of the covariate effects on the means of the daily precipitation distributions. GAMs have been developed for daily precipitation series by, for example, <xref ref-type="bibr" rid="bib1.bibx45 bib1.bibx9" id="text.17"/> and <xref ref-type="bibr" rid="bib1.bibx80" id="text.18"/> – although, again, they impose a fixed relationship between the mean and standard deviation of the intensity distribution.</p>
      <p id="d2e193">Against this background, the primary contribution of the present paper is to examine the effect of allowing a more flexible modelling framework in which covariates are allowed to influence multiple aspects of the daily precipitation distribution, both parametrically and semi-parametrically. Our working hypothesis is that the known deficiencies of GLM-based generators – in particular their tendency to misrepresent seasonal precipitation extremes – reflect primarily a lack of flexibility in representing variation in the scale of the intensity distribution and in the functional form of covariate effects, rather than a fundamental misspecification of the distributional family itself; we therefore retain the widely-used gamma distribution for precipitation intensities throughout. To test this hypothesis, we use the class of Generalised Additive Models for Location, Scale and Shape <xref ref-type="bibr" rid="bib1.bibx70" id="paren.19"><named-content content-type="pre">GAMLSS;</named-content></xref> which, in principle, allow both parametric and semiparametric representation of covariate effects on the first four moments of precipitation intensity. GAMLSS do not appear to have been used previously as precipitation generators, although they have been used in related areas such as the linking of local precipitation climatologies to large-scale atmospheric conditions <xref ref-type="bibr" rid="bib1.bibx82 bib1.bibx67" id="paren.20"><named-content content-type="pre">e.g.</named-content></xref>; the analysis of trends in extreme precipitation <xref ref-type="bibr" rid="bib1.bibx38 bib1.bibx52" id="paren.21"><named-content content-type="pre">e.g.</named-content></xref>; and the characterisation of nonstationarity in the frequency and severity of floods and droughts <xref ref-type="bibr" rid="bib1.bibx81 bib1.bibx53 bib1.bibx54 bib1.bibx84 bib1.bibx66" id="paren.22"><named-content content-type="pre">e.g.</named-content></xref>.</p>
      <p id="d2e216">While evaluating the performance of the GLM- and GAMLSS-based approaches in the case study described in Sect. <xref ref-type="sec" rid="Ch1.S2"/>, the reproduction of extreme precipitation was found to be surprisingly sensitive to the recording resolution of the data. For example, the results can change appreciably if the data are rounded to the nearest 0.5 <inline-formula><mml:math id="M1" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">mm</mml:mi></mml:mrow></mml:math></inline-formula> prior to analysis, as is sometimes done to reduce the effect of inconsistent recording practices over time or between stations <xref ref-type="bibr" rid="bib1.bibx3" id="paren.23"><named-content content-type="pre">e.g.</named-content></xref>. Although it is known that modelling results can be sensitive to the way in which small precipitation values are recorded <xref ref-type="bibr" rid="bib1.bibx90" id="paren.24"/>, this more general effect of rounding is, to our knowledge, not widely appreciated. A secondary contribution of the paper is to examine this effect, therefore.</p>
      <p id="d2e237">GLM-based precipitation generators treat precipitation occurrence and intensity separately (see Sect. <xref ref-type="sec" rid="Ch1.S3.SS1"/>). This is unrestrictive when considering a single location, because the cumulative distribution function of the precipitation amount (<inline-formula><mml:math id="M2" display="inline"><mml:mi>Y</mml:mi></mml:math></inline-formula>, say) can always be written as

          <disp-formula id="Ch1.Ex1"><mml:math id="M3" display="block"><mml:mtable rowspacing="0.2ex" class="split" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:msub><mml:mi>F</mml:mi><mml:mi>Y</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>y</mml:mi><mml:mo>)</mml:mo><mml:mo>:=</mml:mo><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>Y</mml:mi><mml:mo>≤</mml:mo><mml:mi>y</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>Y</mml:mi><mml:mo>≤</mml:mo><mml:mi>y</mml:mi><mml:mo>|</mml:mo><mml:mi>Y</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>)</mml:mo><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>Y</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>+</mml:mo><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>Y</mml:mi><mml:mo>≤</mml:mo><mml:mi>y</mml:mi><mml:mo>|</mml:mo><mml:mi>Y</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>)</mml:mo><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>Y</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>

        where the notation <inline-formula><mml:math id="M4" display="inline"><mml:mrow><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mo>|</mml:mo><mml:mi>B</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> denotes the conditional probability of an event <inline-formula><mml:math id="M5" display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> given that <inline-formula><mml:math id="M6" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> has occurred. The expression <inline-formula><mml:math id="M7" display="inline"><mml:mrow><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>Y</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>Y</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the probability of precipitation occurrence; while the conditional probability <inline-formula><mml:math id="M8" display="inline"><mml:mrow><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>Y</mml:mi><mml:mo>&lt;</mml:mo><mml:mi>y</mml:mi><mml:mo>|</mml:mo><mml:mi>Y</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> can be deduced from a model for non-zero precipitation intensity and, trivially, <inline-formula><mml:math id="M9" display="inline"><mml:mrow><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>Y</mml:mi><mml:mo>≤</mml:mo><mml:mi>y</mml:mi><mml:mo>|</mml:mo><mml:mi>Y</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> for all <inline-formula><mml:math id="M10" display="inline"><mml:mrow><mml:mi>y</mml:mi><mml:mo>≥</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d2e501">When considering daily precipitation at multiple locations, however, it is necessary to consider the dependence between different sites on the same day. To model this dependence, intensities are again treated separately from occurrence in most GLM-based precipitation generators. When using the resulting models to generate synthetic multisite precipitation sequences therefore, the “dry” sites are ignored when generating non-zero intensities at “wet” sites so that the methodology is not capable of (for example) systematically producing lower intensities at locations close to dry sites. The extent to which this is a problem is probably application-specific, but it is avoided in some other approaches, such as those building on the work of <xref ref-type="bibr" rid="bib1.bibx7" id="text.25"/>, which uses transformations of Gaussian random fields. A final contribution of the present paper is thus to adapt the transformed Gaussian fields approach to the context of GLMs and GAMLSS, so as to represent inter-site dependence in precipitation and occurrence simultaneously.</p>
      <p id="d2e507">The extensions of the basic GLM methodology are demonstrated using a dataset similar to that used by <xref ref-type="bibr" rid="bib1.bibx89" id="text.26"/>: this allows for a straightforward evaluation of their effects in a setting where the performance of GLMs is well understood. The dataset is described in the next section, after which Sect. <xref ref-type="sec" rid="Ch1.S3"/> provides a brief review of GLM-based precipitation generators and describes the proposed extensions. The performance of the methods is assessed and compared in Sect. <xref ref-type="sec" rid="Ch1.S4"/>; Sect. <xref ref-type="sec" rid="Ch1.S5"/> concludes.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Study area and data</title>
      <p id="d2e527">The study area considered in <xref ref-type="bibr" rid="bib1.bibx89" id="text.27"/> has dimensions around 50 <inline-formula><mml:math id="M11" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M12" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 40 <inline-formula><mml:math id="M13" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> and is centred on the catchment of the river Blackwater, a subtributary of the Thames in southern England. As illustrated in Fig. <xref ref-type="fig" rid="F1"/>, the region has modest topographical variation: altitudes range from near sea-level in the Thames floodplain in the north-east to almost 300 <inline-formula><mml:math id="M14" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:math></inline-formula> at the highest points of two escarpments running roughly East-West. The region has a humid temperate oceanic climate <xref ref-type="bibr" rid="bib1.bibx8" id="paren.28"/>, with a mean annual precipitation of around 750 <inline-formula><mml:math id="M15" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">mm</mml:mi></mml:mrow></mml:math></inline-formula> and precipitation occurring throughout the year. Much of the precipitation is associated with large-scale synoptic systems transporting moisture from the Atlantic, although precipitation extremes in summer are often associated with more localised convective events <xref ref-type="bibr" rid="bib1.bibx58" id="paren.29"/>.</p>

      <fig id="F1"><label>Figure 1</label><caption><p id="d2e583">Topographical map of the study area (solid rectangle), with locations of the 51 gauges providing data used in the study. The inset shows the location of the study area within the British Isles.</p></caption>
        <graphic xlink:href="https://ascmo.copernicus.org/articles/12/149/2026/ascmo-12-149-2026-f01.png"/>

      </fig>

<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>Precipitation data</title>
      <p id="d2e599">The present study uses a more extensive daily precipitation dataset than that considered by <xref ref-type="bibr" rid="bib1.bibx89" id="text.30"/>, from a network of 51 stations at locations shown in Fig. <xref ref-type="fig" rid="F1"/>. These data are a subset of the MIDAS precipitation dataset <xref ref-type="bibr" rid="bib1.bibx60" id="paren.31"/>. Records cover the period from 1923 to 2022, although few stations were operational during the first few decades of this period. Moreover, data on atmospheric covariates (see below) are only available from 1959 onwards: we therefore restrict attention to the period 1959–2022. The complete MIDAS precipitation dataset includes data from 171 stations that were operational at some time during this period; the 51 selected stations were identified based on extensive quality checks. Specifically, records were retained only if they were nominally operational for at least 10 years and had fewer than 10 % of values missing. Furthermore, since operational difficulties and data quality problems during a given period can often lead to anomalies in the recorded proportions of wet days at a site <xref ref-type="bibr" rid="bib1.bibx21" id="paren.32"><named-content content-type="pre">e.g.</named-content></xref>, such anomalies were identified from the residuals of a two-way Analysis of Variance (ANOVA) fitted to the annual proportions of wet days at each location. Periods of dubious data quality identified from this exercise were also excluded from the subsequent analysis, as were several values that were (i) unusually large (ii) recorded on the last day of a calendar month and (iii) the only value recorded at a site during that month: these values were considered likely to be monthly totals that were included in the dataset erroneously.</p>
      <p id="d2e615">The recording resolution of the data has changed over time, partly due to a change from imperial to metric measurement units in the early 1970s: most values prior to this are recorded to a resolution of around 0.3 <inline-formula><mml:math id="M16" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">mm</mml:mi></mml:mrow></mml:math></inline-formula>, while the resolution is typically 0.1 <inline-formula><mml:math id="M17" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">mm</mml:mi></mml:mrow></mml:math></inline-formula> for more recent data. Such inconsistencies can give rise to spurious trends in rates of precipitation occurrence in particular, which can also be sensitive to inter-site differences in observer practice when recording small non-zero precipitation amounts <xref ref-type="bibr" rid="bib1.bibx90" id="paren.33"><named-content content-type="pre">“trace values”;</named-content></xref>. To reduce the effects of such issues, one option is to treat all non-zero values below some threshold <inline-formula><mml:math id="M18" display="inline"><mml:mrow><mml:mi mathvariant="italic">τ</mml:mi><mml:mo>≥</mml:mo></mml:mrow></mml:math></inline-formula> 0.3 <inline-formula><mml:math id="M19" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">mm</mml:mi></mml:mrow></mml:math></inline-formula> as being known only to lie in the range  (<inline-formula><mml:math id="M20" display="inline"><mml:mrow><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="italic">τ</mml:mi></mml:mrow></mml:math></inline-formula>) so that the exact recorded values are not used: this can be considered as a problem of censored data, as discussed in more detail by <xref ref-type="bibr" rid="bib1.bibx19" id="text.34"/> who also provide reasons for not adopting such an approach. Instead, we follow <xref ref-type="bibr" rid="bib1.bibx90" id="text.35"/> in transforming the original values <inline-formula><mml:math id="M21" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="true">̃</mml:mo></mml:mover><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> to

            <disp-formula id="Ch1.Ex2"><mml:math id="M22" display="block"><mml:mrow><mml:mi>y</mml:mi><mml:mo>=</mml:mo><mml:mo movablelimits="false">max⁡</mml:mo><mml:mo>(</mml:mo><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="true" mathvariant="normal">̃</mml:mo></mml:mover><mml:mo>-</mml:mo><mml:mi mathvariant="italic">τ</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula>

          where the threshold <inline-formula><mml:math id="M23" display="inline"><mml:mi mathvariant="italic">τ</mml:mi></mml:math></inline-formula> is set to 0.5 <inline-formula><mml:math id="M24" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">mm</mml:mi></mml:mrow></mml:math></inline-formula> (here and subsequently, lower- and upper-case letters, respectively denote observed values and the associated random variables). This choice of threshold is hydrologically insignificant, in the sense that evapotranspiration rates in this area are well in excess of 0.5 <inline-formula><mml:math id="M25" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">mm</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> in almost all months of the year so that precipitation amounts below this can be considered as effectively zero: see, for example, Fig. 1 of <xref ref-type="bibr" rid="bib1.bibx88" id="text.36"/>, noting that the rates shown there are in units of mm per week.</p>
      <p id="d2e756">After fitting models to the transformed values, synthetic precipitation sequences on the original scale can be produced by first simulating from the fitted model and then adding <inline-formula><mml:math id="M26" display="inline"><mml:mi mathvariant="italic">τ</mml:mi></mml:math></inline-formula> to any non-zero simulated value. The resulting sequences will contain no values between <inline-formula><mml:math id="M27" display="inline"><mml:mn mathvariant="normal">0</mml:mn></mml:math></inline-formula> and <inline-formula><mml:math id="M28" display="inline"><mml:mi mathvariant="italic">τ</mml:mi></mml:math></inline-formula>; in most applications, however, this is unproblematic providing <inline-formula><mml:math id="M29" display="inline"><mml:mi mathvariant="italic">τ</mml:mi></mml:math></inline-formula> is small enough to be hydrologically insignificant as discussed above.</p>
      <p id="d2e787">In the present application, the thresholding of trace values is designed in part to mitigate the effect of changes in recording resolution on precipitation occurrence. Its use to account for inconsistent recording practices is widespread more generally, however. For example, resolution changes may also affect the analysis of precipitation intensity, albeit to a lesser extent. One approach for mitigating this effect is to round the data to a common resolution prior to analysis <xref ref-type="bibr" rid="bib1.bibx3" id="paren.37"><named-content content-type="pre">e.g.</named-content></xref>. As noted in Sect. <xref ref-type="sec" rid="Ch1.S1"/>, however, and demonstrated via simulation in Appendix <xref ref-type="sec" rid="App1.Ch1.S1"/>, we find that such rounding can have a surprisingly large impact on fitted GLMs and other models for precipitation intensity, with particular sensitivity in the upper tails of the distribution. For most of the analyses reported below therefore, the change in recording resolution is handled not by rounding the data, but rather by including appropriate adjustments within the fitted models.</p>
</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>Atmospheric covariates</title>
      <p id="d2e808">In many applications, the effects of potential changes in climate must be incorporated into synthetic precipitation sequences. This is typically achieved by conditioning on indices of large-scale atmospheric structure that are known (i) to be related to local-scale precipitation; (ii) to capture the relevant climate change signal; and (iii) to be well represented by climate models such as atmosphere-ocean global circulation models (GCMs) <xref ref-type="bibr" rid="bib1.bibx55" id="paren.38"><named-content content-type="pre">e.g.</named-content><named-content content-type="post">Sect. 11.5</named-content></xref>. Generators can therefore be calibrated using historical information on large-scale covariates, after which “future” simulations can be produced by replacing the historical covariate information with the corresponding outputs from climate model simulations. Precipitation generators that incorporate such conditioning are among the more sophisticated tools available for statistical downscaling of climate model outputs <xref ref-type="bibr" rid="bib1.bibx56" id="paren.39"/>.</p>
      <p id="d2e821">In the work reported below, the required conditioning is obtained by including atmospheric indices as covariates in the statistical models. The covariates are all derived from the ERA5 Reanalysis dataset <xref ref-type="bibr" rid="bib1.bibx40" id="paren.40"/>, averaged daily and over all grid cells with centres contained within the study area. The indices considered are 2 <inline-formula><mml:math id="M30" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:math></inline-formula> temperature, 2 <inline-formula><mml:math id="M31" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:math></inline-formula> dewpoint temperature, standardized mean sea level pressure as well as 10 <inline-formula><mml:math id="M32" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:math></inline-formula> wind speed, derived from the 10 <inline-formula><mml:math id="M33" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M34" display="inline"><mml:mi>u</mml:mi></mml:math></inline-formula>- and <inline-formula><mml:math id="M35" display="inline"><mml:mi>v</mml:mi></mml:math></inline-formula>-wind components. These indices are similar to those in previous studies <xref ref-type="bibr" rid="bib1.bibx43 bib1.bibx89 bib1.bibx16 bib1.bibx56 bib1.bibx39 bib1.bibx17" id="paren.41"/>, and consistent with guidance on covariate selection for precipitation downscaling which emphasises the importance of indices representing moisture availability and airflow <xref ref-type="bibr" rid="bib1.bibx85" id="paren.42"><named-content content-type="pre">e.g.</named-content></xref>.</p>

<table-wrap id="T1" specific-use="star"><label>Table 1</label><caption><p id="d2e885">Overview of the modelling approach for both parametric and semi-parametric GLMs and GAMLSS</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left" colsep="1"/>
     <oasis:colspec colnum="2" colname="col2" align="left" colsep="1"/>
     <oasis:colspec colnum="3" colname="col3" align="left" colsep="1"/>
     <oasis:colspec colnum="4" colname="col4" align="left" colsep="1"/>
     <oasis:colspec colnum="5" colname="col5" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry namest="col2" nameend="col3" align="center" colsep="1">Parametric </oasis:entry>
         <oasis:entry namest="col4" nameend="col5" align="center" colsep="0">Semi-parametric </oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Model</oasis:entry>
         <oasis:entry colname="col2">GLM</oasis:entry>
         <oasis:entry colname="col3">GAMLSS</oasis:entry>
         <oasis:entry colname="col4">GAM</oasis:entry>
         <oasis:entry colname="col5">GAMLSS</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(Sect. 3.1)</oasis:entry>
         <oasis:entry colname="col3">(Sect. 3.2)</oasis:entry>
         <oasis:entry colname="col4">(Sect. 3.2)</oasis:entry>
         <oasis:entry colname="col5">(Sect. 3.2)</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Occurrence model</oasis:entry>
         <oasis:entry namest="col2" nameend="col5" colsep="0">Logistic regression </oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Amounts model</oasis:entry>
         <oasis:entry colname="col2">Location-only gamma model</oasis:entry>
         <oasis:entry colname="col3">Location-scale gamma model</oasis:entry>
         <oasis:entry colname="col4">Location-only gamma model</oasis:entry>
         <oasis:entry colname="col5">Location-scale gamma model</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Effect types</oasis:entry>
         <oasis:entry namest="col2" nameend="col3" colsep="1">Linear </oasis:entry>
         <oasis:entry namest="col4" nameend="col5" colsep="0">Linear and splines </oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Estimation</oasis:entry>
         <oasis:entry namest="col2" nameend="col3" colsep="1">Maximum likelihood </oasis:entry>
         <oasis:entry namest="col4" nameend="col5" colsep="0">Penalised maximum likelihood </oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Model selection</oasis:entry>
         <oasis:entry namest="col2" nameend="col3" colsep="1">Adjusted AIC, see Sect. 3.1, Eq. (4) </oasis:entry>
         <oasis:entry namest="col4" nameend="col5" colsep="0">Resampling-based WIC, see Sect. 3.3, Eq. (9) </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Spatial dependence</oasis:entry>
         <oasis:entry namest="col2" nameend="col5" colsep="0">Latent Gaussian field linked to the marginal models via Eq. (10). </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry namest="col2" nameend="col5" colsep="0">Estimation: pairwise-station maximum likelihood, Matérn covariance function fitted to these. See Sect. 3.4. </oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>


</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Methods</title>
      <p id="d2e1048">This section provides a brief summary of the GLM-based approach to daily precipitation modelling, and then describes the extensions that are the main contribution of the present paper. For more details of GLMs in this context, see <xref ref-type="bibr" rid="bib1.bibx17" id="text.43"/> and references therein. Table <xref ref-type="table" rid="T1"/> provides a high-level overview of the modelling approaches.</p>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>GLMs for daily precipitation</title>
      <p id="d2e1063">As noted in Sect. <xref ref-type="sec" rid="Ch1.S1"/>, the standard GLM-based approach treats occurrence and intensity separately. Specifically, if the random variable <inline-formula><mml:math id="M36" display="inline"><mml:mrow><mml:msub><mml:mi>Y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> represents the (possibly thresholded – see above) precipitation intensity for the <inline-formula><mml:math id="M37" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th case in a dataset, then the probability of a “wet” day is modelled using logistic regression:

            <disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M38" display="block"><mml:mtable rowspacing="0.2ex" class="split" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>Y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>)</mml:mo><mml:mo>=</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:mtext> say</mml:mtext><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mtext>with </mml:mtext><mml:mi>log⁡</mml:mi><mml:mfenced open="(" close=")"><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle></mml:mfenced><mml:mo>=</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:msup><mml:mi mathvariant="bold-italic">β</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:msubsup><mml:mi mathvariant="italic">η</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:mtext>say</mml:mtext><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula></p>
      <p id="d2e1196">Here, <inline-formula><mml:math id="M39" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> is a row vector of covariates and <inline-formula><mml:math id="M40" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">β</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> is a corresponding column vector of coefficients: given these values, the probability of precipitation can be calculated as <inline-formula><mml:math id="M41" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mo>[</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>+</mml:mo><mml:mi>exp⁡</mml:mi><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:msubsup><mml:mi mathvariant="italic">η</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>)</mml:mo><mml:msup><mml:mo>]</mml:mo><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d2e1280">If <inline-formula><mml:math id="M42" display="inline"><mml:mrow><mml:msub><mml:mi>Y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> (corresponding to a “wet” day), then the precipitation intensity is modelled as being drawn from a gamma distribution with mean <inline-formula><mml:math id="M43" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">μ</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (say) and shape parameter <inline-formula><mml:math id="M44" display="inline"><mml:mi mathvariant="italic">ψ</mml:mi></mml:math></inline-formula>. The mean is linked to another covariate vector <inline-formula><mml:math id="M45" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, as

            <disp-formula id="Ch1.E2" content-type="numbered"><label>2</label><mml:math id="M46" display="block"><mml:mrow><mml:mi>log⁡</mml:mi><mml:msub><mml:mi mathvariant="italic">μ</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:msup><mml:mi mathvariant="bold-italic">β</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:msubsup><mml:mi mathvariant="italic">η</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:mtext> say</mml:mtext><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

          so that <inline-formula><mml:math id="M47" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">μ</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mi>exp⁡</mml:mi><mml:mo>[</mml:mo><mml:msubsup><mml:mi mathvariant="italic">η</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>. The shape parameter <inline-formula><mml:math id="M48" display="inline"><mml:mi mathvariant="italic">ψ</mml:mi></mml:math></inline-formula> is not linked to covariates, however: it is common to all cases, which implies that the standard deviations (<inline-formula><mml:math id="M49" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>, say) of the distributions are proportional to their means with <inline-formula><mml:math id="M50" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="italic">μ</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>/</mml:mo><mml:msqrt><mml:mi mathvariant="italic">ψ</mml:mi></mml:msqrt></mml:mrow></mml:math></inline-formula>. This is analogous to the assumption of a constant residual variance in linear regression models.</p>
      <p id="d2e1468">The covariate vectors <inline-formula><mml:math id="M51" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M52" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> in Eqs. (<xref ref-type="disp-formula" rid="Ch1.E1"/>) and (<xref ref-type="disp-formula" rid="Ch1.E2"/>) need not be the same, although both of them typically include a “1” representing a constant term (so that the corresponding elements of <inline-formula><mml:math id="M53" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">β</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M54" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">β</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> are analogous to the intercepts in linear regression models), as well as quantities representing seasonal and regional variation, transformations of previous days' precipitation (accounting for temporal dependence in precipitation sequences) and indices of large-scale atmospheric structure. Interaction terms can also be included in situations where some covariates may modulate the effects of others – for example, in the case study of Sect. <xref ref-type="sec" rid="Ch1.S2"/> one might reasonably anticipate temporal dependence to be weaker in summer than in winter, due to the seasonally-varying proportion of convective versus frontal precipitation events.</p>
      <p id="d2e1547">The coefficient vectors <inline-formula><mml:math id="M55" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">β</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M56" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">β</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> can be estimated from a dataset <inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>:=</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mi mathvariant="normal">…</mml:mi><mml:msub><mml:mi>y</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:msup><mml:mo>)</mml:mo><mml:mi mathvariant="normal">T</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, containing daily precipitation observations together with the corresponding covariate vectors <inline-formula><mml:math id="M58" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M59" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>. This estimation is typically done using maximum likelihood, under the assumption that the associated random variables <inline-formula><mml:math id="M60" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msub><mml:mi>Y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> are conditionally independent of each other given the covariates. In this case, standard theory (and software output) also provides estimated standard errors for the coefficient estimates: these provide a way to determine whether the effect of each covariate is estimated sufficiently precisely to justify its inclusion in a model. By placing priors on the model parameters, Bayesian estimation might also be possible, as for example in <xref ref-type="bibr" rid="bib1.bibx36" id="text.44"/>. This potentially allows parameter uncertainty to be propagated into the synthetic sequences produced using the fitted models. However, given the large datasets used for estimation, the effect of parameter uncertainty is generally fairly small, particularly compared to the daily precipitation variability of interest.</p>
      <p id="d2e1674">To choose covariates, an alternative to the inspection of standard errors is to add additional terms to an existing model and determine whether they lead to a sufficiently large increase in log-likelihood. This can be done either using a likelihood ratio test or by choosing the model with the smallest value of a quantity such as the Akaike Information Criterion (AIC). For a model with generic coefficient vector <inline-formula><mml:math id="M61" display="inline"><mml:mrow><mml:mi mathvariant="bold-italic">θ</mml:mi><mml:mo>:=</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mi mathvariant="normal">…</mml:mi><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:msup><mml:mo>)</mml:mo><mml:mi mathvariant="normal">T</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, AIC is defined as

            <disp-formula id="Ch1.E3" content-type="numbered"><label>3</label><mml:math id="M62" display="block"><mml:mrow><mml:mrow class="chem"><mml:mi mathvariant="normal">AIC</mml:mi></mml:mrow><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi>log⁡</mml:mi><mml:mi>L</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">θ</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi>y</mml:mi></mml:msub><mml:mo>;</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi>k</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

          where <inline-formula><mml:math id="M63" display="inline"><mml:mrow><mml:mi>log⁡</mml:mi><mml:mi>L</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">θ</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi>y</mml:mi></mml:msub><mml:mo>;</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the log-likelihood constructed using the data <inline-formula><mml:math id="M64" display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula> and evaluated at the maximum likelihood estimate, <inline-formula><mml:math id="M65" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">θ</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>y</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> say, of <inline-formula><mml:math id="M66" display="inline"><mml:mi mathvariant="bold-italic">θ</mml:mi></mml:math></inline-formula> (the reason for the notation will become clear below). By this criterion, a “good” model is one with a high log-likelihood (and hence a good fit to the data) despite having a small number <inline-formula><mml:math id="M67" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> of parameters: for further justification of the precise definition, see the Supplement.</p>
      <p id="d2e1812">The use of likelihood ratio tests or AIC-type comparisons allows a structured approach to model-building in which simple models are gradually extended by adding related groups of covariates, at each stage evaluating the expanded model using a combination of formal comparisons and less formal model diagnostics, such as residual plots, that are designed to assess whether the modelling assumptions are broadly satisfied: see <xref ref-type="bibr" rid="bib1.bibx17" id="text.45"/>. However, the associated calculations are valid only under the conditional independence assumption underpinning the calculation of the (log-)likelihoods themselves. For the analysis of data from a single spatial location this assumption can be justified providing the covariate vectors contain appropriate information on previous days' precipitation values <xref ref-type="bibr" rid="bib1.bibx20" id="paren.46"/>, but it is usually unrealistic in a multisite context: in the case study of Sect. <xref ref-type="sec" rid="Ch1.S2"/> for example, large-scale frontal weather systems are likely to affect most or all sites simultaneously. To account for the resulting dependence, in a multisite context the standard errors and likelihood ratio tests can be adjusted as described in <xref ref-type="bibr" rid="bib1.bibx18" id="text.47"/>. The AIC can be adjusted similarly (see the Supplement), as

            <disp-formula id="Ch1.E4" content-type="numbered"><label>4</label><mml:math id="M68" display="block"><mml:mrow><mml:msub><mml:mrow class="chem"><mml:mi mathvariant="normal">AIC</mml:mi></mml:mrow><mml:mtext>adj</mml:mtext></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mfenced open="[" close="]"><mml:mrow><mml:mo>-</mml:mo><mml:mi>log⁡</mml:mi><mml:mi>L</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">θ</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>y</mml:mi></mml:msub><mml:mo>;</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mtext> trace </mml:mtext><mml:mo>(</mml:mo><mml:msup><mml:mi mathvariant="bold">GH</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

          where <inline-formula><mml:math id="M69" display="inline"><mml:mi mathvariant="bold">H</mml:mi></mml:math></inline-formula> is the negative Hessian matrix of the log-likelihood at its maximum and <inline-formula><mml:math id="M70" display="inline"><mml:mi mathvariant="bold">G</mml:mi></mml:math></inline-formula> is the covariance matrix of the gradient vector there: both of these quantities also feature in the adjustments to standard errors and likelihood ratio tests, and can be estimated straightforwardly as by-products of the usual algorithm for maximising the log-likelihood for a GLM. The quantity Eq. (<xref ref-type="disp-formula" rid="Ch1.E4"/>) is sometimes called the <italic>network information criterion</italic> or NIC <xref ref-type="bibr" rid="bib1.bibx26" id="paren.48"><named-content content-type="pre">e.g.</named-content><named-content content-type="post">Sect. 4.7</named-content></xref>. In passing, we note that these adjustments to standard errors, likelihood ratio tests, and AIC can also be applied in the absence of inter-site dependence and, in this case, all reduce to the “usual” definitions that apply when the log-likelihood is specified correctly because <inline-formula><mml:math id="M71" display="inline"><mml:mrow><mml:mi mathvariant="bold">G</mml:mi><mml:mo>=</mml:mo><mml:mi mathvariant="bold">H</mml:mi></mml:mrow></mml:math></inline-formula> in this case.</p>
      <p id="d2e1924">Having selected appropriate sets of covariates, estimated the coefficient vectors <inline-formula><mml:math id="M72" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">β</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M73" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">β</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> and carried out diagnostic checks on the fitted model, synthetic precipitation sequences can be generated one day at a time, first by sampling the wet/dry status of each site according to the probabilities defined by the occurrence model Eq. (<xref ref-type="disp-formula" rid="Ch1.E1"/>) and then, for wet sites, sampling their intensities from the corresponding gamma distributions. In a multisite setting, the generation of realistic sequences requires that the sampling is done in such a way as to respect the inter-site dependence structure: for example, on days when one location experiences precipitation, it is likely that neighbouring locations will also be wet. Inter-site dependence in occurrence and intensity is usually handled separately. For occurrence, <xref ref-type="bibr" rid="bib1.bibx89" id="text.49"/> modelled the distribution of the number of wet sites (on the grounds that in a small region the sites are usually either all wet or all dry) in such a way as to be consistent with the wet-day probabilities determined by the GLM; while, for application to larger regions, <xref ref-type="bibr" rid="bib1.bibx2" id="text.50"/> used inter-site dependence models based on thresholded latent Gaussian fields, which allow for a decay in the strength of dependence with intersite distance but can be challenging to estimate when the dependence is very strong. Other approaches also exist: for example, <xref ref-type="bibr" rid="bib1.bibx43" id="text.51"/> use a non-homogeneous hidden Markov model, accounting for spatial dependence directly in the model. Inter-site dependence in intensities is more straightforward and, for GLM-based approaches, is usually achieved via a Gaussian geostatistical model for the <italic>Anscombe residuals</italic> which are defined in such a way as to have a distribution that is very nearly Gaussian: this approach can also be regarded as corresponding to the use of (approximate) Gaussian copulas <xref ref-type="bibr" rid="bib1.bibx89" id="paren.52"/>. Approaches based on latent multivariate Gaussians or latent Gaussian fields have also been used, for example, in <xref ref-type="bibr" rid="bib1.bibx86 bib1.bibx51 bib1.bibx49" id="text.53"/>.</p>
</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Extensions: GLMs to GAMLSS</title>
      <p id="d2e1988">There are two main limitations to GLM-based models for daily precipitation. These are: first, the restriction to linear combinations of covariates in equations Eqs. (<xref ref-type="disp-formula" rid="Ch1.E1"/>) and (<xref ref-type="disp-formula" rid="Ch1.E2"/>); and second, the assumption of the constant shape parameter <inline-formula><mml:math id="M74" display="inline"><mml:mi mathvariant="italic">ψ</mml:mi></mml:math></inline-formula> in the gamma distributions for precipitation intensity. The GAMLSS framework of <xref ref-type="bibr" rid="bib1.bibx70" id="text.54"/> can potentially address both of these issues. A summary is provided here: <xref ref-type="bibr" rid="bib1.bibx75" id="text.55"/> and <xref ref-type="bibr" rid="bib1.bibx76" id="text.56"/> give further details.</p>
      <p id="d2e2012">In GAMLSS, the linearity assumption is relaxed by allowing smooth functions of covariates as well as linear combinations. Thus the respective equivalents of Eqs. (<xref ref-type="disp-formula" rid="Ch1.E1"/>) and (<xref ref-type="disp-formula" rid="Ch1.E2"/>) are, in their simplest forms,

            <disp-formula id="Ch1.E5" content-type="numbered"><label>5</label><mml:math id="M75" display="block"><mml:mtable rowspacing="0.2ex" class="split" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mi>log⁡</mml:mi><mml:mfenced close=")" open="("><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle></mml:mfenced><mml:mo>=</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">β</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:msup><mml:mi mathvariant="bold-italic">β</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:msubsup><mml:mi>s</mml:mi><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mfenced open="(" close=")"><mml:mrow><mml:msubsup><mml:mi>x</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:mo>+</mml:mo><mml:mi mathvariant="normal">…</mml:mi></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>+</mml:mo><mml:msubsup><mml:mi>s</mml:mi><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mi mathvariant="italic">π</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mfenced close=")" open="("><mml:mrow><mml:msubsup><mml:mi>x</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mi mathvariant="italic">π</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>

          and

            <disp-formula id="Ch1.E6" content-type="numbered"><label>6</label><mml:math id="M76" display="block"><mml:mtable class="split" rowspacing="0.2ex" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mi>log⁡</mml:mi><mml:msub><mml:mi mathvariant="italic">μ</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mspace width="0.25em" linebreak="nobreak"/><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">β</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:msup><mml:mi mathvariant="bold-italic">β</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:msubsup><mml:mi>s</mml:mi><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mfenced open="(" close=")"><mml:mrow><mml:msubsup><mml:mi>x</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:mo>+</mml:mo><mml:mi mathvariant="normal">…</mml:mi></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>+</mml:mo><mml:msubsup><mml:mi>s</mml:mi><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mi mathvariant="italic">μ</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mfenced close=")" open="("><mml:mrow><mml:msubsup><mml:mi>x</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mfenced open="(" close=")"><mml:mrow><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mi mathvariant="italic">μ</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula></p>
      <p id="d2e2284">In each of these expressions, <inline-formula><mml:math id="M77" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mo>∗</mml:mo><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">β</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> is a row vector of covariates with effects that are represented linearly as before, with coefficient vector <inline-formula><mml:math id="M78" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">β</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mo>∗</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> (the asterisk “<sup>∗</sup>” denoting either “<inline-formula><mml:math id="M80" display="inline"><mml:mi mathvariant="italic">π</mml:mi></mml:math></inline-formula>” or “<inline-formula><mml:math id="M81" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula>” as appropriate). The terms <inline-formula><mml:math id="M82" display="inline"><mml:mrow><mml:msubsup><mml:mi>s</mml:mi><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:mo>(</mml:mo><mml:mo>∗</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>(</mml:mo><mml:msubsup><mml:mi>x</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mo>∗</mml:mo><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msubsup><mml:mi>s</mml:mi><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mo>∗</mml:mo></mml:msub></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mo>∗</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>(</mml:mo><mml:msubsup><mml:mi>x</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mo>∗</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mo>∗</mml:mo></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> are smooth functions of the covariates <inline-formula><mml:math id="M83" display="inline"><mml:mrow><mml:msubsup><mml:mi>x</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mo>∗</mml:mo><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msubsup><mml:mi>x</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mo>∗</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mo>∗</mml:mo></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, which are typically represented semiparametrically using flexible collections of spline basis functions that allow for the data-driven representation of nonlinear effects. Various families of spline basis are available, although our experience is consistent with the received wisdom that the precise choice is relatively unimportant. With one exception, the results reported below use cubic splines <xref ref-type="bibr" rid="bib1.bibx87" id="text.57"><named-content content-type="post">Chapter 5</named-content></xref> as implemented in the <monospace>gamlss</monospace> <xref ref-type="bibr" rid="bib1.bibx75" id="paren.58"/> and <monospace>bamlss</monospace>
<xref ref-type="bibr" rid="bib1.bibx79" id="paren.59"/> packages in the <sans-serif>R</sans-serif> programming environment <xref ref-type="bibr" rid="bib1.bibx65" id="paren.60"/>. The exception is the representation of seasonal effects, for which cyclic splines are used to represent the repeating seasonal cycle – again, as implemented in the cited software.</p>
      <p id="d2e2503">As written, Eqs. (<xref ref-type="disp-formula" rid="Ch1.E5"/>) and (<xref ref-type="disp-formula" rid="Ch1.E6"/>) do not allow explicitly for smooth functions of more than one covariate, or for situations in which the linear coefficient of one covariate (i.e. the corresponding element of <inline-formula><mml:math id="M84" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">β</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mo>∗</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>) varies smoothly with the value of another so that (e.g.) <inline-formula><mml:math id="M85" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">β</mml:mi><mml:mi>j</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mo>∗</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>:=</mml:mo><mml:msubsup><mml:mi mathvariant="italic">β</mml:mi><mml:mi>j</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mo>∗</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> in an obvious notation. The representation of systematic regional variation is a canonical situation in which one might wish to consider two covariates (i.e. latitude and longitude) simultaneously: in this case, a bivariate spline basis of the two covariates can be specified, either as a tensor product or as a thin-plate spline <xref ref-type="bibr" rid="bib1.bibx87" id="text.61"><named-content content-type="post">Chapter 5</named-content></xref>, the latter of which is used in this work. These constructions are all implemented in the <monospace>gamlss</monospace> and <monospace>bamlss</monospace> software packages, as are the mixed linear-spline terms required for situations where <inline-formula><mml:math id="M86" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">β</mml:mi><mml:mi>j</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mo>∗</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> varies smoothly with <inline-formula><mml:math id="M87" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d2e2609">To relax the assumption of a constant shape parameter, GAMLSS allow both the mean and standard deviation (as well as, in principle, the skewness and kurtosis) of a distribution to depend explicitly on covariates. Thus, instead of the simple relationship <inline-formula><mml:math id="M88" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="italic">μ</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>/</mml:mo><mml:msqrt><mml:mi mathvariant="italic">ψ</mml:mi></mml:msqrt></mml:mrow></mml:math></inline-formula> in a gamma GLM for daily precipitation intensity, GAMLSS represent the standard deviations as

            <disp-formula id="Ch1.E7" content-type="numbered"><label>7</label><mml:math id="M89" display="block"><mml:mrow><mml:mi>log⁡</mml:mi><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">β</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:msup><mml:mi mathvariant="bold-italic">β</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:msubsup><mml:mi>s</mml:mi><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mfenced close=")" open="("><mml:mrow><mml:msubsup><mml:mi>x</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:mo>+</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>+</mml:mo><mml:msubsup><mml:mi>s</mml:mi><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mi mathvariant="italic">σ</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mfenced close=")" open="("><mml:mrow><mml:msubsup><mml:mi>x</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mi mathvariant="italic">σ</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula></p>
      <p id="d2e2753">In the work reported here, the gamma distributional assumption for wet-day rainfall intensities is retained so that the implied shape parameter for the <inline-formula><mml:math id="M90" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th case in the dataset is <inline-formula><mml:math id="M91" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ψ</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msubsup><mml:mi mathvariant="italic">μ</mml:mi><mml:mi>i</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mo>/</mml:mo><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>i</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula>. The complete structure of the daily precipitation sequence at a single location is therefore determined by the three equations Eqs. (<xref ref-type="disp-formula" rid="Ch1.E5"/>)–(<xref ref-type="disp-formula" rid="Ch1.E7"/>).</p>
      <p id="d2e2796">The increased flexibility of GAMLSS compared with GLMs comes at a price: given sufficiently rich collections of splines, almost any model can be made to fit a dataset almost perfectly, simply by specifying very “wiggly” functions <inline-formula><mml:math id="M92" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msubsup><mml:mi>s</mml:mi><mml:mi>j</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mo>∗</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> that (almost) interpolate the observations. Techniques such as maximum likelihood estimation therefore tend to yield model fits that are over-optimised to the data at hand and, consequently, are unsuitable for exploring potential variation beyond what has been observed (which, fundamentally, is the purpose of a precipitation generator). To overcome this, GAMLSS are usually fitted by maximising a penalised log-likelihood of the form

            <disp-formula id="Ch1.E8" content-type="numbered"><label>8</label><mml:math id="M93" display="block"><mml:mrow><mml:mi>log⁡</mml:mi><mml:mi>L</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">θ</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>y</mml:mi></mml:msub><mml:mo>;</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>)</mml:mo><mml:mo>-</mml:mo><mml:mi>P</mml:mi><mml:mo>:=</mml:mo><mml:mi>log⁡</mml:mi><mml:msub><mml:mi>L</mml:mi><mml:mtext>PEN</mml:mtext></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">θ</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi>y</mml:mi></mml:msub><mml:mo>;</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:mtext> say</mml:mtext><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

          in which the “penalty” <inline-formula><mml:math id="M94" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula> is typically constructed using the integrated squared second derivatives of the functions <inline-formula><mml:math id="M95" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msubsup><mml:mi>s</mml:mi><mml:mi>j</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mo>∗</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>. The rationale for this is that the second derivatives measure curvature: wiggly functions have large integrated squared second derivatives, so that the penalty discourages wiggly model fits.</p>
      <p id="d2e2924">Equation (<xref ref-type="disp-formula" rid="Ch1.E8"/>) represents a compromise in model fitting: the <inline-formula><mml:math id="M96" display="inline"><mml:mrow><mml:mi>log⁡</mml:mi><mml:mi>L</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">θ</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>y</mml:mi></mml:msub><mml:mo>;</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> term encourages fidelity to the available observations, while the penalty <inline-formula><mml:math id="M97" display="inline"><mml:mi>P</mml:mi></mml:math></inline-formula> rewards models in which the functions <inline-formula><mml:math id="M98" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msubsup><mml:mi>s</mml:mi><mml:mi>j</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mo>∗</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> are smooth. The tradeoff between these two criteria is determined via <italic>smoothing parameters</italic> for each of the <inline-formula><mml:math id="M99" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msubsup><mml:mi>s</mml:mi><mml:mi>j</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mo>∗</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> which, in principle, must be chosen by the analyst. In practice, however, most modern software packages (including the <monospace>gamlss</monospace> and <monospace>bamlss</monospace> packages used here) incorporate automated, data-driven smoothing parameter selection, essentially by treating the smoothing parameters as auxiliary quantities to be estimated as described in <xref ref-type="bibr" rid="bib1.bibx71" id="text.62"/>.</p>
</sec>
<sec id="Ch1.S3.SS3">
  <label>3.3</label><title>Model selection with multisite GAMLSS</title>
      <p id="d2e3039">GAMLSS do not have to include semiparametric terms: parametric GAMLSS for precipitation intensity would retain just the linear components of Eqs. (<xref ref-type="disp-formula" rid="Ch1.E6"/>) and (<xref ref-type="disp-formula" rid="Ch1.E7"/>). In this case, as for GLMs, model selection can be done using likelihood ratio tests and AIC-type comparisons, with <inline-formula><mml:math id="M100" display="inline"><mml:mrow><mml:msub><mml:mrow class="chem"><mml:mi mathvariant="normal">AIC</mml:mi></mml:mrow><mml:mtext>adj</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> (defined in Eq. <xref ref-type="disp-formula" rid="Ch1.E4"/>) used to account for inter-site dependence where necessary.</p>
      <p id="d2e3060">For semiparametric models, however, the situation is more complicated. This is partly because AIC requires that models are fitted by maximising a log-likelihood rather than a penalised log-likelihood – although, in principle (see Supplement), a criterion such as <inline-formula><mml:math id="M101" display="inline"><mml:mrow><mml:msub><mml:mrow class="chem"><mml:mi mathvariant="normal">AIC</mml:mi></mml:mrow><mml:mtext>adj</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> is applicable more broadly. In practice, however, the matrices <inline-formula><mml:math id="M102" display="inline"><mml:mi mathvariant="bold">G</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M103" display="inline"><mml:mi mathvariant="bold">H</mml:mi></mml:math></inline-formula> in Eq. (<xref ref-type="disp-formula" rid="Ch1.E4"/>) must be estimated from the data and, if they are large, sampling errors in their estimation can compromise the accuracy of any associated inference: <xref ref-type="bibr" rid="bib1.bibx47" id="text.63"/> give an example of this effect. Semiparametric GAMLSS usually employ large numbers of spline basis functions for each smooth term. For these models therefore, the corresponding matrices <inline-formula><mml:math id="M104" display="inline"><mml:mi mathvariant="bold">G</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M105" display="inline"><mml:mi mathvariant="bold">H</mml:mi></mml:math></inline-formula> are themselves large and Eq. (<xref ref-type="disp-formula" rid="Ch1.E4"/>) cannot be estimated accurately.</p>
      <p id="d2e3111">For the semi-parametric GAMLSS we therefore use an alternative model selection criterion, which avoids this difficulty: the WIC <xref ref-type="bibr" rid="bib1.bibx46 bib1.bibx74" id="paren.64"/>. The underlying motivation is the same as that for AIC and <inline-formula><mml:math id="M106" display="inline"><mml:mrow><mml:msub><mml:mrow class="chem"><mml:mi mathvariant="normal">AIC</mml:mi></mml:mrow><mml:mtext>adj</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>, but WIC uses resampling and does not depend on poorly-estimated quantities. Specifically, let <inline-formula><mml:math id="M107" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>∗</mml:mo></mml:msup><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> be a collection of resampled datasets that can each be regarded as drawn from the same joint distribution as the observations <inline-formula><mml:math id="M108" display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula> (see below). Then, in the current context, WIC is defined as

            <disp-formula id="Ch1.E9" content-type="numbered"><label>9</label><mml:math id="M109" display="block"><mml:mtable rowspacing="0.2ex" class="split" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mrow class="chem"><mml:mi mathvariant="normal">WIC</mml:mi></mml:mrow><mml:mo>=</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi>log⁡</mml:mi><mml:msub><mml:mi>L</mml:mi><mml:mtext>PEN</mml:mtext></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">θ</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>y</mml:mi></mml:msub><mml:mo>;</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mfenced close="]" open="["><mml:mrow><mml:msub><mml:mover accent="true"><mml:mrow><mml:mi>log⁡</mml:mi><mml:mi>L</mml:mi></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>PEN</mml:mtext></mml:msub><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">θ</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mrow><mml:msup><mml:mi>y</mml:mi><mml:mo>∗</mml:mo></mml:msup></mml:mrow></mml:msub><mml:mo>;</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>∗</mml:mo></mml:msup></mml:mrow></mml:mfenced><mml:mo>-</mml:mo><mml:msub><mml:mover accent="true"><mml:mrow><mml:mi>log⁡</mml:mi><mml:mi>L</mml:mi></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>PEN</mml:mtext></mml:msub><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">θ</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mrow><mml:msup><mml:mi>y</mml:mi><mml:mo>∗</mml:mo></mml:msup></mml:mrow></mml:msub><mml:mo>;</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>

          where <inline-formula><mml:math id="M110" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mrow><mml:mi>log⁡</mml:mi><mml:mi>L</mml:mi></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover><mml:mtext>PEN</mml:mtext></mml:msub><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>;</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> denotes an average of penalised log-likelihoods computed over the resampled datasets.</p>
      <p id="d2e3297">In Eq. (<xref ref-type="disp-formula" rid="Ch1.E9"/>), the “adjustment” in square brackets is a difference of two terms. The first is an average of maximised penalised log-likelihoods for each of the resampled datasets; the second, however, is an average of penalised log-likelihoods for the observations <inline-formula><mml:math id="M111" display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula>, but evaluated at each of the resample-based estimates <inline-formula><mml:math id="M112" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">θ</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:msup><mml:mi>y</mml:mi><mml:mo>∗</mml:mo></mml:msup></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>. The WIC thus aims to compensate for the extent to which the penalised log-likelihood <inline-formula><mml:math id="M113" display="inline"><mml:mrow><mml:mi>log⁡</mml:mi><mml:msub><mml:mi>L</mml:mi><mml:mtext>PEN</mml:mtext></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">θ</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi>y</mml:mi></mml:msub><mml:mo>;</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is likely to be overoptimistic as a performance measure in alternative datasets generated from the same distribution as the observations <inline-formula><mml:math id="M114" display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula>. The Supplement gives further details.</p>
      <p id="d2e3364">The WIC requires that the resampled datasets <inline-formula><mml:math id="M115" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>∗</mml:mo></mml:msup><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> can be regarded as drawn from the same distribution as the observations <inline-formula><mml:math id="M116" display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula>. Standard bootstrap resampling – sampling individual observations with replacement – is not appropriate for multisite precipitation data, because it destroys the inter-site and temporal dependence structure present in each day's observations. Thus alternative resampling approaches are required: Appendix <xref ref-type="sec" rid="App1.Ch1.S2"/> describes the approach used here. Broadly the idea is to resample individual days instead of individual observations, whilst using additional stratification and adjustments in order to account for covariate dependence, handle missing observations and to ensure that resampled datasets have the same structure as the original one: specifically ensuring that both the full datasets and their dry and wet subsets contain the same number of observations and days with recorded values as the original dataset.</p>
      <p id="d2e3391">Finally, other resampling-based model selection criteria have been proposed, for example, by <xref ref-type="bibr" rid="bib1.bibx15" id="text.65"/> whose criterion is asymptotically equivalent to the WIC <xref ref-type="bibr" rid="bib1.bibx74" id="paren.66"/>. The work reported below uses WIC; we have also examined the Cavanaugh-Shumway criterion and found that it yields very similar conclusions.</p>
</sec>
<sec id="Ch1.S3.SS4">
  <label>3.4</label><title>Generating synthetic sequences: revisiting inter-site dependence</title>
      <p id="d2e3409">GAMLSS can be used to produce synthetic multisite precipitation sequences in the same way as was described for GLMs in Sect. <xref ref-type="sec" rid="Ch1.S3.SS1"/>, using models Eqs. (<xref ref-type="disp-formula" rid="Ch1.E5"/>)–(<xref ref-type="disp-formula" rid="Ch1.E7"/>) to determine the daily occurrence probabilities and intensity distributions and with similar adjustments for capturing inter-site dependence. However, as outlined in Sect. <xref ref-type="sec" rid="Ch1.S1"/>, there is potential to improve on these options by considering dependence in occurrence and intensities simultaneously when simulating multisite precipitation.</p>
      <p id="d2e3420">To address this issue, in a similar spirit to <xref ref-type="bibr" rid="bib1.bibx7" id="text.67"/> we use transformed Gaussian fields or, equivalently, Gaussian copulas <xref ref-type="bibr" rid="bib1.bibx62 bib1.bibx72" id="paren.68"/>. Denoting by <inline-formula><mml:math id="M117" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>:=</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi>Y</mml:mi><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mi mathvariant="normal">…</mml:mi><mml:msub><mml:mi>Y</mml:mi><mml:mtext>St</mml:mtext></mml:msub><mml:msup><mml:mo>)</mml:mo><mml:mi mathvariant="normal">T</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> the random vector representing precipitation at <inline-formula><mml:math id="M118" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> sites on day <inline-formula><mml:math id="M119" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula> of a simulation, the elements of <inline-formula><mml:math id="M120" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are constructed from a corresponding vector <inline-formula><mml:math id="M121" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">Q</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>:=</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mi mathvariant="normal">…</mml:mi><mml:msub><mml:mi>Q</mml:mi><mml:mtext>St</mml:mtext></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> of standard normal random variables with covariance matrix <inline-formula><mml:math id="M122" display="inline"><mml:mi mathvariant="bold">Σ</mml:mi></mml:math></inline-formula>, as

            <disp-formula id="Ch1.E10" content-type="numbered"><label>10</label><mml:math id="M123" display="block"><mml:mrow><mml:msub><mml:mi>Y</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:mo>=</mml:mo><mml:mfenced open="{" close=""><mml:mtable rowspacing="0.2ex" class="cases" columnspacing="1em" columnalign="left left" framespacing="0em"><mml:mtr><mml:mtd><mml:mn mathvariant="normal">0</mml:mn></mml:mtd><mml:mtd><mml:mrow><mml:mtext>if </mml:mtext><mml:mi mathvariant="normal">Φ</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>Q</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:mo>)</mml:mo><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mtext>st</mml:mtext></mml:msub></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:msubsup><mml:mi>F</mml:mi><mml:mtext>st</mml:mtext><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msubsup><mml:mfenced open="(" close=")"><mml:mstyle displaystyle="false"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:mi mathvariant="normal">Φ</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>Q</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:mo>)</mml:mo><mml:mo>-</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mtext>st</mml:mtext></mml:msub></mml:mrow></mml:mfrac></mml:mstyle></mml:mstyle></mml:mfenced></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mtext>otherwise</mml:mtext><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mfenced></mml:mrow></mml:math></disp-formula></p>
      <p id="d2e3629">Here, <inline-formula><mml:math id="M124" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mtext>st</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> is the probability of non-zero precipitation as obtained from the occurrence model Eq. (<xref ref-type="disp-formula" rid="Ch1.E5"/>); <inline-formula><mml:math id="M125" display="inline"><mml:mrow><mml:msubsup><mml:mi>F</mml:mi><mml:mtext>st</mml:mtext><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msubsup><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the inverse cumulative distribution function (CDF) of the gamma distribution for wet-day amounts defined by Eqs. (<xref ref-type="disp-formula" rid="Ch1.E6"/>) and (<xref ref-type="disp-formula" rid="Ch1.E7"/>); and <inline-formula><mml:math id="M126" display="inline"><mml:mrow><mml:mi mathvariant="normal">Φ</mml:mi><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the CDF of the standard normal distribution. It is easily verified that under Eq. (<xref ref-type="disp-formula" rid="Ch1.E10"/>), <inline-formula><mml:math id="M127" display="inline"><mml:mrow><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>Y</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mtext>st</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M128" display="inline"><mml:mrow><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>Y</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:mo>≤</mml:mo><mml:mi>y</mml:mi><mml:mo>|</mml:mo><mml:msub><mml:mi>Y</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:mo>(</mml:mo><mml:mi>y</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> so that the procedure yields precipitation with the correct marginal (i.e. site-specific) distributions at each site.</p>
      <p id="d2e3762">The first line of Eq. (<xref ref-type="disp-formula" rid="Ch1.E10"/>) – yielding a simulated dry day – corresponds to a situation in which <inline-formula><mml:math id="M129" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mtext>st</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> is <italic>censored</italic>, meaning that its precise value is unknown: it is only known not to exceed <inline-formula><mml:math id="M130" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="normal">Φ</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. The <inline-formula><mml:math id="M131" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msub><mml:mi>Q</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> can be regarded as normalised quantile residuals <xref ref-type="bibr" rid="bib1.bibx28" id="paren.69"/>, censored on dry days. There is, moreover, a connection with existing representations of inter-site dependence in precipitation intensities: if <inline-formula><mml:math id="M132" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> so that the first condition in Eq. (<xref ref-type="disp-formula" rid="Ch1.E10"/>) is never met, then the transformation in the second line is similar in spirit to the use of transformed Anscombe residuals for this purpose by <xref ref-type="bibr" rid="bib1.bibx89" id="text.70"/>. The current proposal can therefore be seen as a natural extension of, and a way of linking, apparently distinct approaches from the literature.</p>
      <p id="d2e3848">Under Eq. (<xref ref-type="disp-formula" rid="Ch1.E10"/>), inter-site dependence in the precipitation <inline-formula><mml:math id="M133" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is induced by the covariance matrix <inline-formula><mml:math id="M134" display="inline"><mml:mi mathvariant="bold">Σ</mml:mi></mml:math></inline-formula> of the Gaussian random vector <inline-formula><mml:math id="M135" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">Q</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. In particular, if <inline-formula><mml:math id="M136" display="inline"><mml:mi mathvariant="bold">Σ</mml:mi></mml:math></inline-formula> is the identity matrix, then <inline-formula><mml:math id="M137" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">Q</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> contains independent standard normal random variables, and Eq. (<xref ref-type="disp-formula" rid="Ch1.E10"/>) produces independent precipitation values from the combined occurrence/intensity model at each site. To generate realistic multi-site precipitation sequences, however, non-zero off-diagonal elements of the matrix <inline-formula><mml:math id="M138" display="inline"><mml:mi mathvariant="bold">Σ</mml:mi></mml:math></inline-formula> must usually be estimated from the data <inline-formula><mml:math id="M139" display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula>. Following standard practice for GLM-based precipitation generators, this is done after fitting the marginal occurrence and intensity models Eqs. (<xref ref-type="disp-formula" rid="Ch1.E5"/>)–(<xref ref-type="disp-formula" rid="Ch1.E7"/>): thus, when estimating <inline-formula><mml:math id="M140" display="inline"><mml:mi mathvariant="bold">Σ</mml:mi></mml:math></inline-formula>, the quantities <inline-formula><mml:math id="M141" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M142" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msub><mml:mi mathvariant="italic">μ</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M143" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> are considered known.</p>
      <p id="d2e3974">In most situations involving more than a few sites, estimation of <inline-formula><mml:math id="M144" display="inline"><mml:mi mathvariant="bold">Σ</mml:mi></mml:math></inline-formula> is complicated by the fact that there are typically few, if any, periods during which data from all sites are simultaneously available. This can be handled <xref ref-type="bibr" rid="bib1.bibx86" id="paren.71"><named-content content-type="pre">e.g.</named-content></xref> by estimating correlations separately for each pair of sites using all dates for which both are operational. The resulting matrix of inter-site correlations is not guaranteed to be positive definite however: to address this, we subsequently fit a spatial correlation model to the pairwise estimates. This ensures a valid correlation matrix, and also provides correlations for pairs of locations with non-overlapping records or that are ungauged. We use a Matérn correlation function for this purpose, due to its flexibility in capturing a wide range of correlation behaviours <xref ref-type="bibr" rid="bib1.bibx64" id="paren.72"/>. According to this model, the residual correlation between two locations separated by a distance <inline-formula><mml:math id="M145" display="inline"><mml:mrow><mml:mi>d</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> is

            <disp-formula id="Ch1.E11" content-type="numbered"><label>11</label><mml:math id="M146" display="block"><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mi mathvariant="italic">ν</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>d</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="italic">α</mml:mi><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msup><mml:mn mathvariant="normal">2</mml:mn><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi mathvariant="italic">ν</mml:mi></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:mi mathvariant="normal">Γ</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ν</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mfrac></mml:mstyle><mml:msup><mml:mfenced open="(" close=")"><mml:mrow><mml:msqrt><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi mathvariant="italic">ν</mml:mi></mml:mrow></mml:msqrt><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mi>d</mml:mi><mml:mi mathvariant="italic">ρ</mml:mi></mml:mfrac></mml:mstyle></mml:mrow></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:msub><mml:mi>K</mml:mi><mml:mi mathvariant="italic">ν</mml:mi></mml:msub><mml:mfenced open="(" close=")"><mml:mrow><mml:msqrt><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi mathvariant="italic">ν</mml:mi></mml:mrow></mml:msqrt><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mi>d</mml:mi><mml:mi mathvariant="italic">ρ</mml:mi></mml:mfrac></mml:mstyle></mml:mrow></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

          where <inline-formula><mml:math id="M147" display="inline"><mml:mrow><mml:mi mathvariant="normal">Γ</mml:mi><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the Gamma function and <inline-formula><mml:math id="M148" display="inline"><mml:mrow><mml:msub><mml:mi>K</mml:mi><mml:mi mathvariant="italic">ν</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> the modified Bessel function of the second kind. The form of the correlations is determined by the two parameters <inline-formula><mml:math id="M149" display="inline"><mml:mi mathvariant="italic">ν</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M150" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula>, and <inline-formula><mml:math id="M151" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula>, respectively controlling the smoothness of the residual fields, the distance-dependent rate of correlation decay, and the “nugget” effect representing the limiting correlation at small non-zero distances <xref ref-type="bibr" rid="bib1.bibx4" id="paren.73"/>. In the work reported below, the parameters are estimated from the pairwise inter-site correlations using weighted least squares, weighting the correlation between each pair of sites according to the number of days' data from which it was calculated. This weighting ensures that the correlation model fit is not influenced unduly by pairwise correlations that are estimated imprecisely.</p>
      <p id="d2e4141">For schemes that treat dependence in occurrence and intensities separately, separate correlation models are needed for each component. For intensities, the pairwise correlations to which Eq. (<xref ref-type="disp-formula" rid="Ch1.E11"/>) is fitted can be calculated directly from the relevant (transformed) residuals <xref ref-type="bibr" rid="bib1.bibx89" id="paren.74"><named-content content-type="pre">e.g.</named-content></xref>; while those for occurrence can be inferred from the proportion of days for which both sites are simultaneously wet <xref ref-type="bibr" rid="bib1.bibx3" id="paren.75"><named-content content-type="pre">e.g.</named-content></xref>. For the combined occurrence/intensity scheme Eq. (<xref ref-type="disp-formula" rid="Ch1.E10"/>), however, a single correlation model is needed, and neither of these approaches is feasible, because there are invariably days for which one site in a pair is wet while the other is dry. We thus estimate the pairwise correlations using maximum likelihood as described in Appendix <xref ref-type="sec" rid="App1.Ch1.S3"/>: although this requires numerical optimisation for each pair of sites, it is a principled approach that is generally applicable and can in theory extend beyond bivariate estimation.</p>
      <p id="d2e4160">Having estimated the correlation matrix <inline-formula><mml:math id="M152" display="inline"><mml:mi mathvariant="bold">Σ</mml:mi></mml:math></inline-formula>, synthetic multisite precipitation sequences can be generated one day at a time: on day <inline-formula><mml:math id="M153" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula> of simulation a vector <inline-formula><mml:math id="M154" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">Q</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> of standard normal variates with correlation matrix <inline-formula><mml:math id="M155" display="inline"><mml:mi mathvariant="bold">Σ</mml:mi></mml:math></inline-formula> is simulated, and then converted to a corresponding precipitation vector <inline-formula><mml:math id="M156" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> via the transformation Eq. (<xref ref-type="disp-formula" rid="Ch1.E10"/>).</p>
</sec>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Application to the Blackwater catchment</title>
      <p id="d2e4218">It is of interest to determine whether GAMLSS offer improved performance compared with GLMs when generating synthetic precipitation sequences; and, if so, whether the improvement is due to the use of semiparametric rather than parametric covariate effects or to the use of a non-constant shape parameter in the intensity model (or both). To explore these questions, we compare the performance of four precipitation generators fitted to the Blackwater data from Sect. <xref ref-type="sec" rid="Ch1.S2"/>. The first of these uses GLMs, with a parametric representation of covariate effects in both occurrency and intensity models and a constant shape parameter in the latter; while the second is a GAM (see Sect. <xref ref-type="sec" rid="Ch1.S1"/>) allowing a semiparametric representation of covariate effects and which, for present purposes, can be regarded as a special case of a GAMLSS in which the covariates influence only the location parameter of the intensity distribution. These models are then extended, respectively to parametric and semiparametric GAMLSS, allowing both the location and scale parameter of the intensity distribution to depend on covariates. This approach, starting with and then extending “location-only” models, follows the “parameter-hierarchy” recommended in <xref ref-type="bibr" rid="bib1.bibx75" id="text.76"/>.</p>

      <fig id="F2" specific-use="star"><label>Figure 2</label><caption><p id="d2e4230">Monthly summary statistics for 16-site average daily precipitation series. The black line shows the observed values, whilst the blue shaded areas correspond to intervals containing, respectively 30 %, 50 %, 70 % and 90 % of the distribution of simulation-derived values from the parametric GLM. Statistics: monthly mean, monthly standard deviation, proportion of wet days, conditional mean (mean on rainy days), conditional standard deviation (standard deviation on rainy days), monthly maximum, autocorrelation at lag 1, autocorrelation at lag 2.</p></caption>
        <graphic xlink:href="https://ascmo.copernicus.org/articles/12/149/2026/ascmo-12-149-2026-f02.png"/>

      </fig>

<sec id="Ch1.S4.SS1">
  <label>4.1</label><title>Model-building strategy</title>
      <p id="d2e4246">Each model is constructed using a structured approach <xref ref-type="bibr" rid="bib1.bibx16" id="paren.77"/>. The parametric GLM and semiparametric GAM are constructed independently of each other, in each case starting with a simple baseline model and successively adding terms in perceived order of importance; see Appendix <xref ref-type="sec" rid="App1.Ch1.S4"/>. The final GLM and GAM for precipitation intensity are then taken as starting points for the modelling of intensity standard deviations in the respective GAMLSS. Throughout, multiple effect forms (i.e. parametric forms and/or spline specifications) for potential covariates as well as interactions (see Sect. <xref ref-type="sec" rid="Ch1.S3.SS1"/>) are tested; and alternative formulations are compared using the adjusted AIC (<xref ref-type="disp-formula" rid="Ch1.E4"/>) and WIC (<xref ref-type="disp-formula" rid="Ch1.E9"/>), respectively for the parametric and semiparametric models. For the semi-parametric models always both parametric and spline-based effects are tested. To represent seasonal variation, sine and cosine functions of the day of the year are used in the parametric models; and cyclic splines in the semiparametric ones.  Systematic regional variation is captured using functions of latitude, longitude, and altitude. For the parametric models, latitude–longitude effects are represented by Legendre polynomials; for the semiparametric models, a thin-plate bivariate spline in latitude and longitude is used instead. Following <xref ref-type="bibr" rid="bib1.bibx90" id="text.78"/>, the change in measurement resolution (see Sect. <xref ref-type="sec" rid="Ch1.S2"/>) is accounted for via a binary covariate taking the value 0 for all observations before 1970 and 1 for all subsequent observations: the effect is to provide an adjustment within the models for any systematic changes arising from the coarser resolution of the data in the early part of the record, thus avoiding the need for data preprocessing that may have unintended consequences (see Appendix <xref ref-type="sec" rid="App1.Ch1.S1"/>).</p>
      <p id="d2e4268">After model selection, standard checks on residuals <xref ref-type="bibr" rid="bib1.bibx75" id="text.79"><named-content content-type="post">Chapter 12</named-content></xref> are carried out as an initial assessment that the models appear to capture most of the systematic structure in the data, before carrying out more extensive tests via simulation (see next section). Finally, for each of the four options (GLM, GAM, parametric and semiparametric GAMLSS) a residual correlation matrix <inline-formula><mml:math id="M157" display="inline"><mml:mi mathvariant="bold">Σ</mml:mi></mml:math></inline-formula> is estimated as described in Sect. <xref ref-type="sec" rid="Ch1.S3.SS4"/>, for use in the inter-site dependence model Eq. (<xref ref-type="disp-formula" rid="Ch1.E10"/>).</p>
</sec>
<sec id="Ch1.S4.SS2">
  <label>4.2</label><title>Simulation settings</title>
      <p id="d2e4296">The performance of each precipitation generator is assessed by comparing statistics of the observed data over the period 1959–2022 with those for each of 19 simulated sequences over the same period. The number of simulations is limited by the computation time for the GAMLSS as implemented in the <monospace>bamlss</monospace> package: this is discussed further in Sect. <xref ref-type="sec" rid="Ch1.S5"/> below.  Each statistic is calculated separately for every simulated sequence, thus yielding a distribution of 19 values: if the precipitation generator is realistic, then the corresponding statistic computed from observations should lie within the range of this distribution 90 % of the time on average, since it has a <inline-formula><mml:math id="M158" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:math></inline-formula> chance of being less (resp. greater) than the minimum (resp. maximum) from the simulations.</p>
      <p id="d2e4316">Each simulation is initialised with the first three observed days at each station. Simulations are carried out only on days and at stations where observations are available. When several consecutive observations are missing for a station, the simulation is reinitialised once new observed values become available, using these as the new lagged inputs. This guarantees a like-for-like comparison of simulations and observations. Much of the subsequent analysis will focus on the subset of 16 stations that report through most of the study period. The observations used for initialisation are not used in the comparison exercise, as the simulations contain no corresponding values.</p>

      <fig id="F3" specific-use="star"><label>Figure 3</label><caption><p id="d2e4321">As Fig. <xref ref-type="fig" rid="F2"/> but for the semiparametric GAMLSS. </p></caption>
          <graphic xlink:href="https://ascmo.copernicus.org/articles/12/149/2026/ascmo-12-149-2026-f03.png"/>

        </fig>

</sec>
<sec id="Ch1.S4.SS3">
  <label>4.3</label><title>Reproduction of key precipitation statistics</title>
      <p id="d2e4340">For each month of the year, Figs. <xref ref-type="fig" rid="F2"/> and <xref ref-type="fig" rid="F3"/> compare the observed values of several key summary statistics with those derived from simulations of the parametric GLM and semiparametric GAMLSS (similar figures for the parametric GAMLSS and semi-parametric GAM can be found in the Supplement). These statistics all relate to the daily precipitation series averaged over the 16 stations reporting through most of the 1959–2022 period.</p>

      <fig id="F4" specific-use="star"><label>Figure 4</label><caption><p id="d2e4349">Quantile-quantile plots comparing observed and simulated precipitation distributions for both the parametric <bold>(a)</bold> and semiparametric <bold>(b)</bold> location (GLM/GAM, red) and location-scale (GAMLSS, blue) models. Distributions are pooled across all stations and across the entire study period 1959–2022. Vertical lines and points indicate ranges and median values, respectively, from 19 simulations.</p></caption>
          <graphic xlink:href="https://ascmo.copernicus.org/articles/12/149/2026/ascmo-12-149-2026-f04.png"/>

        </fig>

      <p id="d2e4364">In both figures, most of the observed statistics fall within the 90 % range of the simulations: this indicates good performance of both the occurrence and intensity models in the parametric and semi-parametric case. Results for the parametric GAMLSS and semi-parametric GAM are similar (see Supplement). The overall mean is well captured across all models, however, there is some indication that the conditional (i.e. wet-day) mean might be underestimated. The proportion of wet days and the standard deviation statistics (both overall, as well as conditional) seem better captured by the semiparametric GAMLSS, compared to the parametric models. For the proportion of wet days, this is most likely due to the more flexible representation of covariate effects in the GAMLSS framework, since the GAMLSS-based modelling of standard deviations is not used in the occurrence model. By contrast, the improvement in representing the standard deviations is also seen for the parametric GAMLSS (see Supplement), whence this improvement probably comes from the ability to model variation in the shape parameter of the gamma distributions. In the Supplement (Sect. S4 in the Supplement) we further analyse the persistence of dry and wet spells which we find is well reproduced by all four precipitation generators.</p>

      <fig id="F5" specific-use="star"><label>Figure 5</label><caption><p id="d2e4370">Quantile-quantile plots comparing observed and simulated precipitation distributions values for both the parametric location model (GLM, red) and the parametric location and scale model (GAMLSS, blue) for all four seasons. Distributions are pooled across all stations and across the entire study period 1959–2022. Vertical lines and points indicate ranges and median values, respectively, from 19 simulations.</p></caption>
          <graphic xlink:href="https://ascmo.copernicus.org/articles/12/149/2026/ascmo-12-149-2026-f05.png"/>

        </fig>

</sec>
<sec id="Ch1.S4.SS4">
  <label>4.4</label><title>Daily precipitation distributions</title>
      <p id="d2e4387">Next, we focus on the generators' ability to reproduce the distribution of daily precipitation values, both overall and by season. This is done via quantile-quantile plots, in which the ordered values from each simulation (from smallest to largest) are plotted against those from the observations as in Fig. <xref ref-type="fig" rid="F4"/>. Here, the daily values from all sites throughout the study period are pooled, and each panel shows the results from a location-only model (i.e. a GLM or a GAM) and a location–scale GAMLSS: covariate effects are represented parametrically in Fig. <xref ref-type="fig" rid="F4"/>a but semiparametrically in Fig. <xref ref-type="fig" rid="F4"/>b, and each vertical line indicates the range of values across the 19 simulations.</p>
      <p id="d2e4396">Figure <xref ref-type="fig" rid="F4"/> shows that overall, all four generators capture the rainfall distribution well: the ranges of ordered values from the simulations encompass the line of equality, and the medians of the 19 simulated values at each point fall close to this line. The ranges of simulated values are broadly similar for precipitation amounts up to around 50 <inline-formula><mml:math id="M159" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">mm</mml:mi></mml:mrow></mml:math></inline-formula> (this is around the 99.96th percentile of the observed distribution); but there are larger differences in the upper tail of the distribution, where the GAMLSS yield less between-run variability, whilst still encompassing the line of equality. These results are also consistent when aggregating precipitation on a weekly scale (not shown).</p>
      <p id="d2e4409">The good reproduction of the overall precipitation distribution in Fig. <xref ref-type="fig" rid="F4"/> is consistent with the GLM-based results of <xref ref-type="bibr" rid="bib1.bibx89" id="text.80"/>, who nonetheless found systematic differences between observed and simulated distributions in individual seasons. To investigate this further, therefore, Fig. <xref ref-type="fig" rid="F5"/> compares the observed and simulated precipitation distributions from the two parametric precipitation generators (GLM and parametric scale-location GAMLSS) by season. Here, both sets of simulations tend systematically to overestimate the upper tail of the precipitation distribution in winter and, to a lesser extent, to underestimate in summer. The GLM-based result here is similar to that reported by <xref ref-type="bibr" rid="bib1.bibx89" id="text.81"/>. Some improvement can be seen for the GAMLSS, where in particular in DJF the amount of overestimation is reduced in some simulations. Nonetheless, the improvement appears relatively small since the medians of the simulated quantiles are similar in the GLM and the GAMLSS.</p>

      <fig id="F6" specific-use="star"><label>Figure 6</label><caption><p id="d2e4425">As Fig. <xref ref-type="fig" rid="F5"/> but for the semiparametric location (GAM, red) and location scale model (GAMLSS, blue).</p></caption>
          <graphic xlink:href="https://ascmo.copernicus.org/articles/12/149/2026/ascmo-12-149-2026-f06.png"/>

        </fig>

      <p id="d2e4436">Figure <xref ref-type="fig" rid="F6"/> shows the corresponding plots for the semi-parametric models. The results are broadly comparable with those in Fig. <xref ref-type="fig" rid="F5"/>. The semiparametric GAMLSS simulations show again some improvement in Summer and Winter, however, the overall biases are still present. This overall pattern also persists when focusing on individual months rather than entire season: see for example Fig. S3 in the Supplement showing results for January in which, as with the winter comparisons above, the simulations tend to overestimate the upper tail of the distribution; this is slightly less pronounced for the semiparametric GAMLSS than for the other three generators, however.</p>
      <p id="d2e4443">Taken together, these results suggest that modest improvements in the representation of seasonal precipitation tails can be gained by relaxing the assumption of a constant shape parameter although this does not fully resolve the issue. The more flexible representation of potentially nonlinear covariate effects compared with a standard GLM does not seem to improve tail behaviour much. We note, however, that whilst seasonal tail deviations are apparent in the pooled distribution of single-site rainfall studied above, no such deviation is seen in the distribution of catchment-averaged rainfall for any of the models (see Figs. S4 and S5 in the Supplement). This indicates that for most hydrological applications all weather generator models studied here will have satisfactory performance in terms of extremes, and stands in contrast to the result in <xref ref-type="bibr" rid="bib1.bibx89" id="text.82"/> who found seasonal tail deviations in the time series of average rainfall for GLM-based models. This is probably due either to different treatment of recording resolution (see Appendix <xref ref-type="sec" rid="App1.Ch1.S1"/>) or to the use of different atmospheric covariates: this work uses ERA5-derived variables, whilst <xref ref-type="bibr" rid="bib1.bibx89" id="text.83"/> used an index of the North Atlantic Oscillation (NAO).</p>
</sec>
<sec id="Ch1.S4.SS5">
  <label>4.5</label><title>Distributions of annual maxima</title>
      <p id="d2e4462">An alternative way to characterise the upper tail behaviour is via extreme value theory <xref ref-type="bibr" rid="bib1.bibx25" id="paren.84"/>, which is often used to study the expected frequency of rare events for purposes such as flood risk assessment. In particular, a common summary of potential extremal behaviour is the shape parameter, often denoted as <inline-formula><mml:math id="M160" display="inline"><mml:mi mathvariant="italic">ξ</mml:mi></mml:math></inline-formula>, of a generalized extreme value (GEV) distribution fitted to a collection of annual maxima. If <inline-formula><mml:math id="M161" display="inline"><mml:mrow><mml:mi mathvariant="italic">ξ</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>, as often reported for daily precipitation <xref ref-type="bibr" rid="bib1.bibx48" id="paren.85"><named-content content-type="pre">e.g.</named-content></xref>, then the distribution of extremes is heavy-tailed which implies a potential for as-yet-unobserved extreme events to greatly exceed historical values, whereas if <inline-formula><mml:math id="M162" display="inline"><mml:mrow><mml:mi mathvariant="italic">ξ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> then the distribution is light-tailed, and if <inline-formula><mml:math id="M163" display="inline"><mml:mrow><mml:mi mathvariant="italic">ξ</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> then the distribution has a finite upper bound.</p>
      <p id="d2e4517">To investigate the extremal behaviour of each precipitation generator over the period 1959–2022, the <monospace>extRemes</monospace> package <xref ref-type="bibr" rid="bib1.bibx34" id="paren.86"/> is used to fit GEV distributions to the annual maxima of selected time series derived from each simulation: for each series, the mean and standard deviation of the 19 resulting estimates of <inline-formula><mml:math id="M164" display="inline"><mml:mi mathvariant="italic">ξ</mml:mi></mml:math></inline-formula> are compared with the estimate and standard error derived from the corresponding observations. This standard error is based on large-sample theory and aims to approximate the standard deviation of estimates obtained under repeated sampling: thus it should be comparable with the standard deviation of the simulation-based estimates. We follow the advice of <xref ref-type="bibr" rid="bib1.bibx11" id="text.87"/> and standardize the annual maxima prior to fitting: this does not affect the shape of the distribution.</p>

<table-wrap id="T2" specific-use="star"><label>Table 2</label><caption><p id="d2e4539">Shape parameters of GEV distributions fitted to observed and simulated annual maxima, for selected series of daily precipitation over the period 1959–2022. Simulation-derived estimates are from the semiparametric GAM and GAMLSS. Values in parentheses are standard errors: those for the observation-based estimates are based on large-sample approximations, while those for the simulation-based estimates are the empirical standard deviations across simulations.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="7">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left" colsep="1"/>
     <oasis:colspec colnum="3" colname="col3" align="right" colsep="1"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right" colsep="1"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="right"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">Observed</oasis:entry>
         <oasis:entry rowsep="1" namest="col4" nameend="col5" align="center" colsep="1">GAM </oasis:entry>
         <oasis:entry rowsep="1" namest="col6" nameend="col7" align="center">GAMLSS </oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4">Parametric</oasis:entry>
         <oasis:entry colname="col5">Semi-parametric</oasis:entry>
         <oasis:entry colname="col6">Parametric</oasis:entry>
         <oasis:entry colname="col7">Semi-parametric</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Individual sites</oasis:entry>
         <oasis:entry colname="col2">S003</oasis:entry>
         <oasis:entry colname="col3">0.171</oasis:entry>
         <oasis:entry colname="col4">0.058</oasis:entry>
         <oasis:entry colname="col5">0.110</oasis:entry>
         <oasis:entry colname="col6">0.081</oasis:entry>
         <oasis:entry colname="col7">0.092</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">(0.089)</oasis:entry>
         <oasis:entry colname="col4">(0.108)</oasis:entry>
         <oasis:entry colname="col5">(0.111)</oasis:entry>
         <oasis:entry colname="col6">(0.087)</oasis:entry>
         <oasis:entry colname="col7">(0.090)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">S005</oasis:entry>
         <oasis:entry colname="col3">0.193</oasis:entry>
         <oasis:entry colname="col4">0.092</oasis:entry>
         <oasis:entry colname="col5">0.028</oasis:entry>
         <oasis:entry colname="col6">0.081</oasis:entry>
         <oasis:entry colname="col7">0.095</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">(0.154)</oasis:entry>
         <oasis:entry colname="col4">(0.107)</oasis:entry>
         <oasis:entry colname="col5">(0.139)</oasis:entry>
         <oasis:entry colname="col6">(0.170)</oasis:entry>
         <oasis:entry colname="col7">(0.128)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">S041</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M165" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.083</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M166" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.098</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M167" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.083</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M168" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.059</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M169" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.066</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">(0.062)</oasis:entry>
         <oasis:entry colname="col4">(0.122)</oasis:entry>
         <oasis:entry colname="col5">(0.064)</oasis:entry>
         <oasis:entry colname="col6">(0.131)</oasis:entry>
         <oasis:entry colname="col7">(0.081)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">S102</oasis:entry>
         <oasis:entry colname="col3">0.028</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M170" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.005</oasis:entry>
         <oasis:entry colname="col5">0.051</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M171" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.016)</oasis:entry>
         <oasis:entry colname="col7">0.070)</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">(0.114)</oasis:entry>
         <oasis:entry colname="col4">(0.142)</oasis:entry>
         <oasis:entry colname="col5">(0.103)</oasis:entry>
         <oasis:entry colname="col6">(0.133)</oasis:entry>
         <oasis:entry colname="col7">(0.156)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry namest="col1" nameend="col2" colsep="1">Mean of sites with </oasis:entry>
         <oasis:entry colname="col3">0.094</oasis:entry>
         <oasis:entry colname="col4">0.071</oasis:entry>
         <oasis:entry colname="col5">0.084</oasis:entry>
         <oasis:entry colname="col6">0.076</oasis:entry>
         <oasis:entry colname="col7">0.091</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry namest="col1" nameend="col2" colsep="1">over 30 years of reporting </oasis:entry>
         <oasis:entry colname="col3">(0.083)</oasis:entry>
         <oasis:entry colname="col4">(0.097)</oasis:entry>
         <oasis:entry colname="col5">(0.089)</oasis:entry>
         <oasis:entry colname="col6">(0.116)</oasis:entry>
         <oasis:entry colname="col7">(0.096)</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <fig id="F7" specific-use="star"><label>Figure 7</label><caption><p id="d2e4885">Histograms of the proportion of rainy stations on a given day for the observations <bold>(a)</bold> and the simulations from the semiparametric GAMLSS <bold>(b)</bold>.</p></caption>
          <graphic xlink:href="https://ascmo.copernicus.org/articles/12/149/2026/ascmo-12-149-2026-f07.png"/>

        </fig>

      <p id="d2e4900">Table <xref ref-type="table" rid="T2"/> reports the results of this exercise for a representative set of time series: for four individual stations, and for the 16-site average. The observation- and simulation-derived estimates are all in good agreement: in particular, the point estimates of <inline-formula><mml:math id="M172" display="inline"><mml:mi mathvariant="italic">ξ</mml:mi></mml:math></inline-formula> are (slightly) positive for most of the considered series. This confirms that the extremal behaviours of the simulations and observations are qualitatively similar, despite the generators not being calibrated directly to reproduce such behaviour.</p>
</sec>
<sec id="Ch1.S4.SS6">
  <label>4.6</label><title>Inter-site dependence</title>
      <p id="d2e4920">In their original study of precipitation in this catchment, <xref ref-type="bibr" rid="bib1.bibx89" id="text.88"/> found that inter-site dependence was extremely strong due to the relatively small size of the area. It is challenging to develop simulation strategies that can capture this, particularly when separate dependence models are used for precipitation occurrence and intensity: correlation-based dependence structures for occurrence are hard to identify in this case, because sites are typically either all wet or all dry on most days. To address this, <xref ref-type="bibr" rid="bib1.bibx89" id="text.89"/> introduced an indirect approach for capturing dependence in precipitation occurrence, based on the distribution of the number of wet sites on each day. It is possible, however, that correlation-based dependence structures <italic>can</italic> be used for small catchments by modelling dependence in occurrence and intensity simultaneously (e.g. via Eq. <xref ref-type="disp-formula" rid="Ch1.E10"/>) because, in this case, correlations learned predominantly from precipitation intensity are implicitly shared with the occurrence model.</p>
      <p id="d2e4934">To investigate this, we here evaluate the inter-site dependence structure in our simulations. The results presented are produced using the semiparametric GAMLSS; results obtained from the other models are not shown but are similar.</p>
<sec id="Ch1.S4.SS6.SSS1">
  <label>4.6.1</label><title>Occurrence</title>
      <p id="d2e4944">Following from <xref ref-type="bibr" rid="bib1.bibx89" id="text.90"/>, we investigate dependence in precipitation occurrence by studying the distribution of the proportion of sites that are wet on each day. Figure <xref ref-type="fig" rid="F7"/> shows histograms of this distribution for the observations (Fig. <xref ref-type="fig" rid="F7"/>a) and simulations (Fig. <xref ref-type="fig" rid="F7"/>b). Here, the data from all 19 simulations have been pooled to produce a single distribution.</p>
      <p id="d2e4956">Figure <xref ref-type="fig" rid="F7"/>a shows that on around 80 % of days in the observations, the proportion of wet sites is either above 0.9 or below 0.1. This corresponds to very strong inter-site dependence as noted above. The simulations reproduce the shape of the histogram well (Fig. <xref ref-type="fig" rid="F7"/>b), albeit with slightly fewer days (around 76 %) in these outer histogram bins. The difference between the histograms is more pronounced at the right-hand end, where the proportion of wet sites exceeds 0.9 on 27.5 % of days in the observations but 24.6 % in the simulations. This discrepancy is unlikely to be important in many applications: in situations where it <italic>is</italic> important however, it would be preferable to use an inter-site dependence model, such as that of <xref ref-type="bibr" rid="bib1.bibx89" id="text.91"/>, that is specifically designed for this purpose.</p>
</sec>
<sec id="Ch1.S4.SS6.SSS2">
  <label>4.6.2</label><title>Intensity</title>
      <p id="d2e4977">We next examine inter-site dependence in intensities. The mean Pearson correlation between pairs of stations when both are wet is 0.823 in the observations and 0.815 in the simulations (standard deviation across simulation runs: 0.0072). Across all pairs of stations, the mean absolute difference between observed and simulated correlations is 0.055. This deviation is most likely due to the fact that the Matérn assumption of homogenous correlation decay with distance is not fully realistic due to the small area (see Fig. 7 in the Supplement which shows estimated bivariate correlations against Matérn fitted ones). Overall, the dependence seems slightly underestimated, but generally well captured.</p>
      <p id="d2e4980">As noted in Sect. <xref ref-type="sec" rid="Ch1.S2"/>, precipitation in the study area is mostly associated with large-scale synoptic systems, although localised convective events occur in summer. This suggests the potential for seasonal variation in the strength of inter-site dependence, with localised summer precipitation resulting in weaker dependence. Analysis of the observations confirms this, with average pairwise intensity correlations ranging from 0.75 in summer to 0.87 in winter. This variation is not fully captured in the simulations, although the simulated inter-site correlations are indeed weakest in summer (Table <xref ref-type="table" rid="T3"/>).</p>

<table-wrap id="T3"><label>Table 3</label><caption><p id="d2e4990">Average correlations between pairs of sites for days when both are wet, 1959–2022. Simulation-based results are derived from the semiparametric GAMLSS; the “original” column uses a constant inter-site correlation matrix, while the “revised” column uses a separate matrix for each season. Figures in parentheses are standard deviations across simulations.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="4">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right" colsep="1"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:colspec colnum="4" colname="col4" align="left"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry rowsep="1" namest="col3" nameend="col4" align="center">Simulated </oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Season</oasis:entry>
         <oasis:entry colname="col2">Observed</oasis:entry>
         <oasis:entry colname="col3">Original</oasis:entry>
         <oasis:entry colname="col4">Revised</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Spring (MAM)</oasis:entry>
         <oasis:entry colname="col2">0.80</oasis:entry>
         <oasis:entry colname="col3">0.80 (0.011)</oasis:entry>
         <oasis:entry colname="col4">0.77 (0.023)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Summer (JJA)</oasis:entry>
         <oasis:entry colname="col2">0.75</oasis:entry>
         <oasis:entry colname="col3">0.77 (0.002)</oasis:entry>
         <oasis:entry colname="col4">0.71 (0.022)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Autumn (SON)</oasis:entry>
         <oasis:entry colname="col2">0.87</oasis:entry>
         <oasis:entry colname="col3">0.82 (0.014)</oasis:entry>
         <oasis:entry colname="col4">0.85 (0.016)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Winter (DJF)</oasis:entry>
         <oasis:entry colname="col2">0.87</oasis:entry>
         <oasis:entry colname="col3">0.84 (0.015)</oasis:entry>
         <oasis:entry colname="col4">0.87 (0.010)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Overall</oasis:entry>
         <oasis:entry colname="col2">0.82</oasis:entry>
         <oasis:entry colname="col3">0.81 (0.007)</oasis:entry>
         <oasis:entry colname="col4">0.81 (0.011)</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d2e5114">To address this, we experiment with the use of separate Matérn correlation parameter sets for each season. As shown in the final column of Table <xref ref-type="table" rid="T3"/>, this produces a small improvement in capturing the Winter and Autumn correlation, but none in Spring and Summer. Further investigation is needed to understand this. One possibility is that there is unmodelled seasonality in the systematic regional variation of precipitation over the catchment: simple interactions between seasonal and spatial variation were not chosen in the AIC and WIC-based model selection, but there may be potential for more complex representations to capture the effects. In addition, when stratifying correlations seasonally the data records for computing bivariate correlations can become quite short, leading to larger estimation uncertainty. And finally, as indicated above, correlations might not always decay homogeneously with distance due to the size of the area which raises questions about the use of the Matérn covariance. In particular, we also find that there is larger variance between the estimated latent bivariate correlations in Summer than in Winter. This will not be captured by the chosen covariance structure.</p>
</sec>
<sec id="Ch1.S4.SS6.SSS3">
  <label>4.6.3</label><title>Intensity-Occurrence cross-dependence</title>
      <p id="d2e5128">“Spatial intermittence” refers to the dependence between dry sites and sites nearby that might be expected to only record small precipitation amounts. In the Supplement we analyse the distribution of precipitation amounts in the area, for days when at least one station recorded rainfall but the proportion of wet sites did not exceed 50 %. The remaining wet stations usually only record small amounts of rain in that case. We find that spatial intermittence is captured by the dependence template, meaning that it reproduces the shifted distribution of precipitation amounts, conditional on only a small proportion of the catchment being wet.</p>
</sec>
</sec>
</sec>
<sec id="Ch1.S5" sec-type="conclusions">
  <label>5</label><title>Discussion and conclusions</title>
      <p id="d2e5142">This work has examined the use of generalized additive models for location, scale and shape (GAMLSS) to extend the class of precipitation generators based on generalised linear models (GLMs). GAMLSS allow greater flexibility in modelling than GLMs, first via spline-based representations of covariate effects and second by relaxing the assumption of a constant shape parameter in the precipitation intensity distribution, instead allowing the shape parameter itself to depend on covariates. This potentially allows for the development of improved generators in applications including flood risk assessment and water resource management: the inclusion of large-scale atmospheric predictors as covariates also renders the models suitable for downscaling climate model outputs <xref ref-type="bibr" rid="bib1.bibx56" id="paren.92"/>. In the latter context, however, a potential caveat is that semiparametric models provide data-driven estimates of covariate effects on the parameters of the precipitation distribution – hence, almost by definition, they tend to perform poorly when extrapolating beyond the range of covariates in the training data. This means that the models may not be suitable for downscaling (say) future scenarios in which the large-scale atmospheric predictor configurations differ substantially in terms of data range from those in the historical record. To some extent, this is a caveat of any statistical downscaling method <xref ref-type="bibr" rid="bib1.bibx55" id="paren.93"/>: it is perhaps more serious, however, with a semiparametric model that imposes minimal constraints on the estimated covariate effects. Further work is needed to determine how serious a problem this is likely to be in applications.</p>
      <p id="d2e5151">Although GAMLSS are reasonably well established in the statistical literature, their use as multisite precipitation generators requires the use of estimation and model selection criteria that account for inter-site dependence: we suggest that the adjusted AIC and the WIC can be used for this purpose. A further contribution is the development of an approach, within the GLM/GAMLSS framework, that accounts simultaneously for inter-site dependence in precipitation occurrence and intensity.</p>
      <p id="d2e5154">In a case study focusing on a small catchment in southern England, we find that GAMLSS-based precipitation generators help improve some of the known deficiencies of GLMs. Introducing a semiparametric representation of covariate effects allows to better capture some precipitation statistics. By relaxing the assumption of the constant shape parameter, GAMLSS slightly improve on GLMs in representing seasonal variation in the upper tail of the precipitation distribution without adversely affecting other aspects of performance – but also without fully addressing the problems. These results suggest that, whilst adjusting the scale parameter and relaxing linearity of covariate effects – our initial hypothesis for the source of the problem – does yield modest improvements, it does not resolve the issue; adequately capturing seasonal tail behaviour may therefore require going beyond the widely-used gamma family itself, rather than adding further flexibility within it. Future work will consider distributions other than the gamma as the basis for an intensity model, perhaps considering three-parameter distributions to provide additional control over tail behaviour. In this work we briefly experimented with the three parameter generalised gamma distribution, which provides an additional shape parameter allowing independent control over tail behaviour. We did not, however, find consistent improvements over the classical gamma distribution and fitting proved less stable. Possibly, however, alternative three or four parameter distributions might yield benefits, for example one might consider the Box-Cox Power exponential <xref ref-type="bibr" rid="bib1.bibx69" id="paren.94"/>. Approaches based on extreme value theory could also be explored <xref ref-type="bibr" rid="bib1.bibx29 bib1.bibx44" id="paren.95"><named-content content-type="pre">see for example</named-content></xref>, possibly combined with a gamma model for the main body of the distribution as in <xref ref-type="bibr" rid="bib1.bibx83" id="text.96"/>. However, whilst the incorrectly captured seasonal tails of the intensity distribution are undesirable from a statistical perspective, the practical consequences of this may be less serious given the fact that the seasonal tails of the areal average intensity distribution are well represented (see Supplement).</p>
      <p id="d2e5168">In the same case study, inter-site dependence is strong due to the relatively small size of the study area. It is hard to replicate this with the techniques usually used in conjunction with GLMs and related precipitation generators: the approach proposed here builds on ideas from other types of generators. It captures the dependence well overall, albeit slightly underestimating the proportion of days for which all, or almost all, of the area is wet: for applications in which this is problematic, it would be preferable to use a model that is specifically designed to reproduce the distribution of the proportion of wet sites.</p>
      <p id="d2e5172">A further deficiency of the proposed model – along with other common inter-site dependence models – is that it does not fully capture seasonal variation in the strength of dependence arising, for example, due to the differing prevalence of convective and frontal systems in summer and winter. It is slightly surprising that this problem is not fully resolved via the use of season-specific correlation structures: further investigation of this is needed. Possibilities include expressing parameters of the Matérn covariances (or any other covariance function) as a function of seasonality, or exploring spatial-seasonal interactions within the GLM/GAMLSS itself.</p>
      <p id="d2e5175">A small but potentially important contribution of this work relates to the treatment of changes in recording resolution over time. Previously, where the issue has been considered at all, it has often been handled by preprocessing the data prior to analysis, for example, by rounding all values to the nearest 0.5 <inline-formula><mml:math id="M173" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">mm</mml:mi></mml:mrow></mml:math></inline-formula>. Having discovered that the properties of simulated sequences can be sensitive to the choices made in such a preprocessing step (see Appendix <xref ref-type="sec" rid="App1.Ch1.S1"/>), we propose instead to address the issue within the models themselves. This is done via the inclusion of binary covariates taking the value 0 for all observations measured to a “reference” resolution (typically the finest available) and 1 for all others: the effect is to allow within the model for any systematic changes that arise from the differences in resolution.</p>
      <p id="d2e5188">We have not so far discussed the computational cost of the GAMLSS framework. Fitting of the parametric GAMLSS is fast and comparable in terms of speed with the fitting of GLM-based weather generator models as in <monospace>Rglimclim</monospace> <xref ref-type="bibr" rid="bib1.bibx17" id="paren.97"/>. Fitting of the semi-parametric models is slower and might take several minutes on a standard laptop (Intel Quad Core i5-1145G7 @ 2.60 <inline-formula><mml:math id="M174" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">GHz</mml:mi></mml:mrow></mml:math></inline-formula> CPU, 16 <inline-formula><mml:math id="M175" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">Gb</mml:mi></mml:mrow></mml:math></inline-formula> RAM), but can for the most complex models take in the order of 15–30 <inline-formula><mml:math id="M176" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">min</mml:mi></mml:mrow></mml:math></inline-formula>. Model selection using the WIC can also be slow for large models, due to the need to refit the model to every bootstrap sample in order to calculate the <inline-formula><mml:math id="M177" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">θ</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>∗</mml:mo></mml:msup></mml:mrow></mml:msub><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> in Eq. (<xref ref-type="disp-formula" rid="Ch1.E9"/>). It would be helpful to investigate or develop alternative model selection criteria that can be used for semiparametric models in the presence of unresolved spatial dependence (or other forms of model mis-specification). Adjusted AIC-based model selection is fast as the required terms are usually obtained as a by-product of the maximum likelihood optimization. Similarly, fitting of the spatial dependence template is efficient and highly parallelizable, but might become restrictive for large station datasets in which case other optimizations might be necessary. Finally, simulation of the parametric and semi-parametric GAMLSS is slow and can take on the order of several hours to generate 50 years of simulated data for all stations. This is due to inefficiencies in the <monospace>gamlss</monospace> and <monospace>bamlss</monospace> packages, which are not designed for autoregressive simulation. Using specialised software, as in <xref ref-type="bibr" rid="bib1.bibx36" id="text.98"/>, this can be sped up strongly to the order of seconds to minutes, however, this comes at the cost of ease-of-implementation of spline-based models.</p>
      <p id="d2e5255">An open question is to determine whether the results from our Blackwater case study hold more generally. Experience with the use of GLM-based precipitation generators suggests that the conclusions of <xref ref-type="bibr" rid="bib1.bibx89" id="text.99"/>, based on data for the same catchment, are indeed generally applicable across a range of climatologies and catchment sizes; and Gaussian copulas have been shown to work well for capturing dependence in precipitation intensity in larger areas <xref ref-type="bibr" rid="bib1.bibx51 bib1.bibx3" id="paren.100"><named-content content-type="pre">e.g.</named-content></xref>. Nonetheless, further experience is needed to verify that the more complex GAMLSS methodology is similarly transferable.</p>
</sec>

      
      </body>
    <back><app-group>

<app id="App1.Ch1.S1">
  <label>Appendix A</label><title>Influence of rounding on gamma distribution fits</title>
      <p id="d2e5278">While carrying out this research, as noted in Sect. <xref ref-type="sec" rid="Ch1.S1"/> we noted that some properties of simulated precipitation sequences were sensitive to decisions made during data preprocessing – notably the handling of inconsistencies in the recording resolution of data used to fit the models. This appendix illustrates the issue.</p>
      <p id="d2e5283">Inconsistencies in recording resolution are often handled by rounding all records to a common resolution, such as 0.5 or 0.1 <inline-formula><mml:math id="M178" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">mm</mml:mi></mml:mrow></mml:math></inline-formula>, prior to model fitting <xref ref-type="bibr" rid="bib1.bibx3" id="paren.101"/>. It is known that maximum likelihood fits of gamma distributions are sensitive to rounding errors in small values <xref ref-type="bibr" rid="bib1.bibx59" id="text.102"><named-content content-type="post">Sect. 8.3</named-content></xref>; however, the effect on the <italic>upper</italic> tail of the distribution is perhaps less widely appreciated.</p>
      <p id="d2e5305">We demonstrate this effect by simulating 100 000 values from a gamma distribution with mean 5 and dispersion 1 (these parameter choices are the nearest integers to estimates obtained from the data in our case study). Using maximum likelihood, we then fit a gamma distribution to the simulated data after applying a rounding strategy. Finally we simulate a further 100 000 values from this fitted distribution, apply the same rounding strategy and plot the ordered values of the two rounded datasets against each other in a quantile-quantile plot. This final step provides a like-for-like comparison in determining whether the fitted distribution provides a good match to the data used to fit it, although the results reported below do not change materially if the second dataset is not rounded.</p>
      <p id="d2e5308">Two rounding strategies are considered, as follows:</p>
      <p id="d2e5312"><italic>Rounding only:</italic> round all observations to the nearest 0.5 and remove all values rounded to zero, meaning that the smallest remaining value is 0.5.</p>

      <fig id="FA1"><label>Figure A1</label><caption><p id="d2e5319">Quantile-quantile plots comparing the distributions of rounded samples from a <inline-formula><mml:math id="M179" display="inline"><mml:mrow><mml:mi mathvariant="normal">Γ</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">5</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> distribution with those from gamma distributions fitted to them using maximum likelihood estimation. Panel <bold>(a)</bold> uses rounding only (estimated parameters: 5.244, 0.888); panel <bold>(b)</bold> uses soft thresholding (estimated parameters: 4.778, 1.212).</p></caption>
        
        <graphic xlink:href="https://ascmo.copernicus.org/articles/12/149/2026/ascmo-12-149-2026-f08.png"/>

      </fig>

      <p id="d2e5354"><italic>Soft thresholding:</italic> round all observations to the nearest 0.5, remove all values rounded to zero, and subtract 0.49 so that the smallest remaining value is 0.01.</p>
      <p id="d2e5359">Figure <xref ref-type="fig" rid="FA1"/> shows the results. There are substantial mismatches between the upper tails of the “original” and “fitted” distributions under both rounding strategies, although in different directions. The reason is presumably that the shape parameter of a gamma distribution is strongly linked to the behaviour of its density in the lower tail where, by definition, rounding will substantially change the shape of the data distribution and where there are often many observations. This leads to biased estimates of the shape parameter when fitting the distribution, which has a knock-on effect in the upper tail where fewer observations are available to constrain the model fit.</p>
      <p id="d2e5364">Different ways are available to address this problem. In the work above we decide to work with data at recording resolution and account for changes in measurement accuracy using indicators as covariates within the model. As an alternative we note that estimation using the continuous ranked probability score <xref ref-type="bibr" rid="bib1.bibx35" id="paren.103"/> leads to the correct upper tail behaviour, no matter the rounding strategy in the synthetic examples studied (not shown). This is in line with other literature reporting higher robustness of CRPS-based estimation <xref ref-type="bibr" rid="bib1.bibx32" id="paren.104"><named-content content-type="pre">e.g.</named-content></xref> compared to that based on log-likelihoods. This should be explored further in future work.</p>
</app>

<app id="App1.Ch1.S2">
  <label>Appendix B</label><title>Resampling approach for the WIC</title>
      <p id="d2e5384">The WIC requires that the resampled datasets <inline-formula><mml:math id="M180" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>∗</mml:mo></mml:msup><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> can be regarded as drawn from the same distribution as the observations <inline-formula><mml:math id="M181" display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula>. In simple settings, this is achieved by sampling from the observations with replacement – appealing to the standard bootstrap argument that the empirical distribution of a large number of observations will be close to the underlying data-generating distribution. For multisite precipitation, however, more care is needed to mimic sampling from the “real” data-generating process, which involves (i) dependence on covariates; (ii) residual inter-site dependence; and (iii) observations that are missing because only a subset of stations is reporting on any given day.</p>
      <p id="d2e5409">To account for dependence on covariates and residual inter-site dependence, the resampling is done by selecting full days, with replacement, from the period covered by the observations <inline-formula><mml:math id="M182" display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula>. For each sampled day, all available precipitation values are added to the resampled dataset together with the corresponding covariates (including lagged precipitation values – which are added as covariates, rather than as additional rows in the dataset). However, this needs to be done in such a way that each resampled dataset has both the same number of days' data and the same number of individual observations as the original. Furthermore, if model selection is required for the combined occurrence/intensity model, then the same criteria should apply to the subset of resampled data containing only non-zero precipitation values.</p>
      <p id="d2e5419">To meet all of these requirements in the presence of many missing observations, we use stratified sampling in which days are split into groups (“strata”) containing the same number of non-missing observations and the same number of wet sites: days are then sampled from each stratum with replacement, such that the final resampled dataset contains the same number of days in each stratum as the original. This strategy requires that each stratum contains enough days to allow the creation of a large number of possible samples: this is the case for the dataset considered here.</p>
      <p id="d2e5422">With one exception, this resampling strategy allows model comparisons for the intensities and occurrence component of the integrated model separately. The exception relates to comparisons of occurrence models with and without an intercept, because all resampled datasets have the same total number of observations and the same number of wet days. Such comparisons are rarely of substantive interest however.</p>
</app>

<app id="App1.Ch1.S3">
  <label>Appendix C</label><title>Estimation of inter-site dependence structure</title>
      <p id="d2e5433">When simulating multisite precipitation sequences, inter-site dependence is controlled via the correlation matrix <inline-formula><mml:math id="M183" display="inline"><mml:mi mathvariant="bold">Σ</mml:mi></mml:math></inline-formula> of multivariate standard Gaussian random variates, that are subsequently transformed using Eq. (<xref ref-type="disp-formula" rid="Ch1.E10"/>) to yield simulated precipitation. As described in Sect. <xref ref-type="sec" rid="Ch1.S3.SS4"/>, the elements of <inline-formula><mml:math id="M184" display="inline"><mml:mi mathvariant="bold">Σ</mml:mi></mml:math></inline-formula> are estimated separately for each pair of sites using maximum likelihood. Here we set out the calculations required to evaluate the required log-likelihood function, which must then be optimised numerically.</p>
      <p id="d2e5454">Suppose that for a given pair of sites, the available data consist of <inline-formula><mml:math id="M185" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula> pairs of observations <inline-formula><mml:math id="M186" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">11</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">21</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mi>T</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi>T</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. If both sites are always wet, the inverse of Eq. (<xref ref-type="disp-formula" rid="Ch1.E10"/>) yields a corresponding set of pairs <inline-formula><mml:math id="M187" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mn mathvariant="normal">11</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mn mathvariant="normal">21</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mi>T</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi>T</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> where <inline-formula><mml:math id="M188" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:mo>=</mml:mo><mml:msup><mml:mi mathvariant="normal">Φ</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>[</mml:mo><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:msub><mml:mi>F</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:mo>)</mml:mo><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>. When <inline-formula><mml:math id="M189" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>, however, the corresponding value of <inline-formula><mml:math id="M190" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mtext>st</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> is not known exactly: all that is known is that <inline-formula><mml:math id="M191" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:mo>≤</mml:mo><mml:msup><mml:mi mathvariant="normal">Φ</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d2e5689">Under the proposed dependence model, the pairs <inline-formula><mml:math id="M192" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> are all realisations of a bivariate normal random variable <inline-formula><mml:math id="M193" display="inline"><mml:mi mathvariant="bold-italic">Q</mml:mi></mml:math></inline-formula> say, with mean vector <inline-formula><mml:math id="M194" display="inline"><mml:mn mathvariant="bold">0</mml:mn></mml:math></inline-formula> and covariance matrix <inline-formula><mml:math id="M195" display="inline"><mml:mrow><mml:mfenced close=")" open="("><mml:mtable class="matrix" columnalign="center center" framespacing="0em"><mml:mtr><mml:mtd><mml:mn mathvariant="normal">1</mml:mn></mml:mtd><mml:mtd><mml:mi mathvariant="italic">ρ</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi mathvariant="italic">ρ</mml:mi></mml:mtd><mml:mtd><mml:mn mathvariant="normal">1</mml:mn></mml:mtd></mml:mtr></mml:mtable></mml:mfenced></mml:mrow></mml:math></inline-formula> where <inline-formula><mml:math id="M196" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula> is the pairwise correlation to be estimated. Moreover, these realisations can be considered as independent if the marginal occurrence and intensity models contain an adequate representation of temporal dependence: see <xref ref-type="bibr" rid="bib1.bibx20" id="text.105"/> for more discussion of this point. The log-likelihood for <inline-formula><mml:math id="M197" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula> is thus a sum of contributions from each of the <inline-formula><mml:math id="M198" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula> pairs, as follows:</p>
      <p id="d2e5785"><italic>Both sites wet:</italic> the contribution is the log of the bivariate normal density for <inline-formula><mml:math id="M199" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d2e5819"><italic>Site 1 wet, site 2 dry:</italic> in this case, <inline-formula><mml:math id="M200" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is known exactly but <inline-formula><mml:math id="M201" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is known only to be less than <inline-formula><mml:math id="M202" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="normal">Φ</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. The log-likelihood contribution is obtained by factorising the joint density of <inline-formula><mml:math id="M203" display="inline"><mml:mi mathvariant="bold-italic">Q</mml:mi></mml:math></inline-formula> as <inline-formula><mml:math id="M204" display="inline"><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, in an obvious notation. Under the proposed model, the marginal distribution of <inline-formula><mml:math id="M205" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is standard normal; while the conditional distribution of <inline-formula><mml:math id="M206" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> given <inline-formula><mml:math id="M207" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is normal with mean <inline-formula><mml:math id="M208" display="inline"><mml:mrow><mml:mi mathvariant="italic">ρ</mml:mi><mml:msub><mml:mi>q</mml:mi><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> and variance <inline-formula><mml:math id="M209" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="italic">ρ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> (this is a standard result for the bivariate normal distribution). The log-likelihood contribution is thus

          <disp-formula id="App1.Ch1.S3.Ex1"><mml:math id="M210" display="block"><mml:mrow><mml:mi>log⁡</mml:mi><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mi>log⁡</mml:mi><mml:mi mathvariant="normal">Φ</mml:mi><mml:mfenced open="(" close=")"><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msup><mml:mi mathvariant="normal">Φ</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo><mml:mo>-</mml:mo><mml:mi mathvariant="italic">ρ</mml:mi><mml:msub><mml:mi>q</mml:mi><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:msqrt><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="italic">ρ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:msqrt></mml:mfrac></mml:mstyle></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

        where <inline-formula><mml:math id="M211" display="inline"><mml:mrow><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M212" display="inline"><mml:mrow><mml:mi mathvariant="normal">Φ</mml:mi><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> are, respectively the density and distribution function of the standard normal distribution. The first term does not involve <inline-formula><mml:math id="M213" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula> and can be dropped from the calculations.</p>
      <p id="d2e6140"><italic>Site 1 dry, site 2 wet:</italic> the contribution is analogous to that above, replacing <inline-formula><mml:math id="M214" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M215" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> with <inline-formula><mml:math id="M216" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M217" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>, respectively.</p>
      <p id="d2e6201"><italic>Both sites dry:</italic> here the contribution is <inline-formula><mml:math id="M218" display="inline"><mml:mrow><mml:mi>log⁡</mml:mi><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>≤</mml:mo><mml:msup><mml:mi mathvariant="normal">Φ</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>Q</mml:mi><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>≤</mml:mo><mml:msup><mml:mi mathvariant="normal">Φ</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. For <inline-formula><mml:math id="M219" display="inline"><mml:mrow><mml:mi mathvariant="italic">ρ</mml:mi><mml:mo>≠</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> this must be evaluated numerically: we use the <monospace>mvtnorm</monospace> package for this <xref ref-type="bibr" rid="bib1.bibx33" id="paren.106"/>.</p>
      <p id="d2e6309">If required, the likelihood of the latent correlation matrix can also be derived in the general case of <inline-formula><mml:math id="M220" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> sites, using standard results for the density of a censored normal random variable <xref ref-type="bibr" rid="bib1.bibx41" id="paren.107"/>. Assume without loss of generality that among the <inline-formula><mml:math id="M221" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> stations, stations <inline-formula><mml:math id="M222" display="inline"><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>r</mml:mi></mml:mrow></mml:math></inline-formula> design the dry and <inline-formula><mml:math id="M223" display="inline"><mml:mrow><mml:mi>r</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>S</mml:mi></mml:mrow></mml:math></inline-formula> the wet stations, then the log-likelihood contribution of one day <inline-formula><mml:math id="M224" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>:=</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi>Y</mml:mi><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mi mathvariant="normal">…</mml:mi><mml:msub><mml:mi>Y</mml:mi><mml:mtext>St</mml:mtext></mml:msub><mml:msup><mml:mo>)</mml:mo><mml:mi mathvariant="normal">T</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> is proportional to:

              <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M225" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="App1.Ch1.S3.E12"><mml:mtd><mml:mtext>C1</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mi>log⁡</mml:mi><mml:mi>L</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold">Σ</mml:mi><mml:mo>|</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>∝</mml:mo><mml:mi>log⁡</mml:mi><mml:msub><mml:mi mathvariant="bold-italic">ϕ</mml:mi><mml:mi mathvariant="normal">Σ</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mtext>St</mml:mtext></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="App1.Ch1.S3.E13"><mml:mtd><mml:mtext>C2</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mtable class="split" rowspacing="0.2ex" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>+</mml:mo><mml:mi>log⁡</mml:mi><mml:msub><mml:mi mathvariant="bold">Φ</mml:mi><mml:mi mathvariant="normal">Σ</mml:mi></mml:msub></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mfenced open="(" close=")"><mml:mrow><mml:msup><mml:mi mathvariant="normal">Φ</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msup><mml:mi mathvariant="normal">Φ</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="italic">π</mml:mi><mml:mrow><mml:mi>r</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo><mml:mo>|</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mtext>St</mml:mtext></mml:msub></mml:mrow></mml:mfenced><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula></p>
      <p id="d2e6586">Here <inline-formula><mml:math id="M226" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>r</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mtext>St</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> can be calculated as <inline-formula><mml:math id="M227" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:mo>=</mml:mo><mml:msup><mml:mi mathvariant="normal">Φ</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>(</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi>Y</mml:mi><mml:mtext>st</mml:mtext></mml:msub><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M228" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold">Φ</mml:mi><mml:mi mathvariant="normal">Σ</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">ϕ</mml:mi><mml:mi mathvariant="normal">Σ</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are the CDF and density of the latent standard multivariate Gaussian. This expression, however, can be cumbersome to compute due to the high cost of evaluating conditional normal distributions.</p>
</app>

<app id="App1.Ch1.S4">
  <label>Appendix D</label><title>Model Building Strategy</title>
      <p id="d2e6687">We construct the occurrence and intensity models by sequentially adding covariates to the location parameter in the following order:</p>
      <p id="d2e6690"><list list-type="custom">
          <list-item><label>1.</label>

      <p id="d2e6695"><italic>Base model.</italic>  We begin with a model that includes lag-1 autocorrelation, data resolution, seasonality, and a spatial effect.</p>
          </list-item>
          <list-item><label>2.</label>

      <p id="d2e6703"><italic>Model selection for autocorrelation, seasonal, and spatial effects.</italic> We refine the base model, selecting the most appropriate forms for representing seasonal and spatial variation as well as temporal dependence. <list list-type="custom"><list-item><label>a.</label>
      <p id="d2e6710"><italic>Spatial effect.</italic> For the parametric specification, we consider Legendre polynomials (up to degree three) and their interactions. For the semi-parametric specification, both spline-based (latitude–longitude) and Legendre polynomial representations are assessed, with the former ultimately being selected.</p></list-item><list-item><label>b.</label>
      <p id="d2e6716"><italic>Seasonality.</italic> Competing formulations based on cyclic splines and parametric effects are compared for the semi-parametric models.</p></list-item><list-item><label>c.</label>
      <p id="d2e6722"><italic>Autocorrelation.</italic> Beyond the lag-1 term, additional lags up to lag 3 are evaluated. For each lag, we assess: <list list-type="bullet"><list-item>
      <p id="d2e6729">the inclusion of main effects and their interactions with seasonal components,</p></list-item><list-item>
      <p id="d2e6733">whether to include lagged precipitation overall or indicator variables for lagged occurrences or both,</p></list-item><list-item>
      <p id="d2e6737">whether to employ spline-based or parametric formulations in the semi-parametric models, and</p></list-item><list-item>
      <p id="d2e6741">whether to include interactions between occurrence autocorrelation effects.</p></list-item></list></p></list-item><list-item><label>d.</label>
      <p id="d2e6745"><italic>Additional interactions.</italic> We consider interactions between seasonal and spatial variation, and add site-specific effects such as altitude.</p></list-item></list></p>
          </list-item>
          <list-item><label>3.</label>

      <p id="d2e6753"><italic>Large-scale atmospheric covariates.</italic> Finally, large-scale predictors and their interactions with seasonality are added in the following order: <list list-type="alpha-lower"><list-item>
      <p id="d2e6760">Temperature and possible interactions with seasonality.</p></list-item><list-item>
      <p id="d2e6764">Dewpoint temperature and possible interactions with seasonality.</p></list-item><list-item>
      <p id="d2e6768">Mean sea level pressure and possible interactions with seasonality.</p></list-item><list-item>
      <p id="d2e6772">Wind speed, computed from the 10 <inline-formula><mml:math id="M229" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M230" display="inline"><mml:mi>u</mml:mi></mml:math></inline-formula>- and <inline-formula><mml:math id="M231" display="inline"><mml:mi>v</mml:mi></mml:math></inline-formula>-wind components.</p></list-item></list></p>
          </list-item>
        </list></p>
      <p id="d2e6799">For the GAMLSS, we start with a full location model for intensity and build a model for the standard deviation by adding in order (1) seasonal variation, autocorrelation with resolution and their interactions; (2) Systematic spatial variation; (3) wind speed, temperature, dewpoint temperature, and mean sea level pressure in interaction with seasonality. In total around 40 models are compared for the parametric amounts model, and a similar number for the parametric occurrence as well as for both of the semi-parametric models.</p>
</app>
  </app-group><notes notes-type="codedataavailability"><title>Code and data availability</title>

      <p id="d2e6806">The code used in the study can be found at: <uri>https://github.com/jakobwes/Improving-multisite-precipitation-generators</uri> (last access: 1 May 2026). Unfortunately, the authors are unable to share the data, however, this can be requested from the CEDA archive (<uri>http://catalogue.ceda.ac.uk/uuid/c732716511d3442f05cdeccbe99b8f90</uri>, <xref ref-type="bibr" rid="bib1.bibx60" id="altparen.108"/>).</p>
  </notes><app-group>
        <supplementary-material position="anchor"><p id="d2e6818">The supplement related to this article is available online at <inline-supplementary-material xlink:href="https://doi.org/10.5194/ascmo-12-149-2026-supplement" xlink:title="pdf">https://doi.org/10.5194/ascmo-12-149-2026-supplement</inline-supplementary-material>.</p></supplementary-material>
        </app-group><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d2e6827">JBW: Conceptualization, Methodology, Software, Validation, Formal analysis, Visualization, Investigation, Writing – Original Draft, Writing – Review and Editing. REC: Conceptualization, Methodology, Data Curation, Resources, Writing – Original Draft, Writing – Review and Editing, Supervision, Project administration.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d2e6833">The contact author has declared that neither of the authors has any competing interests.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d2e6839">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.</p>
  </notes><ack><title>Acknowledgements</title><p id="d2e6845">Jakob Benjamin Wessel acknowledges support from the Klaus Murmann fellowship program of the Stiftung der deutschen Wirtschaft/Foundation of German Business and by The Alan Turing Institute’s Enrichment Scheme. This work used JASMIN, the UK’s collaborative data analysis environment (<uri>https://www.jasmin.ac.uk</uri>, last access: 1 May 2026). Jakob Benjamin Wessel acknowledges useful discussions and feedback from Fiona Spuler. Both authors thank reviewers Oliver Stoner and Mamunur Rashid as well as associate editor Christopher Paciorek for comments which have significantly improved the manuscript.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d2e6853">This research has been supported by the Engineering and Physical Sciences Research Council (grant no. 2696930).</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d2e6859">This paper was edited by Christopher Paciorek and reviewed by Oliver Stoner and Mamunur Rashid.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><label>Ailliot et al.(2009)Ailliot, Thompson, and Thomson</label><mixed-citation> Ailliot, P., Thompson, C., and Thomson, P.: Space-time modelling of precipitation using a hidden Markov model and censored Gaussian distributions, Appl. Statist., 58, 405–426, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx2"><label>Ambrosino et al.(2011)Ambrosino, Chandler, and Todd</label><mixed-citation>Ambrosino, C., Chandler, R. E., and Todd, M. C.: Southern African monthly rainfall variability: An analysis based on generalized linear models, J. Climate, 24, <ext-link xlink:href="https://doi.org/10.1175/2010JCLI3924.1" ext-link-type="DOI">10.1175/2010JCLI3924.1</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx3"><label>Ambrosino et al.(2014)Ambrosino, Chandler, and Todd</label><mixed-citation>Ambrosino, C., Chandler, R. E., and Todd, M. C.: Rainfall-derived growing season characteristics for agricultural impact assessments in South Africa, Theor. Appl. Climatol., 115, 411–426, <ext-link xlink:href="https://doi.org/10.1007/s00704-013-0896-y" ext-link-type="DOI">10.1007/s00704-013-0896-y</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx4"><label>Andrianakis and Challenor(2012)</label><mixed-citation>Andrianakis, I. and Challenor, P. G.: The effect of the nugget on Gaussian process emulators of computer models, Comput. Stat. Data An., 56, 4215–4228, <ext-link xlink:href="https://doi.org/10.1016/j.csda.2012.04.020" ext-link-type="DOI">10.1016/j.csda.2012.04.020</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx5"><label>Asong et al.(2016)Asong, Khaliq, and Wheater</label><mixed-citation>Asong, Z. E., Khaliq, M. N., and Wheater, H. S.: Multisite multivariate modeling of daily precipitation and temperature in the Canadian Prairie Provinces using generalized linear models, Clim. Dynam., 47, 2901–2921, <ext-link xlink:href="https://doi.org/10.1007/s00382-016-3004-z" ext-link-type="DOI">10.1007/s00382-016-3004-z</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx6"><label>Ayar et al.(2016)Ayar, Vrac, Bastin, Carreau, Deque, and Gallardo</label><mixed-citation> Ayar, P. V., Vrac, M., Bastin, S., Carreau, J., Deque, M., and Gallardo, C.: Intercomparison of statistical and dynamical downscaling models under the EURO- and MED-CORDEX initiative framework: present climate evaluations, Clim. Dynam., 46, 1301–1329, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx7"><label>Bárdossy and Plate(1992)</label><mixed-citation> Bárdossy, A. and Plate, E.: Space-time model for daily rainfall using atmospheric circulation patterns, Water Resour. Res., 28, 1247–1259, 1992.</mixed-citation></ref>
      <ref id="bib1.bibx8"><label>Beck et al.(2018)Beck, Zimmermann, McVicar, Vergopolan, Berg, and Wood</label><mixed-citation>Beck, H. E., Zimmermann, N. E., McVicar, T. R., Vergopolan, N., Berg, A., and Wood, E. F.: Present and future Köppen-Geiger climate classification maps at 1-km resolution, Scientific Data, 5, 180214, <ext-link xlink:href="https://doi.org/10.1038/sdata.2018.214" ext-link-type="DOI">10.1038/sdata.2018.214</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx9"><label>Beckmann and Buishand(2002)</label><mixed-citation> Beckmann, B.-R. and Buishand, T. A.: Statistical downscaling relationships for precipitation in the Netherlands and North Germany, Int. J. Climatol., 22, 15–32, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx10"><label>Beersma and Buishand(2003)</label><mixed-citation> Beersma, J. J. and Buishand, T. A.: Multi-site simulation of daily precipitation and temperature conditional on the atmospheric circulation, Clim. Res., 25, 121–133, 2003.</mixed-citation></ref>
      <ref id="bib1.bibx11"><label>Belzile et al.(2023)Belzile, Dutang, Northrop, and Opitz</label><mixed-citation>Belzile, L. R., Dutang, C., Northrop, P. J., and Opitz, T.: A modeler's guide to extreme value software, Extremes, 26, 595–638, <ext-link xlink:href="https://doi.org/10.1007/s10687-023-00475-9" ext-link-type="DOI">10.1007/s10687-023-00475-9</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx12"><label>Beven(2021)</label><mixed-citation>Beven, K.: Issues in generating stochastic observables for hydrological models, Hydrol. Process., 35, e14203, <ext-link xlink:href="https://doi.org/10.1002/hyp.14203" ext-link-type="DOI">10.1002/hyp.14203</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx13"><label>Buishand and Brandsma(2001)</label><mixed-citation> Buishand, T. and Brandsma, T.: Multisite simulation of daily precipitation and temperature in the Rhine basin by nearest-neighbor resampling, Water Resour. Res., 37, 2761–2776, 2001.</mixed-citation></ref>
      <ref id="bib1.bibx14"><label>Cameron et al.(2001)Cameron, Beven, and Tawn</label><mixed-citation> Cameron, D., Beven, K., and Tawn, J.: Modelling extreme rainfalls using a modified random pulse Bartlett-Lewis stochastic rainfall model (with uncertainty), Adv. Water Resour., 24, 203–211, 2001.</mixed-citation></ref>
      <ref id="bib1.bibx15"><label>Cavanaugh and Shumway(1997)</label><mixed-citation>Cavanaugh, J. E. and Shumway, R. H.: A Bootstrap Variant of AIC for State-Space Model Selection, Stat. Sinica, 7, 473–496, <uri>https://www.jstor.org/stable/24306089</uri> (last access: 1 May 2026), 1997.</mixed-citation></ref>
      <ref id="bib1.bibx16"><label>Chandler(2005)</label><mixed-citation>Chandler, R. E.: On the use of generalized linear models for interpreting climate variability, Environmetrics, 16, <ext-link xlink:href="https://doi.org/10.1002/env.731" ext-link-type="DOI">10.1002/env.731</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx17"><label>Chandler(2020)</label><mixed-citation>Chandler, R. E.: Multisite, multivariate weather generation based on generalised linear models, Environ. Modell. Softw., 134, <ext-link xlink:href="https://doi.org/10.1016/j.envsoft.2020.104867" ext-link-type="DOI">10.1016/j.envsoft.2020.104867</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx18"><label>Chandler and Bate(2007)</label><mixed-citation>Chandler, R. E. and Bate, S.: Inference for clustered data using the independence loglikelihood, Biometrika, 94, 167–183, <ext-link xlink:href="https://doi.org/10.1093/biomet/asm015" ext-link-type="DOI">10.1093/biomet/asm015</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx19"><label>Chandler and Wheater(2002)</label><mixed-citation>Chandler, R. E. and Wheater, H. S.: Analysis of rainfall variability using Generalized Linear Models — a case study from the West of Ireland., Water Resour. Res., 38, No.10, <ext-link xlink:href="https://doi.org/10.1029/2001WR000906" ext-link-type="DOI">10.1029/2001WR000906</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx20"><label>Chandler et al.(2007)Chandler, Isham, Bellone, Yang, and Northrop</label><mixed-citation>Chandler, R. E., Isham, V., Bellone, E., Yang, C., and Northrop, P.: Space-Time Modeling of Rainfall for Continuous Simulation, in: Statistical Methods for Spatio-Temporal Systems, no. 107 in Monographs on Statistics and Applied Probability, 1st edn., CRC Press, pp. 177–215, <ext-link xlink:href="https://doi.org/10.1201/9781420011050" ext-link-type="DOI">10.1201/9781420011050</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx21"><label>Chandler et al.(2011)Chandler, Bates, and Charles</label><mixed-citation>Chandler, R. E., Bates, B. C., and Charles, S. P.: Rainfall trends in southwest Western Australia, in: Statistical Methods for Trend Detection and Analysis in the Environmental Sciences, edited by: Chandler, R. E. and Scott, E. M., John Wiley and Sons, Chichester, pp. 283–306, <ext-link xlink:href="https://doi.org/10.1002/9781119991571.ch8" ext-link-type="DOI">10.1002/9781119991571.ch8</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx22"><label>Chandler et al.(2014)Chandler, Isham, Northrop, Wheater, Onof, and Leith</label><mixed-citation> Chandler, R. E., Isham, V., Northrop, P., Wheater, H., Onof, C., and Leith, N.: Uncertainty in rainfall inputs, in: Applied Uncertainty Analysis for Flood Risk Management, edited by: Beven, K. and Hall, J., Imperial College Press, London, pp. 101–152,  ISBN 1848162707, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx23"><label>Charles et al.(1999)Charles, Bates, and Hughes</label><mixed-citation> Charles, S., Bates, B., and Hughes, J.: A spatiotemporal model for downscaling precipitation occurrence and amounts, J. Geophys. Res-Atmos., 104, 31657–31669, 1999.</mixed-citation></ref>
      <ref id="bib1.bibx24"><label>Chun et al.(2017)Chun, Mamet, Metsaranta, Barr, Johnstone, and Wheater</label><mixed-citation>Chun, K. P., Mamet, S. D., Metsaranta, J., Barr, A., Johnstone, J., and Wheater, H.: A novel stochastic method for reconstructing daily precipitation time-series using tree-ring data from the western Canadian Boreal Forest, Dendrochronologia, 44, 9–18, <ext-link xlink:href="https://doi.org/10.1016/j.dendro.2017.01.003" ext-link-type="DOI">10.1016/j.dendro.2017.01.003</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx25"><label>Coles(2001)</label><mixed-citation>Coles, S.: An Introduction to Statistical Modeling of Extreme Values, Springer Series in Statistics, Springer, London, <ext-link xlink:href="https://doi.org/10.1007/978-1-4471-3675-0" ext-link-type="DOI">10.1007/978-1-4471-3675-0</ext-link>, 2001.</mixed-citation></ref>
      <ref id="bib1.bibx26"><label>Davison(2003)</label><mixed-citation> Davison, A. C.: Statistical Models, Cambridge University Press, Cambridge, ISBN 0-521-77339-3, 2003.</mixed-citation></ref>
      <ref id="bib1.bibx27"><label>Dawkins et al.(2022)Dawkins, Osborne, Economou, Darch, and Stoner</label><mixed-citation>Dawkins, L. C., Osborne, J. M., Economou, T., Darch, G. J., and Stoner, O. R.: The Advanced Meteorology Explorer: a novel stochastic, gridded daily rainfall generator, J. Hydrol., 607, 127478, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2022.127478" ext-link-type="DOI">10.1016/j.jhydrol.2022.127478</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx28"><label>Dunn and Smyth(1996)</label><mixed-citation>Dunn, P. K. and Smyth, G. K.: Randomized Quantile Residuals, J. Comput. Graph. Stat., 5, <ext-link xlink:href="https://doi.org/10.1080/10618600.1996.10474708" ext-link-type="DOI">10.1080/10618600.1996.10474708</ext-link>, 1996.</mixed-citation></ref>
      <ref id="bib1.bibx29"><label>Friederichs(2010)</label><mixed-citation>Friederichs, P.: Statistical downscaling of extreme precipitation events using extreme value theory, Extremes, 13, 109–132, <ext-link xlink:href="https://doi.org/10.1007/s10687-010-0107-5" ext-link-type="DOI">10.1007/s10687-010-0107-5</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx30"><label>Frost et al.(2011)Frost, Charles, Timbal, Chiew, Mehrotra, Nguyen, Chandler, McGregor, Fu, Kirono, Fernandez, and Kent</label><mixed-citation>Frost, A. J., Charles, S. P., Timbal, B., Chiew, F. H. S., Mehrotra, R., Nguyen, K. C., Chandler, R. E., McGregor, J. L., Fu, G., Kirono, D. G. C., Fernandez, E., and Kent, D. M.: A comparison of multi-site daily rainfall downscaling techniques under Australian conditions, J. Hydrol, 408, 1–18, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2011.06.021" ext-link-type="DOI">10.1016/j.jhydrol.2011.06.021</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx31"><label>Furrer and Katz(2008)</label><mixed-citation>Furrer, E. M. and Katz, R. W.: Improving the simulation of extreme precipitation events by stochastic weather generators, Water Resour. Res., 44, W12439, <ext-link xlink:href="https://doi.org/10.1029/2008WR007316" ext-link-type="DOI">10.1029/2008WR007316</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx32"><label>Gebetsberger et al.(2018)Gebetsberger, Messner, Mayr, and Zeileis</label><mixed-citation>Gebetsberger, M., Messner, J. W., Mayr, G. J., and Zeileis, A.: Estimation Methods for Nonhomogeneous Regression Models: Minimum Continuous Ranked Probability Score versus Maximum Likelihood, Mon. Weather Rev., 146, 4323–4338, <ext-link xlink:href="https://doi.org/10.1175/MWR-D-17-0364.1" ext-link-type="DOI">10.1175/MWR-D-17-0364.1</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx33"><label>Genz and Bretz(2009)</label><mixed-citation>Genz, A. and Bretz, F.: Computation of Multivariate Normal and <inline-formula><mml:math id="M232" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula> Probabilities, Lecture Notes in Statistics, Springer-Verlag, Heidelberg, <ext-link xlink:href="https://doi.org/10.1007/978-3-642-01689-9" ext-link-type="DOI">10.1007/978-3-642-01689-9</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx34"><label>Gilleland and Katz(2016)</label><mixed-citation>Gilleland, E. and Katz, R. W.: extRemes 2.0: An Extreme Value Analysis Package in R, J. Stat. Softw., 72, 1–39, <ext-link xlink:href="https://doi.org/10.18637/jss.v072.i08" ext-link-type="DOI">10.18637/jss.v072.i08</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx35"><label>Gneiting and Raftery(2007)</label><mixed-citation>Gneiting, T. and Raftery, A. E.: Strictly Proper Scoring Rules, Prediction, and Estimation, J. Am. Stat. Assoc., 102, 359–378, <ext-link xlink:href="https://doi.org/10.1198/016214506000001437" ext-link-type="DOI">10.1198/016214506000001437</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx36"><label>Groenke et al.(2026)Groenke, Wessel, Miersch, Klein, and Zscheischler</label><mixed-citation>Groenke, B., Wessel, J., Miersch, P., Klein, N., and Zscheischler, J.: Stochastic Weather Generation for Scenario-Neutral Impact Assessments Using Simulation-Based Inference, J. Geophys. Res.-Machine Learning and Computation, 3, e2025JH000902, <ext-link xlink:href="https://doi.org/10.1029/2025JH000902" ext-link-type="DOI">10.1029/2025JH000902</ext-link>, 2026.</mixed-citation></ref>
      <ref id="bib1.bibx37"><label>Grunwald and Jones(2000)</label><mixed-citation>Grunwald, G. K. and Jones, R. H.: Markov models for time series with mixed distribution, Environmetrics, 11, 327–339, <ext-link xlink:href="https://doi.org/10.1002/(SICI)1099-095X(200005/06)11:3&lt;327::AID-ENV412&gt;3.0.CO;2-R" ext-link-type="DOI">10.1002/(SICI)1099-095X(200005/06)11:3&lt;327::AID-ENV412&gt;3.0.CO;2-R</ext-link>, 2000.</mixed-citation></ref>
      <ref id="bib1.bibx38"><label>Gu et al.(2019)Gu, Zhang, Li, Singh, and Sun</label><mixed-citation>Gu, X., Zhang, Q., Li, J., Singh, V. P., and Sun, P.: Impact of urbanization on nonstationarity of annual and seasonal precipitation extremes in China, J. Hydrol., 575, 638–655, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2019.05.070" ext-link-type="DOI">10.1016/j.jhydrol.2019.05.070</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx39"><label>Gutiérrez et al.(2019)Gutiérrez, Maraun, Widmann, Huth, Hertig, Benestad, Roessler, Wibig, Wilcke, Kotlarski, San Martín, Herrera, Bedia, Casanueva, Manzanas, Iturbide, Vrac, Dubrovsky, Ribalaygua, Pórtoles, Räty, Räisänen, Hingray, Raynaud, Casado, Ramos, Zerenner, Turco, Bosshard, Štěpánek, Bartholy, Pongracz, Keller, Fischer, Cardoso, Soares, Czernecki, and Pagé</label><mixed-citation>Gutiérrez, J. M., Maraun, D., Widmann, M., Huth, R., Hertig, E., Benestad, R., Roessler, O., Wibig, J., Wilcke, R., Kotlarski, S., San Martín, D., Herrera, S., Bedia, J., Casanueva, A., Manzanas, R., Iturbide, M., Vrac, M., Dubrovsky, M., Ribalaygua, J., Pórtoles, J., Räty, O., Räisänen, J., Hingray, B., Raynaud, D., Casado, M. J., Ramos, P., Zerenner, T., Turco, M., Bosshard, T., Štěpánek, P., Bartholy, J., Pongracz, R., Keller, D. E., Fischer, A. M., Cardoso, R. M., Soares, P. M., Czernecki, B., and Pagé, C.: An intercomparison of a large ensemble of statistical downscaling methods over Europe: Results from the VALUE perfect predictor cross-validation experiment, Int. J. Climatol., 39, <ext-link xlink:href="https://doi.org/10.1002/joc.5462" ext-link-type="DOI">10.1002/joc.5462</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx40"><label>Hersbach et al.(2020)Hersbach, Bell, Berrisford, Hirahara, Horányi, Muñoz-Sabater, Nicolas, Peubey, Radu, Schepers, Simmons, Soci, Abdalla, Abellan, Balsamo, Bechtold, Biavati, Bidlot, Bonavita, De Chiara, Dahlgren, Dee, Diamantakis, Dragani, Flemming, Forbes, Fuentes, Geer, Haimberger, Healy, Hogan, Hólm, Janisková, Keeley, Laloyaux, Lopez, Lupu, Radnoti, de Rosnay, Rozum, Vamborg, Villaume, and Thépaut</label><mixed-citation>Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz-Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D., Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P., Biavati, G., Bidlot, J., Bonavita, M., De Chiara, G., Dahlgren, P., Dee, D., Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer, A., Haimberger, L., Healy, S., Hogan, R. J., Hólm, E., Janisková, M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., de Rosnay, P., Rozum, I., Vamborg, F., Villaume, S., and Thépaut, J.-N.: The ERA5 global reanalysis, Q. J. Roy. Meteor. Soc., 146, 1999–2049, <ext-link xlink:href="https://doi.org/10.1002/qj.3803" ext-link-type="DOI">10.1002/qj.3803</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx41"><label>Hoffman and Johnson(2011)</label><mixed-citation>Hoffman, H. J. and Johnson, R. E.: Estimation of Multiple Trace Metal Water Contaminants In the Presence of Left-Censored and Missing Data, Journal of Environmental Statistics, 2, 1–16, <uri>http://www.jenvstat.org/v02/i02/paper</uri> (last access: 1 May 2026), 2011.</mixed-citation></ref>
      <ref id="bib1.bibx42"><label>Holsclaw et al.(2016)Holsclaw, Greene, Robertson, and Smyth</label><mixed-citation>Holsclaw, T., Greene, A. M., Robertson, A. W., and Smyth, P.: A Bayesian Hidden Markov Model of Daily Precipitation over South and East Asia, J. Hydrometeorol., 17, 3–25, <ext-link xlink:href="https://doi.org/10.1175/JHM-D-14-0142.1" ext-link-type="DOI">10.1175/JHM-D-14-0142.1</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx43"><label>Hughes et al.(1999)Hughes, Guttorp, and Charles</label><mixed-citation>Hughes, J. P., Guttorp, P., and Charles, S. P.: A non-homogeneous hidden Markov model for precipitation occurrence, J. R. Stat. Soc. C-Appl., 48, <ext-link xlink:href="https://doi.org/10.1111/1467-9876.00136" ext-link-type="DOI">10.1111/1467-9876.00136</ext-link>, 1999.</mixed-citation></ref>
      <ref id="bib1.bibx44"><label>Huser and Davison(2014)</label><mixed-citation>Huser, R. and Davison, A. C.: Space–Time Modelling of Extreme Events, J. Roy. Stat. Soc. B, 76, 439–461, <ext-link xlink:href="https://doi.org/10.1111/rssb.12035" ext-link-type="DOI">10.1111/rssb.12035</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx45"><label>Hyndman and Grunwald(2000)</label><mixed-citation> Hyndman, R. J. and Grunwald, G. K.: Generalized additive modelling of mixed distribution Markov models with application to Melbourne's rainfall, Aust. N. Z. J. Stat., 42, 145–158, 2000.</mixed-citation></ref>
      <ref id="bib1.bibx46"><label>Ishiguro et al.(1991)Ishiguro, Morita, and Ishiguro</label><mixed-citation>Ishiguro, M., Morita, K. I., and Ishiguro, M.: Application of an estimator-free information criterion (WIC) to aperture synthesis imaging, International Astronomical Union Colloquium, 131, 243–248, <ext-link xlink:href="https://doi.org/10.1017/S0252921100013403" ext-link-type="DOI">10.1017/S0252921100013403</ext-link>, 1991.</mixed-citation></ref>
      <ref id="bib1.bibx47"><label>Jesus and Chandler(2011)</label><mixed-citation>Jesus, J. and Chandler, R. E.: Estimating functions and the generalized method of moments, Interface Focus, 1, 871–885, <ext-link xlink:href="https://doi.org/10.1098/rsfs.2011.0057" ext-link-type="DOI">10.1098/rsfs.2011.0057</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx48"><label>Katz et al.(2002)Katz, Parlange, and Naveau</label><mixed-citation>Katz, R. W., Parlange, M. B., and Naveau, P.: Statistics of extremes in hydrology, Adv. Water Resour., 25, 1287–1304, <ext-link xlink:href="https://doi.org/10.1016/S0309-1708(02)00056-8" ext-link-type="DOI">10.1016/S0309-1708(02)00056-8</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx49"><label>Keller et al.(2015)Keller, Fischer, Frei, Liniger, Appenzeller, and Knutti</label><mixed-citation>Keller, D. E., Fischer, A. M., Frei, C., Liniger, M. A., Appenzeller, C., and Knutti, R.: Implementation and validation of a Wilks-type multi-site daily precipitation generator over a typical Alpine river catchment, Hydrol. Earth Syst. Sci., 19, 2163–2177, <ext-link xlink:href="https://doi.org/10.5194/hess-19-2163-2015" ext-link-type="DOI">10.5194/hess-19-2163-2015</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx50"><label>Kenabatho et al.(2012)Kenabatho, McIntyre, Chandler, and Wheater</label><mixed-citation>Kenabatho, P. K., McIntyre, N. R., Chandler, R. E., and Wheater, H. S.: Stochastic simulation of rainfall in the semi-arid Limpopo basin, Botswana, Int. J. Climatol., 32(7), 1113–1127, <ext-link xlink:href="https://doi.org/10.1002/joc.2323" ext-link-type="DOI">10.1002/joc.2323</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx51"><label>Kleiber et al.(2012)Kleiber, Katz, and Rajagopalan</label><mixed-citation>Kleiber, W., Katz, R. W., and Rajagopalan, B.: Daily spatiotemporal precipitation simulation using latent and transformed Gaussian processes, Water Resour. Res., 48, <ext-link xlink:href="https://doi.org/10.1029/2011WR011105" ext-link-type="DOI">10.1029/2011WR011105</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx52"><label>Liu et al.(2022)Liu, Zou, Xia, Chen, and Wang</label><mixed-citation>Liu, H., Zou, L., Xia, J., Chen, T., and Wang, F.: Impact assessment of climate change and urbanization on the nonstationarity of extreme precipitation: A case study in an urban agglomeration in the middle reaches of the Yangtze river, Sustain. Cities Soc., 85, 104038, <ext-link xlink:href="https://doi.org/10.1016/j.scs.2022.104038" ext-link-type="DOI">10.1016/j.scs.2022.104038</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx53"><label>López and Francés(2013)</label><mixed-citation>López, J. and Francés, F.: Non-stationary flood frequency analysis in continental Spanish rivers, using climate and reservoir indices as external covariates, Hydrol. Earth Syst. Sci., 17, 3189–3203, <ext-link xlink:href="https://doi.org/10.5194/hess-17-3189-2013" ext-link-type="DOI">10.5194/hess-17-3189-2013</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx54"><label>Machado et al.(2015)Machado, Botero, López, Francés, Díez-Herrero, and Benito</label><mixed-citation>Machado, M. J., Botero, B. A., López, J., Francés, F., Díez-Herrero, A., and Benito, G.: Flood frequency analysis of historical flood data under stationary and non-stationary modelling, Hydrol. Earth Syst. Sci., 19, 2561–2576, <ext-link xlink:href="https://doi.org/10.5194/hess-19-2561-2015" ext-link-type="DOI">10.5194/hess-19-2561-2015</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx55"><label>Maraun and Widmann(2018)</label><mixed-citation>Maraun, D. and Widmann, M.: Statistical Downscaling and Bias Correction for Climate Research, Cambridge University Press, <ext-link xlink:href="https://doi.org/10.1017/9781107588783" ext-link-type="DOI">10.1017/9781107588783</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx56"><label>Maraun et al.(2010)Maraun, Wetterhall, Ireson, Chandler, Kendon, Widmann, Brienen, Rust, Sauter, Themeßl, Venema, Chun, Goodess, Jones, Onof, Vrac, and Thiele-Eich</label><mixed-citation>Maraun, D., Wetterhall, F., Ireson, A. M., Chandler, R. E., Kendon, E. J., Widmann, M., Brienen, S., Rust, H. W., Sauter, T., Themeßl, M., Venema, V., Chun, K., Goodess, C., Jones, R., Onof, C., Vrac, M., and Thiele-Eich, I.: Precipitation downscaling under climate change – recent developments to bridge the gap between dynamical models and the end user, Rev. Geophys., 48, RG3003, <ext-link xlink:href="https://doi.org/10.1029/2009RG000314" ext-link-type="DOI">10.1029/2009RG000314</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx57"><label>Maraun et al.(2015)Maraun, Widmann, Gutiérrez, Kotlarski, Chandler, Hertig, Wibig, Huth, and Wilcke</label><mixed-citation>Maraun, D., Widmann, M., Gutiérrez, J., Kotlarski, S., Chandler, R. E., Hertig, E., Wibig, J., Huth, R., and Wilcke, R.: VALUE: A framework to validate downscaling approaches for climate change studies, Earths Future, 3, 1–14, <ext-link xlink:href="https://doi.org/10.1002/2014EF000259" ext-link-type="DOI">10.1002/2014EF000259</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx58"><label>Mayes(2013)</label><mixed-citation>Mayes, J.: Regional weather and climates of the British Isles — Part 2: South East England and East Anglia, Weather, 68, 59–65, <ext-link xlink:href="https://doi.org/10.1002/wea.2073" ext-link-type="DOI">10.1002/wea.2073</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx59"><label>McCullagh and Nelder(1989)</label><mixed-citation>McCullagh, P. and Nelder, J.: Generalized Linear Models (second edition), Chapman and Hall, London, <ext-link xlink:href="https://doi.org/10.1201/9780203753736" ext-link-type="DOI">10.1201/9780203753736</ext-link>, 1989.</mixed-citation></ref>
      <ref id="bib1.bibx60"><label>Met Office(2006)</label><mixed-citation>Met Office: MIDAS: UK Daily Rainfall Data, Met Office [data set], <uri>http://catalogue.ceda.ac.uk/uuid/c732716511d3442f05cdeccbe99b8f90</uri> (last access: 21 June 2022), 2006.</mixed-citation></ref>
      <ref id="bib1.bibx61"><label>Mockler et al.(2016)Mockler, Chun, Sapriza-Azuri, Bruen, and Wheater</label><mixed-citation>Mockler, E. M., Chun, K. P., Sapriza-Azuri, G., Bruen, M., and Wheater, H. S.: Assessing the relative importance of parameter and forcing uncertainty and their interactions in conceptual hydrological model simulations, Adv. Water Resour., 97, 299–313, <ext-link xlink:href="https://doi.org/10.1016/j.advwatres.2016.10.008" ext-link-type="DOI">10.1016/j.advwatres.2016.10.008</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx62"><label>Nelsen(2006)</label><mixed-citation> Nelsen, R. B.: An Introduction to Copulas, Springer Series in Statistics, 2 edn., Springer, New York, NY, ISBN 978-0-387-28678-5, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx63"><label>Northrop(2024)</label><mixed-citation>Northrop, P. J.: Stochastic models of rainfall, Annu. Rev. Stat. Appl., 11, 51–74, <ext-link xlink:href="https://doi.org/10.1146/annurev-statistics-040622-023838" ext-link-type="DOI">10.1146/annurev-statistics-040622-023838</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx64"><label>Porcu et al.(2024)Porcu, Bevilacqua, Schaback, and Oates</label><mixed-citation>Porcu, E., Bevilacqua, M., Schaback, R., and Oates, C. J.: The Matérn Model: A Journey Through Statistics, Numerical Analysis and Machine Learning, Stat. Sci., 39, 469–492, <ext-link xlink:href="https://doi.org/10.1214/24-STS923" ext-link-type="DOI">10.1214/24-STS923</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx65"><label>R Core Team(2025)</label><mixed-citation>R Core Team: R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, Austria, <uri>https://www.R-project.org/</uri> (last access: 1 May 2026), 2025.</mixed-citation></ref>
      <ref id="bib1.bibx66"><label>Rashid and Beecham(2019)</label><mixed-citation>Rashid, M. M. and Beecham, S.: Development of a non-stationary Standardized Precipitation Index and its application to a South Australian climate, Sci. Total Environ., 657, 882–892, <ext-link xlink:href="https://doi.org/10.1016/j.scitotenv.2018.12.052" ext-link-type="DOI">10.1016/j.scitotenv.2018.12.052</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx67"><label>Rashid et al.(2016)Rashid, Beecham, and Chowdhury</label><mixed-citation>Rashid, M. M., Beecham, S., and Chowdhury, R. K.: Statistical downscaling of rainfall: a non-stationary and multi-resolution approach, Theor. Appl. Climatol., 124, 919–933, <ext-link xlink:href="https://doi.org/10.1007/s00704-015-1465-3" ext-link-type="DOI">10.1007/s00704-015-1465-3</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx68"><label>Richardson(1981)</label><mixed-citation> Richardson, C. W.: Stochastic simulation of daily precipitation, temperature, and solar radiation, Water Resour. Res., 17, 182–190, 1981.</mixed-citation></ref>
      <ref id="bib1.bibx69"><label>Rigby and Stasinopoulos(2004)</label><mixed-citation>Rigby, R. A. and Stasinopoulos, D. M.: Smooth centile curves for skew and kurtotic data modelled using the Box–Cox power exponential distribution, Stat. Med., 23, 3053–3076, <ext-link xlink:href="https://doi.org/10.1002/sim.1861" ext-link-type="DOI">10.1002/sim.1861</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bibx70"><label>Rigby and Stasinopoulos(2005)</label><mixed-citation>Rigby, R. A. and Stasinopoulos, D. M.: Generalized additive models for location, scale and shape, J. R. Stat. Soc. C-Appl., 54, 507–554, <ext-link xlink:href="https://doi.org/10.1111/j.1467-9876.2005.00510.x" ext-link-type="DOI">10.1111/j.1467-9876.2005.00510.x</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx71"><label>Rigby and Stasinopoulos(2014)</label><mixed-citation>Rigby, R. A. and Stasinopoulos, D. M.: Automatic smoothing parameter selection in GAMLSS with an application to centile estimation, Stat. Methods Med. Res., 23, 318–32, <ext-link xlink:href="https://doi.org/10.1177/0962280212473302" ext-link-type="DOI">10.1177/0962280212473302</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx72"><label>Schölzel and Friederichs(2008)</label><mixed-citation>Schölzel, C. and Friederichs, P.: Multivariate non-normally distributed random variables in climate research – introduction to the copula approach, Nonlin. Processes Geophys., 15, 761–772, <ext-link xlink:href="https://doi.org/10.5194/npg-15-761-2008" ext-link-type="DOI">10.5194/npg-15-761-2008</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx73"><label>Semenov et al.(1998)Semenov, Brooks, Barrow, and Richardson</label><mixed-citation> Semenov, M., Brooks, R., Barrow, E., and Richardson, C.: Comparison of the WGEN and LARS-WG stochastic weather generators for diverse climates, Clim. Res., 10, 95–107, 1998.</mixed-citation></ref>
      <ref id="bib1.bibx74"><label>Shibata(1997)</label><mixed-citation>Shibata, R.: Bootstrap Estimate of Kullback-Leibler Information for Model Selection, Stat. Sinica, 7, 375–394, <uri>https://www.jstor.org/stable/24306084</uri> (last access: 1 May 2026), 1997.</mixed-citation></ref>
      <ref id="bib1.bibx75"><label>Stasinopoulos et al.(2017)Stasinopoulos, Rigby, Heller, Voudouris, and De Bastiani</label><mixed-citation>Stasinopoulos, M. D., Rigby, R. A., Heller, G. Z., Voudouris, V., and De Bastiani, F.: Flexible regression and smoothing: Using GAMLSS in R, Chapman and Hall/CRC, <ext-link xlink:href="https://doi.org/10.1201/b21973" ext-link-type="DOI">10.1201/b21973</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx76"><label>Stasinopoulos et al.(2024)Stasinopoulos, Kneib, Klein, Mayr, and Heller</label><mixed-citation>Stasinopoulos, M. D., Kneib, T., Klein, N., Mayr, A., and Heller, G. Z.: Generalized Additive Models for Location, Scale and Shape: A Distributional Regression Approach, with Applications, Cambridge Series in Statistical and Probabilistic Mathematics, Cambridge University Press, Cambridge, <ext-link xlink:href="https://doi.org/10.1017/9781009410076" ext-link-type="DOI">10.1017/9781009410076</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx77"><label>Stehlík and Bárdossy(2002)</label><mixed-citation> Stehlík, J. and Bárdossy, A.: Multivariate stochastic downscaling model for generating daily precipitation series based on atmospheric circulation, J. Hydrol, 256, 120–141, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx78"><label>Tosonoğlu and Onof(2017)</label><mixed-citation>Tosonoğlu, F. and Onof, C.: Joint modelling of drought characteristics derived from historical and synthetic rainfalls: application of Generalized Linear Models and Copulas, Journal of Hydrology: Regional Studies, 14, 167–181, <ext-link xlink:href="https://doi.org/10.1016/j.ejrh.2017.11.001" ext-link-type="DOI">10.1016/j.ejrh.2017.11.001</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx79"><label>Umlauf et al.(2018)Umlauf, Klein, and Zeileis</label><mixed-citation>Umlauf, N., Klein, N., and Zeileis, A.: BAMLSS: Bayesian Additive Models for Location, Scale, and Shape (and Beyond), J. Comput. Graph. Stat., 27, 612–627, <ext-link xlink:href="https://doi.org/10.1080/10618600.2017.1407325" ext-link-type="DOI">10.1080/10618600.2017.1407325</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx80"><label>Underwood(2008)</label><mixed-citation> Underwood, F. M.: Describing long-term trends in precipitation using generalized additive models, J. Hydrol., 364, 285–297, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx81"><label>Villarini et al.(2009)Villarini, Serinaldi, Smith, and Krajewski</label><mixed-citation>Villarini, G., Serinaldi, F., Smith, J. A., and Krajewski, W. F.: On the stationarity of annual flood peaks in the continental United States during the 20th century, Water Resour. Res., 45, W08417, <ext-link xlink:href="https://doi.org/10.1029/2008WR007645" ext-link-type="DOI">10.1029/2008WR007645</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx82"><label>Villarini et al.(2010)Villarini, Smith, and Napolitano</label><mixed-citation>Villarini, G., Smith, J. A., and Napolitano, F.: Nonstationary modeling of a long record of rainfall and temperature over Rome, Adv. Water Resour., 33, 1256–1267, <ext-link xlink:href="https://doi.org/10.1016/j.advwatres.2010.03.013" ext-link-type="DOI">10.1016/j.advwatres.2010.03.013</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx83"><label>Vrac and Naveau(2007)</label><mixed-citation>Vrac, M. and Naveau, P.: Stochastic downscaling of precipitation: From dry events to heavy rainfalls, Water Resour. Res., 43, <ext-link xlink:href="https://doi.org/10.1029/2006WR005308" ext-link-type="DOI">10.1029/2006WR005308</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx84"><label>Wang et al.(2015)Wang, Li, Feng, and Hu</label><mixed-citation>Wang, Y., Li, J., Feng, P., and Hu, R.: A Time-Dependent Drought Index for Non-Stationary Precipitation Series, Water Resour. Manag., 29, 5631–5647, <ext-link xlink:href="https://doi.org/10.1007/s11269-015-1138-0" ext-link-type="DOI">10.1007/s11269-015-1138-0</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx85"><label>Wilby and Wigley(2000)</label><mixed-citation> Wilby, R. and Wigley, T.: Precipitation predictors for downscaling: Observed and general circulation model relationships, Int. J. Climatol., 20, 641–661, 2000.</mixed-citation></ref>
      <ref id="bib1.bibx86"><label>Wilks(1998)</label><mixed-citation>Wilks, D. S.: Multisite generalization of a daily stochastic precipitation generation model, J. Hydrol., 210, 178–191, <ext-link xlink:href="https://doi.org/10.1016/S0022-1694(98)00186-3" ext-link-type="DOI">10.1016/S0022-1694(98)00186-3</ext-link>, 1998. </mixed-citation></ref>
      <ref id="bib1.bibx87"><label>Wood(2017)</label><mixed-citation>Wood, S. N.: Generalized Additive Models: An Introduction with R, Second Edition, Chapman and Hall/CRC Press, New York, <ext-link xlink:href="https://doi.org/10.1201/9781315370279" ext-link-type="DOI">10.1201/9781315370279</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx88"><label>Yang et al.(2005a)Yang, Chandler, Isham, Annoni, and Wheater</label><mixed-citation> Yang, C., Chandler, R. E., Isham, V. S., Annoni, C., and Wheater, H. S.: Simulation and downscaling models for potential evaporation, J. Hydrol., 302, 239–254, 2005a.</mixed-citation></ref>
      <ref id="bib1.bibx89"><label>Yang et al.(2005b)Yang, Chandler, Isham, and Wheater</label><mixed-citation>Yang, C., Chandler, R. E., Isham, V. S., and Wheater, H. S.: Spatial-temporal rainfall simulation using generalized linear models, Water Resour. Res., 41, <ext-link xlink:href="https://doi.org/10.1029/2004WR003739" ext-link-type="DOI">10.1029/2004WR003739</ext-link>, 2005b.</mixed-citation></ref>
      <ref id="bib1.bibx90"><label>Yang et al.(2006)Yang, Chandler, Isham, and Wheater</label><mixed-citation>Yang, C., Chandler, R. E., Isham, V. S., and Wheater, H. S.: Quality control for daily observational rainfall series in the UK, Water Environ. J., 20, 185–193, <ext-link xlink:href="https://doi.org/10.1111/j.1747-6593.2006.00035.x" ext-link-type="DOI">10.1111/j.1747-6593.2006.00035.x</ext-link>, 2006.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Improving multisite precipitation generators based on generalised linear models</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>Ailliot et al.(2009)Ailliot, Thompson, and Thomson</label><mixed-citation>
      
Ailliot, P., Thompson, C., and Thomson, P.:
Space-time modelling of precipitation using a hidden Markov model and censored Gaussian distributions, Appl. Statist., 58, 405–426, 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Ambrosino et al.(2011)Ambrosino, Chandler, and Todd</label><mixed-citation>
      
Ambrosino, C., Chandler, R. E., and Todd, M. C.:
Southern African monthly rainfall variability: An analysis based on generalized linear models, J. Climate, 24, <a href="https://doi.org/10.1175/2010JCLI3924.1" target="_blank">https://doi.org/10.1175/2010JCLI3924.1</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Ambrosino et al.(2014)Ambrosino, Chandler, and Todd</label><mixed-citation>
      
Ambrosino, C., Chandler, R. E., and Todd, M. C.:
Rainfall-derived growing season characteristics for agricultural impact assessments in South Africa, Theor. Appl. Climatol., 115, 411–426, <a href="https://doi.org/10.1007/s00704-013-0896-y" target="_blank">https://doi.org/10.1007/s00704-013-0896-y</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Andrianakis and Challenor(2012)</label><mixed-citation>
      
Andrianakis, I. and Challenor, P. G.:
The effect of the nugget on Gaussian process emulators of computer models, Comput. Stat. Data An., 56, 4215–4228, <a href="https://doi.org/10.1016/j.csda.2012.04.020" target="_blank">https://doi.org/10.1016/j.csda.2012.04.020</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Asong et al.(2016)Asong, Khaliq, and Wheater</label><mixed-citation>
      
Asong, Z. E., Khaliq, M. N., and Wheater, H. S.:
Multisite multivariate modeling of daily precipitation and temperature in the Canadian Prairie Provinces using generalized linear models, Clim. Dynam., 47, 2901–2921, <a href="https://doi.org/10.1007/s00382-016-3004-z" target="_blank">https://doi.org/10.1007/s00382-016-3004-z</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>Ayar et al.(2016)Ayar, Vrac, Bastin, Carreau, Deque, and Gallardo</label><mixed-citation>
      
Ayar, P. V., Vrac, M., Bastin, S., Carreau, J., Deque, M., and Gallardo, C.:
Intercomparison of statistical and dynamical downscaling models under the EURO- and MED-CORDEX initiative framework: present climate evaluations, Clim. Dynam., 46, 1301–1329, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>Bárdossy and Plate(1992)</label><mixed-citation>
      
Bárdossy, A. and Plate, E.:
Space-time model for daily rainfall using atmospheric circulation patterns, Water Resour. Res., 28, 1247–1259, 1992.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>Beck et al.(2018)Beck, Zimmermann, McVicar, Vergopolan, Berg, and Wood</label><mixed-citation>
      
Beck, H. E., Zimmermann, N. E., McVicar, T. R., Vergopolan, N., Berg, A., and Wood, E. F.:
Present and future Köppen-Geiger climate classification maps at 1-km resolution, Scientific Data, 5, 180214, <a href="https://doi.org/10.1038/sdata.2018.214" target="_blank">https://doi.org/10.1038/sdata.2018.214</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>Beckmann and Buishand(2002)</label><mixed-citation>
      
Beckmann, B.-R. and Buishand, T. A.:
Statistical downscaling relationships for precipitation in the Netherlands and North Germany, Int. J. Climatol., 22, 15–32, 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Beersma and Buishand(2003)</label><mixed-citation>
      
Beersma, J. J. and Buishand, T. A.:
Multi-site simulation of daily precipitation and temperature conditional on the atmospheric circulation, Clim. Res., 25, 121–133, 2003.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Belzile et al.(2023)Belzile, Dutang, Northrop, and Opitz</label><mixed-citation>
      
Belzile, L. R., Dutang, C., Northrop, P. J., and Opitz, T.:
A modeler's guide to extreme value software, Extremes, 26, 595–638, <a href="https://doi.org/10.1007/s10687-023-00475-9" target="_blank">https://doi.org/10.1007/s10687-023-00475-9</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Beven(2021)</label><mixed-citation>
      
Beven, K.:
Issues in generating stochastic observables for hydrological models, Hydrol. Process., 35, e14203, <a href="https://doi.org/10.1002/hyp.14203" target="_blank">https://doi.org/10.1002/hyp.14203</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Buishand and Brandsma(2001)</label><mixed-citation>
      
Buishand, T. and Brandsma, T.:
Multisite simulation of daily precipitation and temperature in the Rhine basin by nearest-neighbor resampling, Water Resour. Res., 37, 2761–2776, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>Cameron et al.(2001)Cameron, Beven, and Tawn</label><mixed-citation>
      
Cameron, D., Beven, K., and Tawn, J.:
Modelling extreme rainfalls using a modified random pulse Bartlett-Lewis stochastic rainfall model (with uncertainty), Adv. Water Resour., 24, 203–211, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>Cavanaugh and Shumway(1997)</label><mixed-citation>
      
Cavanaugh, J. E. and Shumway, R. H.:
A Bootstrap Variant of AIC for State-Space Model Selection, Stat. Sinica, 7, 473–496, <a href="https://www.jstor.org/stable/24306089" target="_blank"/> (last access: 1 May 2026), 1997.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Chandler(2005)</label><mixed-citation>
      
Chandler, R. E.:
On the use of generalized linear models for interpreting climate variability, Environmetrics, 16, <a href="https://doi.org/10.1002/env.731" target="_blank">https://doi.org/10.1002/env.731</a>, 2005.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>Chandler(2020)</label><mixed-citation>
      
Chandler, R. E.:
Multisite, multivariate weather generation based on generalised linear models, Environ. Modell. Softw., 134, <a href="https://doi.org/10.1016/j.envsoft.2020.104867" target="_blank">https://doi.org/10.1016/j.envsoft.2020.104867</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>Chandler and Bate(2007)</label><mixed-citation>
      
Chandler, R. E. and Bate, S.:
Inference for clustered data using the independence loglikelihood, Biometrika, 94, 167–183, <a href="https://doi.org/10.1093/biomet/asm015" target="_blank">https://doi.org/10.1093/biomet/asm015</a>, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Chandler and Wheater(2002)</label><mixed-citation>
      
Chandler, R. E. and Wheater, H. S.:
Analysis of rainfall variability using Generalized Linear Models — a case study from the West of Ireland., Water Resour. Res., 38, No.10, <a href="https://doi.org/10.1029/2001WR000906" target="_blank">https://doi.org/10.1029/2001WR000906</a>, 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>Chandler et al.(2007)Chandler, Isham, Bellone, Yang, and Northrop</label><mixed-citation>
      
Chandler, R. E., Isham, V., Bellone, E., Yang, C., and Northrop, P.:
Space-Time Modeling of Rainfall for Continuous Simulation, in: Statistical Methods for Spatio-Temporal Systems, no. 107 in Monographs on Statistics and Applied Probability, 1st edn., CRC Press, pp. 177–215, <a href="https://doi.org/10.1201/9781420011050" target="_blank">https://doi.org/10.1201/9781420011050</a>, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Chandler et al.(2011)Chandler, Bates, and Charles</label><mixed-citation>
      
Chandler, R. E., Bates, B. C., and Charles, S. P.:
Rainfall trends in southwest Western Australia, in: Statistical Methods for Trend Detection and Analysis in the Environmental Sciences, edited by: Chandler, R. E. and Scott, E. M., John Wiley and Sons, Chichester, pp. 283–306, <a href="https://doi.org/10.1002/9781119991571.ch8" target="_blank">https://doi.org/10.1002/9781119991571.ch8</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Chandler et al.(2014)Chandler, Isham, Northrop, Wheater, Onof, and Leith</label><mixed-citation>
      
Chandler, R. E., Isham, V., Northrop, P., Wheater, H., Onof, C., and Leith, N.:
Uncertainty in rainfall inputs, in: Applied Uncertainty Analysis for Flood Risk Management, edited by: Beven, K. and Hall, J., Imperial College Press, London, pp. 101–152,  ISBN 1848162707, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Charles et al.(1999)Charles, Bates, and Hughes</label><mixed-citation>
      
Charles, S., Bates, B., and Hughes, J.:
A spatiotemporal model for downscaling precipitation occurrence and amounts, J. Geophys. Res-Atmos., 104, 31657–31669, 1999.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>Chun et al.(2017)Chun, Mamet, Metsaranta, Barr, Johnstone, and Wheater</label><mixed-citation>
      
Chun, K. P., Mamet, S. D., Metsaranta, J., Barr, A., Johnstone, J., and Wheater, H.:
A novel stochastic method for reconstructing daily precipitation time-series using tree-ring data from the western Canadian Boreal Forest, Dendrochronologia, 44, 9–18, <a href="https://doi.org/10.1016/j.dendro.2017.01.003" target="_blank">https://doi.org/10.1016/j.dendro.2017.01.003</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>Coles(2001)</label><mixed-citation>
      
Coles, S.:
An Introduction to Statistical Modeling of Extreme Values, Springer Series in Statistics, Springer, London, <a href="https://doi.org/10.1007/978-1-4471-3675-0" target="_blank">https://doi.org/10.1007/978-1-4471-3675-0</a>, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Davison(2003)</label><mixed-citation>
      
Davison, A. C.:
Statistical Models, Cambridge University Press, Cambridge, ISBN 0-521-77339-3, 2003.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Dawkins et al.(2022)Dawkins, Osborne, Economou, Darch, and Stoner</label><mixed-citation>
      
Dawkins, L. C., Osborne, J. M., Economou, T., Darch, G. J., and Stoner, O. R.:
The Advanced Meteorology Explorer: a novel stochastic, gridded daily rainfall generator, J. Hydrol., 607, 127478, <a href="https://doi.org/10.1016/j.jhydrol.2022.127478" target="_blank">https://doi.org/10.1016/j.jhydrol.2022.127478</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Dunn and Smyth(1996)</label><mixed-citation>
      
Dunn, P. K. and Smyth, G. K.:
Randomized Quantile Residuals, J. Comput. Graph. Stat., 5, <a href="https://doi.org/10.1080/10618600.1996.10474708" target="_blank">https://doi.org/10.1080/10618600.1996.10474708</a>, 1996.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>Friederichs(2010)</label><mixed-citation>
      
Friederichs, P.:
Statistical downscaling of extreme precipitation events using extreme value theory, Extremes, 13, 109–132, <a href="https://doi.org/10.1007/s10687-010-0107-5" target="_blank">https://doi.org/10.1007/s10687-010-0107-5</a>, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>Frost et al.(2011)Frost, Charles, Timbal, Chiew, Mehrotra, Nguyen, Chandler, McGregor, Fu, Kirono, Fernandez, and Kent</label><mixed-citation>
      
Frost, A. J., Charles, S. P., Timbal, B., Chiew, F. H. S., Mehrotra, R., Nguyen, K. C., Chandler, R. E., McGregor, J. L., Fu, G., Kirono, D. G. C., Fernandez, E., and Kent, D. M.:
A comparison of multi-site daily rainfall downscaling techniques under Australian conditions, J. Hydrol, 408, 1–18, <a href="https://doi.org/10.1016/j.jhydrol.2011.06.021" target="_blank">https://doi.org/10.1016/j.jhydrol.2011.06.021</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>Furrer and Katz(2008)</label><mixed-citation>
      
Furrer, E. M. and Katz, R. W.:
Improving the simulation of extreme precipitation events by stochastic weather generators, Water Resour. Res., 44, W12439, <a href="https://doi.org/10.1029/2008WR007316" target="_blank">https://doi.org/10.1029/2008WR007316</a>, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>Gebetsberger et al.(2018)Gebetsberger, Messner, Mayr, and Zeileis</label><mixed-citation>
      
Gebetsberger, M., Messner, J. W., Mayr, G. J., and Zeileis, A.:
Estimation Methods for Nonhomogeneous Regression Models: Minimum Continuous Ranked Probability Score versus Maximum Likelihood, Mon. Weather Rev., 146, 4323–4338, <a href="https://doi.org/10.1175/MWR-D-17-0364.1" target="_blank">https://doi.org/10.1175/MWR-D-17-0364.1</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>Genz and Bretz(2009)</label><mixed-citation>
      
Genz, A. and Bretz, F.:
Computation of Multivariate Normal and <i>t</i> Probabilities, Lecture Notes in Statistics, Springer-Verlag, Heidelberg, <a href="https://doi.org/10.1007/978-3-642-01689-9" target="_blank">https://doi.org/10.1007/978-3-642-01689-9</a>, 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>Gilleland and Katz(2016)</label><mixed-citation>
      
Gilleland, E. and Katz, R. W.:
extRemes 2.0: An Extreme Value Analysis Package in R, J. Stat. Softw., 72, 1–39, <a href="https://doi.org/10.18637/jss.v072.i08" target="_blank">https://doi.org/10.18637/jss.v072.i08</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>Gneiting and Raftery(2007)</label><mixed-citation>
      
Gneiting, T. and Raftery, A. E.:
Strictly Proper Scoring Rules, Prediction, and Estimation, J. Am. Stat. Assoc., 102, 359–378, <a href="https://doi.org/10.1198/016214506000001437" target="_blank">https://doi.org/10.1198/016214506000001437</a>, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>Groenke et al.(2026)Groenke, Wessel, Miersch, Klein, and Zscheischler</label><mixed-citation>
      
Groenke, B., Wessel, J., Miersch, P., Klein, N., and Zscheischler, J.:
Stochastic Weather Generation for Scenario-Neutral Impact Assessments Using Simulation-Based Inference, J. Geophys. Res.-Machine Learning and Computation, 3, e2025JH000902, <a href="https://doi.org/10.1029/2025JH000902" target="_blank">https://doi.org/10.1029/2025JH000902</a>, 2026.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>Grunwald and Jones(2000)</label><mixed-citation>
      
Grunwald, G. K. and Jones, R. H.:
Markov models for time series with mixed distribution, Environmetrics, 11, 327–339, <a href="https://doi.org/10.1002/(SICI)1099-095X(200005/06)11:3&lt;327::AID-ENV412&gt;3.0.CO;2-R" target="_blank">https://doi.org/10.1002/(SICI)1099-095X(200005/06)11:3&lt;327::AID-ENV412&gt;3.0.CO;2-R</a>, 2000.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>Gu et al.(2019)Gu, Zhang, Li, Singh, and Sun</label><mixed-citation>
      
Gu, X., Zhang, Q., Li, J., Singh, V. P., and Sun, P.:
Impact of urbanization on nonstationarity of annual and seasonal precipitation extremes in China, J. Hydrol., 575, 638–655, <a href="https://doi.org/10.1016/j.jhydrol.2019.05.070" target="_blank">https://doi.org/10.1016/j.jhydrol.2019.05.070</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>Gutiérrez et al.(2019)Gutiérrez, Maraun, Widmann, Huth, Hertig, Benestad, Roessler, Wibig, Wilcke, Kotlarski, San Martín, Herrera, Bedia, Casanueva, Manzanas, Iturbide, Vrac, Dubrovsky, Ribalaygua, Pórtoles, Räty, Räisänen, Hingray, Raynaud, Casado, Ramos, Zerenner, Turco, Bosshard, Štěpánek, Bartholy, Pongracz, Keller, Fischer, Cardoso, Soares, Czernecki, and Pagé</label><mixed-citation>
      
Gutiérrez, J. M., Maraun, D., Widmann, M., Huth, R., Hertig, E., Benestad, R., Roessler, O., Wibig, J., Wilcke, R., Kotlarski, S., San Martín, D., Herrera, S., Bedia, J., Casanueva, A., Manzanas, R., Iturbide, M., Vrac, M., Dubrovsky, M., Ribalaygua, J., Pórtoles, J., Räty, O., Räisänen, J., Hingray, B., Raynaud, D., Casado, M. J., Ramos, P., Zerenner, T., Turco, M., Bosshard, T., Štěpánek, P., Bartholy, J., Pongracz, R., Keller, D. E., Fischer, A. M., Cardoso, R. M., Soares, P. M., Czernecki, B., and Pagé, C.:
An intercomparison of a large ensemble of statistical downscaling methods over Europe: Results from the VALUE perfect predictor cross-validation experiment, Int. J. Climatol., 39, <a href="https://doi.org/10.1002/joc.5462" target="_blank">https://doi.org/10.1002/joc.5462</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>Hersbach et al.(2020)Hersbach, Bell, Berrisford, Hirahara, Horányi, Muñoz-Sabater, Nicolas, Peubey, Radu, Schepers, Simmons, Soci, Abdalla, Abellan, Balsamo, Bechtold, Biavati, Bidlot, Bonavita, De Chiara, Dahlgren, Dee, Diamantakis, Dragani, Flemming, Forbes, Fuentes, Geer, Haimberger, Healy, Hogan, Hólm, Janisková, Keeley, Laloyaux, Lopez, Lupu, Radnoti, de Rosnay, Rozum, Vamborg, Villaume, and Thépaut</label><mixed-citation>
      
Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz-Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D., Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P., Biavati, G., Bidlot, J., Bonavita, M., De Chiara, G., Dahlgren, P., Dee, D., Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer, A., Haimberger, L., Healy, S., Hogan, R. J., Hólm, E., Janisková, M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., de Rosnay, P., Rozum, I., Vamborg, F., Villaume, S., and Thépaut, J.-N.:
The ERA5 global reanalysis, Q. J. Roy. Meteor. Soc., 146, 1999–2049, <a href="https://doi.org/10.1002/qj.3803" target="_blank">https://doi.org/10.1002/qj.3803</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>Hoffman and Johnson(2011)</label><mixed-citation>
      
Hoffman, H. J. and Johnson, R. E.:
Estimation of Multiple Trace Metal Water Contaminants In the Presence of Left-Censored and Missing Data, Journal of Environmental Statistics, 2, 1–16, <a href="http://www.jenvstat.org/v02/i02/paper" target="_blank"/> (last access: 1 May 2026), 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>Holsclaw et al.(2016)Holsclaw, Greene, Robertson, and Smyth</label><mixed-citation>
      
Holsclaw, T., Greene, A. M., Robertson, A. W., and Smyth, P.:
A Bayesian Hidden Markov Model of Daily Precipitation over South and East Asia, J. Hydrometeorol., 17, 3–25, <a href="https://doi.org/10.1175/JHM-D-14-0142.1" target="_blank">https://doi.org/10.1175/JHM-D-14-0142.1</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>Hughes et al.(1999)Hughes, Guttorp, and Charles</label><mixed-citation>
      
Hughes, J. P., Guttorp, P., and Charles, S. P.:
A non-homogeneous hidden Markov model for precipitation occurrence, J. R. Stat. Soc. C-Appl., 48, <a href="https://doi.org/10.1111/1467-9876.00136" target="_blank">https://doi.org/10.1111/1467-9876.00136</a>, 1999.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>Huser and Davison(2014)</label><mixed-citation>
      
Huser, R. and Davison, A. C.:
Space–Time Modelling of Extreme Events, J. Roy. Stat. Soc. B, 76, 439–461, <a href="https://doi.org/10.1111/rssb.12035" target="_blank">https://doi.org/10.1111/rssb.12035</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>Hyndman and Grunwald(2000)</label><mixed-citation>
      
Hyndman, R. J. and Grunwald, G. K.:
Generalized additive modelling of mixed distribution Markov models with application to Melbourne's rainfall, Aust. N. Z. J. Stat., 42, 145–158, 2000.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>Ishiguro et al.(1991)Ishiguro, Morita, and Ishiguro</label><mixed-citation>
      
Ishiguro, M., Morita, K. I., and Ishiguro, M.:
Application of an estimator-free information criterion (WIC) to aperture synthesis imaging, International Astronomical Union Colloquium, 131, 243–248, <a href="https://doi.org/10.1017/S0252921100013403" target="_blank">https://doi.org/10.1017/S0252921100013403</a>, 1991.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>Jesus and Chandler(2011)</label><mixed-citation>
      
Jesus, J. and Chandler, R. E.:
Estimating functions and the generalized method of moments, Interface Focus, 1, 871–885, <a href="https://doi.org/10.1098/rsfs.2011.0057" target="_blank">https://doi.org/10.1098/rsfs.2011.0057</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>Katz et al.(2002)Katz, Parlange, and Naveau</label><mixed-citation>
      
Katz, R. W., Parlange, M. B., and Naveau, P.:
Statistics of extremes in hydrology, Adv. Water Resour., 25, 1287–1304, <a href="https://doi.org/10.1016/S0309-1708(02)00056-8" target="_blank">https://doi.org/10.1016/S0309-1708(02)00056-8</a>, 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>Keller et al.(2015)Keller, Fischer, Frei, Liniger, Appenzeller, and Knutti</label><mixed-citation>
      
Keller, D. E., Fischer, A. M., Frei, C., Liniger, M. A., Appenzeller, C., and Knutti, R.:
Implementation and validation of a Wilks-type multi-site daily precipitation generator over a typical Alpine river catchment, Hydrol. Earth Syst. Sci., 19, 2163–2177, <a href="https://doi.org/10.5194/hess-19-2163-2015" target="_blank">https://doi.org/10.5194/hess-19-2163-2015</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>Kenabatho et al.(2012)Kenabatho, McIntyre, Chandler, and Wheater</label><mixed-citation>
      
Kenabatho, P. K., McIntyre, N. R., Chandler, R. E., and Wheater, H. S.:
Stochastic simulation of rainfall in the semi-arid Limpopo basin, Botswana, Int. J. Climatol., 32(7), 1113–1127, <a href="https://doi.org/10.1002/joc.2323" target="_blank">https://doi.org/10.1002/joc.2323</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>Kleiber et al.(2012)Kleiber, Katz, and Rajagopalan</label><mixed-citation>
      
Kleiber, W., Katz, R. W., and Rajagopalan, B.:
Daily spatiotemporal precipitation simulation using latent and transformed Gaussian processes, Water Resour. Res., 48, <a href="https://doi.org/10.1029/2011WR011105" target="_blank">https://doi.org/10.1029/2011WR011105</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label>Liu et al.(2022)Liu, Zou, Xia, Chen, and Wang</label><mixed-citation>
      
Liu, H., Zou, L., Xia, J., Chen, T., and Wang, F.:
Impact assessment of climate change and urbanization on the nonstationarity of extreme precipitation: A case study in an urban agglomeration in the middle reaches of the Yangtze river, Sustain. Cities Soc., 85, 104038, <a href="https://doi.org/10.1016/j.scs.2022.104038" target="_blank">https://doi.org/10.1016/j.scs.2022.104038</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib53"><label>López and Francés(2013)</label><mixed-citation>
      
López, J. and Francés, F.:
Non-stationary flood frequency analysis in continental Spanish rivers, using climate and reservoir indices as external covariates, Hydrol. Earth Syst. Sci., 17, 3189–3203, <a href="https://doi.org/10.5194/hess-17-3189-2013" target="_blank">https://doi.org/10.5194/hess-17-3189-2013</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib54"><label>Machado et al.(2015)Machado, Botero, López, Francés, Díez-Herrero, and Benito</label><mixed-citation>
      
Machado, M. J., Botero, B. A., López, J., Francés, F., Díez-Herrero, A., and Benito, G.:
Flood frequency analysis of historical flood data under stationary and non-stationary modelling, Hydrol. Earth Syst. Sci., 19, 2561–2576, <a href="https://doi.org/10.5194/hess-19-2561-2015" target="_blank">https://doi.org/10.5194/hess-19-2561-2015</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib55"><label>Maraun and Widmann(2018)</label><mixed-citation>
      
Maraun, D. and Widmann, M.:
Statistical Downscaling and Bias Correction for Climate Research, Cambridge University Press, <a href="https://doi.org/10.1017/9781107588783" target="_blank">https://doi.org/10.1017/9781107588783</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib56"><label>Maraun et al.(2010)Maraun, Wetterhall, Ireson, Chandler, Kendon, Widmann, Brienen, Rust, Sauter, Themeßl, Venema, Chun, Goodess, Jones, Onof, Vrac, and Thiele-Eich</label><mixed-citation>
      
Maraun, D., Wetterhall, F., Ireson, A. M., Chandler, R. E., Kendon, E. J., Widmann, M., Brienen, S., Rust, H. W., Sauter, T., Themeßl, M., Venema, V., Chun, K., Goodess, C., Jones, R., Onof, C., Vrac, M., and Thiele-Eich, I.:
Precipitation downscaling under climate change – recent developments to bridge the gap between dynamical models and the end user, Rev. Geophys., 48, RG3003, <a href="https://doi.org/10.1029/2009RG000314" target="_blank">https://doi.org/10.1029/2009RG000314</a>, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib57"><label>Maraun et al.(2015)Maraun, Widmann, Gutiérrez, Kotlarski, Chandler, Hertig, Wibig, Huth, and Wilcke</label><mixed-citation>
      
Maraun, D., Widmann, M., Gutiérrez, J., Kotlarski, S., Chandler, R. E., Hertig, E., Wibig, J., Huth, R., and Wilcke, R.:
VALUE: A framework to validate downscaling approaches for climate change studies, Earths Future, 3, 1–14, <a href="https://doi.org/10.1002/2014EF000259" target="_blank">https://doi.org/10.1002/2014EF000259</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib58"><label>Mayes(2013)</label><mixed-citation>
      
Mayes, J.:
Regional weather and climates of the British Isles — Part 2: South East England and East Anglia, Weather, 68, 59–65, <a href="https://doi.org/10.1002/wea.2073" target="_blank">https://doi.org/10.1002/wea.2073</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib59"><label>McCullagh and Nelder(1989)</label><mixed-citation>
      
McCullagh, P. and Nelder, J.:
Generalized Linear Models (second edition), Chapman and Hall, London, <a href="https://doi.org/10.1201/9780203753736" target="_blank">https://doi.org/10.1201/9780203753736</a>, 1989.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib60"><label>Met Office(2006)</label><mixed-citation>
      
Met Office:
MIDAS: UK Daily Rainfall Data, Met Office [data set], <a href="http://catalogue.ceda.ac.uk/uuid/c732716511d3442f05cdeccbe99b8f90" target="_blank"/> (last access: 21 June 2022), 2006.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib61"><label>Mockler et al.(2016)Mockler, Chun, Sapriza-Azuri, Bruen, and Wheater</label><mixed-citation>
      
Mockler, E. M., Chun, K. P., Sapriza-Azuri, G., Bruen, M., and Wheater, H. S.:
Assessing the relative importance of parameter and forcing uncertainty and their interactions in conceptual hydrological model simulations, Adv. Water Resour., 97, 299–313, <a href="https://doi.org/10.1016/j.advwatres.2016.10.008" target="_blank">https://doi.org/10.1016/j.advwatres.2016.10.008</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib62"><label>Nelsen(2006)</label><mixed-citation>
      
Nelsen, R. B.:
An Introduction to Copulas, Springer Series in Statistics, 2 edn., Springer, New York, NY, ISBN 978-0-387-28678-5, 2006.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib63"><label>Northrop(2024)</label><mixed-citation>
      
Northrop, P. J.:
Stochastic models of rainfall, Annu. Rev. Stat. Appl., 11, 51–74, <a href="https://doi.org/10.1146/annurev-statistics-040622-023838" target="_blank">https://doi.org/10.1146/annurev-statistics-040622-023838</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib64"><label>Porcu et al.(2024)Porcu, Bevilacqua, Schaback, and Oates</label><mixed-citation>
      
Porcu, E., Bevilacqua, M., Schaback, R., and Oates, C. J.:
The Matérn Model: A Journey Through Statistics, Numerical Analysis and Machine Learning, Stat. Sci., 39, 469–492, <a href="https://doi.org/10.1214/24-STS923" target="_blank">https://doi.org/10.1214/24-STS923</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib65"><label>R Core Team(2025)</label><mixed-citation>
      
R Core Team:
R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, Austria, <a href="https://www.R-project.org/" target="_blank"/> (last access: 1 May 2026), 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib66"><label>Rashid and Beecham(2019)</label><mixed-citation>
      
Rashid, M. M. and Beecham, S.:
Development of a non-stationary Standardized Precipitation Index and its application to a South Australian climate, Sci. Total Environ., 657, 882–892, <a href="https://doi.org/10.1016/j.scitotenv.2018.12.052" target="_blank">https://doi.org/10.1016/j.scitotenv.2018.12.052</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib67"><label>Rashid et al.(2016)Rashid, Beecham, and Chowdhury</label><mixed-citation>
      
Rashid, M. M., Beecham, S., and Chowdhury, R. K.:
Statistical downscaling of rainfall: a non-stationary and multi-resolution approach, Theor. Appl. Climatol., 124, 919–933, <a href="https://doi.org/10.1007/s00704-015-1465-3" target="_blank">https://doi.org/10.1007/s00704-015-1465-3</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib68"><label>Richardson(1981)</label><mixed-citation>
      
Richardson, C. W.:
Stochastic simulation of daily precipitation, temperature, and solar radiation, Water Resour. Res., 17, 182–190, 1981.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib69"><label>Rigby and Stasinopoulos(2004)</label><mixed-citation>
      
Rigby, R. A. and Stasinopoulos, D. M.:
Smooth centile curves for skew and kurtotic data modelled using the Box–Cox power exponential distribution, Stat. Med., 23, 3053–3076, <a href="https://doi.org/10.1002/sim.1861" target="_blank">https://doi.org/10.1002/sim.1861</a>, 2004.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib70"><label>Rigby and Stasinopoulos(2005)</label><mixed-citation>
      
Rigby, R. A. and Stasinopoulos, D. M.:
Generalized additive models for location, scale and shape, J. R. Stat. Soc. C-Appl., 54, 507–554, <a href="https://doi.org/10.1111/j.1467-9876.2005.00510.x" target="_blank">https://doi.org/10.1111/j.1467-9876.2005.00510.x</a>, 2005.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib71"><label>Rigby and Stasinopoulos(2014)</label><mixed-citation>
      
Rigby, R. A. and Stasinopoulos, D. M.:
Automatic smoothing parameter selection in GAMLSS with an application to centile estimation, Stat. Methods Med. Res., 23, 318–32, <a href="https://doi.org/10.1177/0962280212473302" target="_blank">https://doi.org/10.1177/0962280212473302</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib72"><label>Schölzel and Friederichs(2008)</label><mixed-citation>
      
Schölzel, C. and Friederichs, P.:
Multivariate non-normally distributed random variables in climate research – introduction to the copula approach, Nonlin. Processes Geophys., 15, 761–772, <a href="https://doi.org/10.5194/npg-15-761-2008" target="_blank">https://doi.org/10.5194/npg-15-761-2008</a>, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib73"><label>Semenov et al.(1998)Semenov, Brooks, Barrow, and Richardson</label><mixed-citation>
      
Semenov, M., Brooks, R., Barrow, E., and Richardson, C.:
Comparison of the WGEN and LARS-WG stochastic weather generators for diverse climates, Clim. Res., 10, 95–107, 1998.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib74"><label>Shibata(1997)</label><mixed-citation>
      
Shibata, R.:
Bootstrap Estimate of Kullback-Leibler Information for Model Selection, Stat. Sinica, 7, 375–394, <a href="https://www.jstor.org/stable/24306084" target="_blank"/> (last access: 1 May 2026), 1997.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib75"><label>Stasinopoulos et al.(2017)Stasinopoulos, Rigby, Heller, Voudouris, and De Bastiani</label><mixed-citation>
      
Stasinopoulos, M. D., Rigby, R. A., Heller, G. Z., Voudouris, V., and De Bastiani, F.:
Flexible regression and smoothing: Using GAMLSS in R, Chapman and Hall/CRC, <a href="https://doi.org/10.1201/b21973" target="_blank">https://doi.org/10.1201/b21973</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib76"><label>Stasinopoulos et al.(2024)Stasinopoulos, Kneib, Klein, Mayr, and Heller</label><mixed-citation>
      
Stasinopoulos, M. D., Kneib, T., Klein, N., Mayr, A., and Heller, G. Z.:
Generalized Additive Models for Location, Scale and Shape: A Distributional Regression Approach, with Applications, Cambridge Series in Statistical and Probabilistic Mathematics, Cambridge University Press, Cambridge, <a href="https://doi.org/10.1017/9781009410076" target="_blank">https://doi.org/10.1017/9781009410076</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib77"><label>Stehlík and Bárdossy(2002)</label><mixed-citation>
      
Stehlík, J. and Bárdossy, A.:
Multivariate stochastic downscaling model for generating daily precipitation series based on atmospheric circulation, J. Hydrol, 256, 120–141, 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib78"><label>Tosonoğlu and Onof(2017)</label><mixed-citation>
      
Tosonoğlu, F. and Onof, C.:
Joint modelling of drought characteristics derived from historical and synthetic rainfalls: application of Generalized Linear Models and Copulas, Journal of Hydrology: Regional Studies, 14, 167–181, <a href="https://doi.org/10.1016/j.ejrh.2017.11.001" target="_blank">https://doi.org/10.1016/j.ejrh.2017.11.001</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib79"><label>Umlauf et al.(2018)Umlauf, Klein, and Zeileis</label><mixed-citation>
      
Umlauf, N., Klein, N., and Zeileis, A.:
BAMLSS: Bayesian Additive Models for Location, Scale, and Shape (and Beyond), J. Comput. Graph. Stat., 27, 612–627, <a href="https://doi.org/10.1080/10618600.2017.1407325" target="_blank">https://doi.org/10.1080/10618600.2017.1407325</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib80"><label>Underwood(2008)</label><mixed-citation>
      
Underwood, F. M.:
Describing long-term trends in precipitation using generalized additive models, J. Hydrol., 364, 285–297, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib81"><label>Villarini et al.(2009)Villarini, Serinaldi, Smith, and Krajewski</label><mixed-citation>
      
Villarini, G., Serinaldi, F., Smith, J. A., and Krajewski, W. F.:
On the stationarity of annual flood peaks in the continental United States during the 20th century, Water Resour. Res., 45, W08417, <a href="https://doi.org/10.1029/2008WR007645" target="_blank">https://doi.org/10.1029/2008WR007645</a>, 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib82"><label>Villarini et al.(2010)Villarini, Smith, and Napolitano</label><mixed-citation>
      
Villarini, G., Smith, J. A., and Napolitano, F.:
Nonstationary modeling of a long record of rainfall and temperature over Rome, Adv. Water Resour., 33, 1256–1267, <a href="https://doi.org/10.1016/j.advwatres.2010.03.013" target="_blank">https://doi.org/10.1016/j.advwatres.2010.03.013</a>, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib83"><label>Vrac and Naveau(2007)</label><mixed-citation>
      
Vrac, M. and Naveau, P.:
Stochastic downscaling of precipitation: From dry events to heavy rainfalls, Water Resour. Res., 43, <a href="https://doi.org/10.1029/2006WR005308" target="_blank">https://doi.org/10.1029/2006WR005308</a>, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib84"><label>Wang et al.(2015)Wang, Li, Feng, and Hu</label><mixed-citation>
      
Wang, Y., Li, J., Feng, P., and Hu, R.:
A Time-Dependent Drought Index for Non-Stationary Precipitation Series, Water Resour. Manag., 29, 5631–5647, <a href="https://doi.org/10.1007/s11269-015-1138-0" target="_blank">https://doi.org/10.1007/s11269-015-1138-0</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib85"><label>Wilby and Wigley(2000)</label><mixed-citation>
      
Wilby, R. and Wigley, T.:
Precipitation predictors for downscaling: Observed and general circulation model relationships, Int. J. Climatol., 20, 641–661, 2000.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib86"><label>Wilks(1998)</label><mixed-citation>
      
Wilks, D. S.:
Multisite generalization of a daily stochastic precipitation generation model, J. Hydrol., 210, 178–191, <a href="https://doi.org/10.1016/S0022-1694(98)00186-3" target="_blank">https://doi.org/10.1016/S0022-1694(98)00186-3</a>, 1998.


    </mixed-citation></ref-html>
<ref-html id="bib1.bib87"><label>Wood(2017)</label><mixed-citation>
      
Wood, S. N.:
Generalized Additive Models: An Introduction with R, Second Edition, Chapman and Hall/CRC Press, New York, <a href="https://doi.org/10.1201/9781315370279" target="_blank">https://doi.org/10.1201/9781315370279</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib88"><label>Yang et al.(2005a)Yang, Chandler, Isham, Annoni, and Wheater</label><mixed-citation>
      
Yang, C., Chandler, R. E., Isham, V. S., Annoni, C., and Wheater, H. S.:
Simulation and downscaling models for potential evaporation, J. Hydrol., 302, 239–254, 2005a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib89"><label>Yang et al.(2005b)Yang, Chandler, Isham, and Wheater</label><mixed-citation>
      
Yang, C., Chandler, R. E., Isham, V. S., and Wheater, H. S.:
Spatial-temporal rainfall simulation using generalized linear models, Water Resour. Res., 41, <a href="https://doi.org/10.1029/2004WR003739" target="_blank">https://doi.org/10.1029/2004WR003739</a>, 2005b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib90"><label>Yang et al.(2006)Yang, Chandler, Isham, and Wheater</label><mixed-citation>
      
Yang, C., Chandler, R. E., Isham, V. S., and Wheater, H. S.:
Quality control for daily observational rainfall series in the UK, Water Environ. J., 20, 185–193, <a href="https://doi.org/10.1111/j.1747-6593.2006.00035.x" target="_blank">https://doi.org/10.1111/j.1747-6593.2006.00035.x</a>, 2006.

    </mixed-citation></ref-html>--></article>
