SWAT Model Use Calibration and Validation
SWAT Model Use Calibration and Validation
AND VALIDATION
J. G. Arnold, D. N. Moriasi, P. W. Gassman, K. C. Abbaspour, M. J. White,
R. Srinivasan, C. Santhi, R. D. Harmel, A. van Griensven,
M. W. Van Liew, N. Kannan, M. K. Jha
ABSTRACT. SWAT (Soil and Water Assessment Tool) is a comprehensive, semi-distributed river basin model that requires a
large number of input parameters, which complicates model parameterization and calibration. Several calibration tech-
niques have been developed for SWAT, including manual calibration procedures and automated procedures using the shuf-
fled complex evolution method and other common methods. In addition, SWAT-CUP was recently developed and provides
a decision-making framework that incorporates a semi-automated approach (SUFI2) using both manual and automated
calibration and incorporating sensitivity and uncertainty analysis. In SWAT-CUP, users can manually adjust parameters
and ranges iteratively between autocalibration runs. Parameter sensitivity analysis helps focus the calibration and uncer-
tainty analysis and is used to provide statistics for goodness-of-fit. The user interaction or manual component of the
SWAT-CUP calibration forces the user to obtain a better understanding of the overall hydrologic processes (e.g., baseflow
ratios, ET, sediment sources and sinks, crop yields, and nutrient balances) and of parameter sensitivity. It is important for
future calibration developments to spatially account for hydrologic processes; improve model run time efficiency; include
the impact of uncertainty in the conceptual model, model parameters, and measured variables used in calibration; and as-
sist users in checking for model errors. When calibrating a physically based model like SWAT, it is important to remember
that all model input parameters must be kept within a realistic uncertainty range and that no automatic procedure can
substitute for actual physical knowledge of the watershed.
Keywords. Autocalibration, Hydrologic model, SWAT, Validation.
T
he SWAT (Soil and Water Assessment Tool) mod- decisions on water resources and nonpoint-source pollution
el is a continuous-time, semi-distributed, process- in large river basins. The first version of SWAT was devel-
based river basin model. It was developed to oped in the early 1990s and released as version 94.2. Engel
evaluate the effects of alternative management et al. (1993) reported the first application of SWAT in the
peer-reviewed literature; Srinivasan and Arnold (1994) and
Arnold et al. (1998) later published the first peer-reviewed
Submitted for review in November 2011 as manuscript number description of a geographic information system (GIS) inter-
SW 9517; approved for publication by the Soil & Water Division of face for SWAT and overview describing the key compo-
ASABE in May 2012.
The authors are Jeffrey G. Arnold, ASABE Fellow, Agricultural En- nents of SWAT, respectively. Arnold and Forher (2005) de-
gineer, USDA-ARS Grassland, Soil and Water Research Laboratory, Tem- scribed the expanding global use of SWAT as well as
ple, Texas; Daniel N. Moriasi, ASABE Member, Research Hydrologist, several subsequent releases of the model: versions 96.1,
USDA-ARS Great Plains Agroclimate and Natural Resources Research
Unit, El Reno, Oklahoma; Philip W. Gassman, ASABE Member, Asso-
98.2, 99.2, and 2000. Gassman et al. (2007) provided fur-
ciate Scientist, Center for Agricultural and Rural Development, Iowa State ther description of SWAT, including SWAT version 2005,
University, Ames, Iowa; Karim C. Abbaspour, Senior Scientist, Eawag: and also presented an in-depth overview of over 250
Swiss Federal Institute for Aquatic Science and Technology, Dübendorf, SWAT-related applications that were performed worldwide.
Switzerland; Michael J. White, ASABE Member, Agricultural Engineer,
USDA-ARS Grassland, Soil and Water Research Laboratory, Temple, Krysanova and Arnold (2008), Douglas-Mankin et al.
Texas; Raghavan Srinivasan, ASABE Member, Director, Spatial Sci- (2010), and Tuppad et al. (2011) provide further updates on
ences Laboratory, Texas A&M University, College Station, Texas; Chin- SWAT application and development trends, and the latter
nasamy Santhi, Associate Research Scientist, Texas Research and Exten- two articles provide further description of SWAT version
sion Center, Temple, Texas; Daren Harmel, ASABE Member, Research
Leader, USDA-ARS Grassland, Soil and Water Research Laboratory, 2009, the latest release of the model.
Temple, Texas; Ann van Griensven, Associate Professor, UNESCO-IHE The development of SWAT is a continuation of USDA
Institute for Water Education, Delft, The Netherlands; Michael W. Van Agricultural Research Service (ARS) modeling experience
Liew, Hydrologic Computer Simulation Modeler, Department of Biologi-
cal Systems Engineering, University of Nebraska, Lincoln, Nebraska; Na-
that spans a period of over 30 years (Gassman et al., 2007;
rayanan Kannan, ASABE Member, Assistant Research Scientist, Texas Williams et al., 2008). The current SWAT model includes
Research and Extension Center, Temple, Texas; and Manoj K. Jha, key components contributed from USDA-ARS models as
ASABE Member, Assistant Professor, College of Engineering, North well as from other models (fig. 1). Core pesticide transport,
Carolina A&T, Greensboro, North Carolina. Corresponding author: Jef-
frey G. Arnold, USDA-ARS Grassland, Soil and Water Research Labora- hydrology, and crop growth models that have been incorpo-
tory, 808 East Blackland Road, Temple, TX 76502; phone: 254-770-6502; rated into SWAT can be traced to earlier USDA-ARS field-
e-mail: [email protected].
scale models (fig. 1): the Groundwater Loading Effects of components shown in figure 1 as well as other routines, and
Agricultural Management Systems (GLEAMS) model also features several pre- and post-processing software
(Leonard et al., 1987), the Chemicals, Runoff, and Erosion tools, including the widely used ArcGIS SWAT (ArcSWAT)
from Agricultural Management Systems (CREAMS) model GIS interface (Olivera et al., 2006). Extensive SWAT2009
(Knisel, 1980), and the Environmental Policy Integrated documentation can also be accessed at the SWAT website
Climate model (Williams et al., 2008; Wang et al., 2011). (http://swatmodel.tamu.edu), including theoretical docu-
These components were first grafted into the Simulator for mentation describing all equations, a user’s manual describ-
Water Resources in Rural Basins (SWRRB) model (fig. 1; ing model inputs and outputs, ArcSWAT and Map Window
Arnold and Williams, 1987), along with other key compo- interface manuals, and a developer’s manual. In addition to
nents including a weather generator, sediment routing rou- the model documentation, access is also provided at the
tine, and groundwater submodel (Arnold and Allen, 1999). website to all supporting software, selected journal articles
The initial version of SWAT was then created by interfac- and other publications, a SWAT literature database, previ-
ing SWRRB with the routing structure in the Routing Out- ous and forthcoming conference information, forthcoming
puts to Outlet (ROTO) model (fig. 1; Arnold et al., 1995b). workshops, SWAT-related job openings, and an email
Expanded routing and pollutant transport capabilities have newsletter called SWATbytes. The core SWAT development
since been incorporated into the model (fig. 1), including and user support team is located at the USDA-ARS Grass-
reservoir, pond, wetland, point source, and septic tank ef- land, Soil and Water Research Laboratory and the Texas
fects as well as enhanced sediment routing routines (Arnold AgriLife Blackland Research Center in Temple, Texas.
et al., 2010b) and in-stream kinetic routines from the SWAT development is also occurring at other research sites
QUAL2E model (Brown and Barnwell, 1987). Additional in North America and in other regions (Gassman et al.,
modifications that have been incorporated into SWAT 2010), and multiple user groups have developed world-
(fig. 1) include an improved carbon cycling routine based wide, including SWAT, ArcSWAT, VizSWAT, SWAT-CUP,
on the CFARM model (Kemanian, 2011), alternative daily Latin American, southeast Asia, Africa, Iran, and Brazil us-
and subdaily hydrology routines including the Green-Ampt er groups.
infiltration method (Green and Ampt, 1911), temporal ac- Many of the previous studies published in the extensive
counting of management practice and land use changes and body of peer-reviewed and other SWAT literature describe
enhanced subsurface tile drainage, filter strips, grassed wa- calibration and validation approaches used for verifying the
terways, irrigation, and other improved representations of accuracy of the model for the simulated conditions. These
conservation and management practices (fig. 1; Arnold et testing procedures have been reported at varying levels of
al., 2010b). The temporal accounting routine allows users detail for a wide range of watershed scales, environmental
to introduce the adoption of different selected management conditions, and application goals worldwide (e.g., Gassman
practices or account for changes in land use part way et al., 2007, 2010). More in-depth procedures have also
through a SWAT simulation run, such as the hydrologic and been reported for specific aspects of the calibration and val-
pollutant impacts simulated by Chiang et al. (2010) in re- idation process, such as the guidelines proposed by Moriasi
sponse to temporal changes in pasture use for a 32 km2 wa- et al. (2007) regarding specific statistical criteria to judge
tershed in northwest Arkansas. the success of SWAT (and other model) testing results.
The current SWAT2009 code incorporates all of the However, a comprehensive overview of all key facets re-
ametric tests, t-test, objective functions, autocorrelation, served data is indicated by an NSE value of 1. NSE values
and cross-correlation. By far, the most widely used statis- ≤0 indicate that the observed data mean is a more accurate
tics reported for calibration and validation are r2 and NSE. predictor than the simulated output. Both NSE and r2 are
The r2 statistic can range from 0 to 1, where 0 indicates no biased toward high flows. To minimize this bias, some re-
correlation and 1 represents perfect correlation, and it pro- searchers have taken the log of flows for statistical compar-
vides an estimate of how well the variance of observed val- ison or have developed statistics for low and high flow sea-
ues are replicated by the model predictions(Krause et al., sons (Krause et al., 2005). Krause et al. (2005) provide
2005). A perfect fit also requires that the regression slope further discussion regarding the strengths and weaknesses
and intercept are equal to 1 and 0, respectively; however, of using r2, NSE, and other efficiency criteria measures.
the slope and intercept have typically not been reported in An extensive list of r2 and NSE calibration and/or vali-
published SWAT studies. If r2 is the primary statistical dation statistics is provided by Gassman et al. (2007) for
measure, it should always be used with slope and intercept 115 SWAT studies that reported hydrologic results as well
to ensure that means are reasonable (slope = 1) and bias is as 37 SWAT studies that reported pollutant results. Similar
low. NSE values can range between -∞ to 1 and provide a r2 and NSE statistical compilations for an additional 20
measure how well the simulated output matches the ob- SWAT studies are reported by Douglas-Mankin et al.
served data along a 1:1 line (regression line with slope (2010), and 23 SWAT studies are reviewed by Tuppad et al.
equal to 1). A perfect fit between the simulated and ob- (2011). These statistics provides valuable insight regarding
b R 2
theoretical boundaries for each specific input parameter. (1)
−1
The input values are usually determined over the course of if b > 1
iterative SWAT simulations, which sometimes number in
where b is the slope of the regression line between meas-
the thousands. Several optimization schemes have been
ured and simulated signals. A major advantage of this effi-
used in SWAT autocalibration applications, including gen-
ciency criterion is that it ranges from 0 to 1, which com-
eralized likelihood uncertainty estimation (GLUE), shuffled
complex evolution (SCE), and the Parameter Estimation pared to NSE with a range of -∞ to 1, ensures that in a
(PEST) program (Doherty, 2004). Govender and Everson multisite calibration the objective function is not governed
(2005) and Wang and Melesse (2005) used PEST to cali- by a single or a few badly simulated stations.
brate key hydrology-related parameters for SWAT applica-
tions in South Africa and northwest Minnesota, respective- CALIBRATION PARAMETERS
ly. Wang and Melesse (2005) also found that manual Numerous studies have reported input parameters used
calibration resulted in more accurate predictions than the in SWAT model calibration. Table 1 summarizes the pa-
automated PEST approach. Ng et al. (2010b) described ad- rameters used in 64 studies and in studies previously sum-
vantages and disadvantages of using PEST versus the marized by Douglas-Mankin et al. (2010) and Tuppad et al.
GLUE method for calibrating SWAT for a watershed in (2011). All of these studies include detailed reporting of
central Illinois. Setegn et al. (2009) and Razavi et al. (2010) model parameterization and calibration procedures, includ-
described additional SWAT calibration studies that relied ing tables with parameter ranges and/or final values. Many
partially on GLUE methodology for watersheds located in publications in the literature (https://www.card.iastate.edu/
Ethiopia and south-central New York. To determine opti- swat_articles) do not adequately report changes in parame-
mum input parameters based on the global objective criteria ters. Model parameters used in calibration studies and even
for a simulation of a river basin in Belgium, van Griensven in the selected publications exhibited gaps. Tuppad et al.
and Bauwens (2003, 2005) incorporated an SCE module di- (2011) re-emphasizes an important point made by Douglas-
rectly into the SWAT code. Calibration parameters, corre- Mankin et al. (2010): “Improved reporting of calibration
sponding parameter ranges, and measured daily streamflow and validation procedures and results, perhaps guided by a
and pollutant data were input for the application, which re- set of standard reporting guidelines, is essential for ade-
quired several thousand SWAT simulations for completion. quate interpretation of each study and comparison among
Eckhardt and Arnold (2001) and Eckhardt et al. (2005) used studies in the future. This increased information would also
similar SCE-based automatic calibration methods for form the basis for assigning typical parameters and ranges
SWAT simulations of German watersheds. Other SCE- for use in both manual or automatic calibration and uncer-
based SWAT automatic calibration applications are reported tainty processes.”
by Di Luzio and Arnold (2004) and Van Liew et al. (2005, Table 1 categorizes parameters by process. Since SWAT
2007). is a comprehensive model that simulates process interac-
Automatic calibration and uncertainty analysis capabil- tions, many parameters will impact multiple processes. For
ity is now directly incorporated in SWAT2009 (Gassman et example, CN directly impacts surface runoff; however, as
al., 2010) via the SWAT-CUP software developed by Ea- surface runoff changes, all components of hydrology bal-
wag (2009). A number of previous SWAT application pro- ance change. Soil erosion and nutrient transport are also di-
jects report automated calibration/validation and uncertain- rectly impacted by surface runoff, as are plant growth and
ty analysis using SWAT-CUP. Abbaspour et al. (2007) nutrient cycling. This is the primary reason why most man-
performed a multi-objective calibration and validation of ual calibration methods start with the hydrology balance
the Thur watershed in Switzerland using discharge, sedi- and streamflow, then move to sediment, and finally cali-
ment, nitrate, and phosphate in the objective function with brate nutrients and pesticides, as shown in figure 2 (Santhi
uncertainty analysis. Schuol et al. (2008a, 2008b) calibrated et al., 2001). It is evident from table 1 that hydrology is cal-
with uncertainty analysis and validated models of west Af- ibrated in most studies, with CN2, AWC, ESCO, and
rica and the entire continent of Africa. Yang et al. (2008) SURLAG used routinely. The baseflow process is also of-
compared five different optimization algorithms in SWAT- ten calibrated with the baseflow recession parameters used
CUP and calibrated a watershed in China (2007) using the in many studies.
MCMC algorithm. Faramarzi et al. (2009) used SWAT to Parameters for sediment calibration are used less often
build a hydrologic model of Iran and calibrated and vali- due to inadequate reporting or studies that did not focus on
dated it with the SUFI2 algorithm accounting for prediction sediment. It is interesting to note that of the studies in
uncertainty. Akhavan et al. (2010) calibrated a model of ni- which sediment was calibrated, channel parameters were
trate leaching for a watershed in Iran, and Andersson et al. used more often than parameters affecting sediment
Figure 4. Location of the Little Washita River Experimental Watershed in Oklahoma (from Van Liew et al., 2005).
calibration of model parameters based on a single objective sary after autocalibration. Caution must also be exercised in
function. The SCE-UA has been widely used in watershed using the autocalibration tool so that the selection of initial
model calibration and other areas of hydrology, such as soil lower and upper ranges in the parameters results in cali-
erosion, subsurface hydrology, remote sensing, and land brated values that are representative of watershed condi-
surface modeling, and has generally been found to be ro- tions (Van Liew et al., 2005).
bust, effective, and efficient (Duan, 2003).
Parameters in SWAT were calibrated at the daily time SEMI-AUTOMATED SUFI2
scale in a distributed fashion using the automated calibra- For the second case study, an example calibration of the
tion procedure, in which observed and simulated outputs Danube project (Rouholahnejad et al., 2012b) was selected
were compared at the same outlet points as the manual cal- using SWAT-CUP. Rouholahnejad et al. (2012a) referred to
ibration. With the completion of a given optimization, two the process of parameter assignment as parameterization.
sets of calibrated parameters were computed for the Little Correct parameterization is an important step in model cal-
River that corresponded to subwatersheds F and B, and two ibration and must be based on the knowledge of the hydro-
sets were computed for the Little Washita that correspond- logic processes and variability in soil, land use, slope, and
ed to subwatersheds 526 and 550. Default values suggested location as defined by the subbasin number. Parameteriza-
by van Griensven (2002) were selected as initial upper and tion, therefore, could be defined as “the process of impart-
lower ranges for the respective model parameters. Minimiz- ing the analyst’s knowledge of the physical processes of the
ing the sum of squares residuals was used as the objective watershed to the model.” No automatic calibration proce-
function in the autocalibration procedure. dure can substitute for actual physical knowledge of the
The Van Liew et al. (2005) study highlighted an im- watershed, which can translate into correct parameter rang-
portant difference that must be realized in comparing the es for different parts of the watershed. These ranges can ef-
manual versus autocalibration approaches. The autocalibra- fectively guide the optimization routine. Hence, correct pa-
tion approach was strictly a quantitative comparison that rameterization can result in faster and more accurate model
involved minimizing the difference between measured and calibration with smaller prediction uncertainty. SWAT-CUP
simulated values. The manual approach involved both includes automated as well as semi-automatic procedures
quantitative and qualitative comparisons, since it involved for model calibration. The following steps are suggested in
using the total mass controller in conjunction with graph- a calibration exercise with the semi-automated program
ical comparisons of monthly and daily hydrographs and du- SUFI2:
ration of daily flow curves to calibrate the model against 1. Develop initial or default SWAT input parameters
measured data. Use of the manual calibration accentuates (as created by ArcSWAT or other GIS interfaces)
the tradeoffs that exist in achieving total mass balance, rea- and prepare the input files for SWAT-CUP.
sonable hydrograph responses, and adequate representation 2. Run the model with initial parameters and plot the
of the range in flows. Van Liew et al. (2005) suggest that simulated and observed variables at each gauging
the strengths of both the manual and autocalibration ap- station for the entire period of record.
proaches can be used to facilitate the calibration process. 3. Based on step 2, divide the entire period into cali-
With proper selection of the upper and lower ranges for bration and validation periods while attempting to
model parameter values, autocalibration can provide an ini- ensure that both periods have a similar number of
tial parameter set with minimal labor on the part of the user. wet and dry years and similar average water bal-
Depending on the particular modeling needs, a manual ap- ances.
proach can then be taken to refine the calibration, so that an 4. Determine the most sensitive parameters for the
appropriate balance is achieved regarding the amount, tim- observed values of interest. This information can
ing, and distribution of the output variable. usually be deduced from the literature (see ta-
Results of the Van Liew et al. (2005) suggest that the au- ble 1).
tocalibration option in SWAT provides a powerful, labor- 5. Assign an initial uncertain range (typically 20% to
saving tool that can be used to substantially reduce the frus- 30%) to each parameter globally, meaning scaling
tration and subjectivity that often characterize manual cali- the parameters identically for each HRU.
brations. If used in combination with a manual approach, 6. Run the SWAT-CUP-SUFI2 model 300 to
the autocalibration tool shows promising results in provid- 500 times and view the results for each gauged
ing initial estimates for model parameters. To maintain outlet, as shown in figure 5.
mass balance and adequately represent the range in magni- 7. Perform the global sensitivity analysis and view
tude of output variables, manual adjustments may be neces- the results. At this stage, the P-factor and p-value
t-statistic can be used to eliminate non-sensitive realistic ranges, as influenced by the uncertainty in defining
parameters from the calibration process. the parameter.
8. After observing model performance in step 6, re- In the manner described above, the parameters of each
gionalize the respective parameters. For example, observational gauge can be used to spatially calibrate the
as shown in figure 5, the model systematically un- model in the drainage area between the gauges. At this
derestimated baseflow at outlet q_209 (in subbasin point, the analyst’s knowledge of processes in the water-
209), and there is an early shift in the flow peak. shed could also be implemented in the optimization.
To increase baseflow, decrease deep percolation Figure 6 shows the results after implementing the above
(GWQMN), decrease the groundwater revap coef- changes in the parameters and running the model, where
ficient (GW_REVAP), and increase the threshold NS increased from -1.5 to 0.2. Additional iterations can fur-
depth of water in shallow aquifer (REVAPMN). To ther improve the results. Details on parameterization and
correct the early shift, decrease the slope results can be found in Rouholahnejad et al. (2012a,
(HRU_SLP), increase Manning’s roughness coeffi- 2012b). After calibration, the model should be run for the
cient (OV_N), increase the value of overland flow validation period to assess its performance.
rate (SLSUBBSN), and increase snow melt param-
eters (SMTMP).
To increase baseflow and delay peaks, identify the sub- DISCUSSION
basins that contribute to the outlet at subbasin 209 and im-
Gassman et al. (2010) discussed trends in SWAT use and
plement the changes in the respective parameters. For ex-
the technical and networking factors that are regarded as
ample, make changes in the parameters only in subbasins
strengths of the model, which include: web-based docu-
draining into 209, and set new ranges for the parameters us-
mentation, user support groups, SWAT literature database,
ing one-at-a-time sensitivity analysis implemented in
GIS interface tools, pre- and post-processing tools, open
SWAT-CUP. The parameters must always be kept within
source code, regional and international conferences, and
Figure 6. Observed flow, 95% model uncertainty, and best estimation at gauging station 209 after calibration.