3
3
105-115
Abstract
Cognitive behavior therapy (CBT) is by far the most examined type of
psychological treatment for depression and is recommended in most
treatment guidelines. However, no recent meta-analysis has
integrated the results of randomized trials examining its e"ects, and
its e#cacy in comparison with other psychotherapies,
pharmacotherapies and combined treatment for depression remains
uncertain. We searched PubMed, PsycINFO, Embase and the
Cochrane Library to identify studies on CBT, and separated included
trials into several subsets to conduct random-e"ects meta-analyses.
We included 409 trials (518 comparisons) with 52,702 patients, thus
conducting the largest meta-analysis ever of a speci$c type of
psychotherapy for a mental disorder. The quality of the trials was
found to have increased signi$cantly over time (with increasing
numbers of trials with low risk of bias, less waitlist control groups,
and larger sample sizes). CBT had moderate to large e"ects
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 1 of 32
:
compared to control conditions such as care as usual and waitlist
(g=0.79; 95% CI: 0.70-0.89), which remained similar in sensitivity
analyses and were still signi$cant at 6-12 month follow-up. There was
no reduction of the e"ect size of CBT according to the publication
year (<2001 vs. 2001-2010 vs. >2011). CBT was signi$cantly more
e"ective than other psychotherapies, but the di"erence was small
(g=0.06; 95% CI: 0-0.12) and became non-signi$cant in most
sensitivity analyses. The e"ects of CBT did not di"er signi$cantly from
those of pharmacotherapies at the short term, but were signi$cantly
larger at 6-12 month follow-up (g=0.34; 95% CI: 0.09-0.58), although
the number of trials was small, and the di"erence was not signi$cant
in all sensitivity analyses. Combined treatment was more e"ective
than pharmacotherapies alone at the short (g=0.51; 95% CI: 0.19-
0.84) and long term (g=0.32; 95% CI: 0.09-0.55), but it was not more
e"ective than CBT alone at either time point. CBT was also e"ective
as unguided self-help intervention (g=0.45; 95% CI: 0.31-0.60), in
institutional settings (g=0.65; 95% CI: 0.21-1.08), and in children and
adolescents (g=0.41; 95% CI: 0.25-0.57). We can conclude that the
e#cacy of CBT in depression is documented across di"erent formats,
ages, target groups, and settings. However, the superiority of CBT
over other psychotherapies for depression does not emerge clearly
from this meta-analysis. CBT appears to be as e"ective as
pharmacotherapies at the short term, but more e"ective at the
longer term.
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 2 of 32
:
psychotherapies6.
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 3 of 32
:
METHODS
Identi$cation and selection of trials
This study is part of a larger meta-analytic project on psychological
treatments for depression10. The protocol for the current meta-analysis
has been published in the Open Science Framework
(http://osf.io/a6p3w).
Two independent researchers screened all records, and all papers that
could meet inclusion criteria according to one of them were retrieved as
full text. The two independent researchers also decided to include or
exclude a study in the database, and disagreements were resolved
through discussion.
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 4 of 32
:
(cognitive restructuring). Cognitive restructuring could be combined with
other mood management skills, such as behavioral activation, problem-
solving, social skills training, or mindfulness. This de$nition was derived
from an extensive study in which di"erent types of psychotherapies
were examined by multiple researchers, resulting in a consensus on the
de$nition of each therapy11.
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 5 of 32
:
Collaboration14, 15. The RoB tool assesses possible sources of bias in
randomized trials, including the adequate generation of allocation
sequence; the concealment of allocation to conditions; the prevention of
knowledge of the allocated intervention (masking of assessors); and
dealing with incomplete outcome data (this was assessed as positive
when intention-to-treat analyses were conducted, meaning that all
randomized patients were included in the analyses). Two independent
researchers evaluated the validity of the included studies, and
disagreements were solved through discussion.
Outcome measures
For each comparison between a psychological treatment and a control
condition, the e"ect size indicating the di"erence between the two
groups at post-test was calculated (Hedges’ g)16. E"ect sizes were
calculated by subtracting (at post-test) the average score of the
psychotherapy group from the average score of the control group and
dividing the result by the pooled standard deviation. Because some
studies were expected to have relatively small sample sizes, we
corrected the e"ect size for small sample bias.
When the means and standard deviations were not reported in a study,
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 6 of 32
:
we used change scores. If these were not reported, we converted binary
outcomes to Hedges’ g. If these were also not reported, we used other
statistics (e.g., p value, t value) to calculate the e"ect size.
Meta-analyses
To make a historical overview of trials on CBT over time, we conducted
bivariable linear regression analyses examining if the characteristics of
the trials have changed over time. We limited these analyses to the
subset comparing CBT with control conditions, because this was the
largest and most homogeneous subset.
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 7 of 32
:
when the 95% con$dence interval (CI) of the e"ect size does not overlap
with the 95% CI of the pooled e"ect size20. Third, we pooled e"ects while
excluding in%uential cases, de$ned by the diagnostics proposed by
Viechtbauer and Cheung22. Fourth, we calculated the e"ect when the
smallest or largest e"ect in each study was considered. Fifth, we
estimated the pooled e"ect using only studies with a low risk of bias. We
also used three di"erent methods to assess and adjust for potential
publication bias20, 23: Duval and Tweedie's trim and $ll procedure24,
Rücker's “limit meta-analysis method”25, and the selection model26, 27.
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 8 of 32
:
extracted the rate of response (i.e., a 50% reduction of depressive
symptoms compared to baseline). If the response rate was not reported,
we estimated it using a method based on the baseline means, the post-
test means, the post-test standard deviations and the number of
subjects35. For studies using the Hamilton Rating Scale for Depression
(HAM-D), we also calculated the rate of remission, de$ned as a score of
≤7 on the 17-item version of that scale36. We also calculated the relative
risk (RR) for response and remission of CBT compared with the control
groups, as well as the NNT (as 1 divided by the risk di"erence).
RESULTS
Selection and inclusion of studies
After examining a total of 30,889 records (21,563 after removal of
duplicates), we retrieved 3,584 full-text papers for further consideration.
A total of 409 trials met the inclusion criteria for this meta-analysis (see
Figure 1). Selected characteristics of included studies and comparisons
are presented in the supplementary information.
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 9 of 32
:
Figure 1 Open in !gure viewer PowerPoint
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 10 of 32
:
Included studies (n=409)
Clinical 106
(25.9)
Other 122
(29.6)
Adolescents 25 (6.1)
Adults 160
(39.1)
Elderly 26 (6.4)
Perinatal 41 (10.0)
Other 75 (18.3)
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 11 of 32
:
In the majority of studies (n=226, 55.3%), depression was de$ned as
meeting the criteria for a depressive disorder according to a diagnostic
interview, while in 162 studies (39.3%) it was de$ned as a score above
the cut-o" on a self-report depression measure. The mean age of
participants in the studies was 40.1±14.98 years; the average proportion
of women was 69%. Most studies were conducted in the US (n=141,
34.5%) or in the UK or other European countries (n=141, 34.5%). Most
studies (n=249, 60.8%) were published since 2011.
Historical overview
The historical overview was limited to the subset comparing CBT with
control conditions (241 studies with 271 comparisons, including 12,907
patients in CBT arms and 12,199 in control conditions). The cumulative
number of studies over time is shown in Figure 2.
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 12 of 32
:
Figure 2 Open in !gure viewer PowerPoint
The bivariable linear regression analyses found that the number of trials
examining depressed patients with general medical disorders and
women with perinatal depression increased signi$cantly over time
(p=0.007 and p=0.012, respectively). The use of waitlist as the control
condition decreased signi$cantly over time (p=0.001), while the number
of studies with low risk of bias increased signi$cantly (p<0.001), as well
as the number of trials in non-Western countries (p=0.005). The number
of participants in each comparison also increased signi$cantly (p<0.001),
while the number of sessions of CBT decreased signi$cantly over time
(p=0.03). All the other characteristics of CBT trials did not change over
time (see also supplementary information).
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 13 of 32
:
control conditions after treatment was g=0.79 (95% CI: 0.70-0.89),
corresponding to an NNT of 3.8 (see Table 2). Heterogeneity was very
high (I2=85; 95% CI: 83-86), and the prediction interval ranged from –0.45
to 2.04.
Post-test
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 14 of 32
:
Only low risk of bias 29 0.91 (0.46- 94 (92- –1.46 to 3.2
PI – prediction interval, NNT – number needed to treat. The reported publication bias
The sensitivity analyses supported the main $ndings (see Table 2 and
supplementary information). Heterogeneity was considerably lower after
excluding outliers (I2=26; 95% CI: 11-39), but the number of outliers that
had to be removed was large (n=77). The e"ect size was smaller for
studies with low risk of bias (g=0.60; 95% CI: 0.49-0.71) and after
adjusting for publication bias (g=0.47, 95% CI: 0.35-0.59 using the trim
and $ll procedure).
The subgroup analyses indicated that the e"ect size in studies with low
risk of bias was signi$cantly lower than in other studies (p<0.001), and
that the e"ect size di"ered across countries (higher in non-Western
countries; p=0.003) and treatment formats (higher for group formats;
p=0.02). There was no reduction of the e"ect size of CBT according to
the publication year (<2001 vs. 2001-2010 vs. >2011) (p=0.43). We
entered all variables in a multivariable meta-regression analysis and
found that, after adjustment for all variables, only the use of a waitlist
control condition (p=0.02) and whether the trial was conducted in an
“other” country (not the US, Europe, East Asia, Canada or Australia;
p=0.001) had a signi$cant impact on the e"ect size (see supplementary
information).
CBT was still e"ective at 6 to 9 month follow-up (g=0.74, 95% CI: 0.36-
1.11) and at 10 to 12 month follow-up (g=0.49, 95% CI: 0.01-0.98), and
this was con$rmed in most sensitivity analyses (see Table 2 and
supplementary information). Heterogeneity was high in most analyses.
At 13 to 24 month follow-up, the main e"ect size was no longer
signi$cant (g=0.22, 95% CI: –0.12 to 0.56), although this may be related to
the small number of studies (n=8).
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 15 of 32
:
The response rate was 0.42 (95% CI: 0.39-0.45) in CBT and 0.19 (95% CI:
0.18-0.21) in the control conditions, which resulted in a RR of 2.13 (95%
CI: 1.96-2.32) and a NNT of 4.7 (95% CI: 4.0-5.5) in favor of CBT (see
Table 3). Most sensitivity analyses indicated similar outcomes, except
that there was signi$cant publication bias, and the RR was lower in
studies with low risk of bias. The response rates di"ered signi$cantly
across control conditions, with the lowest rate for waitlist (see Table 3
and supplementary information).
Response
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 16 of 32
:
Publication bias 259 0.39 84 1.66 59 8.0
correction (0.36- (82- (1.48- (54- (6.2-
0.42) 85) 1.85) 64) 11.0)
The remission rate was 0.36 (95% CI: 0.31-0.42) for CBT and 0.15 (0.12-
0.18) for control conditions, which resulted in a RR of 2.45 (95% CI: 2.06-
2.92), and a NNT of 3.6 (95% CI: 2.7-5.0). This rate remained very similar
in the sensitivity analyses, although it was somewhat lower (but still
signi$cant) after adjustment for publication bias. These $ndings should
be considered with caution, because the di"erence between reported
and estimated remission rates was signi$cant (p=0.02) (see Table 3 and
supplementary information).
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 17 of 32
:
psychotherapies
When limiting the studies to those with low risk of bias, or excluding
outliers, or after adjustment for publication bias, the di"erence between
CBT and other psychotherapies was no longer signi$cant. In the
subgroup analyses in which we examined the di"erent psychotherapies
that were compared with CBT, we found no indication that one of these
therapies was more or less e"ective than CBT (see Table 4 and
supplementary information).
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 18 of 32
:
The number of studies reporting longer-term outcomes was small, and
no signi$cant di"erences between CBT and other psychotherapies were
found at 6-9 months, 9-12 months, or 13-24 months (see Table 4 and
supplementary information).
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 19 of 32
:
di"erences were found in subgroup analyses (see Table 4 and
supplementary information).
Combined treatment was not signi$cantly more e"ective than CBT alone
(g=0.19; 95% CI: –0.11 to 0.50) in the 15 relevant studies (14
comparisons; 644 participants, including 325 in the combined and 319 in
the CBT only conditions). Only one of three analyses in which we
adjusted for publication bias resulted in a signi$cant e"ect size in favor
of combined treatment. Because of the limited number of trials, we
could only conduct a limited number of subgroup analyses, and none of
them resulted in signi$cant di"erences between subgroups (see Table 4
and supplementary information).
Other comparisons
Unguided self-help CBT (Internet-based or not) had a small to moderate
e"ect on depression (g=0.45; 95% CI: 0.31-0.60), based on 36 studies (39
comparisons; 11,720 participants, including 6,206 in the CBT and 5,514 in
the control conditions). The e"ects of unguided CBT were signi$cant in
all sensitivity analyses, although they were somewhat smaller in two of
three analyses adjusting for publication bias. Subgroup analyses
indicated that waitlist-controlled trials resulted in larger e"ect sizes
(p=0.03), and studies in Europe resulted in smaller e"ects (p=0.01). We
also found that studies conducted after 2011 had signi$cantly larger
e"ects than earlier studies (p=0.01), suggesting that the e"ects may have
improved over time (see Table 5 and supplementary information).
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 20 of 32
:
Table 5. Other comparisons between cognitive behavior therapy (CBT)
and control conditions
NNT – number needed to treat. The reported publication bias correction is that using the
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 21 of 32
:
We could compare CBT in institutional settings to control conditions in
10 studies (11 comparisons; 448 participants, including 275 in CBT and
173 in the control conditions). Five studies (six comparisons) were
conducted in psychiatric inpatient settings, four in nursing homes, and
one in another institutional setting. None of the trials was rated as at low
risk of bias. We found a moderate to large e"ect (g=0.65; 95% CI: 0.21-
1.08) with high heterogeneity, which remained signi$cant in most
sensitivity analyses, but was no longer signi$cant in two of the three
analyses adjusting for publication bias (see Table 5 and supplementary
information). Because of the small number of trials and the low quality,
we did not conduct subgroup analyses.
DISCUSSION
This is the largest meta-analysis ever of a speci$c type of psychotherapy
for a mental disorder, including 409 RCTs (518 comparisons) with 52,702
patients. CBT was found to be e"ective in depression when compared to
control conditions such as usual care and waitlist, with a moderate to
large e"ect size (g=0.79). This e"ect was robust in several sensitivity
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 22 of 32
:
analyses, although it was somewhat smaller for studies with low risk of
bias (g=0.60) and after adjustment for publication bias (g=0.47). CBT was
still signi$cantly e"ective at 6-9 month (g=0.74) and 10-12 month
(g=0.49) follow-up, and this was con$rmed in most sensitivity analyses.
Research on CBT has evolved over time. The quality of studies has
improved, which can be seen from the increasing number of trials with
low risk of bias, the decrease in the use of waitlist control groups, and
the increase in sample sizes of included studies. The number of
treatment sessions has signi$cantly decreased over the years. In a meta-
regression analysis, we could not con$rm that the e"ect size of CBT has
37
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 23 of 32
:
decreased over time, as was suggested in an earlier study37.
ACKNOWLEDGEMENTS
Supplementary information on this study is available at
http://osf.io/a6p3w.
REFERENCES
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 24 of 32
:
mental health for all. Geneva: World Health Organization, 2022.
Google Scholar
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 25 of 32
:
7 Cuijpers P, Karyotaki E, de Wit L et al. The e"ects of $fteen evidence-
supported therapies for adult depression: a meta-analytic review.
Psychother Res 2020; 30: 279-93.
Google Scholar
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 26 of 32
:
PubMed Web of Science® Google Scholar
15 Sterne JAC, Savovic J, Page MJ et al. RoB 2: a revised tool for assessing
risk of bias in randomised trials. BMJ 2019; 366:l4898.
Google Scholar
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 27 of 32
:
PubMed Web of Science® Google Scholar
Google Scholar
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 28 of 32
:
CAS PubMed Web of Science® Google Scholar
Google Scholar
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 29 of 32
:
analyses: a structural equation modeling approach. Psychol Methods 2014;
19: 211-29.
33 Furukawa TA. From e"ect size into number needed to treat. Lancet
1999; 353: 1680.
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 30 of 32
:
consensus de$nitions of terms in major depressive disorder: remission,
recovery, relapse, and recurrence. Arch Gen Psychiatry 1991; 48: 851-5.
Citing Literature
Download PDF
Privacy Policy
Terms of Use
About Cookies
Manage Cookies
Accessibility
Wiley Research DE&I Statement and Publishing Policies
Developing World Access
Contact Us
Training and Support
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 31 of 32
:
DMCA & Reporting Piracy
OPPORTUNITIES
Subscription Agents
Advertisers & Corporate Partners
https://onlinelibrary.wiley.com/doi/full/10.1002/wps.21069 07/06/2025, 13 22
Page 32 of 32
: