Generative Artificial Intelligence, Human Creativity, and
Generative Artificial Intelligence, Human Creativity, and
https://doi.org/10.1093/pnasnexus/pgae052
Advance access publication 5 March 2024
Research Report
Abstract
Recent artificial intelligence (AI) tools have demonstrated the ability to produce outputs traditionally considered creative. One such
system is text-to-image generative AI (e.g. Midjourney, Stable Diffusion, DALL-E), which automates humans’ artistic execution to
generate digital artworks. Utilizing a dataset of over 4 million artworks from more than 50,000 unique users, our research shows that
over time, text-to-image AI significantly enhances human creative productivity by 25% and increases the value as measured by the
likelihood of receiving a favorite per view by 50%. While peak artwork Content Novelty, defined as focal subject matter and relations,
increases over time, average Content Novelty declines, suggesting an expanding but inefficient idea space. Additionally, there is a
consistent reduction in both peak and average Visual Novelty, captured by pixel-level stylistic elements. Importantly, AI-assisted
artists who can successfully explore more novel ideas, regardless of their prior originality, may produce artworks that their peers
evaluate more favorably. Lastly, AI adoption decreased value capture (favorites earned) concentration among adopters. The results
suggest that ideation and filtering are likely necessary skills in the text-to-image process, thus giving rise to “generative synesthesia”—
the harmonious blending of human exploration and AI exploitation to discover new creative workflows.
Keywords: generative AI, human–AI collaboration, creative workflow, impact of AI, art
Significance Statement
We investigate the implications of incorporating text-to-image generative artificial intelligence (AI) into the human creative workflow.
We find that generative AI significantly boosts artists’ productivity and leads to more favorable evaluations from their peers. While
average novelty in artwork content and visual elements declines, peak Content Novelty increases, indicating a propensity for idea ex
ploration. The artists who successfully explore novel ideas and filter model outputs for coherence benefit the most from AI tools,
underscoring the pivotal role of human ideation and artistic filtering in determining an artist’s success with generative AI tools.
artifacts, there remains a significant knowledge gap regarding its Creative productivity
impact on productivity in artistic endeavors which lack well- We define creative productivity as the log of the number of arti
defined objectives, and the long-run implications on human cre facts that a user posts in a month. Figure 1a reveals that upon
ativity more broadly. In particular, if humans increasingly rely adoption, artists experience a 50% increase in productivity on
on generative AI for content creation, creative fields may become average, which then doubles in the subsequent month. For the
saturated with generic content, potentially stifling exploration of average user, this translates to approximately 7 additional arti
new creative frontiers. Given that generative algorithms will re facts published in the adoption month and 15 artifacts in the fol
main a mainstay in creative domains as it continues to mature, lowing month. Beyond the adoption month, user productivity
it is critical to understand how generative AI is affecting creative gradually stabilizes to a level that still exceeds preadoption vol
production, the evaluation of creative artifacts, and human cre ume. By automating the execution stage of the creative process,
ativity more broadly. To this end, our research questions are adopters can experience prolonged productivity gains compared
3-fold: to their nonadopter counterparts.
a b
e f
Fig. 1. Causal effect of adopting generative AI on a) creative productivity as the log of monthly posts; b) creative value as number of favorites per view; c)
mean Content Novelty; d) maximum Content Novelty; e) mean Visual Novelty; f) maximum Visual Novelty. The error bars represent 95% CI.
may be designed to capture specific visual elements from which there individual-level differences that enable certain artists to
users can sample from to maintain a particular and consistent successfully produce more creative artworks? Specifically, how
visual style. In effect, AI may be pushing artists toward visual does humans’ baseline novelty, in the absence of AI tools, correl
homogeneity. ate with their ability to successfully explore novel ideas with gen
erative AI to produce valuable artifacts? To delve into this
heterogeneity, we categorize each user into quartiles based on
Role of human creativity in AI-assisted their average Content and Visual Novelty without AI assistance
value capture to capture each users’ baseline novelty. We then employ a two-
Although aggregate trends suggest novelty of ideas and aesthetic way fixed effects model to examine the interaction between
features is sharply declining over time with generative AI, are adoption, pretreatment novelty quartiles, and posttreatment
4 | PNAS Nexus, 2024, Vol. 3, No. 3
a b
adjustments in novelty. Each point in Fig. 2a and b represents the method (11). This method utilizes ideas from cooperative game
estimated impact of increasing mean Content (left) or Visual theory to approximate the predictive signal of covariates, ac
(right) Novelty on Value based on artists’ prior novelty denoted counting for linear and nonlinear interactions through the
along the horizontal axis. Intuitively, these estimates quantify Markov chain Monte Carlo method. Intuitively, a feature of great
the degree to which artists can successfully navigate the creative er importance indicates potentially greater impacts on treatment
space based on prior originality in both ideation and visuals to effect heterogeneity among adopters.
earn more favorable evaluations from peers. Refer to SI Figure 3 offers correlational evidence that Content Novelty sig
Appendix, Section 2B for estimation details. nificantly increases model performance within several months of
Figure 2a presents correlational evidence that users, regardless adoption, whereas Visual Novelty remains marginally impactful
of their proficiency in generating novel ideas, might be able to until the last observation period. This suggests that Content
realize significant gains in Value if they can successfully produce Novelty plays a more significant role in predicting posttreatment
more novel content with generative AI. The lowest quartile of con variations in Value gains compared to Visual Novelty. In sum
tent creators may also experience marginally significant gains. mary, these findings illustrate that content is king in the
However, those same users who benefit from expressing more text-to-image creative paradigm.
novel ideas may also face penalties for producing more divergent
visuals.
Next, Fig. 2b suggests that users who were proficient in creating Platform-level value capture
exceedingly novel visual features before adopting generative AI One question remains: do individual-level differences within
may garner the most Value gains from successfully introducing adopters result in greater concentrations of value among fewer
more novel ideas. While marginally significant, less proficient users at the platform-level? Specifically, are more favorites being
users can also experience weak Value gains. In general, more nov captured by fewer users, or is generative AI promoting less
el ideas are linked to improved Value capture. Conversely, users concentrated value capture? To address these questions, we cal
capable of producing the most novel visual features may face pen culate the Gini coefficients with respect to favorites received of
alties for pushing the boundaries of pixel-level aesthetics with never-treated units, not-yet-treated units, and treated units and
generative AI. This finding might be attributed to the contextual conduct permutation tests with 10,000 iterations to evaluate if
nature of Value, implying an “acceptable range” of novelty. adoption of generative AI may lead to a less concentrated distribu
Artists already skilled at producing highly novel pixel-level fea tion of favorites among users. The Gini coefficient is a common
tures may exceed the limit of what can be considered coherent. measure of aggregate inequality where a coefficient of 0 indicates
Despite penalties for pushing visual boundaries, the gains from that all users make up an equal proportion of favorites earned,
exploring creative ideas with AI outweigh the losses from visual and a coefficient of 1 indicates that a single user captures all favor
divergence. Unique concepts take priority over novel aesthetics, ites. Thus, higher values of the Gini coefficient indicate a greater
as shown by the larger Value gains for artists who were already concentration of favorites captured by fewer users. Figure 4
adept at Visual Novelty before using AI. This suggests users who depicts the differences in cumulative distributions as well as
naturally lean toward visual exploration may benefit more from Gini coefficients of both control groups and the treated group
generative AI tools to explore the idea space. with respect to a state of perfect equality.
Lastly, we estimate Generalized Random Forests (10) config First, observe that platform-level favorites are predominantly
ured to optimize the splitting criteria that maximize heterogen captured by a small portion of users, reflecting an aggregate con
eity in Value gains among adopters for each postadoption centration of favorites. Second, this concentration is more pro
period. With each trained model, we extract feature importance nounced among not-yet-treated units than among never-treated
weights quantified by the SHAP (SHapley Additive exPlanations) units. Third, despite the presence of aggregate concentration,
Zhou and Lee | 5
Fig. 4. Gini coefficients of treated units vs. never-treated and not-yet-treated units.
favorites captured among AI adopters are more evenly distributed D shows the difference between the treated coefficient and the
compared to both never-treated and not-yet-treated control units. control group coefficients, show that the differences in coeffi
The results from the permutation tests in Table 1, where column cients are statistically significant between never-treated and
6 | PNAS Nexus, 2024, Vol. 3, No. 3
Table 1. Permutation tests for statistical significance. Novelty diminish. This implies that the universe of creative possi
bilities is expanding but with some inefficiencies.
Coefficient D P-value
Our results hint that the widespread adoption of generative AI
Never-treated 0.807 −0.0128 0.0673 technologies in creative fields could lead to a long-run equilibrium
Not-yet-treated 0.824 −0.0298 0.0026 where in aggregate, many artifacts converge to the same types of
Treated 0.794
content or visual features. Creative domains may be inundated
The column D denotes the difference in Gini coefficients relative to the treated with generic content as exploration of the creative space dimin
population. ishes. Without establishing new frontiers for creative exploration,
AI systems trained on outdated knowledge banks run the risk of
perpetuating the generation of generic content at a mass scale
not-yet-treated groups vs. the treated group. This suggests that in a self-reinforcing cycle (17). Before we reach that point, technol
generative AI may lead to a broader allocation of favorites earned ogy firms and policy makers pioneering the future of generative AI
(value capture from peer feedback), particularly among control
ideas. This finding hints at the importance of humans’ baseline and maximum distances for each artifact. This month’s artifacts
ideation and filtering abilities as focal expressions of creativity are then added to the baseline set such that all future artworks are
in a text-to-image paradigm. Finally, generative AI appears to pro compared to all prior artworks, effectively capturing the time-
mote a more even distribution of platform-level favorites among varying nature of novelty. Continue for all remaining months.
adopters, signaling a potential step toward an increasingly demo We apply this approach to all adopters’ artworks and a random
cratized, inclusive creative domain for artists empowered by AI sample of 10,000 control users due to computational feasibility.
tools.
In summary, our findings emphasize that humans’ ideation
proficiency and a refined artistic filter rather than pure mechanic Content feature extraction
al skill may become the focal skills required in a future of human– To describe the focal objects and object relationships in an arti
AI cocreative process as generative AI becomes more mainstream fact, we utilize state-of-the-art multimodal model BLIP-2 (24)
in creative endeavors. This phenomenon in which AI-assisted which takes as input an image and produces a text description
of the content. A key feature of this approach is the availability
Supplementary Material 9 Boden MA. 1998. Creativity and artificial intelligence. Artif Intell.
103(1–2):347–356.
Supplementary material is available at PNAS Nexus online.
10 Athey S, Tibshirani J, Wager S. 2019. Generalized random forests.
Ann Statist. 47(2):1148–1178.
Funding 11 Lundberg S, Lee S-I. 2017. A unified approach to interpreting
model predictions. In: Proceedings of the 31st International
The authors declare no funding.
Conference on Neural Information Processing Systems. p.
4768–4777.
Author Contributions 12 Xu Y. 2017. Generalized synthetic control method: causal infer
ence with interactive fixed effects models. Polit Anal. 25(1):57–76.
D.L. and E.Z. designed the research and wrote the paper. E.Z.
13 Peng S, Kalliamvakou E, Cihon P, Demirer M. 2023. The impact of
analyzed data and performed research with guidance from D.L.
AI on developer productivity: evidence from github copilot,