GNN-Computer aided Civil Eng - 2022 - Song - Elastic structural analysis based on graph neural network without labeled data
GNN-Computer aided Civil Eng - 2022 - Song - Elastic structural analysis based on graph neural network without labeled data
12944
RESEARCH ARTICLE
1 INTRODUCTION in the digital world (Lee et al., 2018; Salehi & Burgueño,
2018).
The emergence of new-generation artificial intelligence Currently, research on ML/DL-based computation in
technologies represented by machine learning (ML) and civil engineering has addressed all levels of analysis scenar-
deep learning (DL; Dong et al., 2021; Goodfellow et al., ios. At the construction material level, Rafiei et al. (2017)
2016; LeCun et al., 2015) is attracting an increasing number presented a deep restricted Boltzmann machine to esti-
of scholars. These technologies have been applied in civil mate concrete properties based on mixture proportions.
engineering, particularly in the fields of computational Nguyen et al. (2018) adopted artificial neural networks
structural analysis and computational material analysis (ANNs) for predicting the strength of foamed concrete.
(Amezquita-Sancheza et al., 2020; Sun et al., 2021). It is C. Wang et al. (2022) first introduced the Seq2Seq frame-
anticipated that the performance of these technologies will work and then the attention mechanism (Wang et al.,
surpass the performance of traditional numerical methods 2022) to simulate the cyclic behavior of low-yield-point
to realize the efficient simulation of engineering structures steel. Kang et al. (2021) predicted the strength of steel
fiber-reinforced concrete using 11 ML algorithms and ana- cannot interpret the analysis results (Naser, 2021) and eval-
lyzed the influential factors. At the structural member uate the correctness of the models based on the test set; this
level, Olalusi and Awoyera (2021) focused on the shear is unacceptable for engineering applications where safety
capacity of slender RC structures with steel fibers and is the primary goal.
used Gaussian process regression for prediction. Wakjira To address these limitations, an innovative physics-
et al. (2022) employed ensemble ML to estimate the shear informed DL model named StructGNN-E (i.e., structural
capacity of fiber-reinforced-polymer (FRP) reinforced con- analysis based on graph neural network [GNN]–elastic),
crete. Naderpour et al. (2021) utilized a decision tree which is able to implement elastic analysis of structural
and ANNs to determine the failure mode of RC columns systems without labeled data, is proposed in this study. The
and achieved exceptional performance. At the structural remainder of this paper is organized as follows. In Sec-
system level, Rafiei and Adeli (2017) combined ML clas- tion 2, we analyze the data representations of structural
sification algorithms with an optimization process for systems and elaborate on how to utilize non-Euclidean
earthquake prediction. Zhu et al. (2021) attempted to use data—the graph—to organize their feature information.
ANNs and support vector regression to calculate the buck- On this basis, we develop a GNN architecture to handle
ling load of imperfect reticulated shells, avoiding costly the structural analysis of elastic structural systems in Sec-
computation in nonlinear analyses. Recently, the deep long tion 3, wherein the basic knowledge of GNNs is introduced
short-term memory (LSTM) proposed by Hochreiter and and a variant of the graph isomorphism network (GIN)
Schmidhuber (1997) model shows great potential in seis- adapting to the structural analysis scenario is illustrated. In
mic analysis. Zhang et al. (2019) developed an LSTM model addition, we propose a physics-informed paradigm, which
to compute the seismic responses of a specific building, integrates fundamental mechanical formulations into DL,
and the results agreed with the reference data well. Torky to resolve the paucity of data at the structural level. In Sec-
and Ohno (2021) combined a convolutional neural net- tion 4, we validate the StructGNN-E model with randomly
work (CNN) with LSTM to predict the seismic response of generated frame structures of different scales. In Section 5,
an industrial-level building. we carry out ablation studies, which further demonstrate
Valuable results have been acquired through relevant the unique effectiveness of the StructGNN-E model, and
studies and have helped engineers efficiently build compli- conceptually discuss the potential application in the field
cated quantitative relationships in structural computation of structural optimization. Finally, we conclude the cur-
(Feng et al., 2020; Guan et al., 2021; Graf et al., 2012; Kabir rent work and discuss the outlook of our future work in
et al., 2021; Oh et al., 2020) and optimization (Hung & Jan, Section 6.
2002; Messner et al., 1994; Parvin & Serpen, 1999; Yin &
Zhu, 2019). However, at the structural system level, exist-
ing studies have the following limitations. (1) In the current 2 DATA REPRESENTATION USING
literature, the data representation of structural systems has GRAPH STRUCTURES
rarely been a point of focus, which can possess various
topologies due to different structural member configu- The mechanism of structural analysis can be abstracted
rations and their connectivity, resulting in exponentially as predicting structural responses = (𝐘𝑡0 , … , 𝐘𝑡𝑛 ) that
increasing complexity, compared to that of materials and conform to physical laws given structural properties =
structural members. The linear data structure used in cur- (𝐂1 , 𝐂2 , …) and external stimuli = (𝐗𝑡0 , … , 𝐗𝑡𝑛 ) (C. Wang
rent studies cannot comprehensively describe the feature et al., 2020). At the structural system level, such as a
information of structural systems. (2) Applications at this steel frame structure, the structural properties Ci corre-
level face severe data scarcity. Most ML/DL models adopt spond to the overall spatial topologies, section geometries
a data-driven paradigm, which relies heavily on big data of underlying steel members, material properties of struc-
for learning underlying patterns. Nevertheless, there is lit- tural steel, and so forth. The stimulus-response pair < Xtj ,
tle experimental data concerning structural systems, and Ytj > represents structural analysis results such as the
data generation using classic finite element (FE) analysis is load‒displacement relationship. The subscript tj indicates
inordinately time-consuming and cannot cover the entire the stimulus/response at location j at time t.
parameter space. It is well-known that common data- To address the problem, traditional methods, including
driven models suffer from data sparsity (Martins et al., numerical approaches operating at the level of structural
2020) and are not directly applicable at this level. (3) The matrices and mechanical approaches operating at the
theoretical correctness of the analysis results is difficult level of nodes or FE elements, follow the idea of solving
to guarantee. The inference process of data-driven mod- a large problem by decomposing it into a number of
els often neglects distinctive mechanical backgrounds of small sub-problems and have been proven useful in civil
structural analysis. Accordingly, researchers and engineers engineering. Methods belonging to the former category
14678667, 2023, 10, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/mice.12944 by University Of Connecticut, Wiley Online Library on [07/06/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
SONG et al. 1309
FIGURE 2 Graph representation of a planar frame structure. (a) Schematic illustration of a planar frame structure and (b) graph
representation
denote an edge connecting vi and vj . The neighborhood of physical positions of each structural member and their spa-
a node v is defined as tial information, such as their lengths. As the subsequent
sections will show, node features of coordinates play a
𝑁 (𝑣) = {𝑢 ∈ 𝑉| (𝑣, 𝑢) ∈ 𝐸} (2) critical role in structural analysis.
Based on the concepts and notations of graph data, we
One of the most commonly used graph representations can properly digitalize arbitrary structural systems and
is an adjacency matrix A with Aij = 1 if 𝑒𝑖𝑗 ∈ 𝐸 and Aij = 0 if preserve their feature information with high fidelity, facil-
𝑒𝑖𝑗 ∉ 𝐸. To save storage space, we can extend the adjacency itating intelligent structural analysis by downstream DL
matrix A to record structural properties such as section models.
geometries and stiffness by utilizing the symmetry of undi-
rected graphs. For example, the extended adjacency matrix
of the planar structure in Figure 2 can be written as 3 STRUCTGNN-E MODEL
⎡ −1 𝐸𝐼0,1 0 0 0 0 0 0 0 0 0 ⎤ In practice, most engineering structures adopt elastic anal-
⎢ ⎥
⎢𝐸𝐴0,1 0 𝐸𝐼1,2 0 𝐸𝐼1,4 0 0 0 0 0 0 ⎥ ysis solutions under static loads during the design stage.
⎢ ⎥
⎢ 0 𝐸𝐴1,2 0 0 0 𝐸𝐼2,5 0 0 0 0 0 ⎥ Furthermore, elastic cases also serve as the foundations
⎢ ⎥
⎢ 0 0 0 −1 𝐸𝐼3,4 0 0 0 𝐸𝐼3,8 0 0 ⎥ of elasto-plastic cases, which provide key techniques and
⎢ ⎥
⎢ 0 𝐸𝐴 0 𝐸𝐴3,4 0 𝐸𝐼4,5 0 0 𝐸𝐼4,8 𝐸𝐼4,9 0 ⎥ help develop the general framework. Therefore, this study
1,4
⎢ ⎥
𝐴=⎢ 0 0 𝐸𝐴2,5 0 𝐸𝐴4,5 0 𝐸𝐼5,6 0 0 𝐸𝐼5,9 𝐸𝐼5,10 ⎥ takes elastic structural systems under static loads as the
⎢ ⎥
⎢ 0 0 0 0 0 𝐸𝐴5,6 0 0 0 0 𝐸𝐼6,10 ⎥⎥ research object.
⎢
⎢ 0 0 0 0 0 0 0 −1 𝐸𝐼7,8 0 0 ⎥⎥
⎢ Adapting to the graph data representation, we propose
⎢ 0 0 0 𝐸𝐴3,8 𝐸𝐴4,8 0 0 𝐸𝐴7,8 0 𝐸𝐼8,9 0 ⎥⎥
⎢ a DL structural analysis model named StructGNN-E (i.e.,
⎢ ⎥
⎢ 0 0 0 0 𝐸𝐴4,9 𝐸𝐴5,9 0 0 𝐸𝐴8,9 0 𝐸𝐼9,10 ⎥ structural analysis based on GNN–elastic) within the GNN
⎢ ⎥
⎣ 0 0 0 0 0 𝐸𝐴5,10 𝐸𝐴6,10 0 0 𝐸𝐴9,10 0 ⎦ framework. The model architecture is shown in Figure 3.
(3) The feature information of structural systems is trans-
where each diagonal element Aii indicates the boundary formed into graph data and sent to a variant of GIN,
condition of node vi (−1 for a confined node); each element which computes the internal force distribution using a
in the upper triangle Aij,i<j represents the bending rigid- message-passing mechanism. The whole model is driven
ity of the structural member between node vi and node vj ; by structural mechanics without labeled data, overcoming
and each element in the lower triangle Aij,i>j represents the data scarcity problem at the structural system level.
the axial rigidity.
Apart from the fundamental representation above, both
the nodes and edges can have attributes 𝐗𝑣 ∈ ℝ𝑛×𝑑 (n is 3.1 Basics of GNNs
the number of nodes and d is the dimension of node fea-
tures) and 𝐗𝑒 ∈ ℝ𝑚×𝑐 (m is the number of edges and c is GNNs are a kind of emerging DL technique that extends
the dimension of edge features) to enhance the expressive- classical neural networks such as CNNs and RNNs to pro-
ness of the graph. For structural systems, the coordinates cess non-Euclidean graph data. A fundamental assump-
of joints are important node features that determine the tion underlying GNNs is that the targets for prediction
14678667, 2023, 10, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/mice.12944 by University Of Connecticut, Wiley Online Library on [07/06/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
SONG et al. 1311
F I G U R E 3 Architecture of the StructGNN-E (structural analysis based on graph neural network [GNN]–elastic) model. GIN, graph
isomorphism network
(0)
𝒉𝑣 = 𝒙𝑣 , ∀𝑣 ∈ 𝑉 (4)
( )
(𝑙) (𝑙) (𝑙−1) (𝑙−1)
𝒎𝑢𝑣 = 𝑀𝑆𝐺 𝒉𝑣 , 𝒉𝑢 , ∀ (𝑢, 𝑣) ∈ 𝐸 (5)
({ })
(𝑙) (𝑙) (𝑙)
𝒂𝑣 = AGG 𝒎uv |𝑢 ∈ 𝑁 (𝑣) , ∀𝑣 ∈ 𝑉 (6)
( )
(𝑙) (𝑙) (𝑙−1) (𝑙)
𝒉𝑣 = UPT 𝒉𝑣 , 𝒂𝑣 , ∀𝑣 ∈ 𝑉 (7)
(𝑙) In this manner, nodes that have the same local structure,
Typically, 𝑀𝑆𝐺 is embedded into the aggregation func-
(𝑙) such as node i and node j in Figure 5, can be distinguished
tion 𝐴𝐺𝐺 or simply preserves the representation from
(𝑙) (𝑙) easily because the messages (i.e., the coordinates) passed
the previous layer. The choices of 𝐴𝐺𝐺 and 𝑈𝑃𝑇 are from their neighborhood nodes differ and thus can be
crucial and derive different GNN models. For instance, projected to discriminative representations through sim-
the graph convolutional network (Kipf & Welling, 2017) ple mathematical operations. This method corresponds to
selects mean pooling as the aggregation function and a a feature augmentation strategy in which node IDs are
single-layer neural network as the updater: used to enhance the expressiveness of graph data, which
( { }) is popular in recommender systems (J. Wang et al., 2018).
(𝑙) (𝑙−1)
𝒉𝑣 = ReLU 𝑾 (𝑙) ⋅ MEAN 𝒉𝑢 |∀𝑢 ∈ 𝑁 (𝑣) ∪ {𝑣} For nodes with different degrees, such as nodes s and
(8) j, we resort to carefully designing the message aggrega-
Alternatively, in GraphSAGE (Hamilton et al., 2017), a tor. We revisit classical aggregators—sum pooling, average
max pooling aggregator is used, and the updater adds a pooling, and max pooling—to analyze their effectiveness.
concatenation operation before the linear mapping: On the right side of Figure 5, we illustrate the “messages”
from the neighborhood of nodes s and j. We calculate the
( [ ]) aggregation results using classical aggregators as shown in
(𝑙) (𝑙) (𝑙−1) (𝑙)
𝒉𝑣 = ReLU 𝑾𝑈𝑃𝑇 ⋅ 𝒉𝑣 , 𝒂𝑣 (9)
Table 1. By comparing the results at two nodes, we can
determine the cases in which sum pooling and mean pool-
({ ( ) }) ing fail to distinguish the local structures. For example,
(𝑙) (𝑙) (𝑙−1)
𝒂𝑣 = MAX ReLU 𝑾AGG ⋅ 𝒉𝑢 , ∀𝑢 ∈ 𝑁 (𝑣) when a = 7d, b = 7 h, x = 9d, and y = 9 h, the sum pooling
(10) aggregator yields the same message information of (36d,
36 h). Max pooling seems the most powerful in this situ-
The lReLU operation in Equations (8) to (10) is the
ation. However, in Figure 5, nodes s and r, which share
rectified linear unit:
the same maximal neighborhood coordinate (a + d, b +
h), will be confused. In summary, classical aggregators are
ReLU (𝑥) = 𝑥 ⊙ 1𝑥≥0 (11)
incapable of adapting to all cases in structural systems.
Distinguishing different nodes with similar local struc-
where 𝟏𝑐𝑜𝑛𝑑𝑖𝑡𝑖𝑜𝑛 is the indicator function and ⊙ denotes the
tures corresponds to the well-known graph isomorphism
Hadamard product.
problem (Weisfeiler & Leman, 1968). An eligible GNN is
expected to map different multisets of node features to dif-
ferent representations, requiring the aggregation function
3.2 Modification of the GIN to be injective. Considering that the neighborhood nodes
in structural systems are countable and finite, we adopt the
In the structural analysis scenario, an effective GNN philosophy of GIN (Xu et al., 2019), which states that for a
should be able to distinguish different nodes by mapping countable multiset, there exists a function ∶ → ℝ𝑛 so
them to different representations in the embedding space. that
Figure 5 displays an asymmetric frame structure, wherein
∑
nodes i and j have the same degree (i.e., the number of (𝑣, 𝑋) = (1 + 𝜖) ⋅ (𝑣) + (𝑥) (12)
edges connecting to the node). However, despite shar- 𝑥∈𝑋
ing the same local structure, the internal forces at these
two nodes react differently in most load cases. Moreover, is unique, where 𝑣 ∈ and 𝑋 ⊂ is a multiset of bounded
since these two nodes are located in the middle of the size. In addition, any function that acts on such pairs can
entire structure, very deep layers are required to reach the be decomposed as follows:
boundary so that the model can discriminate the subtrees [ ]
rooted in them, which is infeasible in practice. There- ∑
(𝑣, 𝑋) = 𝜙 (1 + 𝜖) ⋅ (𝑣) + (𝑥) (13)
fore, how GNNs can identify different nodes with similar 𝑥∈𝑋
local structures is a challenging problem in structural
analysis and imposes high demands on selecting an appro- Based on this statement, GIN uses multilayer percep-
priate aggregation function within the message-passing trons (MLPs) to model the composite function (𝑙) ◦𝜙(𝑙−1)
framework. by leveraging the universal approximation theorem
To address this problem, we first augment the graph data (Hornik et al., 1989, 1990). The original GIN thus updates
by introducing physical coordinates into the node features. node representations as follows:
14678667, 2023, 10, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/mice.12944 by University Of Connecticut, Wiley Online Library on [07/06/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
SONG et al. 1313
( )
(𝑙) ( (𝑙)) (𝑙−1) ∑ (𝑙−1) existing ML/DL studies on structural analysis rely heavily
𝒉𝑣 = MLP 1+𝜖 (𝑙) ⋅ 𝒉𝑣 + 𝒉𝑢 (14) on data from experiments or generated using FE analy-
𝑢∈𝑁(𝑣) sis to train the model parameters, which corresponds to
a supervised learning scheme. However, in contrast to
where 𝜖(𝑙) can be either learnable or fixed. Equation (12) construction materials and structural members such as
is applicable when node features are organized as one- beams and columns, experimental data are very scarce
hot encodings in the first iteration, which makes their at the structural system level. Meanwhile, data genera-
summation injective. To accommodate structural system tion through FE analysis will consume a large amount
scenarios, we modify the first layer to: of time and, more importantly, cannot cover the entire
( ) parameter space due to the high complexity of feature
(1) (1) ( ) (0)
∑ (0) information of structural systems. In addition to the data
𝒉𝑣 = MLP 1 + 𝜖(1) ⋅ MLP 𝒙𝑣 + MLP 𝒙𝑢
𝑢∈𝑁(𝑣) paucity problem, data-driven models are criticized for their
(15) poor interpretability and neglect the physical background
Then, the GIN model can preserve injectiveness while of structural engineering. As a result, researchers and engi-
recursively updating each node’s feature embedding to neers cannot evaluate the correctness of the prediction
capture the graph structure. results, which impedes the promotion of DL methods in
engineering projects.
3.3 Physics-informed modeling Structural analysis has a solid theory in mechanics.
Specifically, elastic cases are governed by three mechan-
The mainstream DL models are driven by big data, min- ical equations: the equilibrium equation, the deforma-
ing the latent patterns underlying the dataset. Accordingly, tion compatibility equation, and the elastic constitutive
14678667, 2023, 10, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/mice.12944 by University Of Connecticut, Wiley Online Library on [07/06/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
1314 SONG et al.
FIGURE 9 Comparison of bending moment diagrams. (a) Results using the StructGNN-E model and (b) results using SAP2000
FIGURE 10 Comparison of virtual work diagrams. (a) Results using the StructGNN-E model and (b) Results using SAP2000
efficiency, making our proposed model applicable to engi- cuss its potential application in the field of structural
neering practice. optimization.
N nodes . .
.
. . .
.
. . .
.
.
FIGURE 16 Structural optimization procedures with traditional methods and deep learning methods. FE, finite element
In contrast, the StructGNN-E model can modify the tra- superb performance in both computational accuracy and
ditional unidirectional paradigm to a new bidirectional efficiency.
paradigm. As shown in Figure 16, not only can the input The main conclusions are summarized as follows:
parameters be transformed into the final results, but the
gradients of the results can also be propagated back to 1. We innovatively utilize the graph data structure to orga-
update the input parameters. The backward flow is imple- nize the feature information with nonsequential and
mented by assimilating the input parameters as a part of translation-variant properties, which realizes the data
learnable parameters in DL, which can be co-optimized representation of structural systems with high fidelity.
by the gradient descent method. Due to the excellent 2. We propose the StructGNN-E model within the GNN
parameterization of DL, there is no need to rebuild the architecture to adapt to the graph representation,
model when the input parameters are changed. This new wherein we modify GIN to distinguish the nodes with
paradigm is much more efficient because, on the one hand, similar local topologies.
the information from the results is fully utilized to guide 3. We propose a novel physics-informed paradigm to
the optimization direction of the input parameters, and resolve the data scarcity problem at the structural
on the other hand, the optimizer integrated into the DL system level. The paradigm incorporates structural
framework, such as the Adam algorithm, is more advanced mechanics into DL and converts the training process
than traditional algorithms. Moreover, since the optimiza- to solving the physical equations, implementing the
tion and the structural analysis share the same training structural analysis without labeled data and ensuring
process, we can accomplish them simultaneously and not the theoretical correctness.
avoid implementing two separate algorithms. Therefore, 4. The numerical experiment verifies that StructGNN-E
the StructGNN-E model possesses great potential to inno- converges to accurate results of structures with different
vate the field of structural optimization, deriving a new scales and exhibits excellent computational efficiency.
end-to-end optimization paradigm with high efficiency 5. Ablation studies demonstrate the unique effectiveness
and automaticity. of StructGNN-E at the structural system level compared
with classical DL models and the data-driven paradigm.
6 CONCLUSION 6. The StructGNN-E model can derive a new end-to-end
structural optimization method with high efficiency
To fill the research gap of ML/DL-based analysis at the and automaticity, which is anticipated to have great
structural system level, we propose a physics-informed potential to reform the field of structural optimization.
DL model named StructGNN-E based on the graph data
structure. A numerical experiment and ablation studies Nevertheless, the current framework is applicable for
demonstrate that our proposed model is uniquely effec- elastic analysis and frame-like structures. The implemen-
tive, compared with classical DL models and exhibits tation of the physics-informed paradigm can be complex
14678667, 2023, 10, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/mice.12944 by University Of Connecticut, Wiley Online Library on [07/06/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
1322 SONG et al.
for elasto-plastic scenarios when the memory effect is Hornik, K., Stinchcombe, M., & White, H. (1989). Multilayer feed-
taken into account. Structures with high-dimensional forward networks are universal approximators. Neural Networks,
members (such as shear walls) or nonlinear deformation 2(5), 359–366.
Hornik, K., Stinchcombe, M., & White, H. (1990). Universal approx-
are difficult to fit into the current framework. Despite these
imation of an unknown mapping and its derivatives using
limitations, StructGNN-E provides a valuable technical
multilayer feedforward networks. Neural Networks, 3(5), 551–560.
framework for intelligent computation at the structural Huang, P., & Chen, Z. (2021). Deep learning for nonlinear seismic
system level, including a graph data representation to responses prediction of subway station. Engineering Structures,
describe the structural topologies, a GNN architecture to 244, 112735.
gather the information from neighborhood nodes, and a Hung, S., & Jan, J. (2002). Machine learning in engineering analysis
physics-informed paradigm to alleviate the dependence on and design: An integrated fuzzy neural network learning mode.
big data, which paves an inspiring avenue for our future Computer-Aided Civil and Infrastructure Engineering, 14(3), 207–
219.
work on extending it to elasto-plastic structural analysis.
Hwang, S. H., Mangalathu, S., Shin, J., & Jeon, J. S. (2021). Machine
learning-based approaches for seismic demand and collapse of
AC K N OW L E D G M E N T S ductile reinforced concrete building frames. Journal of Building
We gratefully acknowledge the financial support provided Engineering, 34, 101905.
by the National Natural Science Foundation of China Jenkins, W. M. (1991). Towards structural optimization via the genetic
(Grant No. 52121005), China National Postdoctoral Pro- algorithm. Computers & Structures, 40(5), 1321–1327.
gram for Innovative Talents (Award No. BX20220177), Kabir, M. A. B., Hasan, A. S., & Billah, A. H M. (2021). Failure mode
identification of column base plate connection using data-driven
and China Postdoctoral Science Foundation (Grant No.
machine learning techniques. Engineering Structures, 240, 112389.
2022M711864).
Kang, M. C., Yoo, D. Y., & Gupta, R. (2021). Machine learning-based
prediction for compressive and flexural strengths of steel fiber-
REFERENCES reinforced concrete. Construction and Building Materials, 266,
Amezquita-Sancheza, J. P., Valtierra-Rodriguez, M., & Adeli, H. 121117.
(2020). Machine learning in structural engineering. Scientia Iran- Kangwai, R. D., & Guest, S. D. (2000). Symmetry-adapted equilibrium
ica, 27(6), 2645–2656. matrices. International Journal of Solids and Structures, 37(11),
Dong, S., Wang, P., & Abbas, K. (2021). A survey on deep learning and 1525–1548.
its applications. Computer Science Review, 240, 100379. Kim, T., Kwon, O. S., & Song, J. (2019). Response prediction of nonlin-
Esteghamati, M. Z., & Flint, M. M. (2021). Developing data-driven ear hysteretic systems by deep neural networks. Neural Networks,
surrogate models for holistic performance-based assessment of 111, 1–10.
mid-rise RC frame buildings at early design. Engineering Struc- Kingma, D. P., & Ba, J. (2014). Adam: A method for stochas-
tures, 245, 112971. tic optimization. The 3rd International Conference on Learning
Feng, D. C., Liu, Z. T., Wang, X. D., Chen, Y., Chang, J. Q., Wei, D. Representations, San Diego, CA.
F., & Jiang, Z. M. (2020). Machine learning-based compressive Kipf, T. N., & Welling, M. (2017). Semi-supervised classification
strength prediction for concrete: An adaptive boosting approach. with graph convolutional networks. International Conference on
Construction and Building Materials, 230, 117000. Learning Representations, Toulon, France.
Gholizadeh, S., Salajegheh, E., & Torkzadeh, P. (2008). Structural Lagaros, N. D., & Papadrakakis, M. (2012). Neural network based
optimization with frequency constraints by genetic algorithm prediction schemes of the non-linear seismic response of 3D
using wavelet radial basis function neural network. Journal of buildings. Advances in Engineering Software, 44(1), 92–115.
Sound and Vibration, 312(1-2), 316–331. LeCun, Y., Bengio, Y., & Hinton, G. (2015). Deep learning. Nature,
Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep Learning. The 521(7553), 436.
MIT Press. Lee, S., Ha, J., Zokhirova, M., Moon, H., & Lee, J. (2018). Background
Graf, W., Freitag, S., Sickert, J. U., & Kaliske, M. (2012). Struc- information of deep learning for structural engineering. Archives
tural analysis with fuzzy data and neural network based material of Computational Methods in Engineering, 25(1), 121–129.
description. Computer-Aided Civil and Infrastructure Engineering, Liu, X., Yi, W. J., Li, Q. S., & Shen, P. S. (2008). Genetic evolutionary
27(9), 640–654. structural optimization. Journal of Constructional Steel Research,
Guan, X., Burton, H., Shokrabadi, M., & Yi, Z. (2021). Seismic 64(3), 305–311.
drift demand estimation for steel moment frame buildings: From Lu, J., Luo, Y., & Li, N. (2009). An incremental algorithm to trace
mechanics-based to data-driven models. journal of structural the non-linear equilibrium paths of pin-jointed structures using
engineering, 147(6), 04021058. the singular value decomposition of the equilibrium matrix.
Hamilton, W., Ying, Z., & Leskovec, J. (2017). Inductive represen- Proceedings of the Institution of Mechanical Engineers, 223(7),
tation learning on large graphs. Advances in Neural Information 881–890.
Processing Systems, 30, 1025. Martins, G. B., Papa, J. P., & Adeli, H. (2020). Deep learning tech-
Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. niques for recommender systems based on collaborative filtering.
Neural Computation, 9(8), 1735–1780. Expert Systems, 37(6), e12647.
14678667, 2023, 10, Downloaded from https://onlinelibrary.wiley.com/doi/10.1111/mice.12944 by University Of Connecticut, Wiley Online Library on [07/06/2024]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
SONG et al. 1323
Messner, J. I., Sanvido, V. E., & Kumara, S. R. (1994). StructNet: a neu- Wang, C., Song, L., & Fan, J. (2022). End-to-end structural analy-
ral network for structural system selection. Computer-Aided Civil sis in civil engineering based on deep learning. Automation in
and Infrastructure Engineering, 9(2), 109–118. Construction, 138, 104255.
Morfidis, K., & Kostinakis, K. (2017). Seismic parameters’ com- Wang, C., Xu, L., & Fan, J. (2020). A general deep learning framework
binations for the optimum prediction of the damage state of for history-dependent response prediction based on UA-Seq2Seq
RC buildings using neural networks. Advances in Engineering model. Computer Methods in Applied Mechanics and Engineering,
Software, 106, 1–16. 372, 113357.
Naderpour, H., Mirrashid, M., & Parsa, P. (2021). Failure mode pre- Wang, J., Huang, P., Zhao, H., Zhang, Z., Zhao, B., & Lee, D. L.
diction of reinforced concrete columns using machine learning (2018). Billion-scale commodity embedding for e-commerce rec-
methods. Engineering Structures, 248, 113263. ommendation in Alibaba. Proceedings of the 24th ACM SIGKDD
Naser, M. Z. (2021). An engineer’s guide to eXplainable Artifi- International Conference on Knowledge Discovery & Data Mining,
cial Intelligence and Interpretable Machine Learning: Navigating London, UK (pp. 839–848).
causality, forced goodness, and the false perception of inference. Wang, Y., Wang, J., Cao, Z., & Farimani, A. B. (2022). Molecular
Automation in Construction, 129, 103821. contrastive learning of representations via graph neural networks.
Nguyen, T., Kashani, A., Ngo, T., & Bordas, S. (2018). Deep neu- Nature Machine Intelligence, 4, 279–287.
ral network with high-order neuron for the prediction of foamed Weisfeiler, B., & Leman, A. (1968). A reduction of a graph to a
concrete strength. Computer-Aided Civil and Infrastructure Engi- canonical form and an algebra arising during this reduction.
neering, 34(4), 316–332. Nauchno-Technicheskaya Informatsiya, 2(9), 12–16.
Oh, B. K., Park, Y., & Park, H. S. (2020). Seismic response predic- Wu, Z., Pan, S., Chen, F., Long, G., Zhang, C., & Yu, P. S. (2020). A
tion method for building structures using convolutional neural comprehensive survey on graph neural networks. IEEE Transac-
network. Structural Control and Health Monitoring, 27(5), e2519. tions on Neural Networks and Learning Systems, 32(1), 4–24.
Olalusi, O. B., & Awoyera, P. O. (2021). Shear capacity prediction Xu, K., Hu, W., Leskovec, J., & Jegelka, S. (2019). How powerful
of slender reinforced concrete structures with steel fibers using are graph neural networks? International Conference on Learning
machine learning. Engineering Structures, 227, 111470. Representations, New Orleans, LA.
Parvin, A., & Serpen, G. (1999). Recurrent neural networks for Yin, T., & Zhu, H. P. (2019). An efficient algorithm for architecture
structural optimization. Computer-Aided Civil and Infrastructure design of Bayesian neural network in structural model updating.
Engineering, 14(6), 445–451. Computer-Aided Civil and Infrastructure Engineering, 35(4), 354–
Pellegrino, S. (1993). Structural computations with the singular value 372.
decomposition of the equilibrium matrix. International Journal of Ying, R., He, R., Chen, K., Eksombatchai, P., Hamilton, W. L., &
Solids and Structures, 30(21), 3025–3035. Leskovec, J. (2018). Graph convolutional neural networks for web-
Pellegrino, S., & Calladine, C. R. (1986). Matrix analysis of stati- scale recommender systems. Proceedings of the 24th ACM SIGKDD
cally and kinematically indeterminate frameworks. International International Conference on Knowledge Discovery & Data Mining,
Journal of Solids and Structures, 22(4), 409–428. London, UK (pp. 974–983).
Rafiei, M. H., & Adeli, H. (2017). NEEWS: A novel earthquake early Zafeiriou, S., Bronstein, M., Cohen, T., Vinyals, O., Leskovec, J., Liò,
warning system using neural dynamic classification and neu- P., Bruna, J., & Gori, M. (2022). Guest editorial: Non-euclidean
ral dynamic optimization model. Soil Dynamics and Earthquake machine learning. IEEE Transactions on Pattern Analysis &
Engineering, 100, 417–427. Machine Intelligence, 44(2), 723–726.
Rafiei, M. H., Khushefati, W. H., Demirboga, R., & Adeli, H. (2017). Zhang, R., Chen, Z., Chen, S., Zheng, J., Büyüköztürk, O., & Sun,
Supervised deep restricted Boltzmann machine for estimation of H. (2019). Deep long short-term memory networks for nonlinear
concrete compressive strength. ACI Materials Journal, 114(2), 237– structural seismic response prediction. Computers & Structures,
244. 220, 55–68.
Reddy, J. N. (2019). Introduction to the finite element method (4th ed.). Zhou, J., Cui, G., Hu, S., Zhang, Z., Yang, C., Liu, Z., Wang, L., Li, C.,
McGraw-Hill Education. & Sun, M. (2020). Graph neural networks: A review of methods
Salehi, H., & Burgueño, R. (2018). Emerging artificial intelligence and applications. AI Open, 1, 57–81.
methods in structural engineering. Engineering Structures, 171, Zhu, S., Ohsaki, M., & Guo, X. (2021). Prediction of non-linear buck-
170–189. ling load of imperfect reticulated shell using modified consistent
Sun, H., Burton, H. V., & Huang, H. (2021). Machine learning applica- imperfection and machine learning. Engineering Structures, 226,
tions for building structural design and performance assessment, 111374.
state-of-the-art review. Journal of Building Engineering, 33, 101816. Zienkiewicz, O. C., & Robert, L. T. (2005). The finite element method
Torky, A. A., & Ohno, S. (2021). Deep learning techniques for predict- for solid and structural mechanics. Elsevier.
ing nonlinear multi-component seismic responses of structural
buildings. Computers & Structures, 252, 106570.
Tsubaki, M., Tomii, K., & Sese, J. (2019). Compound–protein inter- How to cite this article: Song, L.-H., Wang, C.,
action prediction with end-to-end learning of neural networks for Fan, J.-S., & Lu, H.-M. (2023). Elastic structural
graphs and sequences. Bioinformatics, 35(2), 309–318. analysis based on graph neural network without
Wakjira, T. G., Al-Hamrani, A., Ebead, U., & Alnahhal, W. (2022). labeled data. Computer-Aided Civil and
Shear capacity prediction of FRP-RC beams using single and Infrastructure Engineering, 38, 1307–1323.
ensenble ExPlainable machine learning models. Composite Struc- https://doi.org/10.1111/mice.12944
tures, 287, 115381.