License: CC BY-NC-SA 4.0
arXiv:2307.00385v2 [q-bio.NC] 23 Apr 2026

Sulcal Pattern Matching with the Wasserstein Distance

Abstract

We present the unified computational framework for modeling the sulcal patterns of human brain obtained from the magnetic resonance images. The Wasserstein distance is used to align the sulcal patterns nonlinearly. These patterns are topologically different across subjects making the pattern matching a challenge. We work out the mathematical details and develop the gradient descent algorithms for estimating the deformation field. We further quantify the image registration performance. This method is applied in identifying the differences between male and female sulcal patterns.

1 Introduction

The concave regions in the highly convoluted cerebral cortex of the human brain are referred to as the sulci (Fig. 1). These complex tree-shaped sulcal curves are highly variable in length, area, depth, curvature and topology across different subjects [2]. There have been extensive studies that connect the variabilities of such biomarkers with the differences in cognitive or pathological characteristics between populations [10]. However, since each subject have different topological patterns, it is difficult to match the sulcal patterns across subjects [9]. One approach of reducing the difficulty of matching is to smooth the sulcal patterns and then match the smoothed patterns. Such smoothed representations enable pattern matching in a continuous probabilistic fashion. Thus, we propose to use the Wasserstein distance, which minimizes the optimal transport cost of moving between probability distributions.

The Wasserstein distances have been previously applied in various imaging applications. [21] computed the Wasserstein distance using the hyperbolic metric for cortical brain morphometry. [15] proposed the sliced Wasserstein distance in speeding up the computation in pattern recognition. [19] derived a cycle-consistent generative adversarial network based on the optimal transport for MRI reconstruction. Despite the advance of various algorithms for Wasserstein distance based pattern matching [3, 15], the rational of involving smoothing in such algorithms is unclear. In this paper, we work out the mathematical and implementation details on the Wasserstein distance on heat kernel smoothing, the kernel version of diffusion. In the experiment, we demonstrated the Wasserstein distance based registration reduces the variability of sulcal patterns across subjects.

Refer to caption
Fig. 1: Top: sulcal curves (red) are displayed on top of cortical surfaces. Bottom: sulcal curves are projected and smoothed in the rectangular domain [π,π]×[0,π]2[-\pi,\pi]\times[0,\pi]\subset\mathbb{R}^{2}.

2 Methods

2.1 Sulcal pattern data

We used the processed T1-weighted MRI of 456 subjects (age-matched 274 females and 182 males) in the Human Connectome Project (HCP) [23]. The MRI were obtained using a Siemens 3T Connectome Skyra scanner with a 32-channel head coil [8, 22]. The MRI were registered to the MNI space with a FLIRT affine and FNIRT nonlinear registration [12]. FreeSurfer’s recon-all pipeline was used on the distortion- and bias-corrected MRI [6] that includes the segmentation of white matter and pial surfaces as well as FreeSurfer’s folding-based surface registration to the template. TRACE algorithm was used for automatic sulcal curve extraction from surface meshes [9, 17] (Fig. 1) and then projected on the unit sphere, which is further parameterized by spherical angles p=(x,y)[π,π]×[0,π]2p=(x,y)\in[-\pi,\pi]\times[0,\pi]\subset\mathbb{R}^{2}. Since the data is spherical, it is periodic over xx. Fig. 1 displays the sulcal curves of two different subjects on 2\mathbb{R}^{2}, where sulcal curves are assigned value 11 (colored red) while all other parts of the brain are assigned value 0.

2.2 Heat kernel smoothing of sulcal patterns

The sulcal pattern f(p)f(p) is smoothed with heat kernel to reduce high frequency noise. The smoothing increases the signal-to-noise ratio (SNR) and increases the statistical power in the population study [9]. We model the sulcal pattern as

f(p)=μ(p)+ε(p),p=(x,y)[π,π]×[0,π]2,f(p)=\mu(p)+\varepsilon(p),\quad p=(x,y)\in[-\pi,\pi]\times[0,\pi]\subset\mathbb{R}^{2},

where μ(p)\mu(p) is the underlying true signal and ε(p)\varepsilon(p) is the noise. For observed data, we assign value 1 to the sulcal curves and 0 otherwise (Fig. 1). We estimate μ(p)\mu(p) by smoothing with isotropic heat kernel. The smoothed estimate μ^\widehat{\mu} is given by the solution of the isotropic diffusion

σu(x,y,σ)=22xu(x,y,σ)+22yu(x,y,σ)\frac{\partial}{\partial\sigma}u(x,y,\sigma)=\frac{\partial^{2}}{\partial^{2}x}u(x,y,\sigma)+\frac{\partial^{2}}{\partial^{2}y}u(x,y,\sigma)

with the initial condition u(x,y,σ=0)=f(x,y)u(x,y,\sigma=0)=f(x,y) and the boundary conditions

u(π,y,σ)=u(π,y,σ),u(π,y,σ)=u(π,y,σ),\displaystyle u\left(-\pi,y,\sigma\right)=u\left(\pi,y,\sigma\right),u^{\prime}\left(-\pi,y,\sigma\right)=u^{\prime}\left(\pi,y,\sigma\right),
u(x,0,σ)=u(x,π,σ)=0.\displaystyle u(x,0,\sigma)=u\left(x,\pi,\sigma\right)=0.

The amount of smoothing σ\sigma is determined empirically. The solution is given as the weighted Fourier series [4]:

μ^(x,y)=j=0mk=0neλjkσ[Ajkϕjk1+Bjkϕjk2].\widehat{\mu}(x,y)=\sum_{j=0}^{m}\sum_{k=0}^{n}e^{-\lambda_{jk}\sigma}\bigg[A_{jk}\phi_{jk}^{1}+B_{jk}\phi_{jk}^{2}\bigg]. (1)

The eigenvalues λjk=j2+k2\lambda_{jk}=j^{2}+k^{2} and the eigenfunctions corresponding to Laplacian 22x+22y\frac{\partial^{2}}{\partial^{2}x}+\frac{\partial^{2}}{\partial^{2}y} are

ϕjk1=2cos(jx)sin(ky)π2(1+δj0),ϕjk2=2sin(jx)sin(ky)π2,\displaystyle\phi_{jk}^{1}=\frac{2\cos(jx)\sin(ky)}{\pi^{2}(1+{\delta_{j0}})},\phi_{jk}^{2}=\frac{2\sin(jx)\sin(ky)}{\pi^{2}},

where δj0=1\delta_{j0}=1 if j=0j=0 and 0 otherwise. The coefficients AjkA_{jk} and BjkB_{jk} are estimated in the least square fashion using the initial data f(p)f(p) [4]. The smoothing results are shown in Fig. 1 with diffusion time σ=0.001\sigma=0.001.

The smooth representation enables alignment of different sulcal patterns on a common grid. For each subject, we first obtain the Fourier coefficients AjkA_{jk} and BjkB_{jk} via smoothing. The value on the grid point (xi,yj)(x_{i},y_{j}) is then given by μ^(xi,yj)\widehat{\mu}(x_{i},y_{j}) in the expansion (1).

2.3 Wasserstein distance on heat kernel smoothing

Let f1f_{1} and f2f_{2} be the empirical distribution on the two sets of scatter points that define the vertices of sulcal curves 𝐏1={p1,,pn}\mathbf{P}_{1}=\{p_{1},\cdots,p_{n}\} and 𝐏2={q1,,qn}\mathbf{P}_{2}=\{q_{1},\cdots,q_{n}\}:

f1(p)=1ni=1nδ(ppi),f2(q)=1ni=1nδ(qqi).f_{1}(p)=\frac{1}{n}\sum_{i=1}^{n}\delta\left(p-p_{i}\right),\;f_{2}(q)=\frac{1}{n}\sum_{i=1}^{n}\delta\left(q-q_{i}\right). (2)

After algebraic derivations involving Choquest’s and Birkhoff’s theorems [20], the 2-Wasserstein distance DWD_{W} between the empirical distributions is given as the Monge formulation

DW(f1,f2)=infτSn(1ni=1npipτ(i)2)1/2,\displaystyle D_{W}(f_{1},f_{2})=\inf_{\tau\in S_{n}}\Big(\frac{1}{n}\sum_{i=1}^{n}\|p_{i}-p_{\tau(i)}\|^{2}\Big)^{1/2}, (3)

where SnS_{n} is the permutation group of order nn.

The diffused sulcal pattern can be written as the kernel convolution on initial condition f(p)f(p) as μ^(p)=Kσf(p)\widehat{\mu}(p)=K_{\sigma}*f(p). with heat kernel KσK_{\sigma} [4]. In 2\mathbb{R}^{2}, the heat kernel is simply Gaussian kernel

Kσ(p,pi)=12πσ2e12σ2(ppi)(ppi).K_{\sigma}(p,p_{i})=\frac{1}{2\pi\sigma^{2}}e^{-\frac{1}{2\sigma^{2}}(p-p_{i})^{\top}(p-p_{i})}.

The heat kernel smoothing on sulcal patterns 𝐏1{\bf P}_{1} and 𝐏2{\bf P}_{2} is given by (Fig. 1 Bottom)

f~1(p)=1ni=1nKσ(p,pi),f~2(q)=1ni=1nKσ(q,qi).\widetilde{f}_{1}(p)=\frac{1}{n}\sum_{i=1}^{n}K_{\sigma}(p,p_{i}),\;\widetilde{f}_{2}(q)=\frac{1}{n}\sum_{i=1}^{n}K_{\sigma}(q,q_{i}). (4)

Let 𝒜(f~1,f~2)\mathcal{A}(\widetilde{f}_{1},\widetilde{f}_{2}) be the set of all possible joint density functions corresponding to f~1\widetilde{f}_{1} and f~2\widetilde{f}_{2}. The Wasserstein distance between diffused sulcal pattern can be written as [7]

DW(f~1,f~2)=infπ𝒜(f~1,f~2)(pq2π(p,q)dpdq)12.D_{W}(\widetilde{f}_{1},\widetilde{f}_{2})=\inf_{\pi\in\mathcal{A}(\widetilde{f}_{1},\widetilde{f}_{2})}\Big(\int\|p-q\|^{2}\pi(p,q)\mathrm{d}p\mathrm{d}q\Big)^{\frac{1}{2}}. (5)

If we restrict the joint distribution to be a linear combination of multivariate normals, we can compute the expression (5) exactly. We will denote the Wasserstein distance with such a restriction as DWD_{W^{\prime}}. Then we have the following equivalence.

Refer to caption
Fig. 2: Subject 1 (lower transparency) superimposed on top of subject 2 (higher transparency). The arrows are the displacement field of matching subject 1 to subject 2.
Theorem 1

For heat kernel smoothing f~1\widetilde{f}_{1} and f~2\widetilde{f}_{2}, we have

DW(f~1,f~2)=DW(f1,f2).D_{W^{\prime}}(\widetilde{f}_{1},\widetilde{f}_{2})=D_{W}(f_{1},f_{2}). (6)

Proof. We provide the sketch of proof. The joint density π\pi in (5) can be expressed as the mixture of multivariate normals as π(p,q)=i,j=1nπijϕij(p,q),\pi(p,q)=\sum_{i,j=1}^{n}\pi_{ij}\phi_{ij}(p,q), where the mixing proportions πij\pi_{ij} are doubly stochastic and satisfies iπij=jπij=1\sum_{i}\pi_{ij}=\sum_{j}\pi_{ij}=1. The multivariate normals ϕij\phi_{ij} follow

ϕijN([piqj],[σ2I2HHσ2I2]),\phi_{ij}\sim{N}\left(\begin{bmatrix}p_{i}\\ q_{j}\end{bmatrix},\begin{bmatrix}\sigma^{2}I_{2}&H\\ H^{\top}&\sigma^{2}I_{2}\end{bmatrix}\right),

where pi𝐏1p_{i}\in\mathbf{P}_{1}, qi𝐏2q_{i}\in\mathbf{P}_{2}, I2I_{2} is the identity matrix and HH is the 2×22\times 2 positive semi-definite covariance matrix. Then after lengthy derivations, we can obtain

DW(f~1,f~2)2=infπij,Hi,j=1nπijpiqj2+2tr(σ2I2H).D_{W^{\prime}}(\widetilde{f}_{1},\widetilde{f}_{2})^{2}=\inf_{\pi_{ij},H}\sum_{i,j=1}^{n}\pi_{ij}\left\|p_{i}-q_{j}\right\|^{2}+2\operatorname{tr}(\sigma^{2}I_{2}-H).

Subsequently, it suffices to minimize each term in DWD_{W^{\prime}} separately. Minimizing the first term leads to the same result as (3) while the second term results in zero [7]. \square

Since DW(f~1,f~2)DW(f~1,f~2)D_{W}(\widetilde{f}_{1},\widetilde{f}_{2})\leq D_{W^{\prime}}(\widetilde{f}_{1},\widetilde{f}_{2}), we have

DW(f~1,f~2)DW(f1,f2)D_{W}(\widetilde{f}_{1},\widetilde{f}_{2})\leq D_{W}(f_{1},f_{2})

indicating that heat kernel smoothing reduces the Wasserstein distance between the sulcal patterns and their variabilities.

2.4 Gradient descent on the dual formulation

Given two smooth patterns f~1\widetilde{f}_{1} and f~2\widetilde{f}_{2}, the computation of the Wasserstein distance involves mapping a point pp from the first pattern to the corresponding point U(p)U(p) in the second pattern (Fig. 2). The deformation pU(p)p\to U(p) is required to have the smallest possible total displacement and satisfies

U1(A)f~1(p)dp=Af~2(q)dq,A2.\int_{U^{-1}(A)}\widetilde{f}_{1}(p)\mathrm{d}p=\int_{A}\widetilde{f}_{2}(q)\mathrm{d}q,\quad\forall A\subset\mathbb{R}^{2}.

This leads to solving the optimization problem

infUU(p)p2f~1(p)dp,\inf_{U}\int\|U(p)-p\|^{2}\widetilde{f}_{1}(p)\mathrm{d}p, (7)

which can be further reduced to minimizing

L(φ,ψ)=φ(p)f~1(p)dp+ψ(q)f~1(q)dqL(\varphi,\psi)=\int\varphi(p)\widetilde{f}_{1}(p)\mathrm{d}p+\int\psi(q)\widetilde{f}_{1}(q)\mathrm{d}q (8)

with constraint φ(p)+ψ(q)pq\varphi(p)+\psi(q)\geq{p^{\top}q} [3].

Theorem 2

[13] The unique solution (φ,ψ)(\varphi,\psi) to the problem (8) with constraint φ(p)+ψ(q)pq\varphi(p)+\psi(q)\geq p^{\top}q are convex conjugates, i.e., ψ=φ\psi=\varphi^{*} and φ=ψ\varphi=\psi^{*} where φ\varphi^{*} and ψ\psi^{*} are defined as

{φ(q)=maxp(pqφ(p))ψ(p)=maxq(qpψ(q)).\begin{cases}\varphi^{*}(q)=\max_{p}(p^{\top}q-\varphi(p))\\ \psi^{*}(p)=\max_{q}(q^{\top}p-\psi(q))\end{cases}.

The solution to (7) is then given by U=φ.U=\nabla\varphi.

The solution is given implicitly. However, the derivative of (8) can be explicitly found.

Theorem 3

[3] Suppose φ\varphi is the convex conjugate of ψ\psi for some ψ\psi and 2φ\nabla^{2}\varphi is Lipschitz. Then it follows LL is convex, Lipschitz and

L(φ)=f~1(f~2φ)det(Hφ),L^{\prime}(\varphi)=\widetilde{f}_{1}-(\widetilde{f}_{2}\circ\nabla\varphi)\det(H_{\varphi}),

where HφH_{\varphi} is he Hessian matrix of φ\varphi.

Subsequently, we solve for gradient descent on φ\varphi as [3]:

φn+1=φnL(φn),\varphi_{n+1}=\varphi_{n}-L^{\prime}(\varphi_{n}), (9)

The overall computational complexity for the algorithm is O(N)O(N) [14], which is more scalable than the Hungarian algorithm with O(N3)O(N^{3}) [18] and fluid mechanics with O(N3/2)O(N^{3/2}) [1]. In this study, 800×400800\times 400 rectangular grid with N=320000N=320000 vertices were used. The grid values μ^(xi,yj)\widehat{\mu}(x_{i},y_{j}) serve as the input for the gradient descent. We display the result of the gradient descent on estimating the displacement field φ(p)p\nabla\varphi(p)-p as arrows in Fig. 2. Due to the convexity of φ\varphi, the algorithm is expected to recover the global optimum (Remark 3.3 in [3]).

Refer to caption
Fig. 3: Left: the Hungarian Algorithm is used to match two sets of three points (colored red and blue). Right: the gradient descent is used to match the heat kernel smoothing on these points. Arrows are the estimated displacement U(p)pU(p)-p.

3 Experiments

3.1 Validation against the Hungarian Algorithm

We matched the two sets of nn random generated scatter points in [0,1]2[0,1]^{2}. Fig.3 displays the result of one realization with n=3n=3. The Wasserstein distance DW(f1,f2)D_{W}(f_{1},f_{2}) can be computed exactly through (3) using the Hungarian Algorithm [18]. We then applied heat kernel kernel smoothing with bandwidth σ=0.01\sigma=0.01 on the scatter points and compute the Wasserstein distance DW(f~1,f~2)D_{W}^{\prime}(\tilde{f}_{1},\tilde{f}_{2}) using the gradient descent (9). The average percentage reduction of distance for 100 independent simulations for n=2,3,4n=2,3,4 are 0.86±0.050.86\pm 0.05, 0.84±0.060.84\pm 0.06 and 0.77±0.040.77\pm 0.04 respectively. This is consistent with our theoretical result in section 2.3. The computer code for performing Wasserstein distance based matching is provided in https://github.com/laplcebeltrami/sulcaltree.

Refer to caption
Fig. 4: The intersubject variability of the smoothed original sulcal patterns (left) and deformed patterns (right).

3.2 Reduction of image registration variability

The sulcal pattern has been already nonlinearly aligned using FreeSurfer’s standard folding-based surface registration to the surface atlas [6]. Wasserstein distance based matching significantly reduces the sulcal pattern variability further from the FreeSurfer alignment. We first compute the average of smoothed sulcal pattern maps T=1ni=1nf~i(p)T=\frac{1}{n}\sum_{i=1}^{n}\widetilde{f}_{i}(p) on smoothed estimates (1). This serves as the reference template pattern for subsequent registration. We then align each subject to the average pattern TT using gradient descent (9). We initialize φ\varphi with 12p2\frac{1}{2}\|p\|^{2} so that initial mapping φ\nabla\varphi becomes the identity. Such initialization is data-free and has demonstrated efficacy in various applications including facial [14] and brain images [3]. We set as the maximum number of iterations at 200. The average runtime for one subject is approximately 8 seconds on a desktop computer.

For deformation Ui(x)U_{i}(x) for subject ii, the deformed sulcal patterns are obtained as f~iT(p)=f~i(Ui(p))\widetilde{f}_{i}^{T}(p)=\widetilde{f}_{i}(U_{i}(p)). The statistical variability before registration is computed as the sample variance of sulcal patterns f~i(p)\widetilde{f}_{i}(p) at each point across subjects (Fig. 4). The variability is significantly reduced after deformation. The mean variability over all the points in the brain is 0.0021 for the smoothed original pattern while it is 7.8×1057.8\times 10^{-5}, the significant reduction of variability by 96.29%96.29\%.

3.3 Sexual dimorphism

The method is subsequently used to determine the sulcal pattern differences between 240 females and 172 males. The two sample tt-statistics were constructed on the deformed smoothed sulcal patterns f~iT(p)=f~i(Ui(p))\widetilde{f}_{i}^{T}(p)=\widetilde{f}_{i}(U_{i}(p)) (Fig. 5). The threshold tt-statistic value greater than 4.124.12 or smaller than 2.84-2.84 correspond to the pp-value of 0.05 after multiple comparisons correction through the permutation test (half millions) for the deformed pattern [5]. We obtained numerous significant differences all over the brain regions including the temporal lobe, which is responsible for sensory processing and known for sex difference [9]. The negative tt-statistics in most of the brain regions indicates the presence of less sulci for females. This seems to be related to findings in [16], where females have larger gray matter volumes in a number of areas including the left temporal gyrus. In [11], cortical thickening is found extensively in the left superior parietal gyrus and postcentral gyrus, which is consistent with less sulci in our study.

Refer to caption
Fig. 5: tt-statistics map of the localized sulcal pattern differences (female-male). Values are thresholded in [2.84,4.12][-2.84,4.12] corresponding to the corrected pp-value of 0.05. Most of female brain regions show less sulci, which corresponds to more gray matter and thicker cortical thickness observed in other studies.

4 Discussion

In this paper, we presented the new framework of matching sulcal patterns of human brain across subjects. We provided the theoretical justification for performing heat kernel smoothing before computing the Wasserstein distance. Smoothing reduces the Wasserstein distance between the sulcal patterns and spatial pattern variabilities. It is also possible to further refine the registration performance via the multi-resolution on heat kernel smoothing [3] or iteratively using the average of the deformed sulcal patterns 1ni=1nf~iT(p)\frac{1}{n}\sum_{i=1}^{n}\widetilde{f}_{i}^{T}(p) as the template in the next iteration. These are left as future studies.

The FreeSurfer uses the irregular triangle meshes while our smoothing and the gradient descent uses the regular pixel grid. This may introduce potential interpolation artifacts. A better approach would be to perform smoothing and the gradient descent on triangle meshes. This is left as a future study.

5 Acknowledgement

We would like to thank Soheil Kolouri of Vanderbilt University and Tahmineh Azizi of University of Wisconsin-Madison for discussion on the Wasserstein distance, and Ilwoo Lyu of Ulsan National Institute of Science and Technology for assistant with the TRACE algorithm.

References

  • [1] J. Benamou and Y. Brenier (2000) A computational fluid mechanics solution to the monge-kantorovich mass transfer problem. Numerische Mathematik 84 (3), pp. 375–393. External Links: Document, ISBN 0945-3245, Link Cited by: §2.4.
  • [2] A. Cachia, J.F. Mangin, D. Riviere, F. Kherif, N. Boddaert, A. Andrade, D. Papadopoulos-Orfanos, J.B. Poline, I. Bloch, M. Zilbovicius, P. Sonigo, F. Brunelle, and J. Régis (2003) A primal sketch of the cortex mean curvature: a morphogenesis based approach to study the variability of the folding patterns. IEEE Trans. Med. Imag. 22 (6), pp. 754–765. Cited by: §1.
  • [3] R. Chartrand, B. Wohlberg, K. Vixie, and E. Bollt (2009-01) A gradient descent solution to the monge-kantorovich problem. Applied Mathematical Sciences 3, pp. 1071–1080. Cited by: §1, §2.4, §2.4, §2.4, §3.2, §4, Theorem 3.
  • [4] M.K. Chung, K.M. Dalton, L. Shen, A.C. Evans, and R.J. Davidson (2007) Weighted fourier series representation and its application to quantifying the amount of gray matter. IEEE Transactions on Medical Imaging 26 (4), pp. 566–581. External Links: Document Cited by: §2.2, §2.2, §2.3.
  • [5] M.K. Chung, L. Xie, S.-G. Huang, Y. Wang, J. Yan, and L. Shen (2019) Rapid acceleration of the permutation test via transpositions. International Workshop on Connectomics in Neuroimaging 11848, pp. 42–53. Cited by: §3.3.
  • [6] B. Fischl, N. Rajendran, E. Busa, J. Augustinack, O. Hinds, B. Yeo, H. Mohlberg, K. Amunts, and K. Zilles (2007) Cortical folding patterns and predicting cytoarchitecture. Cerebral Cortex 18 (8), pp. 1973–1980. Cited by: §2.1, §3.2.
  • [7] C.R. Givens and R.M. Shortt (1984) A class of wasserstein metrics for probability distributions.. Michigan Mathematical Journal 31 (2), pp. 231–240. Cited by: §2.3, §2.3.
  • [8] M.F. Glasser, S.N. Sotiropoulos, J.A. Wilson, T.S. Coalson, B. Fischl, J.L. Andersson, J. Xu, S. Jbabdi, M. Webster, J.R. Polimeni, D.C. Van Essen, and M. Jenkinson (2013) The minimal preprocessing pipelines for the Human Connectome Project. NeuroImage 80, pp. 105–124. Cited by: §2.1.
  • [9] S.-G. Huang, I. Lyu, A. Qiu, and M.K. Chung (2020) Fast polynomial approximation of heat kernel convolution on manifolds and its application to brain sulcal and gyral graph pattern analysis. IEEE Transactions on Medical Imaging 39, pp. 2201–2212. Cited by: §1, §2.1, §2.2, §3.3.
  • [10] K. Im and P.E. Grant (2019) Sulcal pits and patterns in developing human brains. NeuroImage 185, pp. 881–890. Cited by: §1.
  • [11] K. Im, J.-M. Lee, J. Lee, Y.-W. Shin, I. Y. Kim, J. S. Kwon, and S.I. Kim (2006) Gender difference analysis of cortical thickness in healthy young adults with surface-based methods. NeuroImage 31, pp. 31–38. Cited by: §3.3.
  • [12] M. Jenkinson, P. Bannister, and S. Brady (2002) Improved optimization for the robust and accurate linear registration and motion correction of brain images. NeuroImage 17 (2), pp. 825–841. Cited by: §2.1.
  • [13] M. Knott and C. S. Smith (1984) On the optimal mapping of distributions. Journal of Optimization Theory and Applications 43 (1), pp. 39–49. External Links: ISBN 1573-2878 Cited by: Theorem 2.
  • [14] S. Kolouri, S.R. Park, M. Thorpe, D. Slepcev, and G.K. Rohde (2017) Optimal mass transport: signal processing and machine-learning applications. IEEE Signal Processing Magazine 34 (4), pp. 43–59. External Links: Document Cited by: §2.4, §3.2.
  • [15] S. Kolouri, G.K. Rohde, and H. Hoffmann (2018) Sliced wasserstein distance for learning gaussian mixture models. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vol. , pp. 3427–3436. External Links: Document Cited by: §1.
  • [16] E. Luders, C. Gaser, K.L. Narr, and A.W. Toga (2009) Why sex matters: brain size independent differences in gray matter distributions between men and women. Journal of Neuroscience 29, pp. 14265–14270. Cited by: §3.3.
  • [17] I. Lyu, S.H. Kim, N.D. Woodward, M.A. Styner, and B.A. Landman (2018) TRACE: a topological graph representation for automatic sulcal curve extraction. IEEE Trans. Med. Imag. 37 (7), pp. 1653–1663. Cited by: §2.1.
  • [18] J. Munkres (1957) Algorithms for the assignment and transportation problems. Journal of the Society for Industrial and Applied Mathematics 5 (1), pp. 32–38. External Links: ISSN 03684245, Link Cited by: §2.4, §3.1.
  • [19] G. Oh, B. Sim, H. Chung, L. Sunwoo, and J.C. Ye (2020) Unpaired deep learning for accelerated mri using optimal transport driven cyclegan. IEEE Transactions on Computational Imaging 6 (), pp. 1285–1296. External Links: Document Cited by: §1.
  • [20] G. Peyré and M. Cuturi (2019) Computational optimal transport. Foundations and Trends in Machine Learning 11 (5-6), pp. 355–607. Cited by: §2.3.
  • [21] J. Shi, W. Zhang, and Y. Wang (2016) Shape analysis with hyperbolic wasserstein distance. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Vol. , pp. 5051–5061. External Links: Document Cited by: §1.
  • [22] S.M. Smith, C.F. Beckmann, J. Andersson, E.J. Auerbach, J. Bijsterbosch, E. Douaud, D.A. Feinberg, L. Griffanti, M.P. Harms, et al. (2013) Resting-state fMRI in the Human Connectome Project. NeuroImage 80, pp. 144–168. Cited by: §2.1.
  • [23] D.C. Van Essen, K. Ugurbil, E. Auerbach, D. Barch, T.E.J. Behrens, R. Bucholz, A. Chang, L. Chen, M. Corbetta, and S.W. Curtiss (2012) The human connectome project: a data acquisition perspective. NeuroImage 62, pp. 2222–2231. Cited by: §2.1.