0% found this document useful (0 votes)

17 views

Dymchenko and Raffin - 2023 - Loss-Driven Sampling Within Hard-To-Learn Areas Fo

The paper proposes a new adaptive sampling method called Breed for training neural networks using simulation-based data. Breed concentrates new samples in areas showing high loss values during training based on a balance ratio that evolves over iterations. Compared to state-of-the-art R3 sampling, Breed converges faster and achieves a lower validation loss on two benchmark problems.

Uploaded by

dahelihabeddine

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views

Dymchenko and Raffin - 2023 - Loss-Driven Sampling Within Hard-To-Learn Areas Fo

Uploaded by

dahelihabeddine

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Loss-driven sampling within hard-to-learn areas for

simulation-based neural network training

Sofya Dymchenko, Bruno Raﬀin

To cite this version:

Sofya Dymchenko, Bruno Raﬀin. Loss-driven sampling within hard-to-learn areas for simulation-
based neural network training. MLPS 2023 - Machine Learning and the Physical Sciences Workshop
at NeurIPS 2023 - 37th conference on Neural Information Processing Systems, Dec 2023, New Orleans,
United States. pp.1-5. �hal-04305233�

HAL Id: hal-04305233

https://hal.science/hal-04305233
Submitted on 24 Nov 2023

HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est

archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents
entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non,
lished or not. The documents may come from émanant des établissements d’enseignement et de
teaching and research institutions in France or recherche français ou étrangers, des laboratoires
abroad, or from public or private research centers. publics ou privés.

Distributed under a Creative Commons Attribution 4.0 International License

Loss-driven sampling within hard-to-learn areas for
simulation-based neural network training

Sofya Dymchenko Bruno Raffin

Univ. Grenoble Alpes, Inria, Univ. Grenoble Alpes, Inria,
CNRS, Grenoble INP, LIG CNRS, Grenoble INP, LIG
38000 Grenoble, France 38000 Grenoble, France
[email protected] [email protected]

Abstract
This paper focuses on active learning methods for training neural networks from
synthetic input samples that can be generated on-demand. This includes Physics In-
formed Neural Networks (PINNs), simulation-based inference, deep surrogates and
deep reinforcement learning. An adaptive process observes the training progress
and steers the data generation with the goal of speeding up and increasing the
quality of training. We propose a novel adaptive sampling method that concentrates
samples close to the areas showing high loss values. Compared to the state-of-
the-art R3 sampling our algorithm converges to a validation loss of 0.5 in 6000
iterations, while it takes 25000 iterations to reach a loss of 0.7 for the R3 algorithm
when training a PINN with the Allen Cahn equation.

1 Introduction
The machine learning community has recently shown a growing interest in applying deep neu-
ral networks (DNN) to physical science [Lavin et al., 2021]. The novel approaches take different
perspectives on the subject [Das and Tesfamariam, 2022]. Some design specific types of neural
models and loss functions, such as graph-based networks [Meyer et al., 2021], physics-informed
neural networks (PINNs) [Raissi et al., 2019]. Others use DNNs to learn probability distributions
for experimental design [Foster et al., 2021], probabilistic programming [Baydin et al., 2019], and
simulation-based inference [Cranmer et al., 2022]. Deep surrogates are combined with traditional nu-
merical solvers and trained through different modalities on supercomputers as in [Meyer et al., 2022],
[Brace et al., 2021] and [Ward et al., 2021]. In this paper we focus on adaptive sampling methods to
optimize DNN training in the case where samples can be generated on-demand, usually through a
solver code.
Classical DNN training methods work with fixed datasets that are repeatedly presented during training
across multiple epochs. The ability to perform training using synthetic data, which can be generated
on-demand, opens the way to different strategies to improve the training process. Synthetic data is
generated from a set of input parameters sampled within a given bounded domain. The inputs can
be directly used for DNN training without transformation as in data-free PINNs training, or be the
initial conditions for autoregressive solvers, which produce data series. Adaptive sampling monitors
training progress to guide the sampling process towards selecting the inputs that provide the most
effective data. The associated computing cost should also be reduced so as not to slow down training.
The question of data selection and sample similarity originated in the context of training DNN with a
finite dataset. It is tackled in various ways: measuring samples uncertainty by approximating training
dynamics [Kye et al., 2022, Wang et al., 2022], calculating samples influence [K and Søgaard, 2021],
selecting representative subset with use of gradients [Killamsetty et al., 2022, Fayyaz et al., 2022,
Katharopoulos and Fleuret, 2019].

Machine Learning and the Physical Sciences Workshop, NeurIPS 2023.

For PINN’s collocation set [Yang et al., 2022, Nabian et al., 2021] proposes re-weighting samples
importance, [Wu et al., 2022] creates a training subset based on a distribution calculated as a nor-
malized loss. However, these methods usually require additional computations or are not applicable
to non-finite data. The recently published R3 sampling [Daw et al., 2023] advances the baseline by
retaining points whose loss is higher than average and resampling the remaining ones uniformly for
each iteration. This way the DNN is able to learn better the points with high loss while exploring new
ones. Relying on the per-sample loss to quantify the data training effectiveness has the benefit of
being a lightweight single value, which requires no extra computation or excessive memory.
In this paper, we propose a method where the main objective is to increase the sampling density in
areas with high loss over the lifetime of the training process. Instead of retaining high-loss points,
we sample new points in their Gaussian neighbourhoods according to loss. To avoid overfitting in
these areas, we introduce a balance control value that evolves across iterations. It defines a ratio
between the proposed loss-driven sampling and uniform sampling, which enables the exploration of
new high-informative areas and the repetition of already learned examples. The main contributions of
the paper are:

• a loss-driven sampling method called Breed (Balance Ratio and EnhancE Density);
• a concept of exploration-concentration balance control with ratio value;
• a novel benchmark designed for evaluating sampling strategies for simulation-based training;
• a comparison with the state-of-the-art R3 sampling on two tasks, which shows a significant
performance improvement on both, the convergence speed and validation loss.

2 Proposed method
Let’s first introduce notations. The sampling selects input points x ∈ X ⊆ Rd . x can be used directly
as a collocation point as for PINNs training, or go through a function f (x) = y, where y ∈ Y ⊆ Rd
is the output, the tuple (x, y) being used for training. The trained neural network is denoted fθ (x),
where θ are neural network parameters. In the case of surrogate training, the goal is to have fθ
approximate f .

2.1 The exploration-concentration balance control value

To balance a training set, we introduce a concentrate-explore value r(i) ∈ [0, 1] as a ratio of points
to sample non-uniformly by concentrating in areas of high loss. The remaining points are sampled
uniformly in the domain for (1) having examples for the network to remember what was learned and
(2) exploring areas with samples that might show high losses. The value r(i) changes over iterations
i of the neural network, growing linearly from the starting value sr to the ending value er for cr
iterations and then is constant at value er . The configuration of r value is called scenario and denoted
as a triplet (sr , er , cr ).

2.2 Gaussian neighbourhood sampling method

The sampling strategy is based on loss statistics from the neural network similar to [Wu et al., 2022].
The loss is normalized to construct a distribution that will provide a number of points to sample
within sample neighbourhoods.
The algorithm is repeated for N iterations. i.e. i = [N ], where we denote [N ] = 0, . . . , N − 1.
The covariance Σ = Id · σ defines the radius of the spherical neighbourhood (fixed). The values
r(i) ∈ [0, 1] to control concentrate-explore trade-off ratio are predefined (subsection 2.1). The initial
training set S (0) is sampled uniformly and the number of samples |S (i) | = Ns is fixed.
(i) (i)
At each iteration i, the neural network fθ(i) (x) provides a loss value L(xj ; θ(i) ) = lj ≥ 0 per sam-
(i)
ple xj ∈ S (i) for j = [Ns ]. The values of vector l(i) then are sum normalized to have distribution
properties. The categorical distribution over the samples of S (i) is constructed proportionally to the
(i) ′(i)
loss, i.e. P (xj ) = lj . This distribution trialed n times is a multinomial distribution P (n, l′(i) ) .
It models the number of points, called children, in the next training set to be sampled around each
point, called parent, of the current training set. The sampling is run with replacements, so a parent

2
with high loss will have several children while a parent with low loss might not have any. It allows
us to adaptively refine sampling density in areas with high loss. The name of the algorithm, Breed,
self-explains this mechanic of breeding the most interesting points from the point of view of the
training process.
(i+1)
The next training set S (i+1) consists of two sets. The concentration set Sc is a set of points
(i+1) (i)
sampled in a loss-driven manner. Its size depends on r(i) value, i.e. |Sc | = Nc = ⌊Ns × r(i) ⌋.
(i)
To construct it, the number of children for each parent is sampled as {mj }j=[Ns ] ∼ P (Nc , l′(i) ).
P (i)
Note that j=[Ns ] mj = Nc . A parent point acts as the centre of a Gaussian with Σ width, which
we call a neighbourhood. We sample the children set from each neighbourhood, i.e.:
[ (i)
[ n (i+1) (i)
o
Sc(i+1) = CΣ (xj ) = xk ∼ N (xj , Σ), k = [mj ] . (1)
j=[Ns ] j=[Ns ]

(i+1) (i+1) (i) (i)

The uniform set Su is a remaining portion of points, i.e. |Su | = Nu = Ns − Nc , sampled
uniformly to explore new data and to keep the presence of points with low loss as well, i.e.:
n o
(i+1)
Su(i+1) = xk ∼ U(X ), k = [Nu(i) ] . (2)

(i+1) (i+1)
Finally, the composed training set is S (i+1) := Sc ∪ Su .

3 Experiments

We compare1 Breed sampling to the baseline uniform dynamic sampling, which creates a train-
ing set for each iteration by selecting Ns uniformly distributed points in X , and the R3 sam-
pling [Daw et al., 2023] on two benchmark problems.

Benchmarks. The first problem is a new simulation-based benchmark

called pits gradient descent (PGD). The simulation is a gradient de-
scent on a surface made of "bell pits" configured by the number, centres,
weights, and widths of the bells (negative Gaussians curves). The in-
put is a point x0 ∈ X ⊂ R2 and the output is the local minimum
xf ∈ Y = X ⊂ R2 corresponding to the starting point of the gradient
descent. The task can be extended to a time-dependent variant and/or
3-dimensional variant with boundary conditions being a surface equation.
The motivation to create this benchmark was to have flexibility in config-
uring different cases and an obvious visual clue about hard-to-learn areas, Figure 1: IPGD surface
which are those with near-zero gradients.
The case we consider in experiments is a surface with two pits located at points (−0.5, −0.5) and
(1.5, 1.5) in the domain X × Y = [−2, 2]2 , with corresponding weights (0.895, 0.005), and widths
(0.4, 0.008). Because the widths and weights are extremely different, this configuration is referred
to as two imbalanced pits (IPGD), see Figure 1. The validation is done on two sets: the first one
consists of 10k points uniformly sampled in the domain, and the second, hard validation set, consists
of 10k points whose gradients are less than 4 · 10−3 . For a fair comparison, we designed a neural
network with hyperparameters optimized for the best results with uniform sampling. The model is a
4-layered perceptron, Adam optimizer with a learning rate 10−3 scheduled by plateau scheduler with
patience 5, weight decay 5 · 10−5 , and a Huber loss function. The maximum number of iterations is
N = 100 and the number of samples is Ns = 1000. The r scenario is (0.15, 0.7, 25), the width of
neighborhoods is σ = 0.005.
The second benchmark is a PINN trained to solve the classical Allen Cahn partial differential
equation. This benchmark is used in [Daw et al., 2023]. The model we use here has the exact same
architecture and hyperparameters, and others are as follows: N = 60k, Ns = 1000, r scenario is
(0.15, 0.7, 5000), σ = 0.001.
1
https://gitlab.inria.fr/breed/breed/

3
Validation set Hard validation set
Uniform Uniform
R3 0.6 R3

Huber Loss (Log)

10 1
Breed Breed
0.5

Huber Loss
0.4
0.3
10 2
0.2
0.1
0 20 40 60 80 100 0 20 40 60 80 100
Iteration Iteration
(a) Imbalanced pits GD

1.1
Validation set Validation set
Uniform 1.0
1.0 R3
Relative L 2 error

Relative L 2 error
Breed 0.9
0.9
0.8 0.8 Uniform
R3
0.7 0.7 Breed
0.6
0.6
0.5
0.4 0.5
0 10000 20000 30000 40000 0 10000 20000 30000 40000
Iteration Iteration
(b) Allen Cahn equation (averaged) (c) Allen Cahn equation (not averaged)
Figure 2: Comparison of validation errors over training iterations for Breed, R3 and uniform sampling
for (a) Imbalanced Pits Gradient Descent and (b-c) Allen Cahn PINN. The plots (a-b) are presented
as mean and standard deviation computed over 5 random seeds, whereas in the plot (c) each line
represents one run.

Results analysis. Results are presented in Figure 2 as errors over training iterations. For all
experiments, Breed shows both faster convergence and lower errors compared to Uniform and
R3 sampling. Notice that for Breed the error decrease happens near the cr iteration, showing the
importance of the exploration-concentration r scheme. In Figure 2a, R3 sampling performs worse
than the baseline for both validation sets, while Breed reaches twice lower error than the baseline even
for the hard validation set. In Figure 2b, the variation of error for R3 is explained by the instability of
the method visible in Figure 2c. R3 converged for 3 out of the 5 trainings to the same error value. In
opposite, Breed shows a stable convergence for all runs, while the uniform sampling converges only
for 1 run.

4 Conclusion

We presented Breed, a novel adaptive sampling algorithm for training DNNs with synthetic data.
Breed relies on the per-sample loss value to identify hard-to-train areas and combines a dual
exploration-concentration scheme with Uniform sampling to discover potential hard areas and
to remember trivial ones and Gaussian multinomial sampling to focus on hard areas. The experiments
demonstrate overall better performance in quality, convergence speed, and stability compared to the
baseline uniform and the state-of-the-art R3 sampling. Future work will focus on validating Breed
with more benchmarks, including higher dimension problems and simulation-based scenarios with
functions generating time series.

5 Acknowledgements

This work has been supported by the ENGAGE Inria-DFKI project.

4
References
[Baydin et al., 2019] Baydin, A. G., Shao, L., Bhimji, W., Heinrich, L., Meadows, L., Liu, J., Munk, A.,
Naderiparizi, S., Gram-Hansen, B., Louppe, G., Ma, M., Zhao, X., Torr, P., Lee, V., Cranmer, K., Prabhat, and
Wood, F. (2019). Etalumis: Bringing probabilistic programming to scientific simulators at scale. Publisher:
IEEE Computer Society.
[Brace et al., 2021] Brace, A., Yakushin, I., Ma, H., Trifan, A., Munson, T., Foster, I., Ramanathan, A., Lee,
H., Turilli, M., and Jha, S. (2021). Coupling streaming AI and HPC ensembles to achieve 100-1000x faster
biomolecular simulations.
[Cranmer et al., 2022] Cranmer, K., Brehmer, J., and Louppe, G. (2022). The frontier of simulation-based
inference.
[Das and Tesfamariam, 2022] Das, S. and Tesfamariam, S. (2022). State-of-the-art review of design of experi-
ments for physics-informed deep learning. Number: arXiv:2202.06416.
[Daw et al., 2023] Daw, A., Bu, J., Wang, S., Perdikaris, P., and Karpatne, A. (2023). Mitigating propagation
failures in physics-informed neural networks using retain-resample-release (r3) sampling. In Proceedings of
the 40th International Conference on Machine Learning, pages 7264–7302. PMLR. ISSN: 2640-3498.
[Fayyaz et al., 2022] Fayyaz, M., Aghazadeh, E., Modarressi, A., Pilehvar, M. T., Yaghoobzadeh, Y., and
Kahou, S. E. (2022). BERT on a data diet: Finding important examples by gradient-based pruning. In
NeurIPS.
[Foster et al., 2021] Foster, A., Ivanova, D. R., Malik, I., and Rainforth, T. (2021). Deep adaptive design:
Amortizing sequential bayesian experimental design.
[K and Søgaard, 2021] K, K. and Søgaard, A. (2021). Revisiting methods for finding influential examples.
[Katharopoulos and Fleuret, 2019] Katharopoulos, A. and Fleuret, F. (2019). Not all samples are created equal:
Deep learning with importance sampling.
[Killamsetty et al., 2022] Killamsetty, K., Abhishek, G. S., Ramakrishnan, G., Evfimievski, A. V., Popa, L., and
Iyer, R. (2022). AUTOMATA : Gradient based data subset selection for compute-efficient hyper-parameter
tuning. In Advances in Neural Information Processing Systems.
[Kye et al., 2022] Kye, S. M., Choi, K., and Chang, B. (2022). TiDAL: Learning training dynamics for active
learning. Publisher: arXiv Version Number: 1.
[Lavin et al., 2021] Lavin, A., Zenil, H., Paige, B., Krakauer, D., Gottschlich, J., Mattson, T., Anandkumar, A.,
Choudry, S., Rocki, K., Baydin, A. G., Prunkl, C., Paige, B., Isayev, O., Peterson, E., McMahon, P. L., Macke,
J., Cranmer, K., Zhang, J., Wainwright, H., Hanuka, A., Veloso, M., Assefa, S., Zheng, S., and Pfeffer, A.
(2021). Simulation intelligence: Towards a new generation of scientific methods.
[Meyer et al., 2021] Meyer, L., Pottier, L., Ribes, A., and Raffin, B. (2021). Deep surrogate for direct time fluid
dynamics. pages 1–7.
[Meyer et al., 2022] Meyer, L., Ribés, A., and Raffin, B. (2022). Simulation-based parallel training.
[Nabian et al., 2021] Nabian, M. A., Gladstone, R. J., and Meidani, H. (2021). Efficient training of physics-
informed neural networks via importance sampling. 36(8):962–977.
[Raissi et al., 2019] Raissi, M., Perdikaris, P., and Karniadakis, G. E. (2019). Physics-informed neural networks:
A deep learning framework for solving forward and inverse problems involving nonlinear partial differential
equations. 378:686–707.
[Wang et al., 2022] Wang, H., Huang, W., Wu, Z., Tong, H., Margenot, A. J., and He, J. (2022). Deep active
learning by leveraging training dynamics.
[Ward et al., 2021] Ward, L., Sivaraman, G., Pauloski, J. G., Babuji, Y., Chard, R., Dandu, N., Redfern, P. C.,
Assary, R. S., Chard, K., Curtiss, L. A., Thakur, R., and Foster, I. (2021). Colmena: Scalable machine-
learning-based steering of ensemble simulations for high performance computing. pages 9–20.
[Wu et al., 2022] Wu, C., Zhu, M., Tan, Q., Kartha, Y., and Lu, L. (2022). A comprehensive study of non-
adaptive and residual-based adaptive sampling for physics-informed neural networks.
[Yang et al., 2022] Yang, Z., Qiu, Z., and Fu, D. (2022). DMIS: Dynamic mesh-based importance sampling for
training physics-informed neural networks.

Karpatne A. Knowledge Guided Machine Learning... 2022
No ratings yet
Karpatne A. Knowledge Guided Machine Learning... 2022
442 pages
#Loyola, Pedergnana & Garcia - Smart Sampling and Incremental Function Learning For Very Large High Dimensional Data
No ratings yet
#Loyola, Pedergnana & Garcia - Smart Sampling and Incremental Function Learning For Very Large High Dimensional Data
13 pages
Not All Samples Are Created Equal
No ratings yet
Not All Samples Are Created Equal
13 pages
Self-adaptive physics-informed neural networks
No ratings yet
Self-adaptive physics-informed neural networks
23 pages
PINN
No ratings yet
PINN
27 pages
Solving Parabolic Periodic P-Laplacian by Deep Learning
No ratings yet
Solving Parabolic Periodic P-Laplacian by Deep Learning
15 pages
Influence-Balanced_Loss_for_Imbalanced_Visual_Classification
No ratings yet
Influence-Balanced_Loss_for_Imbalanced_Visual_Classification
10 pages
weighted-ensemble-2003.02316v3
No ratings yet
weighted-ensemble-2003.02316v3
41 pages
AI60201_module3_4_problems (1)
No ratings yet
AI60201_module3_4_problems (1)
4 pages
Answer Key
No ratings yet
Answer Key
12 pages
Kim2024_ADAM Optimization With Adaptive Batch Selection
No ratings yet
Kim2024_ADAM Optimization With Adaptive Batch Selection
42 pages
Zhang Learning Fast Sample Re-Weighting Without Reward Data ICCV 2021 Paper
No ratings yet
Zhang Learning Fast Sample Re-Weighting Without Reward Data ICCV 2021 Paper
10 pages
2023-Physics-Informed Radial Basis Network (PIRBN) A Local
No ratings yet
2023-Physics-Informed Radial Basis Network (PIRBN) A Local
41 pages
Coeurdoux etal23-PnPGibbs
No ratings yet
Coeurdoux etal23-PnPGibbs
15 pages
Linear Regression Notes
No ratings yet
Linear Regression Notes
15 pages
Aula 4 (L) - Oggi La Tua Lezione È in Presenza
No ratings yet
Aula 4 (L) - Oggi La Tua Lezione È in Presenza
11 pages
s10107-012-0572-5
No ratings yet
s10107-012-0572-5
29 pages
(Gin, Craig, Et Al.), Deep Learning Models For Global Coordinate Transformations That Linearize Pdes., Arxiv Preprint Arxiv-1911.02710 (2019) .
No ratings yet
(Gin, Craig, Et Al.), Deep Learning Models For Global Coordinate Transformations That Linearize Pdes., Arxiv Preprint Arxiv-1911.02710 (2019) .
27 pages
1710 11573 PDF
No ratings yet
1710 11573 PDF
14 pages
A Proposal On Machine Learning Via Dynamical Systems
No ratings yet
A Proposal On Machine Learning Via Dynamical Systems
11 pages
Cifar 10
No ratings yet
Cifar 10
31 pages
1-s2.0-S0950584923001866-main
No ratings yet
1-s2.0-S0950584923001866-main
12 pages
cs229 Notes1 PDF
No ratings yet
cs229 Notes1 PDF
28 pages
DL 3
No ratings yet
DL 3
72 pages
Machine Learning Notes by Standard Andrew Ng
No ratings yet
Machine Learning Notes by Standard Andrew Ng
142 pages
Machine Learning Notes AndrewNg
No ratings yet
Machine Learning Notes AndrewNg
141 pages
Linearna Regresija - NG
No ratings yet
Linearna Regresija - NG
7 pages
A Biased Graph Neural Network Sampler With Near Optimal Regret
No ratings yet
A Biased Graph Neural Network Sampler With Near Optimal Regret
25 pages
Stanford ML CS229-Merged Notes
No ratings yet
Stanford ML CS229-Merged Notes
126 pages
CS229 Lecture Notes: Supervised Learning
No ratings yet
CS229 Lecture Notes: Supervised Learning
293 pages
Unit 2
No ratings yet
Unit 2
37 pages
Adaptive Linear Neuron Using Linear (Identity) Activation Function With Batch Gradient Method
No ratings yet
Adaptive Linear Neuron Using Linear (Identity) Activation Function With Batch Gradient Method
19 pages
An Overview of Overfitting and Its Solutions
No ratings yet
An Overview of Overfitting and Its Solutions
7 pages
10.1007@s00521 018 3712 X PDF
No ratings yet
10.1007@s00521 018 3712 X PDF
13 pages
How_to_measure_uncertainty_in_uncertainty_sampling
No ratings yet
How_to_measure_uncertainty_in_uncertainty_sampling
35 pages
Solving Flows of Dynamical Systems by Deep Neural Networks and A Novel Deep Learning Algorithm
No ratings yet
Solving Flows of Dynamical Systems by Deep Neural Networks and A Novel Deep Learning Algorithm
12 pages
A Rate of Convergence of Physics Informed Neural Networks For The Linear Second Order Elliptic PDEs
No ratings yet
A Rate of Convergence of Physics Informed Neural Networks For The Linear Second Order Elliptic PDEs
24 pages
L06 Slides.mlp3
No ratings yet
L06 Slides.mlp3
26 pages
2304.12824v2
No ratings yet
2304.12824v2
31 pages
CS229 Lecture Notes: Supervised Learning
No ratings yet
CS229 Lecture Notes: Supervised Learning
30 pages
CS229
No ratings yet
CS229
69 pages
cs229 2
No ratings yet
cs229 2
275 pages
CS229 Lecture Notes: Supervised Learning
No ratings yet
CS229 Lecture Notes: Supervised Learning
30 pages
An Overview of Overfitting and Its Solutions
No ratings yet
An Overview of Overfitting and Its Solutions
7 pages
UC Merced Previously Published Works
No ratings yet
UC Merced Previously Published Works
40 pages
cs188_sp16_f_sol
No ratings yet
cs188_sp16_f_sol
27 pages
Dynamic Neural Diversification Path To Computation
No ratings yet
Dynamic Neural Diversification Path To Computation
9 pages
An Overview of Overfitting and Its Solutions
No ratings yet
An Overview of Overfitting and Its Solutions
7 pages
High Precision Differentiation Techniques For Data-Driven Solution of Nonlinear Pdes by Physics-Informed Neural Networks
No ratings yet
High Precision Differentiation Techniques For Data-Driven Solution of Nonlinear Pdes by Physics-Informed Neural Networks
28 pages
Regression
No ratings yet
Regression
30 pages
A Unified Scalable Framework for Causal Sweeping Strategies有四个时间分片可视化的paper - 结合时间分片和XPINN
No ratings yet
A Unified Scalable Framework for Causal Sweeping Strategies有四个时间分片可视化的paper - 结合时间分片和XPINN
28 pages
1804.10306v1
No ratings yet
1804.10306v1
64 pages
F - P N N S L: Unction Space Arameterization of Eural Etworks For Equential Earning
No ratings yet
F - P N N S L: Unction Space Arameterization of Eural Etworks For Equential Earning
29 pages
Minsky y Papert
No ratings yet
Minsky y Papert
77 pages
21CS031 - QB
No ratings yet
21CS031 - QB
10 pages
Solution ToYegnRame2001
No ratings yet
Solution ToYegnRame2001
107 pages
16992-Article Text-20486-1-2-20210518
No ratings yet
16992-Article Text-20486-1-2-20210518
9 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
From Everand
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
Fouad Sabry
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
1379756569Maths project 2021-22 original. (3)
No ratings yet
1379756569Maths project 2021-22 original. (3)
6 pages
thesis_com_2024_zamxaka nichume
No ratings yet
thesis_com_2024_zamxaka nichume
40 pages
Physics-Informed Neural Networks for Elastic Plate
No ratings yet
Physics-Informed Neural Networks for Elastic Plate
11 pages
Physics-Informed Neural Networks in Fluid Simulations
No ratings yet
Physics-Informed Neural Networks in Fluid Simulations
9 pages
2210.09546v1
No ratings yet
2210.09546v1
13 pages
2308.08010v1
No ratings yet
2308.08010v1
18 pages
Deep Finite Volume Method for Partial Differential
No ratings yet
Deep Finite Volume Method for Partial Differential
21 pages
Article 60
No ratings yet
Article 60
6 pages
s00366 023 01799 7
No ratings yet
s00366 023 01799 7
18 pages
MTK For Control
No ratings yet
MTK For Control
39 pages
Physics Informed NN
No ratings yet
Physics Informed NN
85 pages
chapter1_1
No ratings yet
chapter1_1
14 pages
A Thermal Machine Learning Solver for Chip Simulation-复制
No ratings yet
A Thermal Machine Learning Solver for Chip Simulation-复制
14 pages
A Hands-On Introduction To Physics-Informed Neural Networks For Solving PDE
No ratings yet
A Hands-On Introduction To Physics-Informed Neural Networks For Solving PDE
38 pages
1-s2.0-S0045782523005546-main
No ratings yet
1-s2.0-S0045782523005546-main
20 pages
PINNacle
No ratings yet
PINNacle
50 pages
2203 09346v2
No ratings yet
2203 09346v2
34 pages
Artificial Intelligence (AI) and Deep Learning For CFD: October 2022
No ratings yet
Artificial Intelligence (AI) and Deep Learning For CFD: October 2022
163 pages
Exact Dirichlet Boundary Physics-Informed Neural Network EPINN
No ratings yet
Exact Dirichlet Boundary Physics-Informed Neural Network EPINN
18 pages
DeepXDE A Deep Learning Library For Solving Differ
No ratings yet
DeepXDE A Deep Learning Library For Solving Differ
17 pages
Data Science_Modern_Template_CV
No ratings yet
Data Science_Modern_Template_CV
1 page
Full Text
No ratings yet
Full Text
18 pages
(2021), Cai Et Al., Physics-Informed Neural Networks For Heat Transfer Problems
No ratings yet
(2021), Cai Et Al., Physics-Informed Neural Networks For Heat Transfer Problems
15 pages
Dissertation EDxcHO9
No ratings yet
Dissertation EDxcHO9
186 pages
M C GPT: A GPT: Y Runch Chat Assisted Framework For Scientific Machine Learning
No ratings yet
M C GPT: A GPT: Y Runch Chat Assisted Framework For Scientific Machine Learning
22 pages
Deep XDE
No ratings yet
Deep XDE
21 pages
JGR Solid Earth - 2022 - Rasht‐Behesht - Physics‐Informed Neural Networks PINNs for Wave Propagation and Full Waveform
No ratings yet
JGR Solid Earth - 2022 - Rasht‐Behesht - Physics‐Informed Neural Networks PINNs for Wave Propagation and Full Waveform
21 pages
Bachelor Thesis Jort Bouma
No ratings yet
Bachelor Thesis Jort Bouma
25 pages

Dymchenko and Raffin - 2023 - Loss-Driven Sampling Within Hard-To-Learn Areas Fo

Uploaded by

Dymchenko and Raffin - 2023 - Loss-Driven Sampling Within Hard-To-Learn Areas Fo

Uploaded by

Loss-driven sampling within hard-to-learn areas for

simulation-based neural network training

To cite this version:

HAL Id: hal-04305233

HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est

Distributed under a Creative Commons Attribution 4.0 International License

Sofya Dymchenko Bruno Raffin

Machine Learning and the Physical Sciences Workshop, NeurIPS 2023.

2.1 The exploration-concentration balance control value

2.2 Gaussian neighbourhood sampling method

(i+1) (i+1) (i) (i)

Benchmarks. The first problem is a new simulation-based benchmark

Huber Loss (Log)

This work has been supported by the ENGAGE Inria-DFKI project.

You might also like