Physical Metallurgy II - 02 Diffusion
Physical Metallurgy II - 02 Diffusion
Diffusion
M. Khodaei
2
Contents
1. Diffusion
Atomic Mechanisms of Diffusion
Interstitial Diffusion
Interstitial Diffusion as a Random Jump Process
Effect of Temperature—Thermal Activation
Steady-State Diffusion
Nonsteady-State Diffusion
Solutions to the Diffusion Equation
Substitutional Diffusion
Self-Diffusion
Vacancy Diffusion
Diffusion in Substitutional Alloys
High-Diffusivity Paths
3
Introduction
Many reactions and processes that are important in the treatment of materials rely on
the transfer of mass either within a specific solid (ordinarily on a microscopic level)
or from a liquid, a gas, or another solid phase. This is necessarily accomplished by
diffusion.
Materials of all types are often heat-treated to improve their properties. The phenomena
that occur during a heat treatment almost always involve atomic diffusion.
The study of phase transformations concerns those mechanisms by which a system
attempts to reach equilibrium state and how long it takes. One of the most
fundamental processes that controls the rate at which many transformations occur is the
diffusion of atoms.
So, the study of diffusion in solid materials is very important.
4
Introduction
Diffusion in solids: the phenomenon of mass transport by atomic
motion within a specific solid.
Initial condition
After diffusion
5
Introduction
High temperature
For a long while
This result indicates that copper atoms have migrated or diffused into
the nickel, and that nickel has diffused into copper. The process by
which atoms of one metal diffuse into another is termed inter-
diffusion, or impurity diffusion.
In almost all cases, there is a net drift or transport of atoms from high-
to low-concentration regions.
Diffusion also occurs for pure metals, but all atoms exchanging
positions are of the same type; this is termed self-diffusion. Of course,
self-diffusion is not normally subject to observation by noting
compositional changes.
6
Diffusion (Thermodynamic point of view)
The reason why diffusion occurs is always so as to produce a decrease in Gibbs
free energy
Example: Two blocks of the same A-B solid solution, but with different
compositions, are welded together and held at a temperature high enough for long-
range diffusion to occur.
1st case
The molar free energy of each part of the alloy will be given by G1 and G2,
and initially the total free energy of the welded block will be G3.
However, if diffusion occurs, the free energy will decrease towards G4 (the free energy
of a homogeneous alloy)
A decrease in free energy is produced by A and B atoms diffusing away from the regions
of high concentration to that of low concentration, i.e. down the concentration gradients.
7
Diffusion (Thermodynamic point of view)
The reason why diffusion occurs is always so as to produce a decrease in
Gibbs free energy
Example: Two blocks of the same A-B solid solution, but with different
compositions, are welded together and held at a temperature high enough for
long-range diffusion to occur.
2nd case (In alloy systems that contain a miscibility gap)
the A and B atoms would diffuse towards the regions of high concentration, i.e. up
the concentration gradients. However, this is still the most natural process as it
reduces the free energy from G3 towards G4 again.
8
Diffusion (Thermodynamic point of view)
chemical potential gradient
The A and B atoms are diffusing from regions where the chemical potential is high to
regions where it is low, i.e. down the chemical potential gradient in both cases.
9
Diffusion (Thermodynamic point of view)
So
Gibbs free energy:
diffusion always occurs to produce a decrease in Gibbs free energy.
Chemical Concentration:
In some cases, solute atoms diffuse from high concentration regions toward the low
concentration regions (Down the concentration gradient).
In some cases, solute atoms diffuse from low concentration regions toward the high
concentration regions (Up the concentration gradient).
Chemical potential:
the solute atoms are diffusing from regions where the chemical potential is high to
regions where it is low, i.e. down the chemical potential gradient in both cases.
NOTES:
In practice the first case mentioned above is far more common than the second case, and it
is usually assumed that diffusion occurs down concentration gradients.
However, since case 1 above is mainly encountered in practice and because concentration
differences are much easier to measure than chemical potential differences, it is
nevertheless more convenient to relate diffusion to concentration gradients.
10
Atomic Mechanisms of Diffusion
From an atomic perspective, diffusion is just the stepwise migration of atoms from
lattice site to lattice site. In fact, the atoms in solid materials are in constant motion,
rapidly changing positions.
There are two common mechanisms for in crystalline solids
Substitutional atoms usually diffuse by a vacancy mechanism
Interstitial atoms migrate by forcing their way between the larger atoms, i.e. interstitially.
In most metal alloys, interstitial diffusion occurs much more rapidly than diffusion by
the vacancy mode, because the interstitial atoms are smaller and thus more mobile.
Furthermore, there are more empty interstitial positions than vacancies; hence, the
probability of interstitial atomic movement is greater than for vacancy diffusion.
18
Interstitial Diffusion
Interstitial Diffusion as a random Jump Process
Consider first a simple model: A dilute interstitial solid solution where the parent atoms
are arranged on a simple cubic lattice and the solute B atoms fit perfectly into the
interstices without causing any distortion of the parent lattice.
If plane (1) contains n1 B-atoms per m2 the number of atoms that will jump from plane (1) to
(2) in 1 s (J) will be given by:
Likewise
Therefore
21
Interstitial Diffusion
Interstitial Diffusion as a random Jump Process
It can be seen that
So
(a) in equilibrium position, (b) at the position of maximum lattice distortion, (c) Variation of the
free energy of the lattice as a function of the position of interstitial.
Diffusion increases as the size of the interstitial atom decreases. (The atomic diameters
decrease in the order C, N, H.)
27
Interstitial Diffusion
effect of Temperature—Thermal Activation
Graphical representation of D as a function of temperature
So,
28
Interstitial Diffusion
Steady-State Diffusion
The simplest type of diffusion to deal with is when a steady state exists, that is when
the concentration at every point does not change with time.
A steady state will eventually be reached when the concentration everywhere reaches
an unchanged value.
29
Interstitial Diffusion
Steady-State Diffusion
The diffusion of atoms of a gas through a thin metal plate for which the concentrations (or
pressures) of the diffusing species on both surfaces of the plate are held constant.
This diffusion process reaches a state wherein the diffusion flux does not change with
time—that is, the mass of diffusing species entering the plate on the high-pressure side
is equal to the mass exiting from the low-pressure surface—such that there is no net
accumulation of diffusing species in the plate. This is an example of what is termed
steady-state diffusion.
30
Interstitial Diffusion
Steady-State Diffusion
The concentration gradient is the slope at a particular point on this curve. In the present
treatment, the concentration profile is assumed to be linear.
t=0
t>0
32
Interstitial Diffusion
Nonsteady-State Diffusion (Fick’s 2nd law)
In order to calculate how the concentration of B at any point varies with time consider a
narrow slice of material with an area A and a thickness δx
Or
And finally
34
Interstitial Diffusion
Nonsteady-State Diffusion (Fick’s 2nd law)
Under conditions of nonsteady-state the partial differential equation
Solutions to this expression (concentration in terms of both position and time) are possible
when physically meaningful boundary conditions are specified.
For 3D problem in a same manner
35
Interstitial Diffusion
Nonsteady-State Diffusion (Fick’s 2nd law)
Graphical interpretation of is the curvature of the CB versus x curve.
If the concentration profile appears as shown in the below Fig., it has a positive curvature
everywhere and the concentration at all points on such a curve will increase with time.
Positive
When the curvature is negative then CB decreases with time.
Negative
36
Interstitial Diffusion
Solutions to the Diffusion equation – Homogenization
It is often of interest to be able to calculate the time taken for an inhomogeneous alloy to
reach complete homogeneity, as for example in the elimination of segregation in
castings.
Example: segregation of alloying elements on grain boundaries
X
X
37
Interstitial Diffusion
Solutions to the Diffusion equation – Homogenization
The simplest composition variation that can be solved mathematically is if CB varies
sinusoidally with distance in one dimension
where is the mean composition, and β0 is the amplitude of the initial concentration
profile.
38
Interstitial Diffusion
Solutions to the Diffusion equation – Homogenization
To predict the distribution of concentration of the alloying element during
homogenization, we must consider the governing equation, boundary
conditions and initial conditions
Amplitude β 0/ e β 0/ e 2 β 0/ e 3 β 0/ e n
40
Interstitial Diffusion
Solutions to the Diffusion equation – Semi-infinite solution x
Example: The Carburization of Steel t=0
The aim of carburization is to increase the carbon concentration in the surface layers of
a steel product in order to achieve a harder wear-resistant surface. This is usually done
by holding the steel in a gas mixture containing CH4 and/or CO at a temperature where
it is austenitic.
An analytical expression for these profiles can be obtained by solving Fick’s second law
using the boundary conditions. The specimen is considered to be infinitely long.
or
41
Interstitial Diffusion
Solutions to the Diffusion equation – Semi-infinite solution
The erf function is shown graphically as below
Carbon concentration
42
Interstitial Diffusion
Solutions to the Diffusion equation – Semi-infinite solution
Note that since erf (0.5) ≈ 0.5 then
if suppose
so
then
So, the depth at which the carbon concentration is midway between Cs and C0 is given by
Example:
47
Substitutional Diffusion
Self-Diffusion
Since A* and A atoms are chemically identical their jump frequencies are also almost
identical. Thus the diffusion coefficient can be related to the jump frequency by below
Equation
so
It can be seen that for a given crystal structure and bond type Q/RTm is roughly
constant; that is, the activation enthalpy for self-diffusion, Q, is roughly proportional to
the equilibrium melting temperature, Tm.
50
Substitutional Diffusion
Self-Diffusion
An immediate consequence of these correlations is that the diffusion coefficients of all
materials with a given crystal structure and bond type will be approximately the same at the
same fraction of their melting temperature, i.e. D(T/Tm) = constant.
BUT WHY?
Increasing the interatomic bond strength makes the process of melting more difficult; that is, T m is
raised.
It also makes diffusion more difficult by increasing ΔHv and ΔHm.
51
Substitutional Diffusion
Vacancy Diffusion
The jumping of atoms into vacant sites can equally well be considered as the jumping of vacancies
onto atom sites.
Jumping of
an atom into
a vacant site
Jumping of a
vacancy onto
an atom site
If excess vacancies are introduced into the lattice they will diffuse at a rate which depends on the
jump frequency.
However, a vacancy is always surrounded by sites to which it can jump and it is thus analogous to
an interstitial atom.
Therefore a vacancy can be considered to have its own diffusion coefficient given by
52
Substitutional Diffusion
Vacancy Diffusion
The jumping of atoms into vacant sites can equally well be considered as the jumping of vacancies
onto atom sites.
Jumping of
an atom into
a vacant site
Jumping of a
vacancy onto
an atom site
where JA and JB are the fluxes of A and B atoms across a given lattice plane.
In the case of interstitial diffusion because the lattice planes of the parent atoms were unaffected
by the diffusion process. It will be seen, however, that the situation is different in the case of
substitutional diffusion.
54
Substitutional Diffusion
Diffusion in Substitutional Alloys
Consider the inter diffusion of A and B atoms in a diffusion couple that is made by welding together
blocks of pure A and B. If the couple is annealed at a high enough temperature, a concentration
profile will develop
If we make the simplifying assumption that the total number of atoms per unit volume is a
constant, C0, independent of composition, then
so
55
Substitutional Diffusion
Diffusion in Substitutional Alloys
Hence at a given position the concentration gradients driving the diffusion of A and B atoms
are equal but opposite, and the fluxes of A and B relative to the lattice can be written as
The rate at which vacancies are created or destroyed at any point is given by
58
Substitutional Diffusion
Diffusion in Substitutional Alloys
How to create or destroy the vacancies?
Jogged edge dislocations can provide a convenient source or sink for vacancies
Destroying the vacancies
before After
Vacancies can be absorbed by the extra half-plane of the edge dislocation shrinking
59
Substitutional Diffusion
Diffusion in Substitutional Alloys
How to create or destroy the vacancies?
Jogged edge dislocations can provide a convenient source or sink for vacancies
Creating the vacancies
before After
Vacancies can be emitted from the extra half-plane of the edge dislocation growth
60
Substitutional Diffusion
Diffusion in Substitutional Alloys
This means that extra atomic planes will be introduced on the B-rich side while whole
planes of atoms will be ‘eaten’ away on the A-rich side.
Consequently the lattice planes in the middle of the couple will be shifted to the left.
61
Substitutional Diffusion
Diffusion in Substitutional Alloys - Kirkendall effect
In order to show the movement of the lattice at a given point in the couple, insoluble wires are
inserted at the interface before welding the two blocks together. These wires remain in effect
‘fixed’ to the lattice planes.
The displacement of inert wires during diffusion was first observed by Smigelskas and
Kirkendall in 1947 and is usually known as the Kirkendall effect.
In this experiment a block of α-brass (Cu-30wt% Zn) was wound with molybdenum wire and
encapsulated in a block of pure Cu.
After annealing at a high temperature it was found that the separation of the markers (w) had
decreased. This is because DZn > DCu and the zinc atoms diffuse out of the central block faster
than they are replaced by copper atoms diffusing in the opposite direction.
Similar effects have since been demonstrated in many other alloy systems. In general it is found
that in any given couple, atoms with the lower melting point possess a higher D
62
Substitutional Diffusion
Diffusion in Substitutional Alloys
Vacancies are not necessarily all annihilated at dislocations, but can also be absorbed by internal
boundaries and free surfaces.
However, those not absorbed at dislocations mainly agglomerate to form holes or voids in the
lattice. Void nucleation is difficult because it requires the creation of a new surface and it is generally
believed that voids are heterogeneously nucleated at impurity particles.
Nanoshell
Nanotube
64
Substitutional Diffusion
Diffusion in Substitutional Alloys
The velocity at which any given lattice plane moves, v, can be related to the flux of
vacancies crossing it.
If the plane has an area A, during a small time interval δt, the plane will sweep out a volume
of Avδt containing Av δt C0 atoms.
This number of atoms is removed by the total number of vacancies crossing the plane in the
same time interval, i.e. J A δt, giving
Thus
Using the same arguments as were used to derive the 2nd Fick’s law, we can show that
The total flux of A atoms across a stationary plane with respect to the specimen is the sum
of two contributions:
1. a diffusive flux − DA ∂CA/ ∂x due to diffusion relative to the lattice, and
2. a flux υCA due to the velocity of the lattice in which diffusion is occurring.
Therefore:
66
Substitutional Diffusion
Diffusion in Substitutional Alloys
By combining this equation we obtain the equivalent of Fick’s first law for the flux relative
to the specimen ends:
Likewise
i.e.
Fick’s Second law
for Substitutional
Diffusion
By substituting J’A into , we have
These equations were first derived by Darken and are usually known as Darken’s equations
67
Substitutional Diffusion
Diffusion in Substitutional Alloys
In concentrated alloys the experimentally determined values of inter-diffusion
coefficient, DA and DB are also found to show the same form of temperature dependence
as all other diffusivities, so that
68
Substitutional Diffusion
High-Diffusivity Paths
In this section diffusion along dislocations, interfaces, grain boundaries and free
surfaces will be discussed.
All of these defects are associated with a more open structure and it has been shown
experimentally that the jump frequency for atoms migrating along these defects is
higher than that for diffusion in the lattice.
It will become apparent that under certain circumstances diffusion along these defects
can be the dominant diffusion path.
It is found experimentally that diffusion along grain boundaries and free surfaces can be
described by
This mainly reflects the relative ease with which atoms can migrate along free
surfaces, interior boundaries and through the lattice.
69
Substitutional Diffusion
High-Diffusivity Paths
Surface diffusion can play an important role in many metallurgical phenomena, but in an
average metallic specimen the total grain boundary area is much greater than the
surface area so that grain boundary diffusion is usually most important.
The effect of grain boundary diffusion can be illustrated by considering a diffusion couple
made by welding together two metals, A and B.
A atoms diffusing along the boundary will be able to penetrate much deeper than atoms
which only diffuse through the lattice.
In addition, as the concentration of solute builds up in the boundaries atoms will also
diffuse from the boundary into the lattice.
Points in the lattice close to grain boundaries can receive solute via the high conductivity
path much more rapidly than if the boundaries were absent.
70
Substitutional Diffusion
High-Diffusivity Paths - Grain boundary diffusion
For simplicity let us take a case of steady-state diffusion through a sheet of material in which
the grain boundaries are perpendicular to the sheet
so
71
Substitutional Diffusion
High-Diffusivity Paths - Grain boundary diffusion
Then
Or
Or
It can be seen that the relative importance of lattice and grain boundary diffusion depends on
the ratio Dbδ/D1d. When Dbδ > D1d diffusion through the lattice can be ignored in
comparison to grain boundary diffusion.
The effective width of a grain boundary is ∼0.5 nm. Grain sizes on the other hand can vary
from ∼1 to 1000 µm and the effectiveness of the grain boundaries will vary accordingly.
72
Substitutional Diffusion
High-Diffusivity Paths - Grain boundary diffusion
The relative magnitudes of Dbδ and D1d are most sensitive to temperature.
Note that although Db > Dl, at all temperatures the difference increases as temperature
decreases.
This is because the activation energy for diffusion along grain boundaries (Qb) is lower
than that for lattice diffusion (Ql). BUT WHY?
For example, in fcc metals it is generally found that Qb ~ 0.5 Ql.
73
Substitutional Diffusion
High-Diffusivity Paths - Grain boundary diffusion
The relative magnitudes of Dbδ and D1d are most sensitive to temperature.
This means that when the grain boundary diffusivity is scaled by the factor δ/d, the grain
boundary contribution to the total, or apparent, diffusion coefficient is negligible in
comparison to the lattice diffusivity at high temperatures, but dominates at low temperatures.
In general it is found that grain boundary diffusion becomes important below about 0.75-0.8
Tm, where Tm is the equilibrium melting temperature in Kelvin.
74
Substitutional Diffusion
High-Diffusivity Paths - Diffusion along Dislocations
If grain boundary diffusion is compared to the conduction of heat through a material made
of sheets of aluminum in a plastic matrix, the analogy for diffusion along dislocations
would be aluminum wires in a plastic matrix. The dislocations effectively act as pipes
along which atoms can diffuse with a diffusion coefficient Dp.
The contribution of dislocations to the total diffusive flux through a metal will of course
depend on the relative cross-sectional areas of pipe and matrix.