Operator Theory Functional Analysis and Applications
Operator Theory Functional Analysis and Applications
M. Amélia Bastos
Luís Castro
Alexei Yu. Karlovich
Editors
Operator Theory,
Functional
Analysis and
Applications
Operator Theory: Advances and Applications
Volume 282
Founded in 1979 by Israel Gohberg
Editors:
Joseph A. Ball (Blacksburg, VA, USA)
Albrecht Böttcher (Chemnitz, Germany)
Harry Dym (Rehovot, Israel)
Heinz Langer (Wien, Austria)
Christiane Tretter (Bern, Switzerland)
Subseries
Linear Operators and Linear Systems
Subseries editors:
Daniel Alpay (Orange, CA, USA)
Birgit Jacob (Wuppertal, Germany)
André C.M. Ran (Amsterdam, The Netherlands)
Subseries
Advances in Partial Differential Equations
Subseries editors:
Bert-Wolfgang Schulze (Potsdam, Germany)
Michael Demuth (Clausthal, Germany)
Jerome A. Goldstein (Memphis, TN, USA)
Nobuyuki Tose (Yokohama, Japan)
Ingo Witt (Göttingen, Germany)
This book is published under the imprint Birkhäuser, www.birkhauser-science.com, by the registered
company Springer Nature Switzerland AG.
The registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland
Preface
v
vi Preface
• Toeplitz plus Hankel operators and their close relatives. Fourier convolution
operators with slowly oscillating symbols. Singular integral operators with
Cauchy and Mellin kernels. Noncommutative C ∗ -algebras generated by Toeplitz
operators on the unit sphere.
• Groups of orthogonal matrices, all orbits of which generate lattices. The inverse
characteristic polynomial problem for trees.
IWOTA 2019 was held from 22nd to 26th of July 2019 at Instituto Superior
Técnico, University of Lisbon, Portugal. It was focused on the latest developments
in Functional Analysis, Operator Theory, and related fields and was organized by M.
Amélia Bastos (IST, UL), António Bravo (IST, UL), Catarina Carvalho (IST, UL),
Luís Castro (UA), Alexei Karlovich (FCT, UNL), and Helena Mascarenhas (IST,
UL).
IWOTA 2019 had 471 participants from all over the world. The program
consisted of 11 plenary lectures:
• J. Ball, Input/State/Output Linear Systems and Their Transfer Functions: From
Single-Variable to Multivariable to Free Noncommutative Function Theory;
• A. Böttcher, Lattices from Equiangular Tight Frames;
• K. Davidson, Noncommutative Choquet Theory;
• R. Exel, Statistical Mechanics on Markov Spaces with Infinitely Many States;
• H. Feichtinger, Classical Fourier Analysis and the Banach Gelfand Triple;
• R. Kaashoek, Inverting Structured Operators and Solving Related Inverse Prob-
lems;
• I. Klep, Bianalytic Maps between Matrix Convex Sets;
• L.-E. Persson, My Life with Hardy and His Inequalities;
• P. Semrl, Automorphisms of Effect Algebras;
• B. Silbermann, Invertibility Issues for Toeplitz Plus Hankel Operators;
• C. Tretter, Spectra and Essential Spectra of Non-Self-Adjoint Operators;
16 semi-plenary lectures:
• P. Ara, Separated Graphs and Dynamics;
• S. Belinschi, Analytic Transforms of Noncommutative Distributions;
• G. Blower, Linear Systems in Random Matrix Theory;
• A. Caetano, Function Spaces Techniques in Problems of Scattering by Fractal
Screens;
• A. B. Cruzeiro, On Some Stochastic Partial Differential Equations Obtained by
a Variational Approach;
• R. Duduchava, Boundary Value Problems on Hypersurfaces and -Convergence;
• P. Freitas, Spectral Determinants of Elliptic Operators: Dependence on Spatial
Dimension and Order of the Operator;
• E. Gallardo, Invariant Subspaces for Bishop Operators and Beyond;
• Yu. Karlovich, Algebras of Singular Integral Operators with Piecewise Quasi-
continuous Coefficients and Non-Smooth Shifts;
• S. Petermichl, Change of Measure;
• S. Roch, On Quasifractal Algebras;
Preface vii
ix
x Contents
Shoshana Abramovich
Abstract In this paper we extend the well known Heinz inequality which says that
√
2 a1 a2 ≤ H (t) ≤ a1 + a2 , a1 , a2 > 0, 0 ≤ t ≤ 1, where H (t) = a1t a21−t + a11−t a2t .
We discuss the bounds of H (t) in the intervals t ∈ [1, 2] and t ∈ [2, ∞) using the
subquadracity and the superquadracity of ϕ(x) = x t , x ≥ 0respectively. Further,
we extend H (t) to get results related to ni=1 Hi (t) = ni=1 ait ai+1 1−t
+ ai1−t ai+1 t ,
an+1 = a1 , ai > 0, i = 1, . . . , n, where H1 (t) = H (t). These results, obtained
by using rearrangement
techniques, show that the minimum and the maximum
of the sum ni=1 Hi (t) for a given t, depend only on the specific arrangements
called circular alternating order rearrangement and circular symmetrical order
rearrangement of a given set (a) = (a1 , a2 , . .
. , an ), ai > 0, i = 1, 2, . . . , n. These
lead to extended Heinz type inequalities of ni=1 Hi (t) for different intervals of
t. The results may also be considered as special cases of Jensen type inequalities
for concave, convex, subquadratic and superquadratic functions, which are also
discussed in this paper.
S. Abramovich ()
Department of Mathematics, University of Haifa, Mount Carmel, Haifa, Israel
e-mail: [email protected]
where
In the past few years attention has been put toward refining or reversing this
inequality. Recently, in 2019, new refinements of (1.1) have been proved in [5].
We start by discussing the bounds of H (t) in the intervals t ∈ [1, 2] and
t ∈ [2, ∞) using the subquadracity and the superquadracity of ϕ(x) = x t , x ≥ 0
respectively.
Then, we continue extending H (t) to get results related to
n n
1−t
Hi (t) = ait ai+1 + ai1−t ai+1
t
, an+1 = a1 , ai > 0, i = 1, . . . , n,
i=1 i=1
Then,
a
1 1 b
Gp (a, b) ≥ a 2 b 2 ϕ +ϕ . (1.2)
a b
Heinz and Jensen Type Inequalities and Rearrangements 3
when without loss of generality we assume that b ≥ a > 0. Therefore (1.2) follows
from (1.3).
When ϕ (x) ≡ 1 we get from Lemma 1.1:
Corollary 1.2 Let a, b, t ∈ R and let the symmetric function around t = 1
2 be
1 1
H (t) = a 1−t bt + a t b1−t ≥ 2a 2 b 2
When n ≤ t ≤ n + 1 and n = 1, 2, . . . ,
holds for all y ∈ [0, B), (see [2, Definition 2.1], there [0, ∞) instead [0, B)).
The function ϕ is called subquadratic if −ϕ is superquadratic.
4 S. Abramovich
Proposition
1.4 Suppose that f is superquadratic. Let 0 ≤ xi < B, i = 1, . . . , n
and let x = ni=1 ai xi , where ai ≥ 0, i = 1, . . . , n and ni=1 ai = 1. Then
n
n
ai f (xi ) − f (x) ≥ ai f (|xi − x|) . (1.4)
i=1 i=1
n
n
ai f (xi ) − f (x) ≤ ai f (|xi − x|) . (1.5)
i=1 i=1
Indeed, the first inequality in (1.7) follows from the superquadracity of φ(x) = x t ,
t ≥ 2, x ≥ 0. The second inequality follows from the convexity of φ(x) = x t ,
t ≥ 1, x ≥ 0, and the monotonicity of a t b1−t + a 1−t bt .
The proof is complete.
Heinz and Jensen Type Inequalities and Rearrangements 5
It is easy to verify that there are cases by which the superquadratic property leads to
a better result in (1.6) than the monotonicity property of H (t) and vice versa.
In Theorem 1.6 we employ for 1 ≤ t ≤ 2 the inequality satisfied by the
subquadratic function φ (x) = x t .
Theorem 1.6 Let a, b > 0 and 1 < t < 2. Then
a + b ≤ a t b1−t + a 1−t bt
a2 b2
≤ min (a + b) + a 1−t + b1−t |b − a|t , + (1.8)
b a
holds.
Proof The left hand-side of (1.8) follows from the monotonicity of a t b1−t +a 1−t bt ,
x ≥ 0, t ≥ 12 . The right hand-side follows from the subquadracity of φ (x) = x t ,
x ≥ 0, 1 ≤ t ≤ 2, and the monotonicity of a t b1−t + a 1−t bt , 1 ≤ t ≤ 2.
The proof is complete.
In the sequel we use the following lemma.
Lemma 1.7 Let ϕ be a continuous function on x ≥ 0 which is twice differentiable
on x > 0, with ϕ(0) = 0 and
lim xϕ (x) = 0.
x→0+
In [3] the authors deal with Jensen type inequalities and rearrangements. The
following definitions appear first in [6]. Theorem 2.6 below appears in [3].
Definition 2.1 ([6]) An ordered set (x) = (x1 , . . . , xn ) of n real numbers is
arranged in symmetrical decreasing order if
x1 ≤ xn ≤ x2 ≤ · · · ≤ x[(n+2/2)]
or if
xn ≤ x1 ≤ xn−1 ≤ · · · ≤ x[(n+1/2)] .
Definition 2.3 ([6]) A set (x) is arranged in circular symmetrical order if one of its
circular rearrangements is symmetrically decreasing.
The alternating order of (x) as definedin Definition 2.4 below was introduced and
proved in [8] to be the minimum of ni=1 xi xi+1 under rearrangement. Here the
minimum and maximum of
n
xi+1 xi
xi ϕ + xi+1 ϕ
xi xi+1
i=1
and of
n
1−t
xit xi+1 + xi1−t xi+1
t
i=1
under rearrangement of (x) is obtained, which leads to Heinz and Jensen type
inequalities.
Heinz and Jensen Type Inequalities and Rearrangements 7
or if
Definition 2.5 A set (x) is arranged in circular alternating order if one of its
circular rearrangements is arranged in an alternating order.
Theorem 2.6 ([3]) Let F (u, v) be differentiable and symmetric real function
defined on (α, β), −∞ ≤ α < β ≤ ∞ and α ≤ u, v, w ≤ β. Assume that
∂F (v, u) ∂F (v, w)
≤
∂v ∂v
for u ≤ min{w, v}. Then, for any set (x) = (x1 , x2 , . . . , xn ), α ≤ xi ≤ β, i =
1, . . . , n given except its arrangements
n
F (xi , xi+1 ) , xn+1 = x1
i=1
n
n
n
n
F (bi , bi+1 ) ≥ F (ci , ci+1 ) ≥ F (di , di+1 ) ≥ F (ei , ei+1 ) ,
i=1 i=1 i=1 i=1
and we make sure that the two first and two last numbers in (e) are e1 = a1 , e2 =
an−1 , en−1 = a2 , en = an which are already the two first and the two last in the
rearrangements of the alternating order of type (2.1). We realize also that when we
8 S. Abramovich
check (e2 , e3 , . . . , en−2 , en−1 ) we already have that e2 and en−1 are the largest and
the smallest numbers respectively in (e2 , e3 , . . . , en−2 , en−1 ).
Now we use the induction procedure: We assume the validity of Theorem 2.6
for the set of n − 2 numbers and show that this implies its validity for the set
of n numbers. More specifically, the n − 2 numbers if rearranged in alternating
order
n−2 of (2.2) give, according to the induction hypothesis, the smallest value of
i=2 F (ei , ei+1 ) and in the same time we get that (e1 , . . . , en ) is arranged in
alternating order too, this time according to (2.1) and therefore the proof by
induction for n numbers is obtained.
In Theorem 2.7 we introduce the symmetric function F (u, v) = uϕ uv +vϕ uv
and show when it satisfies Theorem 2.6. Similarly, in Theorem 2.8 we introduce the
symmetric function F (u, v) = us v t + ut v s and show when it satisfies Theorem 2.6.
We denote in the sequel ( x) = ( x1 ,
x2 , . . . , x) = (
xn ) and ( x1 ,
x2 , . . . ,
xn )
to be the circular alternating order rearrangement and circular symmetrical order
rearrangement of a given set (x) = (x1 , x2 , . . . , xn ) ∈ Rn .
Theorem 2.7 Assume ϕ is a concave differentiable function on R+ and
Then, for u ≤ v, w
n
n
xit s
xi+1 +t
xi+1xis ≤ xis xi+1
t
+ xi+1
t
xis
i=1 i=1
n
≤
xit s
xi+1 +t
xi+1xis , (2.4)
i=1
∂F (u, v) ∂F (w, v)
≤ when u ≤ min (v, w) .
∂v ∂v
10 S. Abramovich
Since
∂F (u, v) ∂ us v t + ut v s
= = tus v t −1 + sut v s−1 ,
∂v ∂v
we have to show that
n
≤ 2ϕ(1) xi . (2.6)
i=1
Heinz and Jensen Type Inequalities and Rearrangements 11
n
n
xi −
xi+1
xi −
xi+1
2ϕ(1)
xi +
xi ϕ +
xi+1 ϕ
xi
xi+1
i=1 i=1
n
xi+1
xi
≥
xi ϕ +
xi+1 ϕ
xi
xi+1
i=1
n
xi+1 xi
≥ xi ϕ + xi+1 ϕ
xi xi+1
i=1
n
xi+1
xi
≥
xi ϕ +
xi+1 ϕ
xi
xi+1
i=1
n
≥ 2ϕ (1) xi (2.7)
i=1
hold.
(c) For a superquadratic function ϕ the inequalities
n
xi+1
xi
xi ϕ +
xi+1 ϕ
xi
xi+1
i=1
n
xi+1 xi
≥ xi ϕ + xi+1 ϕ
xi xi+1
i=1
n
xi+1
xi
≥
xi ϕ +
xi+1 ϕ
xi
xi+1
i=1
n
xi −
xi+1
xi −
xi+1
≥
xi ϕ +
xi+1 ϕ
xi
xi+1
i=1
n
+2ϕ(1)
xi (2.8)
i=1
hold.
Proof We start with the proof of Case (a). From the concavity of ϕ we get that
xi+1 xi
xi ϕ + xi+1 ϕ ≤ ϕ(1) (xi + xi+1 ) , i = 1, . . . , n, xn+1 = x1 .
xi xi+1
12 S. Abramovich
Therefore,
n
n
xi+1 xi
xi ϕ + xi+1 ϕ ≤ 2ϕ(1) xi .
xi xi+1
i=1 i=1
n n
xi+1 xi
2ϕ(1) xi + xi ϕ 1 − + xi+1 ϕ 1 −
xi xi+1
i=1 i=1
n
xi+1 xi
≥ xi ϕ + xi+1 ϕ ,
xi xi+1
i=1
and together with Theorem 2.7 we get that inequalities (2.7) hold.
Case (c) follows similarly from (1.4) and Theorem 2.7. The proof is complete.
Theorem 2.11 combines the results related to rearrangements proved in Theorem 2.7
with the concavity of x t , x ≥ 0, 0 < t ≤ 1, convexity and subquadracity properties
of x t , x ≥ 0, for 1 ≤ t ≤ 2 and its superquadracity properties for t ≥ 2, to get what
we call: “Extended Heinz type inequalities”.
Alternatively the results of Theorem 2.11 can also be obtained from Theorem 2.8
for 0 ≤ t < 1, s = 1 − t and for t ≥ 1 and s = 1 − t, combined with the concavity
of x t , x ≥ 0, 0 < t ≤ 1, convexity and subquadracity properties of x t , x ≥ 0, for
1 ≤ t ≤ 2 and its superquadracity properties for t ≥ 2, to get the “Extended Heinz
type inequalities”.
Theorem 2.11 Let (x) ∈ Rn+ , (
x) and (
x) its circular alternating order rearrange-
ment and its circular symmetrical order rearrangement respectively, then when
Heinz and Jensen Type Inequalities and Rearrangements 13
xn+1 = x1 , xn+1 = x1 ,
xn+1 =
x1 , we get three cases of Extended Heinz type
inequalities:
(a) For 0 ≤ t < 1, the inequalities
n 1 1
n
1−t
2
xi
2 2
xi+1 ≤ xit
xi+1 +t
xi+1xi1−t
i=1 i=1
n
1−t
≤ xit xi+1 + xi+1
t
xi1−t
i=1
n
1−t
≤
xit
xi+1 +t
xi+1xi1−t
i=1
n
≤ 2xi
i=1
hold.
(b) For 1 ≤ t ≤ 2 and xi > 0, i = 1, 2, . . . , n, the inequalities
n n
n
2
xi + xi1−t + 1−t
xi+1 (| xi+1 |)t ≥
xi −
xit 1−t
xi+1 +t
xi+1xi1−t
i=1 i=1 i=1
n
≥ xi1−t xi+1
t
+ xi+1
t
xi1−t
i=1
n
1−t
≥
xit
xi+1 +t
xi+1xi1−t
i=1
n
≥2 xi
i=1
hold.
(c) For t ≥ 2 and xi > 0, i = 1, 2, . . . , n, the inequalities
n
n
1−t
xit
xi+1 +t
xi+1xi1−t ≥ xi1−t xi+1
t
+ xi+1
t
xi1−t
i=1 i=1
n
1−t
≥ xit
xi+1 +t
xi+1xi1−t
i=1
n n
≥2
xi + xi1−t +
1−t
xi+1 xi+1 |)t
xi −
(|
i=1 i=1
hold.
14 S. Abramovich
Proof From (2.6) in Theorem 2.10 for ϕ(x) = x t , 0 ≤ t ≤ 1 together with the left
hand-side inequality (1.1) we get that Case (a) holds.
From (2.7) in Theorem 2.10 for ϕ(x) = x t , 1 ≤ t ≤ 2 we get that Case (b) holds.
From (2.8) in Theorem 2.10 for ϕ(x) = x t , t ≥ 2 we get that Case (c) holds.
Alternatively we obtain the result of the theorem using Theorem 2.8 and the
concavity, convexity, subquadracity and superquadracity of x t , x > 0 in the relevant
intervals of t. The proof is complete.
References
Abstract This survey aims to highlight some of the consequences that repre-
sentable (and continuous) functionals have in the framework of Banach quasi
*-algebras. In particular, we look at the link between the notions of *-semisimplicity
and full representability in which representable functionals are involved. Then,
we emphasize their essential role in studying *-derivations and representability
properties for the tensor product of Hilbert quasi *-algebras, a special class of
Banach quasi *-algebras.
The investigation of (locally convex) quasi *-algebras was undertaken around the
beginning of the ’80s, in the last century, to give a solution to specific problems
concerning quantum statistical mechanics and quantum field theory, that required
instead a representation of observables as unbounded operators, see, e.g., [9, 28].
They were introduced by G. Lassner in the series of papers [21] and [22] in 1988.
A particular interest has been shown for the theory of *-representations of
quasi *-algebras in a specific family of unbounded densely defined and closable
operators. In this framework, a central role is played by representable functionals,
i.e., those functionals that admit a GNS-like construction. In the process of looking
M. S. Adamo ()
Dipartimento di Matematica, Università di Roma “Tor Vergata”, Roma, Italy
e-mail: [email protected]; [email protected]
L† (D, H) is a C-vector space with the usual sum and scalar multiplication. If we
define the involution † and partial multiplication as
π(a)π(x) = π(ax).
A *-representation π is
• cyclic if π(A0 )ξ is dense in Hπ for some ξ ∈ Dπ ;
• closed if π coincides with its closure
π defined in [30, Section 2].
If (A, A0 ) has a unit 1, then we suppose that π(1) = ID , the identity operator of D.
Theorem 1.4 ([30]) Let (A, A0 ) be a quasi *-algebra with unit 1 and let ω be a
linear functional on (A, A0 ) that satisfies the conditions (L.1)–(L.3) of Definition
1.2. Then, there exists a closed cyclic *-representation πω of (A, A0 ), with cyclic
vector ξω such that
(1) the map y → xy is continuous with respect to the norm defined by the inner
product;
(2) xy|z = y|x ∗ z for all x, y, z ∈ A0 ;
(3) x|y = y ∗ |x ∗ for all x, y ∈ A0 ;
(4) A20 is total in A0 .
Such a *-algebra A0 is said to be a Hilbert algebra. If H denotes the Hilbert space
completion of A0 with respect to the inner product ·|·, then (H, A0 ) is called a
Hilbert quasi *-algebra.
then ϕω (xn , xn ) → 0 as n → ∞.
By the condition (2.2), the closure of ϕω , denoted by ϕ ω , is a well-defined
sesquilinear form on D(ϕ ω ) × D(ϕ ω ) as
τn
D(ϕ ω ) = {a ∈ A : ∃{xn } ⊂ A0 s.t. xn −
→ a and
ϕω (xn − xm , xn − xm ) → 0}.
20 M. S. Adamo
For a locally convex quasi *-algebra (A, A0 ), ϕ ω always exists, [15]. Neverthe-
less, it is unclear whether D(ϕ ω ) is the whole space A. We show in Proposition 2.1
below that D(ϕ ω ) is A in the case of a Banach quasi *-algebra.
Proposition 2.1 ([4]) Let (A, A0 ) be a Banach quasi *-algebra with unit 1, ω ∈
Rc (A, A0 ) and ϕω the associated sesquilinear form on A0 × A0 defined as in (2.1).
Then D(ϕ ω ) = A; hence ϕ ω is everywhere defined and bounded.
Set now
n
+
A0 := xk∗ xk , xk ∈ A0 , n ∈ N .
k=1
Then A+ +
0 is a wedge in A0 and we call the elements of A0 positive elements of A0 .
τn τn
As in [15], we call positive elements of A the elements of A+
0 . We set A+ := A+
0 .
Definition 2.2 A linear functional on A is positive if ω(a) ≥ 0 for every a ∈ A+ .
A family of positive linear functionals F on (A, A0 ) is called sufficient if for every
a ∈ A+ , a = 0, there exists ω ∈ F such that ω(a) > 0.
Definition 2.3 A normed quasi ∗ -algebra (A, A0 ) is called fully representable if
Rc (A, A0 ) is sufficient and D(ϕ ω ) = A for every ω in Rc (A, A0 ).
We denote by QA0 (A) the family of all sesquilinear forms : A × A → C such
that
(i) (a, a) ≥ 0 for every a ∈ A;
(ii) (ax, y) = (x, a ∗y) for every a ∈ A, x, y ∈ A0 ;
We denote by SA0 (A) the subset of QA0 (A) consisting of all continuous sesquilinear
forms having the property that also
(iii) |(a, b)| ≤ ab, for all a, b ∈ A.
Definition 2.4 A normed quasi *-algebra (A, A0 ) is called *-semisimple if, for
every 0 = a ∈ A, there exists ∈ SA0 (A) such that (a, a) > 0.
Proposition 2.1 is useful to show the following result, clarifying the link between
*-semisimplicity and full representability. We need first to introduce the following
condition of positivity
Theorem 2.5 ([4]) Let (A, A0 ) be a Banach quasi *-algebra with unit 1. The
following statements are equivalent.
(i) Rc (A, A0 ) is sufficient.
(ii) (A, A0 ) is fully representable.
On Some Applications of Representable and Continuous Functionals 21
If the condition (P ) holds, (i) and (ii) are equivalent to the following
(iii) (A, A0 ) is *-semisimple.
The condition (P ) is not needed to show (iii) ⇒ (ii) of Theorem 2.5.
Theorem 2.5 shows the deep connection between full representability and *-
semisimplicity for a Banach quasi *-algebra. Under the condition of positivity (P ),
the families of sesquilinear forms involved in the definitions of full representability
and *-semisimplicity can be identified.
For a Hilbert quasi *-algebra (H, A0 ), representable and continuous functionals
are in 1-1 correspondence with a certain family of elements in H.
Definition 2.6 Let (H, A0 ) be a Hilbert quasi *-algebra. An element ξ ∈ H is
called
(i) weakly positive if the operator Lξ : A0 → H defined as Lξ (x) := ξ x is positive.
(ii) bounded if the operator Lξ : A0 → H is bounded.
+ (resp.
The set of all weakly positive (resp. bounded) elements will be denoted as Hw
Hb ), see [4, 29].
Theorem 2.7 ([4]) Suppose that (H, A0 ) is a Hilbert quasi *-algebra. Then ω ∈
Rc (H, A0 ) if, and only if, there exists a unique weakly positive bounded element
η ∈ H such that
ω(ξ ) = ξ |η , ∀ξ ∈ H.
Consider A0 to be L∞ (I, dλ), where I is a compact interval of the real line and λ is
the Lebesgue measure. Let τn be the topology generated by the p-norm
1
p
f p := |f |p dλ , ∀f ∈ L∞ (I, dλ),
I
for p ≥ 1. Then, the completion of L∞ (I, dλ) with respect to the · p -norm is
given by Lp (I, dλ).
We conclude that, for p ≥ 1, (Lp (I, dλ), L∞ (I, dλ)) is a Banach quasi *-
algebra. The same conclusion holds if we consider the *-algebra of continuous
functions over I = [0, 1], denoted by C(I ).
For p ≥ 2, (Lp (I, dλ), L∞ (I, dλ)) and (Lp (I, dλ), C(I )) are fully repre-
sentable and *-semisimple Banach quasi *-algebras. For 1 ≤ p < 2, we have
Rc (Lp (I, dλ), A0 ) = {0} for both A0 = L∞ (I, dλ) or A0 = C(I ). Note that the
Banach quasi *-algebras (Lp (I, dλ), L∞ (I, dλ)) and (Lp (I, dλ), C(I )) verify the
condition (P ) for all p ≥ 1.
22 M. S. Adamo
*-Derivations have been widely employed to describe the dynamics for a quantum
phenomenon. For a quantum system of finite volume V , the Hamiltonian belongs
to the local C*-algebra and implements the inner *-derivation for the dynamics.
Nevertheless, the thermodynamical limit in general fails to exist, see [7].
Under some assumptions, the limit turns out to be a weak *-derivation generating
a one parameter group of weak *-automorphisms, defined for a *-semisimple
Banach quasi *-algebra, as we will investigate in the following section. For detailed
discussion, see [5, 6, 20].
The derivation δ : A0 [ · ] → A[ · ] is densely defined. If δ is closable, then
its closure δ as a linear map is not a derivation in general.
• In this section, we only consider *-semisimple Banach quasi *-algebras, if not
otherwise specified.
For these Banach quasi *-algebras, define a weaker multiplication in A as in [29].
Definition 3.1 Let (A, A0 ) be a Banach quasi *-algebra. Let a, b ∈ A. We say that
the weak multiplication a b is well-defined if there exists a (necessarily unique)
c ∈ A such that:
La : A0 → A La (x) = ax ∀x ∈ A0 (3.1)
Ra : A0 → A Ra (x) = xa ∀x ∈ A0 . (3.2)
24 M. S. Adamo
and
βt (a) − a
δτ (a) = τ − lim , a ∈ D(δτ ).
t →0 t
when a is bounded.
In the following Proposition, the *-semisimplicity is automatically given by
assuming the existence of a representable and continuous functional with associated
faithful *-representation (see [1, 11]).
Proposition 3.8 ([1]) Let (A, A0 ) be a Banach quasi *-algebra with unit 1 and
let δ be a weak *-derivation of (A, A0 ) such that D(δ) = A0 . Suppose that there
exists a representable and continuous functional ω with ω(δ(x)) = 0 for x ∈ A0
and let (Hω , πω , λω ) be the GNS-construction associated with ω. Suppose that πω
26 M. S. Adamo
and δ is closable.
In this section, we recall the construction of the tensor product Hilbert quasi
*-algebra of two given Hilbert quasi *-algebras (H1 , A0 ) and (H2 , B0 ), giving
some results about the relationship between the representability properties for the
tensor product and those for the factors. For further reading on the algebraic and
topological tensor product, see [12–14, 19, 23, 24], for the tensor product Hilbert
quasi *-algebras refer to [2, 3].
For convenience, we assume that (H1 , A0 ) and (H2 , B0 ) are unital Hilbert quasi
*-algebras.
The algebraic tensor product A0 ⊗ B0 is the tensor product *-algebra of A0
and B0 , it is endowed with the canonical multiplication and involution and it is
considered as a subspace of the tensor product vector space H1 ⊗ H2 .
If H1 and H2 are endowed with the inner product ·|·1 and ·|·2 respectively,
then A0 ⊗ B0 satisfies the requirements of Definition 1.7 if we endow it with the
following well-defined inner product
m
n
z|z h
:= xi |xj yi |yj , ∀z, z ∈ A0 ⊗ B0 , (4.1)
1 2
i=1 j =1
n m
where z = i=1 xi ⊗ yi and z = j =1 xj ⊗ yj (see [24, 25, 27]). Then, the
completion of A0 ⊗ B0 with respect to the norm · h induced by the inner product
in (4.1) is a Hilbert quasi *-algebra. Since A0 , B0 are respectively dense in H1 , H2
and · h is a cross-norm, i.e. x ⊗ yh = x1 y2 for all x ⊗ y ∈ A0 ⊗ B0 , the
tensor product *-algebra A0 ⊗ B0 is · h -dense in H1 ⊗h H2 . We conclude that
h B0 ≡ H1 ⊗
A0 ⊗ h H2
semisimple and fully representable Hilbert quasi *-algebra, applying Theorems 2.5
and 2.7.
Employing Theorem 2.7 and Lemma 4.1 in [2], we can give an alternative proof
for Theorem 4.1. The proof will be given below after the proof of Theorem 4.2.
Theorem 4.1 ([2]) Let (H1 ⊗ h H2 , A0 ⊗ B0 ) be the tensor product Hilbert quasi *-
algebra of (H1 , A0 ) and (H2 , B0 ). Then, if ω1 , ω2 are representable and continuous
functionals on H1 and H2 respectively, then ω1 ⊗ ω2 extends to a representable and
continuous functional on the tensor product Hilbert quasi *-algebra H1 ⊗ h H2 .
We now look at what happens to the sesquilinear forms in SA0 (H1 ) and
SB0 (H2 ).
Theorem 4.2 Let (H1 ⊗ h H2 , A0 ⊗ B0 ) be the tensor product Hilbert quasi *-
algebra of (H1 , A0 ) and (H2 , B0 ). Let φ1 ∈ SA0 (H1 ) and φ2 ∈ SB0 (H2 ). Then,
φ2 ∈ SA0 ⊗B0 (H1 ⊗
φ1 ⊗ h H2 ), i.e., it satisfies the conditions (i), (ii) and (iii) after
the Definition 2.3.
Proof Let φ1 ∈ SA0 (H1 ) and φ2 ∈ SB0 (H2 ). These sesquilinear forms are
continuous, thus by the representation theorem for bounded sesquilinear forms
over a Hilbert space, there exist unique bounded operators T1 : H1 → H1 and
T2 : H2 → H2 such that
φ1 (ξ, ξ ) = ξ |T1 ξ and φ2 (η, η ) = η|T2 η ,
To conclude the proof, we show that φ1 ⊗ φ2 is in SA0 ⊗B0 (H1 ⊗ h H2 ). Indeed,
φ2 is a positive sesquilinear form, since T1 ⊗
φ1 ⊗ T2 is a positive operator as a tensor
product of positive operators on a Hilbert space. Thus, the condition (i) is verified.
The condition (ii) can be easily verified using the corresponding properties of φ1
and φ2 . For (iii), we know that
φ2 = T1 ⊗
φ1 ⊗ T2 ≤ T1 T2 ≤ 1.
Using the same argument as in Theorem 2.5 (refer to [4] for a complete proof), the
sesquilinear form ϕ ω associated with a representable and continuous functional ω
in a Hilbert quasi *-algebra (H, A0 ) with unit 1 is bounded and in QA0 (H). Hence,
it is represented by a bounded operator Sω such that
We want to show that Sω 1 is a weakly positive and bounded element. Indeed, for all
x ∈ A0 , we have
x|RSω 1 x = x|x(Sω 1) = x ∗ x|Sω 1 = ω(x ∗ x) ≥ 0,
A similar argument shows that Sω 1ξ is well-defined for all ξ ∈ H. Recall that an
element ξ ∈ H is bounded if, and only if, Rw (ξ ) = Lw (ξ ) = H, where Rw (ξ )
(resp. Lw (ξ )) is the space of universal right (resp. left) weak multipliers of ξ (see
[4, Proposition 4.10]). Then, Sω 1 is a bounded element.
On Some Applications of Representable and Continuous Functionals 29
By Theorem 2.7, we have that Sω 1 is the weakly positive and bounded element
in H corresponding to the representable and continuous functional ω.
For what we just argued, we can give an alternative proof of Theorem 4.1.
h H2 are corresponding to
The proof tells us explicitly what the elements in H1 ⊗
ω2 for ω1 ∈ Rc (H1 , A0 ),
representable and continuous functionals of the form ω1 ⊗
ω2 ∈ Rc (H2 , B0 ).
Proof of Theorem 4.1 Let ω1 , ω2 be representable and continuous functionals on
H1 , H2 respectively. Then,
for what we discussed above, ω1 (ξ ) = ξ |Sω1 1H1 for
all ξ ∈ H1 and ω2 (η) = η|Sω2 1H2 for η ∈ H2 .
Looking at their tensor product on H1 ⊗h H2 , we have
n
ω1 ⊗ ω(ζ ) = ξi ⊗ ηi |Sω1 1H1 ⊗ Sω2 1H2 ,
i=1
for every ζ = ni=1 ξi ⊗ ηi in H1 ⊗h H2 .
Since ϕ ω1 and ϕ ω2 are in QA0 (H1 ) and QB0 (H2 ) respectively, with the
same argument of Theorem 4.2, ϕ ω1 ⊗ ϕ ω2 is continuous and belongs to
QA0 ⊗B0 (H1 ⊗ H2 ). Furthermore, it is represented by Sω1 ⊗
h Sω2 . Hence
H2 .
where {zn } is a sequence of elements in A0 ⊗ B0 · h -converging to ψ ∈ H1 ⊗
The author wishes to thank the anonymous referees for their useful suggestions that improved
the presentation of this manuscript.
References
1. M.S. Adamo, The interplay between representable functionals and derivations on Banach
quasi *-algebras, in Proceedings of the International Conference on Topological Algebras and
Their Applications – ICTAA 2018, ed. by M. Abel. Mathematics Studies (Tartu), 7 (Estonian
Mathematical Society, Tartu, 2018), pp. 48–59
2. M.S. Adamo, About tensor product of Hilbert quasi *-algebras and their representability,
(accepted for publication in the Proceedings of OT27, Theta 2020)
3. M.S. Adamo, M. Fragoulopoulou, Tensor products of normed and Banach quasi *-algebras. J.
Math. Anal. Appl. 490, 2 (2020)
4. M.S. Adamo, C. Trapani, Representable and continuous functionals on a Banach quasi ∗ -
algebra. Mediterr. J. Math. 14, 157 (2017)
5. M.S. Adamo, C. Trapani, Unbounded derivations and *-automorphisms groups of Banach
quasi *-algebras. Ann. Mat. Pura Appl. (4) 198, 1711–1729 (2019)
6. J.-P. Antoine, A. Inoue, C. Trapani, O*-dynamical systems and *-derivations of unbounded
operator algebras. Math. Nachr. 204, 5–28 (1999)
7. J.-P. Antoine, A. Inoue, C. Trapani, Partial *-Algebras and Their Operator Realizations
(Kluwer Academic, Dordrecht, 2003)
8. F. Bagarello, C. Trapani, CQ∗ -algebras: structure properties. Publ. RIMS Kyoto Univ. 32, 85–
116 (1996)
9. F. Bagarello, C. Trapani, The Heisenberg dynamics of spin systems: a quasi *-algebra
approach. J. Math. Phys. 37, 4219–4234 (1996)
10. F. Bagarello, C. Trapani, Lp -spaces as quasi *-algebras. J. Math. Anal. Appl. 197, 810–824
(1996)
11. O. Bratteli, D.W. Robinson, Unbounded derivations of C*-algebras. Commun. Math. Phys. 42,
253–268 (1975)
12. L. Chambadal, J.L. Ovaert, Algèbre Linéaire et Algèbre Tensorielle (Dunod Université, Paris,
1968)
13. A. Defant, K. Floret, Tensor Norms and Operator Ideals (North-Holland, Amsterdam, 1993)
14. M. Fragoulopoulou, Topological Algebras with Involution (North-Holland, Amsterdam, 2005)
15. M. Fragoulopoulou, C. Trapani, S. Triolo, Locally convex quasi *-algebras with sufficiently
many *-representations. J. Math. Anal. Appl. 388, 1180–1193 (2012)
16. M. Fragoulopoulou, A. Inoue, M. Weigt, Tensor products of generalized B ∗ -algebras. J. Math.
Anal. Appl. 420, 1787–1802 (2014)
17. M. Fragoulopoulou, A. Inoue, M. Weigt, Tensor products of unbounded operator algebras.
Rocky Mt. 44, 895–912 (2014)
18. W.-D. Heinrichs, Topological tensor products of unbounded operator algebras on Frèchet
domains. Publ. RIMS Kyoto Univ. 33, 241–255 (1997)
19. A.Ya. Helemskii, Lectures and Exercises on Functional Analysis (American Mathematical
Society, Providence, 2006)
20. E. Hille, R. Phillips, Functional Analysis and Semi-groups (American Mathematical Society,
Providence, 1996)
21. G. Lassner, Topological algebras and their applications in quantum statistics. Wiss. Z. KMU-
Leipzig, Math. Naturwiss. R. 30, 572–595 (1981)
22. G. Lassner, Algebras of unbounded operators and quantum dynamics. Physica A 124, 471–480
(1984)
23. K.B. Laursen, Tensor products of Banach algebras with involution. Trans. Am. Math. Soc. 136,
467–487 (1969)
On Some Applications of Representable and Continuous Functionals 31
Yagub N. Aliyev
1 Introduction
This work was completed with the support of ADA University Faculty Research and Development
Fund.
Y. N. Aliyev ()
School of IT and Engineering, ADA University, Baku, Azerbaijan
e-mail: [email protected]
where c, d are real constants and c > 0, λ is the spectral parameter, q(x) is a real
valued and continuous function over the interval [0, 1].
The present article is about the minimality properties in L2 (0, 1) of the system
of root functions of the boundary value problem (1.1)–(1.3).
It was proved in [2] (see also [3]) that the eigenvalues of the boundary value
problem (1.1)–(1.3) form an infinite sequence accumulating only at +∞ and only
the following cases are possible:
(a) all the eigenvalues are real and simple;
(b) all the eigenvalues are real and all, except one double, are simple;
(c) all the eigenvalues are real and all, except one triple, are simple;
(d) all the eigenvalues are simple and all, except a conjugate pair of non-real, are
real.
Let {vn }∞
n=0 be a sequence of elements from L2 (0, 1) and Vk the closure (in the
norm of L2 (0, 1)) of the linear span of {vn }∞ ∞
n=0, n=k . The system {vn }n=0 is called
minimal in L2 (0, 1) if vk ∈/ Vk for all k = 0, 1, 2, . . ..
The eigenvalues λn (n ≥ 0) will be considered to be listed according to non-
decreasing real part and repeated according to algebraic multiplicity. Asymptotics of
eigenvalues and oscillation of eigenfunctions of the boundary value problem (1.1)–
(1.3), with linear function in the boundary condition, replaced by general rational
function, were studied in paper [3]. The case c < 0 in our problem does not involve
any non-real or non-simple eigenvalues and can be found as a special case in papers
[5, 6].
The main objective of the present paper is to show that the set of root functions
with an arbitrary function removed form a minimal system in L2 (0, 1), except some
cases where this system is neither complete nor minimal. In more general form this
result was obtained in [8, 9]. The main advantage of the current method is that it does
not use any heavy machinery or extensions into more general operators defined over
L2 ⊕ C. Also the special associated functions yk+1 ∗ , y # , y # , which are called in
k+1 k+2
the paper as auxiliary associated functions and used to describe the necessary and
sufficient conditions for minimality in a nice way, are very helpful when a concrete
boundary value problem is studied. One such worked out example is given at the
end of the current paper. The mere number of possibilities that arise in the simple
case of linear dependence shows how different the minimality properties are from
corresponding properties in the cases of boundary conditions free of any eigenvalue
parameters. These minimality results can be used to prove basis properties of the
root functions in L2 (0, 1).
The proofs in this and subsequent sections are similar to the corresponding results
in [1] so we will skip some of them. Let y(x, λ) be a non-zero solution of (1.1),
satisfying initial conditions y(0) = sin β and y (0) = sin β. Then we can write the
Minimality Properties of Sturm-Liouville Problems 35
characteristic equation as
We also note that y(x, λ) → y(x, λn ), uniformly, for x ∈ [0, 1], as λ → λn , and
the function yn (x) = y(x, λn ) is an eigenfunction of (1.1)–(1.3) corresponding to
eigenvalue λn .
By (1.1)–(1.3) we have,
−yn + q(x)yn = λn yn , yn (0) sin β = yn (0) cos β, yn (1) = (cλn + d)yn (1).
d
(y(x, λ)y (x, μ) − y (x, λ)y(x, μ)) = (λ − μ)y(x, λ)y(x, μ).
dx
By integrating this identity from 0 to 1, we obtain
1
(λ − μ)(y(·, λ), y(·, μ)) = (y(x, λ)y (x, μ) − y (x, λ)y(x, μ)) . (2.3)
0
By (2.1),
(μ) (λ)
(y(·, λ), y(·, μ)) = cy (1, λ)y (1, μ) − y (1, λ) + y (1, μ) . (2.6)
λ−μ λ−μ
Note the fact that (λn ) = (λm ) = 0. The required equality (2.2) is now obvious
if we substitute the parameters λ, μ by λn , λm , respectively.
Remark 2.2 Since λn , λm are the eigenvalues of (1.1)–(1.3), it is also possible to
prove (2.2) by letting λ → λn (μ = λn ) and then letting μ → λm in (2.6). It is also
possible to substitute the parameters λ, μ, respectively by λn , λm at the beginning of
the proof, in (2.3). Then the proof would be much simpler. But we chose the present
proof because we will need (2.6) in the subsequent arguments.
In the remaining part of this section we will collect some simple facts about the
inner products and norms of the eigenfunctions.
Lemma 2.3 If λn is a real eigenvalue then
2
yn 22 = (yn , yn ) = cyn (1) + yn (1) (λn ). (2.7)
2
yk 22 = (yk , yk ) = cyk (1) . (2.8)
2
yr 22 = c|yr (1)| . (2.9)
2
Bn = yn 22 − c|yn (1)| . (2.10)
The following corollary of (2.7)–(2.9) will be needed (cf. [2, Theorem 4.3]).
Corollary 2.6 Bn = 0 if and only if the corresponding eigenvalue λn is real and
simple.
If λk is a multiple (double or triple) eigenvalue (λk = λk+1 ) then Bk =
yk (1)ω (λk ) = 0 and Bk+1 is not defined, so we set Bk+1 = yk (1)ω (λk )/2. If
λk is a triple eigenvalue (λk = λk+1 = λk+2 ) then Bk+1 = 0 and Bk+2 is not
defined, so we set Bk+2 = yk (1)ω (λk )/6.
We conclude this section with the following.
Minimality Properties of Sturm-Liouville Problems 37
2
(yr , ys ) = cyr (1) + yr (1) (λr ). (2.11)
Since in the following text (λr ) will appear in denominators of some of the
fractions, it is useful to note here that all non-real eigenvalues of (1.1)–(1.3) are
simple and therefore (λr ) = 0 in (2.11).
In the previous section we collected some simple facts about the inner products and
norms of the eigenfunctions. But in the case of multiple eigenvalues there are also
some associated functions. In this section we will find formulae for inner products
and norms involving associated functions. These cases appear only for the real
eigenvalues so, throughout these sections we assume that all the eigenvalues and
eigenfunctions are real. In particular, we will not write complex conjugate sign that
appeared in the previous formulae.
If λk is a multiple eigenvalue (λk = λk+1 ) then for a first order associated
function yk+1 corresponding to the eigenfunction yk , following relations hold true
[7, p. 28]:
If λk is a triple eigenvalue (λk = λk+1 = λk+2 ) then together with the first order
associated function yk+1 there exists a second order associated function yk+2 for
which
(μ) (μ)
(yλ (·, λ), y(·, μ)) =cyλ (1, λ)y (1, μ) − yλ (1, λ) + y (1, λ)
λ−μ (λ − μ)2
(λ) (λ)
+ y (1, μ) − y (1, μ) . (3.2)
λ−μ (λ − μ)2
d
yk+1 yn − yk+1 yn = (λk − λn )yk+1 yn + yk yn ,
dx
which can be easily derived using the definition of yk+1 . By integrating this equality
from 0 to 1 we will obtain
1
(λk − λn )(yk+1 , yn ) + (yk , yn ) = yk+1 yn − yk+1 yn 0
.
By using the boundary conditions for yk+1 and yn , and the fact that (yk , yn ) =
cyk (1)yn (1) we obtain (3.1) again.
Lemma 3.2 If λk is a multiple eigenvalue then
(λk )
(yk+1 , yk ) = cyk+1 (1)yk (1) + yk (1) . (3.3)
2
Let us apply the procedure mentioned in the comments following the proof of (3.1)
to the functions yk+1 and yk . By integrating the identity
d
yk+1 yk − yk+1 yk = yk2 ,
dx
from 0 to 1, we will obtain
1
yk 22 = yk+1 yk − yk+1 yk 0
.
By using the boundary conditions for yk+1 and yk in the last equality we obtain (2.8)
again.
Lemma 3.3 If λk is a multiple eigenvalue then
2 (λk ) (λk )
yk+1 22 = cyk+1 (1) +
yk+1 (1) + yk (1) . (3.4)
2 6
where
yk+1 = yk+1 − C̃yk .
Proof Differentiating (3.2) with respect to μ we obtain
(μ) (μ)
(yλ (·, λ), yμ (·, μ)) = cyλ (1, λ)yμ (1, μ) − yλ (1, λ) − yλ (1, λ)
λ−μ (λ − μ)2
(μ) 2 (μ) (λ)
+ y (1, λ) + y (1, λ) + yμ (1, μ)
(λ − μ)2 (λ − μ)3 λ−μ
(λ) (λ) 2 (λ)
+ y (1, μ) − yμ (1, μ) − y (1, μ) .
(λ − μ) 2 (λ − μ) 2 (λ − μ)3
Letting μ → λk (λ = λk ) (cf. [1]) and then λ → λk we obtain (3.4).
40 Y. N. Aliyev
(μ)
(ỹk+2 , y(·, μ)) =cỹk+2 (1)y (1, μ) − ỹk+2 (1)
λk − μ
(μ) (μ)
+ ỹk+1 (1) − yk (1) ,
(λk − μ)2 (λk − μ)3
d
yk+2 yn − yk+2 yn = (λk − λn )yk+2 yn + yk+1 yn ,
dx
which can be again easily derived using the definition of yk+2 . By integrating this
equality from 0 to 1 we will obtain
1
(λk − λn )(yk+2 , yn ) + (yk+1 , yn ) = yk+2 yn − yk+2 yn 0
.
By using the boundary conditions for yk+2 and yn , and (3.1) we obtain a new proof
for (3.5).
Lemma 3.5 If λk is a triple eigenvalue then
(λk )
(yk+2 , yk ) = cyk+2 (1)yk (1) + yk (1) . (3.6)
6
Again, by applying the above mentioned procedure to the functions yk+2 and yk
we obtain
d
yk+2 yk − yk+2 yk = yk+1 yk .
dx
By integrating this equality from 0 to 1 we will obtain
1
(yk+1 , yk ) = yk+2 yk − yk+2 yk 0
.
(λk ) I V (λk )
(yk+2 , yk+1 ) = cyk+2 (1)yk+1 (1) +
yk+1 (1) + yk (1) .
6 24
By applying the above mentioned procedure one more time but now to the functions
yk+2 and yk+1 we obtain
d 2
yk+2 yk+1 − yk+2 yk+1 = yk+1 − yk+2 yk .
dx
Integration of this equality from 0 to 1 will give us
1
yk+1 22 − (yk+2 , yk ) = yk+2 yk+1 − yk+2 yk+1 0
.
which is again in perfect agreement with (3.4) and (3.6), because (λk ) = 0.
Lemma 3.7 If λk is a triple eigenvalue then
2 (λk )
yk+2 22 =cyk+2 (1) +
yk+2 (1)
6
I V (λk ) V (λk )
+
yk+1 (1) + yk (1) ,
24 120
where
yk+2 = yk+2 − C̃
yk+1 − D̃yk .
By comparing the equalities in Sects. 2 and 3 we can see that there are some
fundamental differences between the formulae for the inner products and norms of
the eigenfunctions and the corresponding formulae for the associated functions. In
this section we will prove that it is possible to find special associated functions (we
call them auxiliary associated functions) whose properties in inner products make
them more close to the eigenfunctions than the other associated functions. In the
last section, these functions will play a crucial role in our description of minimality
properties.
42 Y. N. Aliyev
yk (1) (λk ) + 3
yk+1 (1) (λk )
C1 = − ,
3yk (1) (λk )
such that
∗ ∗
(yk+1 , yk+1 ) = c(yk+1 ) (1)yk+1 (1). (4.1)
∗ ∗ (λk )
(yk+1 , yk ) = c(yk+1 ) (1)yk (1) + yk (1) .
2
We shall now concentrate on the triple eigenvalue case. Although we will not
∗
need the function yk+1 in the triple eigenvalue case, it is still worthwhile to note that
such a function does not exist in this case. Instead, we will need other associated
#
functions of yk which will be denoted by yk+1 # .
and yk+2
Lemma 4.2 If λk is a triple eigenvalue then there exists an associated function
#
yk+1 = yk+1 + C2 yk , where
yk (1) I V (λk ) + 4
yk+1 (1) (λk )
C2 = − ,
4yk (1) (λk )
for which
#
(yk+1 , yk+2 ) = c(yk+1
#
) (1)yk+2 (1).
#
(yk+1 , yn ) = c(yk+1
#
) (1)yn (1), (n = k + 1, k + 2),
# # (λk )
(yk+1 , yk+1 ) = c(yk+1 ) (1)yk+1 (1) + yk (1) .
6
Minimality Properties of Sturm-Liouville Problems 43
∗ , defined by y ∗
Note that the function yk+2 k+2 = yk+2 + C2 yk+1 , where C2 is the
same constant, also enjoys similar properties:
∗ ∗
(yk+2 , yk+1 ) = c(yk+2 ) (1)yk+1(1), (4.3)
∗ ∗
(yk+2 , yn ) = c(yk+2 ) (1)yn (1), (n = k, k + 1, k + 2), (4.4)
∗ ∗ (λk )
(yk+2 , yk ) = c(yk+2 ) (1)yk (1) + yk (1) . (4.5)
6
Lemma 4.3 If λk is a triple eigenvalue then there exists an associated function
#
yk+2 ∗
= yk+2 + D1 yk , where D1 is a constant, for which
#
(yk+2 , yk+1 ) = c(yk+2
#
) (1)yk+1(1), (4.6)
#
(yk+2 , yk+2 ) = c(yk+2
#
) (1)yk+2(1). (4.7)
where
#
It is not difficult to check that for the function yk+2 ∗
= yk+2 + D1 yk , where
6Qk
D1 = − ,
yk (1) (λk )
# #
(yk+2 , yn ) = c(yk+2 ) (1)yn (1), (n = k, k + 1, k + 2);
# # (λk )
(yk+2 , yk ) = c(yk+2 ) (1)yk (1) + yk (1) .
6
# ) (1) = 0 if and only if
We remark that (yk+2
5 I V (λk ) I V (λk ) − 4C̃ (λk ) = 4 (λk ) V (λk ) − 20D̃ (λk ) .
44 Y. N. Aliyev
In this section we will consider all possible cases of the choice of the root function
which will be deleted from the system to obtain a minimal system. In each case we
will construct explicitly a biorthogonal system.
Theorem 5.1 If all the eigenvalues of (1.1)–(1.3) are real and simple then the
system
1 yn (x) yn (1)
un (x) = . (5.3)
Bn yl (1) yl (x) yl (1)
(un , ym ) = δnm ,
{yn } (n = 0, 1, . . . ; n = k + 1),
1 yn (x) yn (1)
un (x) = (n = k, k + 1), (5.4)
Bn yk (1) yk (x) yk (1)
1 yk+1 (x) yk+1 (1)
uk (x) = .
Bk+1 yk (1) yk (x) yk (1)
∗ ) (1) = 0 then the system
Theorem 5.3 If λk is a double eigenvalue, and if (yk+1
1 yn (x) yn (1)
un (x) = ∗ ) (1) y ∗ (x) (y ∗ ) (1) (n = k, k + 1),
Bn (yk+1 k+1 k+1
1 yk (x) yk (1)
uk+1 (x) = ∗ ∗ (x) (y ∗ ) (1) .
y
Bk+1 (yk+1 ) (1) k+1 k+1
{yn } (n = 0, 1, . . . ; n = l),
1 yk (x) yk (1)
uk+1 (x) = ,
Bk+1 yl (1) yl (x) yl (1)
1 ∗ (x) (y ∗ ) (1)
yk+1
uk (x) = k+1 .
Bk+1 yl (1) yl (x) yl (1)
{yn } (n = 0, 1, . . . ; n = k + 2),
Proof The biorthogonal system is given by the formula (5.4) for n = k, k+1, k+2,
and
1 yn (x) yn (1)
un (x) = # ) (1) y # (x) (y # ) (1)
(n = k, k + 1, k + 2),
Bn (yk+1 k+1 k+1
1 yk (x) yk (1)
uk+2 (x) = # # # ,
Bk+2 (yk+1 ) (1) yk+1 (x) (yk+1 ) (1)
1 # (x) (y # ) (1)
yk+2
uk (x) = #
k+2
# (x) (y # ) (1) .
Bk+2 (yk+1 ) (1) yk+1 k+1
In analogy with Theorem 5.3, we may show that if (yk+1 # ) (1) = 0 then the function
#
yk+1 (x) is orthogonal to all the elements of the system (5.6); hence the system (5.6)
is not complete.
# ) (1) = 0 then the system
Theorem 5.7 If λk is a triple eigenvalue, and if (yk+2
1 yn (x) yn (1)
un (x) = # ) # (x) (y # ) (1) (n = k, k + 1, k + 2),
Bn (yk+2 (1) yk+2 k+2
1 yk (x) yk (1)
uk+2 (x) = # # (x) (y # ) (1) ,
y
Bk+2 (yk+2 ) (1) k+2 k+2
{yn } (n = 0, 1, . . . ; n = l),
1 yk (x) yk (1)
uk+2 (x) = ,
Bk+2 yl (1) yl (x) yl (1)
1 # (x) (y # ) (1)
yk+1
uk+1 (x) = k+1 ,
Bk+2 yl (1) yl (x) yl (1)
1 # (x) (y # ) (1)
yk+2
uk (x) = k+2 .
Bk+2 yl (1) yl (x) yl (1)
1 yn (x) yn (1)
un (x) = (n = r, s),
Bn ys (1) ys (x) ys (1)
1 yr (x) yr (1)
us (x) = .
yr (1)ys (1) (λr ) ys (x) ys (1)
1 ys (x) ys (1)
ur (x) = ,
ys (1)yl (1) (λs ) yl (x) yl (1)
1 yr (x) yr (1)
us (x) = .
yr (1)yl (1) (λr ) yl (x) yl (1)
48 Y. N. Aliyev
Remark 5.10 Using the method of the paper [5] these minimality results can also
be extended to basis properties. Then the sufficient conditions
∗ # #
(yk+1 ) (1) = 0, (yk+1 ) (1) = 0, (yk+2 ) (1) = 0
in Theorems 5.3, 5.6 and 5.7, respectively, will be necessary conditions, too.
6 Example
As an illustration of the above theory, we present a particular result for the following
problem
λ
−y = λy, 0 < x < 1, y(0) = 0, y(1) = + 1 y (1).
3
from which only the eigenfunction y0 is excluded but the associated function y1 is
included, is minimal in L2 (0, 1) if C = 21
2
.
We shall now apply√a different method to this problem. Note that for this
√
problem y(x, λ) = sin√ λx . We need λ in the denominator to make sure that
λ
Minimality Properties of Sturm-Liouville Problems 49
As was pointed out in the comments following the proof of Lemma 4.1, the
condition (y1∗ ) (1) = 0 is equivalent to (λk ) = 3C̃ (λk ). Since C = −C̃,
we obtain, once again, C = 21 2
.
3
If C = 21 2
then we obtain y1∗ (x) = − x6 + x
2 which is orthogonal to all the
elements of the system {yn } (n = 1, 2, . . .).
Acknowledgments Many thanks to ADA University and especially School of IT and Engineering
for their constant support.
References
1. Y.N. Aliyev, Minimality of the system of root functions of Sturm-Liouville problems with
decreasing affine boundary conditions. Colloq. Math. 109, 147–162 (2007)
2. P.A. Binding, P.J. Browne, Application of two parameter eigencurves to Sturm-Liouville
problems with eigenparameter-dependent boundary conditions. Proc. R. Soc. Edinb. 125A,
1205–1218 (1995)
3. P.A. Binding, P.J. Browne, B.A. Watson, Equivalence of inverse Sturm-Liouville problems with
boundary conditions rationally dependent on the eigenparameter. J. Math. Anal. Appl. 291, 246–
261 (2004)
4. A.P. Kartashev, B.L. Rojdestvenskiy, Ordinary Differential Equations and Foundations of
Calculus of Variations (Nauka, Moscow, 1980, in Russian)
5. N.B. Kerimov, Y.N. Aliyev, The basis property in Lp of the boundary value problem rationally
dependent on the eigenparameter. Stud. Math. 174, 201–212 (2006)
6. N.B. Kerimov, V.S. Mirzoev, On the basis properties of one spectral problem with a spectral
parameter in boundary conditions. Sib. Math. J. 44, 813–816 (2003)
7. M.A. Naimark, Linear Differential Operators, 2nd edn. (Nauka, Moscow, 1969, in Russian).
English trans. of 1st edn., Parts I, II (Ungar, New York, 1967, 1968)
8. A.A. Shkalikov, Boundary value problems for ordinary differential equations with a parameter
in the boundary conditions. Tr. Semin. Im. I. G. Petrovskogo 9, 190–229 (1983, in Russian)
9. A.A. Shkalikov, Basis properties of root functions of differential operators with spectral
parameter in the boundary conditions. Differ. Equ. 55, 631–643 (2019)
Scattering, Spectrum and Resonance
States Completeness for a Quantum
Graph with Rashba Hamiltonian
1 Introduction
The problem of resonances was studied during a long time starting with the
famous Rayleigh work on the Helmholtz resonator. The completeness problem for
resonance states is younger. Mathematicians dealt with it during a half of century.
The problem is related to the stability of the completeness under some perturbation.
Namely, one starts with a system with purely discrete spectrum and complete system
of eigenstates, e.g., a closed resonator. Then one perturbs the system in such a way
that eigenvalues turn to resonances and eigenstates—to resonance states, e.g., the
resonator with a boundary window [2, 7, 8, 10]. Correspondingly, a natural question
appears: is the system of quasi-eigenstates complete? One of the approaches to this
problem is related to the Sz.-Nagy functional model [14, 21, 25]. Starting with work
[1], it is known that the scattering matrix is the same as the characteristic function
from the functional model. In particular, root vectors in the functional model
Below, we will use the system of units in which h̄ = 1. The standard Lax-
Phillips approach is applied to the wave (acoustic) equation. There is a close relation
between the Schrödinger and wave cases. We will describe it briefly following [17,
Section 6.4]. Namely, it is necessary to consider the operator A2 (here A is the
generator of the evolution group U (t) for the wave equation, see below):
H 0
A =
2
.
0 H
Resonance States Completeness 53
One can see that A2 acts as the Hamiltonian H on each component of the data
of the acoustic problem. This allows one to use the “acoustic” construction for the
Schrödinger case and, as a result, comes to the relation between the Schrödinger
(S Schr ) and the acoustic (S) scattering matrices:
√
S Schr (z) = S( z).
Let us describe briefly the acoustic case. Consider the Cauchy problem for the wave
equation
ut t = uxx ,
(1.1)
u(x, 0) = u0 (x), ut (x, 0) = u1 (x), x ∈ .
The pair (u0 , u1 ) is called the Cauchy data. The unitary (in E) group U (t),
U (t)(u0 , u1 ) = (u(x, t), ut (x, t)), solves the problem (1.1). The unitary group
U (t)|t ∈R has two orthogonal (in E) subspaces, D− and D+ , called, correspondingly,
incoming and outgoing subspaces.
Definition 1.1 The outgoing (incoming) subspace D+ (D− ) is a subspace of E
having the following properties:
(a) U (t)D+ ⊂ D+ for t > 0; U (t)D− ⊂ D− for t < 0,
(b) ∩t >0 U (t)D+ = {0}; ∩t <0 U (t)D− = {0}
(c) ∪t <0 U (t)D+ = E, ∪t >0 U (t)D− = E.
The existence of the incoming and outgoing subspaces is related to the spectral
properties of the operator H . The property (c) from the definition is fulfilled if the
spectrum of the operator is continuous [17]. The operator H is self-adjoint. Using
the spectral expansion, one can obtain the continuous subspace Ec by removing the
discrete subspace (i.e. eigenspaces) from E. Below we will deal with H |Ec (we will
not introduce a new notation for this operator with the continuous spectrum). The
choice of incoming and outgoing subspaces is not unique. For the graph , one
can choose the subspace D+ containing four-component functions vanishing at the
ring 0 and satisfying the radiation condition on all leads (infinite edges). Due to
the radiating condition, one has only outgoing exponential at the edges. It gives
one properties (a), (b) for the subspace. As for (c), it takes place for Ec due to the
self-adjointness of the operator H :
∪t <0 U (t)D+ = Ec .
54 I. V. Blinova et al.
The definition of the subspace D− is analogous. Property (c) for D− takes the form
∪t >0 U (t)D− = Ec .
Z(t) = P+ U (t)P− , t ≥ 0.
The square roots of the eigenvalues of the generator B of Z(t) are resonances. The
following subspace K = E (D− ⊕ D+ ) is very important for the construction.
The operators {Z(t)}|t ≥0 map the subspace K into itself. Lax and Phillips proved
the following theorem [17].
Theorem 1.2 There is a pair of isometric maps T± : E → L2 (R, N) (the
outgoing and incoming spectral representations), N is an auxiliary space, having
the following properties:
where H±2 (N) is the Hardy space in the upper (lower) half-plane, the matrix-
function S is an inner function in C+ , and
Theorem 1.5 ([21]) Let dim N < ∞. The following statements are equivalent:
1. S is a Blaschke-Potapov product;
2.
2i
lim ln |det S(k)| dk = 0, (1.2)
r→1 (k + i)2
Cr
where
1 + r2 2r
C(r) = , R(r) = .
1 − r2 1 − r2
and after throwing away constants which are irrelevant for convergence, we obtain
the final form of the criterion (1.2), which is convenient for us and will be used
afterwards:
2π
R(r) ln(s(R(r)eit + iC(r))) it
lim e dt = 0. (1.3)
r→1 (R(r)eit + iC(r) + i)2
0
Remark 1.8 We recall that two components correspond to two directions of spin
in respect to magnetic field. It is interesting to compare the result with the
corresponding theorems for other operators. If one deals with the same geometric
graph with the Shrödinger operator, the Dirac operator or the Landau operator (the
Schrödinger operator with a magnetic field) and the Kirchhoff coupling condition
at the vertex then there is an incompleteness. But small perturbation of the coupling
condition (delta-potential at the vertex) restores the completeness. In our case the
spin-orbit interaction plays a role of such perturbation.
where ϕ is the azimuthal angle of a point on the ring, denotes the magnetic
flux encircled by the ring, 0 = h̄e , e is the electron charge, ωSO = h̄aα
is the
frequency associated with the spin-orbit interaction (with interaction constant α),
Fig. 1 Task 1 (upper row), Task 2 (lower row); directions of edges are shown
Resonance States Completeness 57
h̄2
h̄ = , m∗ is the effective mass of the electron. The radial spin operator is
2m∗ a 2
given by
0 e−iϕ
σr = σx cos ϕ + σy sin ϕ = iϕ .
e 0
∗
The constant in Zeeman term is as follows ωL = g4m eB
with g ∗ and m being the
effective gyromagnetic ratio and the free electron mass, respectively.
To construct the scattering matrix for the graph we solve two scattering problems
differing in solutions on the lines.
Task 1:
1 ikx R↑↑ −ikx T↑↑ ikx
ψ e1 ↑ (x) = e + e , ψ e2 ↑ (x) = e .
0 R↑↓ T↑↓
Task 2:
e1 ↓ 0 ikx R↓↑ −ikx e2 ↓ T↓↑ ikx
ψ (x) = e + e , ψ (x) = e .
1 R↓↓ T↓↓
For the both tasks one has the following form of the solution on the ring:
μ μ
ψ(ϕ) = aj eKj ϕ χ (μ) (ϕ),
μ=1,2 j =1,2
where
−i ϕ ϕ
1 e 2 cos θ2 1 e−i 2 sin θ2
χ (1)(ϕ) = √ ϕ
, χ (2) (ϕ) = √ ϕ
.
2π ei 2 sin θ2 2π −ei 2 cos θ2
Here θ = arctan(−α),
! 2 "1/2 1/2
2m∗ E
μ
1 AC α E
+ (−1)μ+j +1
μ
Kj = + + + , k= .
2 2π 2π 4 h̄ h̄2
58 I. V. Blinova et al.
We assume the standard coupling condition at vertex v for the spin-orbit Hamilto-
nian: the continuity of the wave functions and vanishing net spin current densities
[22]. For our graph (see Fig. 1) it has the following form:
(−1)[ej ] Dj ψj (v) = 0, ψ e1 (v) = ψ e2 (v) = · · · = ψ ej (v),
ej
where summation is over all edges ej adjacent to the vertex v, [ej ] = 0 for the
outgoing edge and [ej ] = 1 for the incoming edge. The third edge (see Fig. 1) gives
one two terms (outgoing and incoming) with
∂ α 0 e−iϕ
Dj = −i σr − , σ r = iϕ
∂ϕ 2 0 e 0
(in our case for the third edge, one has φ = 0 for the outgoing term and φ = 2π for
the incoming term). For straight semi-infinite edges (the first and the second edges
in Fig. 1) Dj = dxd
.
Remark 2.1 From mathematical point of view, these conditions ensure the self-
adjointness of the operator on the graph. In physical literature the conditions are
known as Griffith conditions (see, e.g., [13, 20]).
These conditions take the following form for our graph.
For Task 1:
e ↑ e ↑ e e
ψ1 1 (0) = ψ1 2 (0) = ψ1 3 (0) = ψ1 3 (2πa),
e ↑ e ↑ e e
ψ2 1 (0) = ψ2 2 (0) = ψ2 3 (0) = ψ2 3 (2πa),
∂ψ(0) ∂ψ(2πa) 1 T↑↑ R↑↑
−βψ(0)− +βψ(2πa)−ik +ik +ik = 0,
∂ϕ ∂ϕ 0 T↑↓ R↑↓
Resonance States Completeness 59
where
α
β=i σr − ,
2 0
ψ1 and ψ2 denote, correspondingly, the first and the second component of the
vector ψ, 2πa is the length of the ring.
For Task 2:
e ↓ e ↓
ψ1 1 (0) = ψ1 2 (0) = ψ1e3 (0) = ψ1e3 (2πa),
e ↓ e ↓
ψ2 1 (0) = ψ2 2 (0) = ψ2e3 (0) = ψ2e3 (2πa),
∂ψ(0) ∂ψ(2πa) T 0 R↓↑
−βψ(0)− +βψ(2πa)+ik ↓↑ −ik +ik = 0.
∂ϕ ∂ϕ T↓↓ 1 R↓↓
2.5 S-Matrix
Solutions of Task 1 and Task 2 give us the entries of the S-matrix (see [13]):
⎛ ⎞
R↑↑ R↓↑ T↑↑ T↓↑
⎜R↑↓ R↓↓ T↑↓ T↓↓ ⎟
S(k) = ⎜
⎝ T↑↑
⎟
T↓↑ R↑↑ R↓↑ ⎠
T↑↓ T↓↓ R↑↓ R↓↓
where
θ θ
R↑↑ = (1) cos2 + (2) sin2 − 1,
2 2
θ θ
R↑↓ = ((1) − (2) ) sin cos , (2.1)
2 2
R↓↑ = R↑↓ ,
θ θ
R↓↓ = (1) sin2 + (2) cos2 − 1
2 2
and
θ θ
T↑↑ = τ (1) cos2 + τ (2) sin2 ,
2 2
θ θ
T↑↓ = (τ (1) − τ (2)) sin cos , (2.2)
2 2
T↓↑ = T↑↓ ,
θ θ
T↓↓ = τ (1) sin2 + τ (2) cos2 ,
2 2
60 I. V. Blinova et al.
where
ŷ (μ) = 4ikaq (μ) sin(2q (μ)π) − 4(q (μ))2 cos (−1)μ+1 ω − 2 π
0
− 4(q (μ))2 cos(2q (μ)π),
) *
ωL 1 ω 2 E h̄2 k 2
SO
q (μ) = q 2 + (−1)μ , q= + , E= ,
ω 2 h̄ 2m∗
*
2
ωSO g ∗ eB α
ω= 1+ ωL = , ωSO = .
2 4m h̄a
2.6 Completeness
–5
–10
–15
–20
–25
0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
Having explicit expression for the S-matrix, one can simply demonstrate the
behaviour of the integrand of (1.3) at infinity. It is shown in Fig. 2 for the main
part of the integrand:
ln(|det (S(k))|)
F (k) = .
|k|
Acknowledgments This work was partially financially supported by the Government of the
Russian Federation (grant 08-08), grant 16-11-10330 of Russian Science Foundation and grant
19-31-90164 of Russian Foundation for Basic Researches.
References
1. V.M. Adamyan, D.Z. Arov, On a class of scattering operators and characteristic operator-
functions of contractions. Dokl. Akad. Nauk SSSR 160, 9–12 (1965) (in Russian)
2. A. Aslanyan, L. Parnovski, D. Vassiliev, Complex resonances in acoustic waveguides. Q. J.
Mech. Appl. Math. 53, 429–447 (2000)
3. G. Berkolaiko, P. Kuchment, Introduction to Quantum Graphs (AMS, Providence, 2012)
62 I. V. Blinova et al.
4. I.V. Blinova, I.Y. Popov, Quantum graph with the Dirac operator and resonance states
completeness. Oper. Theor. Adv. Appl. 268, 111–124 (2018)
5. A. Chatterjee, I.Y. Popov, M.O. Smolkina, Persistent current in a chain of two Holstein–
Hubbard rings in the presence of Rashba spin-orbit interaction. Nanosyst. Phys. Chem. Math.
10, 50–62 (2019)
6. E. Dehghana, D.S. Khoshnoud, A.S. Naeimi, Logical spin-filtering in a triangular network of
quantum nanorings with a Rashba spin-orbit interaction. Physica B 529, 21–26 (2018)
7. P. Duclos, P. Exner, B. Meller, Open quantum dots: resonances from perturbed symmetry and
bound states in strong magnetic fields. Rep. Math. Phys. 47, 253–267 (2001)
8. J. Edward, On the resonances of the Laplacian on waveguides. J. Math. Anal. Appl. 272, 89–
116 (2002)
9. P. Exner, P. Keating, P. Kuchment, T. Sunada, A. Teplyaev (eds.), Analysis on Graphs and Its
Applications (AMS, Providence, 2008)
10. P. Exner, V. Lotoreichik, M. Tater, On resonances and bound states of Smilansky Hamiltonian.
Nanosyst. Phys. Chem. Math. 7, 789–802 (2016)
11. D.A. Gerasimov, I.Y. Popov, Completeness of resonance states for quantum graph with two
semi-infinite edges. Complex Var. Elliptic Equ. 63, 996–1010 (2018)
12. D. Gerasimov, I. Popov, I. Blinova, A. Popov, Incompleteness of resonance states for quantum
ring with two semi-infinite edges. Anal. Math. Phys. 9, 1287–1302 (2019)
13. O. Kálmán, P. Földi, M.G. Benedict, F.M. Peeters, Magnetoconductance of rectangular arrays
of quantum rings. Phys. Rev. B 78, 125306 (2008)
14. S.V. Khrushchev, N.K. Nikol’skii, B.S. Pavlov, Unconditional bases of exponentials and of
reproducing kernels, in Complex Analysis and Spectral Theory (Leningrad, 1979/1980), vol.
864. Lecture Notes in Mathematics (Springer, Berlin, 1981), pp. 214–335
15. V.K. Kozin, I.V. Iorsh, O.V. Kibis, I.A. Shelykh, Quantum ring with the Rashba spin-orbit
interaction in the regime of strong light-matter coupling. Phys. Rev. B 97, 155434 (2018)
16. V.V. Kudryashov, A.V. Baran, Rashba spin-orbit interaction in a circular quantum ring in the
presence of a magnetic field. Nonlinear Phenom. Complex Syst. Minsk 14(1), 89–95 (2011)
17. P.D. Lax, R.S. Phillips, Scattering Theory (Academic Press, New York, 1967)
18. J. Lipovsky, Quantum graphs and their resonance properties. Acta Physica Slovaca 66, 265–
363 (2016)
19. A.S. Naeimi, L. Eslami, M. Esmaeilzadeh, A wide range of energy spin-filtering in a Rashba
quantum ring using S-matrix method. J. Appl. Phys. 113, 044316 (2013)
20. A.S. Naeimi, L. Eslami, M. Esmaeilzadeh, M.R. Abolhassani, Spin transport properties in a
double quantum ring with Rashba spin-orbit interaction. J. Appl. Phys. 113, 014303 (2013)
21. N. Nikol’skii, Treatise on the Shift Operator: Spectral Function Theory (Springer, Berlin, 1986)
22. K. Pankrashkin, Localization effects in a periodic quantum graph with magnetic field and spin-
orbit interaction. J. Math. Phys. 47, 112105 (2006)
23. I.Y. Popov, A.I. Popov, Quantum dot with attached wires: Resonant states completeness. Rep.
Math. Phys. 80, 1–10 (2017)
24. I.Y. Popov, A.I. Popov, Line with attached segment as a model of Helmholtz resonator: resonant
states completeness. J. King Saud Univ. Sci. 29, 133–136 (2017)
25. B. Sz-Nagy, C. Foias, H. Bercovici, L. Kerchy, Harmonic Analysis of Operators on Hilbert
Space, 2nd edn. (Springer, Berlin, 2010)
Tau Functions Associated with Linear
Systems
Abstract Let (−A, B, C) be a linear system in continuous time t > 0 with input
and output space C and state space H . The function φ(x) (t) = Ce−(t +2x)AB
determines a Hankel integral operator φ(x) on L2 ((0, ∞); C); if φ(x) is trace class,
then the Fredholm determinant τ (x) = det(I + φ(x) ) defines the tau function
of (−A, B, C). Such tau functions arise in Tracy and Widom’s theory of matrix
models, where they describe the fundamental probability distributions of random
matrix theory. Dyson considered such tau functions in the inverse spectral problem
for Schrödinger’s equation −f + uf = λf , and derived the formula for the
d2
potential u(x) = −2 dx 2 log τ (x) in the self-adjoint scattering case (Commun Math
Phys 47:171–183, 1976). This paper introduces a operator function Rx that satisfies
Lyapunov’s equation dR dx = −ARx − Rx A and τ (x) = det(I + Rx ), without
x
1 Introduction
This paper is concerned with Fredholm determinants which arise in the theory of
linear systems and their application to inverse spectral problem for Schrödinger’s
equation. For φ ∈ L2 ((0, ∞); R), the Hankel integral operator corresponding to φ
is φ where
∞
φ f (x) = φ(x + y)f (y) dy (f ∈ L2 ((0, ∞); C).
0
Using the Laguerre system of orthogonal functions as in [30], one can express φ as
a matrix [γj +k ]∞ 2
j,k=1 on , which has the characteristic shape of a Hankel matrix,
and one can establish criteria for the operator to be bounded on L2 ((0, ∞); C).
Megretskii et al. [25] determined the possible spectrum and spectral multiplicity
function that can arise from a bounded and self-adjoint Hankel operator. Thus
they characterized the class of bounded self-adjoint Hankel operators up to unitary
equivalence. Their method involved introducing suitable linear systems on a state
space H , and this motivated the approach of our paper.
Following earlier works by Faddeev and others in the Russian literature, Dyson
[8] considered the inverse spectral problem for Schrödinger’s equation −f + uf =
λf , for u ∈ C 2 (R; R) that decays rapidly as x → ±∞. From the asymp-
totic solutions, he introduced a scattering function φ, considered the translations
φ(x)(y) = φ(y + 2x), and established connections with eigenvalue distributions in
random matrix theory which are described in [37]. He showed that the potential can
be recovered from the scattering data by means of the formula
d2
u(x) = −2 log det(I + φ(x) ), (1.1)
dx 2
These results were developed further by Ercolani and McKean [10] and others
[13, 38, 39] to describe the inverse spectral problem for self-adjoint Schrödinger
operators on R. Grudsky and Rybkin [17] describes the inverse scattering theory of
the KdV equation in terms of Hankel and Toeplitz operators. The latter paper uses
Sarason’s algebra H ∞ + C on the unit disc to describe compact Hankel operators.
In the current paper, we use Hankel operators within the setting of linear systems in
continuous time.
Remarkably, some of the methods of inverse scattering theory do not really need
self-adjointness. However, a significant obstacle in this approach is that Hankel
operators do not have a natural product structure, so it is unclear as to how one
can fully exploit the multiplicative properties of determinants. This paper seeks
to address this issue, by realizing Hankel operators from linear systems, and then
introducing algebras of operators on state space that reflect the properties of Hankel
operators and their Fredholm determinants. As in [25], the Lyapunov differential
equation is fundamental to the development of the theory.
Definition 1.1 (Lyapunov Equation)
(i) Let H be a complex Hilbert space, known as the state space, and L(H ) the
algebra of bounded linear operators on H with the usual operator norm. Let
(e−t A )t ≥0 be a strongly continuous (C0 ) semigroup of bounded linear operators
on H such that e−t A L(H ) ≤ M for all t ≥ 0 and some M < ∞. Let D(A)
Tau Functions Associated with Linear Systems 65
be the domain of the generator −A so that D(A) is itself a Hilbert space for the
graph norm
dRz
− = ARz + Rz A (z > 0), (1.2)
dz
dX
= −AX + BU, Y = CX.
dt
is trace class and gives the unique solution to (1.2) with the initial condition
dR
x
= −AR0 − R0 A = −BC. (1.4)
dx x=0
Definition 1.4 (Tau Function) Given a (2, 2) admissible linear system (−A, B, C),
we define
τ (x) = det(I + Rx ).
Using this general definition of τ , we can unify several results from the scattering
theory of ordinary differential equations. Under circumstances discussed in [17] and
[34], this becomes the well-known Hitota tau function of soliton theory. Such tau
functions are also strongly analogous to the tau functions introduced by Miwa et al.
[26] to describe the isomonodromy of rational differential equations and generalize
classical results on theta functions. The connection between Fredholm determinants
and rational differential equations is further described in [11] and [37]; see also [20].
The Gelfand–Levitan–Marchenko equation [12] provides the linkage between φ and
u via Rx . Consider
∞
T (x, y) + (x + y) + μ T (x, z)(z + y) dz = 0 (0 < x < y) (1.6)
x
Tau Functions Associated with Linear Systems 67
where T (x, y) and (x + y) are m × m matrices with scalar entries. In the context
of (−A, B, C) we assume that (x) = Ce−xA B is known and aim to find T (x, y).
In section two, we use Rx to construct solutions to the associated Gelfand–Levitan
equation (1.6), and introduce a potential
d2
u(x) = −2 log det(I + Rx ).
dx 2
In section three, we obtain a differential equation linking (x) to u(x). In examples
of interest in scattering theory, one can calculate det(I + λRx ) more easily than
the Hankel determinant of (x) directly [10], since Rx has additional properties
that originate from Lyapunov’s equation. In section four, we introduce a differential
algebra of operators on the state space, and a homomorphism to the differential
algebra C[u, u , . . . ] that is generated by the potential. In section five, we describe
the connection between this algebra and the stationary KdV hierarchy. There is a
fundamental connection between theta functions and equations of KdV and KP type;
see [27].
The following section proves existence and uniqueness of solutions of the Lyapunov
equation (1.2), in a style suggested by Peller [30, p. 503]. Peller discusses
scattering functions that produce bounded self-adjoint Hankel operators φ , and
their realization in terms of continuous time linear systems. He observes that in
some cases one needs a bounded semigroup with unbounded generator (−A). We
prove the uniqueness results for bounded and strongly continuous semigroups, then
specialize to holomorphic semigroups. The main application is to the Gelfand–
Levitan equation (1.6), and associated determinants.
Proposition 2.1 Let (e−t A )t ≥0 be a strongly continuous and weakly asymptotically
stable semigroup on a complex Hilbert space H , so e−t A f → 0 weakly as t → ∞
for all f ∈ H . Then
(i) St : R → e−t A Re−t A for t ≥ 0 defines a strongly continuous semigroup on
L1 (H ), which has generator (−L), with dense domain of definition D(L) such
that
L(R) = AR + RA (R ∈ D(L)).
68 G. Blower and S. L. Newsham
(ii) The linear operator L : D(L) → L1 (H ) is injective, and for each R0 ∈ D(L)
with L(R0 ) = X, there exists a weakly convergent integral
∞
R0 = e−t A Xe−t A dt. (2.1)
0
(iii) Suppose moreover that e−t0 A L(H ) < 1 for some t0 > 0. Then L : D(L) →
L1 (H ) is surjective, the integral (2.1) converges absolutely in L1 (H ) and R0
gives the unique solution to AR0 + R0 A = X.
Proof
(i) First observe that by the uniform boundedness theorem, there exists M such
that e−t A L(H ) ≤ M for all t ≥ 0, so (e−t A )t ≥0 is uniformly bounded. Also,
the adjoint semigroup (e−t A )t ≥0 is also strongly continuous and uniformly
†
∞
where Bj , Cj ∈ H satisfy XL1 (H ) = j =1 Bj H Cj H . Then
∞
∞
−t A −t A −t A
St (X) − X = (e Bj Cj e − Bj Cj e ) + (Bj Cj e−t A − Bj Cj )
j =1 j =1
d −t A
e R0 e−t A f, g = − e−t A (AR0 + R0 A)e−t A f, g = − e−t A Xe−t A f, g
dt
a continuous function of t > 0; so integrating we obtain
−sA −sA
s
R0 f, g − e R0 e f, g = e−t A Xe−t A f, g dt.
0
Tau Functions Associated with Linear Systems 69
(iii) The function t → e−t A Xe−t A takes values in the separable space L1 (H )
and is weakly continuous, hence strongly measurable, by Pettis’s theorem.
By considering the spectral radius, Engel and Nagel [9] show that there exist
δ > 0 and Mδ > 0 such that e−t A L(H ) ≤ Mδ e−δt for all t ≥ 0; hence (2.1)
converges as a Bochner–Lebesgue integral with
∞ Mδ2
Rx L1 (H ) ≤ Mδ2 XL1 (H ) e−2δt dt ≤ XL1 (H ) e−2δx .
x 2δ
as s → ∞ where
s
e−t A Xe−t A dt → Rx ;
x
A closed and densely defined linear operator −A is sectorial [9, 15] if there
exists π/2 < θ < π such that Sθ is contained in the resolvent set of −A and
|λ|(λI + A)−1 L(H ) ≤ M for all λ ∈ Sθ . Let D(A) be the domain of A and
D(A∞ ) = ∩∞ n=0 D(A ). See [15, p. 37].
n
classical results from wave equations, we can write eit A + e−it A = 2 cos(tA)
where u(x, t) = cos(tA)f (x) for f ∈ Cc∞ (R; C) is given by
1 t x+t J0 ( t 2 − (x − s)2 )
u(x, t) = f (x + t) + f (x − t) + f (y) ds,
2 2 x−t t 2 − (x − s)2
where J0 is Bessel’s function of the first kind of order zero, and u satisfies
∂ 2u ∂ 2u ∂u
− 2 = u(x, t), u(x, 0) = f (x), (x, 0) = 0.
∂x 2 ∂t ∂t
See [15, p. 121]. Note that (exp(t (iA)2j −1)) gives a unitary group on H
for j = 0, 1, 2, . . . . This can be used to deform the linear system in the
sense of Proposition 2.5(iii). Unitary deformation groups for tau functions are
considered in [26]
72 G. Blower and S. L. Newsham
Proposition 2.5
(i) In the notation of Theorem 2.3, there exists x0 > 0 such that
satisfies the integral equation (1.6) for x0 < x < y and |μ| < 1.
(ii) The determinant satisfies det(I + μRx ) = det(I + μ(x) ) and
d
μ trace Tμ (x, x) = log det(I + μRx ).
dx
(iii) Suppose that t → U (t) is a continuous function [0, 1] → L(H ) such that
U (t)A = AU (t) and U (t)L(H ) ≤ 1. Then there is a family of (2, 2)
admissible linear systems
In this section, we consider the Darboux addition rule for potentials and analyze
the transformation (−A, B, C) → (−A, B, −C) and the effect on the ratios and
derivatives of τ functions. This generalizes [10, section 3.4], and allows us to
introduce a version of the Baker–Akhiezer function for a family of linear systems
with properties that are similar to the classical case, as presented in [3] and [22].
74 G. Blower and S. L. Newsham
τζ (x)
ψζ (x) = exp ζ x . (3.1)
τ∞ (x)
(iii) Let τζ∗ (x) = τζ̄ (x̄) as in Schwarz’s reflection principle, and let
d2
− ψζ (x) + u∞ (x)ψζ (x) = −ζ 2 ψζ (x). (3.2)
dx 2
(ii) There exist hj ∈ C ∞ ((0, ∞); C) such that there is an asymptotic expansion
h1 (x) h2 (x)
ψζ (x) $ eζ x 1 + + 2
+...
ζ ζ
as ζ → ±i∞, and the expansion is uniform for x in compact subsets of (0, ∞).
Proof
(i) For all ζ ∈ C \ Spec(A), there exists x0 (ζ ) such that
τζ (x) det I + (ζ I + A)(ζ I − A)−1 Rx
=
τ∞ (x) det I + Rx
det I + (ζ I − A)−1 ((ζ I − A)Rx + ARx + Rx A)
=
det I + Rx
det I + Rx + (ζ I − A)−1 (ARx + Rx A)
= (3.3)
det I + Rx
so that when ARx + Rx A has rank one, the perturbing term (ζ I − A)−1 (ARx +
Rx A) has rank one; continuing we find
τζ (x)
= det I + (ζ I − A)−1 e−xA BCe−xA (I + Rx )−1
τ∞ (x)
= det I + Ce−xA (I + Rx )−1 (ζ I − A)−1 e−xA B
= 1 + Ce−xA (I + Rx )−1 (ζ I − A)−1 e−xA B,
∂ 2T ∂ 2T
− = u(x)T (x, y)
∂x 2 ∂y 2
2
where u(x) = −2 dx d
2 log τ (x). Then by integrating by parts, we see that ψζ
satisfies Schrödinger’s equation.
The solutions of the differential equation depend analytically on ζ at those
points where the potential depends analytically on ζ ; note that ζ → τζ (x) is
holomorphic and non zero for Rx < 1 and −ζ ∈ Sθ . Then we continue the
solutions analytically to all −ζ in the sector Sθ , on which ψζ (x) is holomorphic
as a function of ζ for x > 0.
(ii) Observe that Xθ = Sθ ∩ (−Sθ ) contains iR \ {0}. For ζ ∈ Sθ ∩ (−Sθ ), by (i)
there exist solutions ψζ (x) and ψ−ζ (x) to (3.2). In particular, ψik and ψ−ik (x)
76 G. Blower and S. L. Newsham
where the integral converges by the hypothesis of Theorem 2.3. Also, (e−zA ) is
an analytic semigroup in the sector Sθ−π/2 , so D(Aj ) is a dense linear subspace
of H for all j = 1, 2, . . . and Aj e−xA ∈ L(H ) and by Cauchy’s estimates
there exists C > 0 such that Aj e−xA L(H ) ≤ Cj !/x j for all x > 0. So we can
generate an asymptotic expansion of (3.3) with terms
1 d τ∞ 1 d
v= log , w= log τ0 τ∞ ,
μ dx τ0 μ dx
2 d2 2 d2
u∞ = − 2 2 log τ∞ , u0 = − 2 2 log τ0 .
μ dx μ dx
In the following result, we show how products and quotients of τ functions can be
linked by the Gelfand–Levitan equation for 2 × 2 matrices, and satisfy the identities
usually associated with Darboux transforms in the theory of integrable systems. See
[23].
Theorem 3.4 Let (−A, B, C) be a (2, 2)-admissible linear system with input and
output spaces C, and let φ(x) = Ce−xA B.
(i) Then there exists δ > 0 such that for all μ ∈ C such that |μ| < δ, the Gelfand-
Levitan equation (1.6) with
! " ! "
W (x, y) V (x, y) 0 φ(x + y)
T (x, y) = , (x + y) =
V (x, y) W (x, y) φ(x + y) 0
Tau Functions Associated with Linear Systems 77
and
1 d
W (x, x) = −V (x, x)2 ;
2μ dx
τ0 τ∞ − 2τ0 τ∞ + τ0 τ∞ = 0. (3.4)
Proof
(i) Let
and
! "
0 φ(x)
(x) = .
φ(x) 0
Now let
! "
1 T∞ + T0 T∞ − T0
T (x, y) =
2 T∞ − T0 T∞ + T0
so that
! "! "! "−1 ! −yA "! "
C 0 e−xA 0 I μRx e 0 0 B
T (x, y) = −
0 C 0 e−xA μRx I 0 e−yA B 0
1 d 1 d
T∞ (x, x) = log τ∞ (x), T0 (x, x) = log τ0 (x);
μ dx μ dx
d 2 d
T0 (x, x) + μ T0 (x, x) − T∞ (x, x) + T∞ (x, x) = 0, (3.5)
dx dx
78 G. Blower and S. L. Newsham
All of the terms begin with Ce−xA and end with e−xA B, and we can replace
e−xA μBCe−xA by μ(ARx + Rx A) to obtain
(3.6) = Ce−xA − 2(I − μRx )−1 A(I − μRx )−1
= 0.
This proves (3.5), and one can easily check that (3.4) is equivalent to
1 dv 1 dw
u0 (x) = + v(x)2 , v(x)2 = − .
μ dx μ dx
The entries of T satisfy the pair of coupled integral equations
∞
0 = W (x, y) + μ V (x, s)φ(s + y) ds
x
∞
0 = V (x, y) + φ(x + y) + μ W (x, s)φ(s + y) ds;
x
so W satisfies
∞
0 = −W (x, z) + μ φ(x + y)φ(y + z) dy
x
∞ ∞
2
+μ W (x, s) φ(s + y)φ(y + z) dyds,
x x
which explains how μ2 φ2 enters into several determinant formulas [37].
Tau Functions Associated with Linear Systems 79
u∞ → uζ = u∞ − 2(log ψζ ) .
d2
−2 log Wr(ψζ , ψη )
dx 2
to the potential.
Proof The definition is consistent with [10, p. 484]. In particular, ψ0 (x) =
d2
τ0 (x)/τ∞ (x), and u0 (x) = u∞ (x) − 2 dx 2 log ψ0 (x), which is consistent with (3.5).
For ζ1 = ζ2 , let "(x) = Wr(ψζ1 , ψζ2 )/ψζ2 , and observe that
" = ζ22 + u∞ − 2(log ψζ1 ) ".
This gives the basic composition rule for Mζ2 Mζ1 . The other statements follow from
Proposition 3.2 and Theorem 3.4. See [24].
d
(I + Rx )−1 = (I + Rx )−1 (ARx + Rx A)(I + Rx )−1 ,
dx
which implies
dFx
= AFx + Fx A − 2Fx AFx ,
dx
with the initial condition
dP
∂P = A(I − 2F )P + + P (I − 2F )A (P ∈ S). (4.1)
dx
(iii) Let ! · " : S → B be the linear map
and let H ∞ (Sδx0 ) the bounded holomorphic complex functions on Sδx0 . Then let
∞ = ∪ ∞ x0
H∞ x0 >0 H (Sδ ) be the algebra of complex functions which are bounded on
some translated sector Sδx0 , with the usual pointwise multiplication.
Theorem 4.4 Let (−A, B, C) be a (2, 2)-admissible linear system with H0 = C as
in Theorem 2.3, so (e−zA ) for z ∈ Sφ0 is a bounded holomorphic semigroup on H .
Let 0 = {P ∈ A : !P " = 0}.
(i) Then (A , ∗, ∂) is a differential ring with bracket !·";
(ii) there is a homomorphism of differential rings ! · " : (A , ∗, ∂) →
(H∞ ∞ , ·, d/dz);
which implies
+ ,+ ,
P Q = Ce−xA Fx P Fx e−xA BCe−xA Fx QFx e−xA B
= Ce−xA Fx P (AFx + Fx A − 2Fx AFx )QFx e−xA B
+ ,
= P (AFx + Fx A − 2Fx AFx )Q
= !P ∗ Q".
d d
Ce−xA Fx = Ce−xA Fx A(I − 2Fx ), Fx e−xA B = (I − 2Fx )AFx e−xA B,
dx dx
d + , - dP .
P = A(I − 2Fx )P + + P (I − 2Fx )A = !∂P ". (4.3)
dx dx
In this case A is possibly unbounded as an operator, so we use the holomorphic
semigroup to ensure that products (4.1) and brackets (4.2) are well defined.
We observe that A has a grading A = ⊕∞ n=1 An , where An is the span of
the elements that have total degree n when viewed as products of A and F .
For Xn ∈ An and Ym ∈ Am , we have Xn ∗ Ym ∈ An+m+2 ⊕ An+m+3 and
∂Xn ∈ An+1 ⊕ An+2 .
Also we have Ak e−zA ∈ L(H ) for all z ∈ Sφ0 and Ak e−zA L(H ) → 0
as z → ∞ in Sφ0 ; hence Rz Ak → 0 and Ak Rz → 0 in L(H ) as z → ∞
in Sφ0 . Hence there exists an increasing positive sequence (xk )∞k=0 such that
x
Ak Fz − Ak ∈ L(H ) for all z ∈ Sφk and Ak Fz − Ak → 0 in L(H ) as z → ∞ in
Sφxk . Let Xn ∈ An and consider a typical summand AFz Ak Fz . . . A in Xn ; we
replace each factor like Ak Fz by the sum of Ak (Fz − I ) and Ak where k ≤ n;
then we observe that there in an initial factor Ce−zA and a final factor e−zA B
in !Xn "; hence !Xn " determines an element of H ∞ (Sφxn ).
Tau Functions Associated with Linear Systems 83
∞ x +n
H∞ = lim H ∞ (Sφ0 ).
n→∞
∞ is consistently
By the principle of isolated zeros, the multiplication on H∞
∞
defined, and H∞ is an integral domain. Now each f ∈ H∞ ∞ gives f ∈
∞ x0 ∞ x0 +1 ∞ . From
H (Sφ ) so f ∈ H (Sφ ) by Cauchy’s estimates, so f ∈ H∞
(i) we deduce that
! · " : ⊕∞ ∞ ∞ xn
n=1 An → ∪n=1 H (Sφ )
domain.
Remark 4.5 Pöppe [31] introduced a linear functional & . ' on Fredholm kernels
K(x, y) on L2 (0, ∞) by &K' = K(0, 0). In particular, let K, G, H, L be integral
operators on L2 (0, ∞) that have smooth kernels of compact support, let = φ(x)
have kernel φ(s + t + 2x), let = dx
d
and G = ψ(x) be another Hankel operator;
then the trace satisfies
d
&' = − trace , (4.4)
dx
1 d
&KG' = − trace KG, (4.5)
2 dx
&(I + )−1 ' = −trace (I + )−1 , (4.6)
1
&K'&GL' = − &K( G + G )L', (4.7)
2
84 G. Blower and S. L. Newsham
where (4.7) is known as the product formula. The easiest way to prove (4.4)–(4.7)
is to observe that G + G is the integral operator with kernel −2φ(x)(s)ψ(x) (t),
which has rank one. These ideas were subsequently revived by McKean [24], and are
implicit in some results of [37]. Our formulas (4.2) and (4.3) incorporate a similar
idea, and are the basis of the proof of Theorem 4.4. The results we obtain appear to
be more general than those of Pöppe, and extend to periodic linear systems [6].
For the remainder of this section, we let A be a n × n complex matrix with
eigenvalues λj (j = 1, . . . , m) with geometric multiplicity nj such that λj +λk = 0
for all j, k ∈ {1, . . . , m}; let K = C(e−λ1 t , . . . , e−λm t , t). Also, let B ∈ Cn×1
and C ∈ C1×n . The formula (4.9) resembles the expressions used to obtain soliton
solutions of KdV, as in [19, (14.12.11)] and [16]. In [17, (6.25)], there is a discussion
of how the scattering data evolve under the time evolution associated with the KdV
flow.
Proposition 4.6
(i) There exists a solution Rt to Lyapunov’s equation (1.2) with R0 = BC, such
that the entries of Rt belong to K, and τ (t) ∈ K;
(ii) φ ∈ K satisfies a linear differential equation with constant coefficients.
(iii) Suppose further that all the eigenvalues of A are simple. Then there exists an
invertible matrix S such that
n
bj cj e−2λj t
τ (t) =1 +
2λj
j =1
/
n
b j cj / (λj − λk )2 −2 nj=1 λj t
+ e .
2λj (λj + λk )2
j =1 1≤j <k≤n
Proof
(i) By the hypothesis, we can introduce a chain of circles C that go once round
each λj in the positive sense and have all the points −λk in their exterior. Then
by [4], the matrix
−1
R0 = (A + λI )−1 BC(A − λI )−1 dλ
2πi C
Tau Functions Associated with Linear Systems 85
m
e−t A = qj (t)e−t λj pj (A), (4.8)
j =1
bj ck e−(λj +λk )t n
Rt =
λj + λk j,k=1
satisfies
d
Rt = −[bj ck e−(λj +λk )t ]nj,k=1 , −DRt − Rt D = −[bj ck e−(λj +λk )t ]nj,k=1 ;
dt
so Rt gives a solution of the Lyapunov equation with generator −D and initial
condition given by the rank-one matrix −S −1 BCS = −[bj ck ]nj,k=1 . Hence
the tau function is given by τ (t) = det(I + Rt ) for this matrix, and there is an
expansion
d d d d3
4 fm+1 (x) = 4f1 (x) fm (x) + 4 f1 (x)fm (x) − 3 fm (x). (5.1)
dx dx dx dx
(ii) If u satisfies fm = 0 for all m greater than or equal to some m0 , then u satisfies
the stationary KdV hierarchy and is said to be an algebro-geometric (finite
gap) potential; see [10, 11, 13, 29, 35].
(iii) Suppose that u(x) → 0 as x → ∞, and likewise for all the partial derivatives
∂ u/∂x ; suppose further that fj (x) → 0 as x → 0 as x → ∞ for
all j = 1, 2, . . . . Then we say that the fj are homogeneous solutions of
the KdV hierarchy, and we consider cases where the system of differential
equations (5.1) has no arbitrary constants of integration.
Proposition 5.2 Let A be as in Theorem 4.4]. Then
d d3 d
4 !A2m+3 " = 3 !A2m+1 " + 8 !A" !A2m+1 "
dx dx dx
d
+ 16!A" !A2m+1 " . (5.2)
dx
Tau Functions Associated with Linear Systems 87
Proof
(i) We have the basic identities
d
!A2m+1 " = !A(I − 2F )A2m+1 + A2m+1 (I − 2F )A",
dx
so
d2
!A2m+1 " = !A(I − 2F )A(I − 2F )A2m+1 + 2A(I − 2F )A2m+1 (I − 2F )A
dx 2
+ A2m+1 (I − 2F )A(I − 2F )A
− 2A(AF + AF − 2F AF )A2m+1
+ A2m+1 (I − 2F )A(I − 2F )A
d2
!A2m+1 " = 2!A(I − 2F )A2m+1(I − 2F )A" − 2!A2m+3 "
dx 2
+ 2!A(I − 2F )A(I − 2F )A2m+1"
+ 2!A2m+1 (I − 2F )A(I − 2F )A"
= 2!A(I − 2F )A2m+1(I − 2F )A" + 2!A2m+3 "
− 4!A2m+1 "!A" − 4!A"!A2m+1".
thus we obtain
d2
!A2m+1 " = 4!A(I − 2F )A(I − 2F )A2m+1 (I − 2F )A"
dx 2
+ 4!A(I − 2F )A2m+1 (I − 2F )A(I − 2F )A"
− 2!A(I − 2F )A2m+3 + A2m+3 (I − 2F )A"
= −8!A"!A2m+1(I − 2F )A" + 4!A2m+3 (I − 2F )A"
− 8!A"!A(I − 2F )A2m+1 "
d
+ 4!A(I − 2F )A2m+3 " − 2 !A2m+3 "
dx
= −8!A"!A(I − 2F )A2m+1 + A2m+1 (I − 2F )A"
d
+ 4!A(I − 2F )A2m+3 + A2m+3 (I − 2F )A" − 2 !A2m+3 "
dx
d d
= −8!A" !A2m+1 " + 2 !A2m+3 ";
dx dx
hence
d3 d d d
!A 2m+1
" = −8!A" !A 2m+1
" + 4 !A 2m+3
" − 8 !A"!A 2m+1
" ;
dx 3 dx dx dx
which gives the stated result (5.2).
Tau Functions Associated with Linear Systems 89
The notation g0 (x; ζ ) is chosen to indicate a generating function and also the
diagonal of a Green’s function; now in Theorem 5.4(iv) we explain the latter
connection. Let C+ = {λ ∈ C : (λ > 0}.
Theorem 5.4 Let (−A, B, C) be as in Theorem 2.3.
(i) Then g0 (x; ζ ) is bounded and continuously differentiable in x and has a unique
asymptotic expansion depending on the bracketed odd powers of A,
d 3 g0 dg0 du
= 4(u + ζ ) + 2 g0 (x > x0 ; −ζ > ω); (5.7)
dx 3 dx dx
(iii) there exists x1 > 0 such that
x
dy
ψ± (x, ζ ) = g0 (x, −ζ ) exp ∓ −ζ (5.8)
x1 2g0 (y; −ζ )
(iv) Suppose that u is a continuous real function that is bounded below, and that ψ±
from (iii) satisfy ψ+ (x; ζ ) ∈ L2 ((0, ∞); C) and ψ− (x; ζ ) ∈ L2 ((−∞, 0); C)
d2
for all ζ ∈ C+ . Then L = − dx 2 + u(x) defines an essentially self-adjoint
operator in L (R; C), and the Green’s function G(x, y; ζ ) which represents
2
g0 (x; −ζ )
G(x, x; ζ ) = √ .
−ζ
Proof
(i) Let π − θ < arg λ < θ , so λ and −λ both lie in Sθ , hence ζ = λ2 satisfies
2π − 2θ < arg ζ < 2θ , so ζ lies close to (−∞, 0). Then ζ I − A2 is invertible
90 G. Blower and S. L. Newsham
1
g0 (x; ζ ) = + Ce−xA (I + Rx )−1 A(ζ I − A2 )−1 (I + Rx )−1 e−xA B (x > 0)
2
is well defined by Theorem 2.3(iii).
To obtain the asymptotic expansion, we note that e−xA (I + Rx )−1 and (I +
Rx )e−xA involve the factor e−xA , where (e−zA ) is a holomorphic semigroup
on Sθ−π/2 . Hence A2j +1 e−xA ∈ L(H ) and by Cauchy’s estimates there exist
δ, x0 , M0 > 0 such that A2j +1 e−xA L(H ) ≤ M0 (2j + 1)! for all x ≥ x0 > 0,
and e−sA L(H ) ≤ M0 e−sδ . As in Proposition 3.2, we have an asymptotic
expansion of
e−zA (λI − A)−1 − (λI + A)−1
∞ ∞
= −e−zA eλs e−sA ds − e−zA e−λs e−sA ds
0 0
A A3 A2j −1
= e−zA + + ···+
λ2 λ4 λ2j
e−zA ∞
+ A2j +1 e−sA (esλ − e−λs ) ds,
λ2j +1 0
in which all the summands are in L(H ) due to the factor e−zA for z ∈ Sθ−π/2 .
Hence
∞
Ce−xA (I + Rx )−1 A2j +1 e−sA (esλ − e−sλ ) ds(I + Rx )−1 e−xA B → 0 (x > 0)
0
(0, ∞), with M, ω > 0 such that |!Ae−t A "| ≤ Meωt for t > 0, and that there
2
is a Maclaurin expansion
!A5 "t 2
!Ae−t A " = !A" − !A3 "t +
2
− · · · + O(t k )
2!
on some neighbourhood of 0+. Then by Watson’s Lemma [36, p. 188], the
integral
∞
!Ae−t A "et ζ dt
2
1 + ,
= 1 + 2A(ζ I + A2 − 2A)−1 ,
2g0 (x; −ζ )
d 2 g0 1 dg0 2 ζ
g0 − = 2(u − ζ )g02 + . (5.9)
dx 2 2 dx 2
So one can define ψ(x; ζ ) as in (5.8), and then one verifies the differential
equation for ψ(x; ζ ) by using (5.9).
(iv) By a theorem of Weyl [18, 10.1.4], L is of limit point type at ±∞, and
there exist nontrivial solutions ψ± (x; ζ ) to −ψ± (x; ζ ) + u(x)ψ± (x; ζ ) =
ζ ψ± (x; ζ ) such that ψ+ (x; ζ ) ∈ L2 (0, ∞) and ψ− (x; ζ ) ∈ L2 (−∞, 0),
and these are unique up to constant multiples. Also the inverse operator
(−ζ I + L)−1 may be represented as an integral operator in L2 (R; C) with
92 G. Blower and S. L. Newsham
Acknowledgments GB thanks Henry McKean for helpful conversations. SLN thanks EPSRC for
financially supporting this research. The authors thank the referee for drawing attention to recent
literature.
References
1. T. Aktosun, F. Demontis, C. van der Mee, Exact solutions to the focusing nonlinear Schrödinger
equation. Inverse Prob. 23, 2171–2195 (2007)
2. M.F. Atiyah, I.G. Macdonald, Introduction to Commutative Algebra (Addison Wesley, Read-
ing, 1969)
3. H.F. Baker, Abelian Functions: Abel’s Theorem and the Allied Theory of Theta Functions
(Cambridge University Press, Cambridge, 1995)
4. R. Bhatia, P. Rosenthal, How and why to solve the operator equation AX − XB = Y . Bull.
Lond. Math. Soc. 29, 1–21 (1997)
5. G. Blower, Linear systems and determinantal random point fields. J. Math. Anal. Appl. 335,
311–334 (2009)
Tau Functions Associated with Linear Systems 93
6. G. Blower, On tau functions for orthogonal polynomials and matrix models. J. Phys. A 44,
285202 (2011)
7. Y.V. Brezhnev, What does integrability of finite-gap or soliton potentials mean? Philos. Trans.
R. Soc. Lond. Ser. A Math. Phys. Eng. Sci. 366, 923–945 (2008)
8. F.J. Dyson, Fredholm determinants and inverse scattering problems. Commun. Math. Phys. 47,
171–183 (1976)
9. K.-J. Engel, R. Nagel, One Parameter Semigroups for Linear Evolution Equations (Springer,
New York, 2000)
10. N. Ercolani, H.P. McKean, Geometry of KdV. IV: Abel sums, Jacobi variety and theta function
in the scattering case. Invent. Math. 99, 483–544 (1990)
11. I.M. Gelfand, L.A. Dikii, Integrable nonlinear equations and the Liouville theorem. Funct.
Anal. Appl. 13, 6–15 (1979)
12. I.M. Gelfand, B.M. Levitan, On the determination of a differential equation from its spectral
function. Izvestiya Akad. Nauk SSSR Ser. Mat. 15, 309–360 (1951)
13. F. Gesztesy, H. Holden, Soliton Equations and Their Algebro-Geometric Solutions Volume I:
(1 + 1)-Dimensional Continuous Models (Cambridge University Press, Cambridge, 2003)
14. F. Gesztesy, B. Simon, The Xi function. Acta Math. 176, 49–71 (1996)
15. J.A. Goldstein, Semigroups of Linear Operators and Applications (Oxford University Press,
Oxford, 1985)
16. S. Grudsky, A. Rybkin, On classical solutions of the KdV equation. Proc. Lond. Math. Soc.
121, 354–371 (2020)
17. S. Grudsky, A. Rybkin, Soliton theory and Hankel operators. SIAM J. Math. Anal. 47, 2283–
2323 (2015)
18. E. Hille, Lectures on Ordinary Differential Equations (Addison-Wesley, Reading, 1968)
19. V.G. Kac, Infinite Dimensional Lie Algebras (Cambridge University Press, Cambridge, 1985)
20. V. Katsnelson, D. Volok, Rational solutions of the Schlesinger system and isoprincipal
deformations of rational matrix functions. I. Oper. Theory Adv. Appl. 149, 291–348 (2004)
21. S. Kotani, Construction of KdV flow I. τ -function via Weyl function. Zh. Mat. Fiz. Anal.
Geom. 14, 297–335 (2018)
22. I.M. Krichever, The integration of nonlinear equations by the methods of algebraic geometry.
Funct. Anal. Appl. 11, 12–26 (1977)
23. V.B. Matveev, Darboux transformation and explicit solutions of the Kadomtcev–Petviaschvily
equation, depending upon functional parameters. Lett. Math. Phys. 3, 213–216 (1979)
24. H.P. McKean, Fredholm determinants. Cent. Eur. J. Math. 9, 205–243 (2011)
25. A.V. Megretskii, V.V. Peller, S.R. Treil, The inverse spectral problem for self-adjoint Hankel
operators. Acta Math. 174, 241–309 (1995)
26. T. Miwa, M. Jimbo, E. Date, Solitons: Differential Equations, Symmetries, and Infinite
Dimensional Algebras (Cambridge University Press, Cambridge, 2000)
27. M. Mulase, Cohomological structure in soliton equations and Jacobian varieties. J. Differ.
Geom. 19, 403–430 (1984)
28. N.K. Nikolski, Operators, Functions and Systems: An Easy Reading, vol. 1 (American
Mathematical Society, Providence, 2002)
29. S. Novikov, S.V. Manakov, L.P. Pitaevskii, V.F. Zakharov, Theory of Solitons, the Inverse
Scattering Method (Consultants Bureau, New York and London, 1984)
30. V.V. Peller, Hankel Operators and Their Applications (Springer, New York, 2003)
31. C. Pöppe, The Fredholm determinant method for the KdV equations. Phys. D 13, 137–160
(1984)
32. C. Pöppe, D.H. Sattinger, Fredholm determinants and the τ function for the Kadomtsev–
Petviashvili hierarchy. Publ. Res. Inst. Math. Sci. 24, 505–538 (1988)
33. M. van der Put, M.F. Singer, Galois Theory of Linear Differential Equations (Springer, Berlin,
2003)
34. A. Rybkin, The Hirota τ -function and well-posedness of the KdV equation with an arbitrary
step-like initial profile decaying on the right half line. Nonlinearity 24, 2953–2990 (2011)
94 G. Blower and S. L. Newsham
35. G. Segal, G. Wilson, Loop groups and equations of KdV type. Inst. Hautes Études Sci. Publ.
Math. 61, 5–65 (1985)
36. I.N. Sneddon, The Use of Integral Transforms (McGraw-Hill, New York, 1972)
37. C.A. Tracy, H. Widom, Fredholm determinants, differential equations and matrix models.
Commun. Math. Phys. 163, 33–72 (1994)
38. M. Trubowitz, The inverse problem for periodic potentials. Commun. Pure Appl. Math. 30,
321–337 (1977)
39. T. Zhang, S. Venakides, Periodic limit of inverse scattering. Commun. Pure Appl. Math. 46,
819–865 (1993)
Groups of Orthogonal Matrices All
Orbits of Which Generate Lattices
Albrecht Böttcher
Abstract There are infinitely many finite groups of orthogonal matrices all orbits
of which, including those of irrational vectors, span lattices, that is, discrete additive
subgroups of the underlying Euclidean space. We show that, both up to isomorphism
and up to orthogonal similarity, exactly eight of these groups are irreducible: the
two trivial groups in one dimension, the cyclic groups of orders 3, 4, 6 in two
dimensions, and the quaternion, binary dihedral, binary tetrahedral groups in four
dimensions.
Let k ≥ 1 and let G be a finite subgroup of O(k), that is, let G be a finite group
of orthogonal k × k matrices. Suppose the order of G is n ≥ 1 and write G =
{G1 , . . . , Gn }. We think of Rk as a column space. For f ∈ Rk , consider the k × n
matrix
F = G1 f G2 f . . . Gn f (1)
A. Böttcher ()
Fakultät für Mathematik, TU Chemnitz, Chemnitz, Germany
e-mail: [email protected]
whose columns are the orbit of f under G. We denote by $(G, f ) ⊂ Rk the set of
all linear combinations of the columns of F with integer coefficients,
It follows that if $(G, f ) is a nontrivial lattice for at least one f , then each Gj ∈ G
must leave this lattice invariant. This indicates that we have to search for lattice
generating groups within the crystallographic point groups.
The 10 crystallographic point groups in R2 are the rotation groups C for =
1, 2, 3, 4, 6 and the dihedral groups D for = 1, 2, 3, 4, 6. It is not difficult to
check that the 7 groups C1 , C2 , C3 , C4 , C6 , D1 , D2 are lattice generating, whereas
the groups D3 , D4 , D6 are not lattice generating. It can be shown that exactly half
of the 32 crystallographic point groups in R3 are lattice generating. The table lists
the 32 groups (in Schoenflies notation). The 16 groups in boldface are the lattice
generating groups, the other 16 groups are not lattice generating.
C1 , C 2 , C 3 , C 4 , C 6 ,
C1h , C2h , C3h , C4h , C6h ,
C2v , C3v , C4v , C6v ,
D2 ,D3 , D4 , D6 , D2h ,D3h , D4h , D6h , D2d , D3d ,
S2 , S4 , S6 ,
T , Td , Th , O, Oh .
interesting groups are the irreducible ones. A finite group G ⊂ O(k) is said to be
irreducible if the members of the group do not share a common invariant subspace
except for {0} and all of Rk . Equivalently, G is irreducible if and only if spanR {Gf :
G ∈ G} = Rk for every f = 0, which in turn is equivalent to the requirement
that the rank of the matrix (1) is k for every f = 0. An irreducible group must in
particular have n ≥ k elements.
The direct sums of the 25 lattice generating groups we found do not yield
irreducible groups. But what about these groups themselves? It turns out that
C3 , C4 , C6 are the only irreducible lattice generating subgroups of O(2) and that
none of the 16 lattice generating subgroups of O(3) is irreducible (because each of
the latter leaves a rotation axis invariant). Tensor products behave better with regard
to irreducibility. Do we obtain more lattice generating groups in this way? Herewith
our first result.
Theorem 1.1 The tensor product of two irreducible and lattice generating groups
acting on at least two-dimensional spaces is never both irreducible and lattice
generating.
Note that the theorem is not true but becomes a triviality if one of the groups
acts on R1 : we have {1} ⊗ G = G and {1, −1} ⊗ G = −G ∪ G. Clearly, −G ∪ G is
irreducible if and only if so is G, and −G ∪ G is lattice generating if and only if G
has this property. Thus, to search for irreducible lattice generating groups we have
to proceed differently.
All unitary irreducible representations of a finite abelian group are in U(1) and
hence all irreducible representations of a finite abelian group in O(k) must have
degree k ≤ 2. Consequently, if k ≥ 3, the O(k) does not contain irreducible
finite abelian groups and thus all the more does not contain irreducible and lattice
generating abelian groups. So we are left with non-abelian groups.
Theorem 1.2 If k ≥ 2 and a subgroup of O(k) is irreducible and lattice generating,
then it is actually contained in SO(k).
In particular, irreducible Coxeter (reflection) groups are never lattice generating.
Theorem 1.3 If k ≥ 3 is odd, then SO(k) does not contain irreducible lattice
generating groups.
Eventually we have the following.
Theorem 1.4 If k ≥ 6 is even, then there are no irreducible and lattice generating
groups in SO(k). If a subgroup of SO(4) is irreducible and lattice generating, then it
is either isomorphic to the quaternion group Q8 of order 8 or to the binary dihedral
group Q12 of order 12 or to the binary tetrahedral group 2T of order 24. The
groups Q8 , Q12 , 2T have faithful irreducible and lattice generating representations
in SO(4).
The quaternion group Q8 may be given by a, b, c : a 2 = b2 = c2 = abc. The
binary dihedral group Q12 also goes under the notations Dic12 , Dic6 , or Dic3 . It has
98 A. Böttcher
a, b, c : a 3 = b2 = c2 = abc.
The binary tetrahedral group 2T is isomorphic to SL(2, 3), that is, to the 2 × 2
matrices over the field F3 with determinant 1. It is also isomorphic to the group of
units in the ring of Hurwitz integers. A group presentation of 2T is
a, b, c : a 2 = b3 = c3 = abc.
2 Proofs
We denote by (·, ·) and · the usual scalar product and norm in the Euclidean space
Rk . The transpose of a matrix A is denoted by A .
Assertion (a) of the following theorem is Corollary 10.5 of [8]. The other
assertions of the theorem were established in [1] for k ∈ {2, 3} and subsequently
in [2] for general k ≥ 2. For the reader’s convenience, we include the proof from [2].
Theorem 2.1 Suppose n ≥ k ≥ 2. Let G ⊂ O(k) be a finite irreducible group of
order n, let f ∈ Rk \ {0}, and let F be the k × n matrix (1). Then the following
hold.
(a) The matrix F F is a nonzero scalar multiple of the identity matrix,
F F = γ I.
Groups of Orthogonal Matrices and Lattices 99
(b) The set $(G, f ) is a lattice if and only if the Gram matrix
F F = ((Gj f, Gk f ))nj,k=1
the restriction β > 0 can always be achieved by changing U . We say that G has
equal blocks if all blocks of S are equal to each other. The trivial cases are G =
U I U = I and G = U (−I )U = −I .
100 A. Böttcher
(Gj f, Gk f ) ∈ γf Q
(Gf, f ) (U SU f, f ) (SU f, U f )
= = ,
(f, f ) (U U f, f ) (U f, U f )
We may without loss of generality assume that these are the first two blocks of S,
that is, S = diag(B, C, . . .). We take h = (f1 , f2 , g1 , g2 , 0, . . . , 0) ∈ Rk of norm
1 and put f = (f1 , f2 ) , g = (g1 , g2 ) . Thus, h2 = f 2 + g2 = 1. Then
Since −1 < α < 1, there are g such that this is irrational. Thus, the presence of
one 1 × 1 block forces all blocks to be 1 × 1. Finally, if S has two different 1 × 1
blocks, S = diag(1, −1, . . .), then h = (f, g, 0, . . .) with f 2 + g 2 = 1 gives
(Sh, h) = f 2 − g 2 = 2f 2 − 1, which may also be irrational.
Proof of Theorems 1.2 and 1.3 Let G ⊂ O(k) be irreducible and lattice generating.
If det G = −1 for some G = U SU ∈ G, then S must contain a 1 × 1 block (−1),
so Lemma 2.2 implies that G = −I , and thus G = {I, −I }. But this group is not
Groups of Orthogonal Matrices and Lattices 101
G = {I, B1 , . . . , Bq } (3)
(s, q ≥ 0, s + q > 0) where the ±Aj are of order 4 and satisfy (±Aj )2 = −I and
−Aj = A3j , each Bj is of order 6 and satisfies Bj3 = −I , and each −Bj is of order
3 and satisfies −Bj = Bj4 .
Proof We know from Lemma 2.2 that each G ∈ G \{I } has equal blocks. This block
is (−1), !6 , !4 , or !3 . If −I ∈
/ G, then the block cannot be !6 or !4 , since then
G3 or G2 would be −I . Thus, in this case G is of the form (3) in which the matrices
Bj have the block !3 . Now suppose −I ∈ G and let G ∈ G \ {I, −I }. If the block
of G is !4 , then G is of order 4 with G2 = −I and G3 = −G, which gives us the
elements ±Aj in (4). If the block is !6 , then G is of order 6 and we have G3 = −I
and G4 = −G. These elements G are the ±Bj in (4). Matrices with the block !3
are of the order 3. The map G → −G is a bijection of every group containing I and
−I and it changes the orders 4 and 3 to the orders 4 and 6, respectively. Thus, the
number r of elements of order 3 is equal to the number q of elements of the order
6, and since the elements −B1 , . . . , −Bq have the order 3, it follows that G must be
among these elements.
Proof of Theorem 1.1 By virtue of Theorem 1.3 we may suppose that the groups
are subgroups of O(2m) and O(2k) with m, k ≥ 1. Lemma 2.2 shows that the trace
of an element G of an irreducible and lattice generating subgroup of O(2) is
if the order of G is 6, equals 0 if G has the order 4, and is − if G is of order
3. Let G ⊂ O(2m) and H ⊂ O(2k) be irreducible and lattice generating groups.
Assume G ⊗ H is an irreducible and lattice generating subgroup of O(4mk). We
have tr (G⊗H ) = (tr G)(tr H ) for G ∈ G and H ∈ H. Thus if both G and H contain
elements G, H of orders 3 or 6, then tr (G ⊗ H ) = ±mk whereas the possible traces
of elements of G ⊗ H are ±4mk (for I, −I ), ±2mk, and 0. It follows that at least
one of the groups, say G, has no elements of the orders 3 and 6. By Corollary 2.3, G
contains a G of order 4 such that G2 = −I . If in the other group there is an element
H of order 3 or 6, then G ⊗ H has the order 12, which is impossible. Consequently,
102 A. Böttcher
W AW = diag(Uj !3 Uj ), W BW = diag(Vj !3 Vj ).
A := U W AW U = diag(!3 ) =: diag(Aj ),
B := U W BW U = diag(Zj !3 Zj ) =: diag(Bj )
with Zj ∈ O(2). But if Z ∈ O(2), then Z!3 Z = !3 if det Z = 1 and Z!3 Z = !23
if det Z = −1. It follows that each block Bj is !3 or !23 . If Bj = !3 , then the j th
block of U W ABW U is Aj Bj = !3 ·!3 = !23 , while if Bj = !23 , the j th block of
U W ABW U is Aj Bj = !3 ·!23 = I . Thus, as the number of blocks is at least two,
AB does not have equal blocks if B contains two different blocks. Therefore either
B = diag(!3 ) or B = diag(!23 ). We arrive at the conclusion that H is isomorphic
Groups of Orthogonal Matrices and Lattices 103
to the group generated by !3 alone, that is, to Z3 . Since H is isomorphic to Z23 , this
is a contradiction.
The following lemma proves part of Theorem 1.4.
Lemma 2.5 If k ≥ 4 is even and G ⊂ SO(k) is an irreducible and lattice generating
group, the G is either isomorphic to the quaternion group Q8 of order 8 or to the
binary dihedral group Q12 of order 12 or to the binary tetrahedral group 2T of
order 24.
Proof If G is a finite group and a prime number p divides the order of G, then G
contains an element of order p (Cauchy’s theorem). By Corollary 2.3, the orders of
the elements of our group are 3, 4, 6, and hence the order of G must be n = 2r 3s . The
Sylow theorems imply that if G is a finite group and p is a prime power dividing
the order of G, then G has at least one subgroup of order p . Thus, if r ≥ 4, then
G contains a subgroup of order 16. The groups Q16 and Z16 have an element of
order 8 and the other 8 groups of order 16 all have at least two elements of order
2. By Corollary 2.3, this is impossible. If s ≥ 2, the G contains a subgroup of
order 9. Lemma 2.4 shows that this subgroup cannot be isomorphic to Z23 , and it is
also impossible that it is isomorphic to Z9 , which contains elements of order 9. The
remaining possible orders are {2, 3, 4, 6, 8, 12, 24}. In what follows we permanently
employ Corollary 2.3 without mentioning this each time.
n = 2, 3, 4 These groups are abelian and hence not irreducible.
n = 6 C6 is abelian and the symmetric group S3 has 3 elements of order 2.
n = 8 C8 , C4 × C2 , C23 are abelian and D4 has more than one element of order
2. The only group remaining is Q8 .
n = 12 We could rule out the abelian groups immediately, but with the case n =
24 in mind, we argue as follows: C12 contains an element of order 12 and C3 ×C22
has 3 elements of order 2. The three non-abelian groups are A4 , the dihedral
group D6 , and the binary dihedral group Q12 . The first two of them have more
than one element of order 2, so that only Q12 is left.
n = 24 We may exclude the three abelian groups. There are 12 non-abelian
groups. Except for 2T , each of the remaining 11 groups contains a subgroup
of order 12. From our arguments to settle the case n = 12 we know that, this
time with the exception of Q12 , each of these subgroups has an element of order
12 or more than one element of order 2. The only of the 11 groups having Q12
as a subgroup is Q12 × C2 . But this group has at least three elements of order 2.
Thus, 2T is the only possible group.
Lemma 2.6 If k ≥ 6 is even and a group G ⊂ O(k) is isomorphic to Q8 , Q12 , or
2T , then G is reducible.
Proof The degrees of the irreducible representations over C of the three groups
Q8 , Q12 , 2T are
Q8 : 1, 1, 1, 1, 2, Q12 : 1, 1, 1, 1, 2, 2, 2T : 1, 1, 1, 2, 2, 2, 3, (5)
104 A. Böttcher
Q8 : 1, 1, 1, 1, 4, Q12 : 1, 1, 2, 2, 4 2T : 1, 2, 3, 4, 4. (6)
The assertion of the lemma is therefore immediate from (6). The lists (5) and the
list for Q8 in (6) are well known. The lists for Q12 and 2T in (6) are explicitly on
the website [4] and in the lecture notes [6]. An alternative proof of the lemma based
solely on (5) and the list for Q8 in (6) is as follows.
If, for a finite group, the maximal degree of an irreducible representation
over C is , then the maximal degree k of an irreducible representation over
R, that is, in O(k), satisfies k ≤ 2. It follows that Q8 and Q12 have no
irreducible representations in O(k) for even k ≥ 6 and that 2T has no irreducible
representations in O(k) for even k ≥ 8. So consider 2T ⊂ O(6).
Let H be a subgroup of O(6) which is isomorphic to Q8 . We claim that
this group has a 5-dimensional invariant subspace. The degrees of the irreducible
representations of Q8 in O(k) are 1, 1, 1, 1, 4. Thus, H is reducible in dimensions
different from 1 and 4. Taking this into account, we may proceed as in the proof of
Lemma 2.4 to get an orthogonal matrix W such that each element of W HW is of the
form diag(H1 , . . . , H ) with Hj ∈ O(mj ) and mj ∈ {1, 4} for all j . Accordingly,
R6 = X1 ⊕ · · · ⊕ X , (7)
with
10 0 −1
1= , i= .
01 1 0
This group contains I, −I and six elements of order 4. This is the case s = 3 and
q = 0 in (4). For f = (a, b, c, d) ∈ R4 \ {0} the matrix
F = G1 f G2 f . . . G8 f
is
⎛ ⎞
a −c −d b −a c d −b
⎜ b −d c −a −b d −c a ⎟
F =⎜ ⎟
⎝ c a −b −d −c −a b d ⎠ .
d b a c −d −b −a −c
X = x − p, Y = y − q, Z = z − r, W = w − s,
F ξ 2 = Av2 ≥ mv2
with some m > 0, and since mv2 does not assume values in (0, m), it follows that
F Z8 is a lattice. Finally, the span of {Gf : G ∈ G} is the span of the columns of F .
106 A. Böttcher
The first four columns form the matrix A, and since this matrix is invertible, F has
the rank 4, which shows that the columns of F span all of R4 . Thus, G is irreducible.
The group Q12 ⊂ SO(4) is faithfully represented by the 12 matrices
10 −1 0
I = , −I = ,
01 0 −1
0 −1 01
A1 = , −A1 = ,
1 0 −1 0
0 −ω 0ω
A2 = , −A2 = ,
−ω2 0 ω2 0
0 −ω2 0 ω2
A3 = , −A3 = ,
−ω 0 ω 0
2
ω 0 ω 0
B = , B2 = ,
0 −ω2 0 −ω
−ω 0 −ω2 0
B4 = , B5 = ,
0 ω2 0ω
where
√
10 α −β 1 3
1= , ω= , α= , β=− .
01 β α 2 2
This group contains I, −I , six elements of order 4, two elements of order 6 (B and
B 5 ), and two elements of order 3 (B 2 and B 4 ), that is, we have the situation s = 3
and q = 2 in (4).
That Q12 is irreducible and lattice generating can be proved as for Q8 . Given
f = (a, b, c, d) ∈ R4 \ {0}, let F be the matrix
F = G1 f G2 f . . . G12 f ,
The first four columns of F are a scalar multiple of an orthogonal matrix, which
implies that F has rank 4 and hence that Q12 is irreducible. If
ξ = (g, x, y, z, p, q, r, s, t, u, v, w) ∈ Z12 ,
Groups of Orthogonal Matrices and Lattices 107
with
F ξ 2 = CV 2 ≥ mV 2
V 2 =X2 + Y 2 + Z 2 + W 2
3 3
= (q + r − y − z)2 + (s + t − v − w)2
4 4
2 2
1 1
+ g − u + (s + w − t − v) + p − x + (z + q − y − r)
2 2
is either 0 or at least 1/4, which shows that F Z12 is discrete and thus a lattice.
The group 2T is faithfully represented in SO(4) by the eight matrices
10 0 −1
I= , A1 = , −I, −A1 ,
01 1 0
0i −i 0
A2 = , A3 = , −A2 , −A3 ,
i0 0i
with
10 0 −1
1= , i=
01 1 0
1
(±I ± A1 ± A2 ± A3 ).
2
108 A. Böttcher
We let B1 , . . . , B8 denote the latter matrices with I having the + sign and ordered
according the lexicographic order +++, ++−, +−+, +−−, . . . of the remaining
signs. The other eight matrices then are −B1 , . . . , −B8 . With these notations we are
in the situation of Corollary 2.3 with s = 3 and q = 8. The rest of the proof is as
for Q8 and Q12 . Given f = (a, b, c, d) ∈ R4 \ {0}, let F be the matrix
F = G1 f G2 f . . . G24 f
in this order. The first four columns of F are the same as the first four columns we
had for Q8 . Thus, they form scalar multiple of an orthogonal matrix, which implies
that F has rank 4 and hence that G is irreducible. Let ξ ∈ Z24 be the vector
ξ =(g1 , x1 , y1 , z1 , p1 , q1 , r1 , s1 , t1 , u1 , v1 , w1 ,
g2 , x2 , y2 , z2 , p2 , q2 , r2 , s2 , t2 , u2 , v2 , w2 )
and put
g = g1 − g2 , x = x1 − x2 , . . . , w = w1 − w2 .
with
X = 2g + p + q + r + s + t + u + v + w,
Y = −2x − p − q − r − s + t + u + v + w,
Z = −2y − p − q + r + s − t − u + v + w,
W = 2z + p − q + r − s + t − u + v − w.
with some m > 0 and since V 2 is either 0 or at least 1, we see that F Z24 is
discrete and thus a lattice.
As said, Lemmas 2.5, 2.6, 2.7 prove Theorem 1.4.
Proof of Theorem 1.5 The representation of 2T we used in the proof of Theo-
rem 1.4 is the quaternionic representation. The complex representation over R can
be constructed as follows. Put
10 0 −1 0 i −i 0
E= , I = , J = , K=
01 1 0 i0 0 i
1 1 1
B1 = (E + I + J + K), B2 = (E + I + J − K), B2 = (E + I − J + K), . . . .
2 2 2
Then the 24 matrices ±E, ±I, ±J, ±K, ±B1 , . . . , ±B8 are a faithful irreducible
representation of 2T in U(2). We denote this matrix group simply by 2T . The seven
conjugacy classes are {E}, {−E}, {±I, ±J, ±K},
C1 := {B1 , B4 , B6 , B7 }, C2 := {B2 , B3 , B5 , B8 },
√
and −C1 , −C2 . Let ω = −1/2 + i 3/2 and let : 2T → C be the homomorphism
sending ±E, ±I, ±J, ±K to 1, the elements of −C1 ∪ C1 to ω, and the elements
of −C2 ∪ C2 to ω2 . This is a representation of 2T in U(1). It follows that the map
τ : 2T → U(2), G → (G)G is a faithful representation of 2T . The representing
matrices are complex of the form A + iB. Replacing each such matrix by
A −B
,
B A
with
01 −1 0
W = , S= .
10 01
110 A. Böttcher
ξm = (xm , ym , zm , wm , pm , 0, . . . , 0) ∈ Z24
such that F ξm ∈ (0, 1/m), which shows that F Z24 is not discrete and proves that
σ (2T ) is not lattice generating.
The topic is trivial for unitary matrices, that is, for finite subgroups of U(k). A
group G ⊂ U(k) of order n is said to be lattice generating if, with F given by (1),
$(G, f ) = F Z[i]n is a discrete additive subgroup of Ck for every f ∈ Ck .
Theorem 3.1 The only irreducible and lattice generating subgroups of U(k) are
G = {1, i, −1, −i} ⊂ U(1) and its subgroups {1} and {1, −1}.
Proof Let G ⊂ U(k) be an irreducible and lattice generating group. In the complex
case, Theorem 2.1 reads as follows. Suppose n ≥ k ≥ 1. Let G ⊂ U(k) be a finite
Groups of Orthogonal Matrices and Lattices 111
irreducible group of order n, let f ∈ Ck \ {0}, and let F be the k × n matrix (1).
Then the following hold.
(a) We have F F ∗ = γ I with some real γ > 0.
(b) The set $(G, f ) is a lattice if and only if the Gram matrix F ∗ F =
((Gj f, Gk f ))nj,k=1 belongs to μQ[i]n×n for some nonzero μ ∈ C.
(c) In case $(G, f ) is a lattice, we actually have F ∗ F ∈ γ Q[i]n×n .
(d) If f = 1, then $(G, f ) is a lattice if and only if F ∗ F ∈ Q[i]n×n .
The reasoning of the proof of Lemma 2.2 shows that if G = U SU ∗ with a diagonal
matrix S, then S must have equal entries, that is, S = ωI with some ω ∈ T.
Consequently, G is composed of scalar multiples of the identity matrix. It follows
that G = {I, ωI, . . . , ωn−1 I } with ω = e2πi/n , which is reducible for k ≥ 2. So let
k = 1. For |f | = 1, the Gram matrix is F ∗ F = (ωk ωj )n−1 j,k=0 . Since ω is an entry
of this matrix, it must be in Q[i]. The only such roots of unity are ω = 1, −1, i.
Conversely, it is clear, that the (irreducible) subgroups of {1, i, −1, −i} are indeed
lattice generating.
Acknowledgments I sincerely thank Lenny Fukshansky, Christian Lehn, Josiah Park, and Dmytro
Shklyarov for stimulating and helpful discussions.
References
This work was supported by the Special Project on High-Performance Computing of the National
Key R&D Program of China (Grant No. 2016YFB0200604), the National Natural Science
Foundation of China (Grant No. 11731006) and the Science Challenge Project of China (Grant
No. TZ2018001).
V. D. Didenko ()
Department of Mathematics, SUSTech International Center for Mathematics, Southern University
of Science and Technology, Shenzhen, China
B. Silbermann
Technische Universität Chemnitz, Fakultät für Mathematik, Chemnitz, Germany
e-mail: [email protected]
1 Introduction
Toeplitz T (a) and Hankel H (b) operators appear in various fields of mathematics,
physics, statistical mechanics and they have been thoroughly studied [7, 42, 47].
Toeplitz plus Hankel operators T (a)+H (b) and Wiener-Hopf plus Hankel operators
W (a)+H (b) play an important role in random matrix theory [1, 6, 30] and scattering
theory [34–36, 43, 45, 46, 56]. Although Fredholm properties and index formulas
for such operators acting on different Banach and Hilbert spaces are often known—
cf. [7, 13, 38–41, 48–52], their invertibility is little studied. So far progress has
been made only in rare special cases. In this work, we want to present an approach,
which allows to treat invertibility problem for a wide classes of Toeplitz plus Hankel
operators on classic Hardy spaces and also for their close relatives: Toeplitz plus
Hankel operators on l p -spaces, generalized Toeplitz plus Hankel operators and
integral and difference Wiener-Hopf plus Hankel operators. The operators acting
on classical Hardy spaces are discussed in more details, whereas for other classes of
operators we only provide a brief overview of the corresponding results.
where fn , n ∈ N are the Fourier coefficients of function f . Moreover, let I denote
the identity operator, P : Lp (T) → H p (T) the projection defined by
∞
∞
P : fn einθ → fn einθ
n=−∞ n=0
We note that
J 2 = I, J P J = Q, J QJ = P , J aJ =
a,
a (t) := a(1/t),
and the operators T (a) and H (b) are related to each other as follows
We now consider the invertibility of Toeplitz plus Hankel operators T (a) + H (b)
generated by L∞ -functions a and b and acting on a Hardy space H p (T). Observe
that the matrix representation of such operators in the standard basis {t n }∞n=0 of
H p (T) is
ak−j +
T (a) + H (b) ∼ ( bk+j +1 )∞
k,j =0 .
There are a variety of approaches to the study of their invertibility and we briefly
discuss some of them.
Let L(X) and F (X) be, respectively, the sets of linear bounded and Fredholm
operators on the Banach space X. Besides, if A is a unital Banach algebra, then
GA stands for the group of all invertible elements in A.
Assume that a ∈ GL∞ , b ∈ L∞ and set
a − ba −1 d
b
V (a, b) := ,
−c a −1
where c := a −1 , d := b
b a −1. Writing T (a)±H (b)+Q for (T (a)±H (b))P +Q, we
consider the operator diag (T (a)+H (b)+Q, T (a)−H (b)+Q) on Lp (T)×Lp (T)
and represent it in the form
T (a) + H (b) + Q 0
0 T (a) − H (b) + Q
= A(T (V (a, b)) + diag (Q, Q))B (2.2)
116 V. D. Didenko and B. Silbermann
and the operator A is also known but its concrete form is not important now.
An immediate consequence of Eq. (2.2) is that both operators T (a) ± H (b) are
Fredholm if and only if the block Toeplitz operator T (V (a, b)) is Fredholm. This
representation is of restricted use because there are piecewise continuous functions
a, b such that only one of the operators T (a) ± H (b) is Fredholm. In addition, both
operators T (a) ± H (b) can be Fredholm but may have different indices. Therefore,
more efficient methods for studying Toeplitz plus Hankel operators are needed,
especially for discontinuous generating functions a and b.
Let us recall that there is a well-developed Fredholm theory for the operators
T (a) + H (b) with generating functions a, b from the set of piecewise continuous
functions P C = P C(T)—cf. [7] for operators acting on the space H 2 and [50] for
the ones on H p , p = 2. However, the defect numbers of these operators, conditions
for their invertibility, and inverse operators can be rarely determined directly from
Eq. (2.2).
a = a − χm a + , (2.3)
−1 −1
where a+ ∈ H q , a+ ∈ H p , a− ∈ H p , a− ∈ H q , χm (t) := t m , m ∈ Z, the term
−1 p
a+ P (a+ ϕ) belongs to L for any ϕ from the set of all trigonometrical polynomials
P = P(T) and there is a constant cp such that
−1
||a+ P (a+ ϕ)||p ≤ cp ||ϕ||p for all ϕ ∈ P.
Invertibility Issues for Toeplitz Plus Hankel Operators 117
−1 p
the term a+ P (a+ ϕ) belongs to LN for any ϕ ∈ PN and there is a constant cp such
that
−1
||a+ P (a+ ϕ)||p ≤ cp ||ϕ||p for all ϕ ∈ PN .
p
Moreover, if T (a) ∈ F (HN ), then
dim ker T (a) = − κj , dim coker T (a) = − κj .
κj <0 κj >0
The numbers κj , called partial indices, are uniquely defined. Moreover, in some
sense, the Wiener-Hopf factorization is unique if it exists. For example, for N = 1
the uniqueness of factorization can be ensured by the condition a− (∞) = 1. We
also note that if T (a) ∈ F (H p ) and ind T (a) ≥ 0 (ind T (a) ≤ 0) then T (a) is
right-invertible (left-invertible) and if κ := ind T (a) > 0, the functions
−1
a+ χj , j = 0, . . . , κ − 1
form a basis in the space ker T (a) and one of the right-inverses has the form
T −1 (aχκ )T (χκ ).
A comprehensive information about Wiener-Hopf factorization is provided in
[11, 44] and in books, which deal with singular integral and convolution operators
[7, 32, 32, 33]. In particular, Wiener-Hopf factorization furnishes conditions for
invertibility of related operators. However, generally there are no efficient methods
for constructing such factorizations and computing partial indices even for contin-
uous matrix-functions. Therefore, in order to study the invertibility of Toeplitz plus
Hankel operators, we have to restrict ourselves to suitable classes of generating
functions.
In the beginning of this century, Ehrhardt [28, 29] developed a factorization
theory to study invertibility for large classes of convolution operators with flips.
Toeplitz plus Hankel operators are included in this general framework. In particular,
118 V. D. Didenko and B. Silbermann
a = b
a b. (2.4)
This class of pairs of functions first appears in [4] and [13]. Equation (2.4) was
latter called the matching condition and the corresponding duo (a, b) a matching
pair—cf. [16]. Let us note that Toeplitz plus Hankel operators of the form
appear in random ensembles [1, 6, 30] and in numerical methods for singular integral
equations on intervals [37]. The generating functions of the operators (2.5) clearly
satisfy the matching condition (2.4).
It is notable that the Fredholmness of the operator T (a) + H (b) implies that
a ∈ GL∞ (T). Therefore, the term b in the matching pair (a, b) is also invertible in
L∞ (T) and one can introduce another pair (c, d), called the subordinated pair for
(a, b), with the functions c and d defined by
c := a/b = a , d := a/
b/ b = b/
a.
is a matching function.
2. If g1 , g2 are matching functions, then the product g = g1 g2 is also a matching
function.
3. If g is a matching function, then for any a ∈ GL∞ the duo (a, ag) is a matching
pair.
4. Any matching pair (a, b), a ∈ GL∞ can be represented in the form (a, ag),
where g = ab−1 is a matching function.
In this section we discuss the Basor-Ehrhardt approach to the study of defect
numbers of the operators T (a) + H (b) ∈ F (H p (T)) if a and b are piecewise
continuous functions satisfying the condition (2.4). Then we present an explicit
criterion for the Fredholmness of such operators. Recall that the circle T is
counterclockwise oriented and f ∈ P C if and only if for any t ∈ T, the one-sided
limits
The definition of d in [4] differs from the one used here. In fact, d in [4]
corresponds to d here. We keep this notation for sake of easy comparability of the
results.
Theorem 2.4 already shows the exceptional role of the endpoints +1 and −1 of
the upper semicircle T+ . To establish the index formula mentioned in [4], a more
geometric interpretation of the Fredholm conditions (2.6)–(2.8) is needed. Here
there are a few details from [4].
120 V. D. Didenko and B. Silbermann
For z1 , z2 ∈ C and θ ∈ (0, 1), we consider the open arc A(z1 , z2 , θ ) connecting
the points z1 and z2 and defined by
1 z − z1
A(z1 , z2 , θ ) := z ∈ C \ {z1 , z2 } : arg ∈ {θ + Z} .
2π z − z2
For θ = 1/2 this arc becomes a line segment and if z1 = z2 it is an empty set.
Assuming that a, b ∈ GP C, a a = b
b and using the auxiliary functions c and d,
one can show that the conditions (2.6)–(2.8) mean that any of the arcs
1 1 1 1
A 1, c+ (1); + , A c− (τ ), c+ (τ ); , A c− (−1), 1; ,
2 2p p 2p
+ (1); 1 + 1 , A (d)
A 1, (d) + (τ ); 1 , A (d)
− (τ ), (d) − (−1), 1; 1 ,
2 2q q 2q
where τ ∈ T+ , does not cross the origin. It is clear that one has to take into account
only the jump discontinuity points since c, d ∈ GP C and consequently c± (τ ) = 0
(τ ) = 0.
and (d) ±
and
−1
(1 + t)c+ (t) ∈ H q , (1 − t)c+ (t) ∈ H p ,
(1 + t)(d) −1
+ (t) ∈ H p , (1 − t)(d) + (t) ∈ H .
q
Note that the indices m, n are uniquely determined and the functions c+ and d+
are also unique up to a multiplicative constant. The representations (2.9), and (2.10)
respectively.
are called antisymmetric factorization of the functions c and d,
Theorem 2.7 (Basor and Ehrhardt [4]) If (a, b), a, b ∈ P C is a matching pair
and T (a) + H (b) ∈ F (H p ), then (a, b) satisfies the basic factorization condition.
The next result allows to determine the defect numbers of the operators T (a) +
H (b) in certain situations.
Theorem 2.8 (Basor and Ehrhardt [4]) Assume that the matching pair (a, b),
a, b ∈ L∞ satisfies the basic factorization condition in H p with m, n ∈ Z. If
T (a) + H (b) ∈ F (H p ), then
⎧
⎪
⎪ 0 if n > 0, m ≤ 0,
⎪
⎪
⎪
⎨ −n if n ≤ 0, m ≤ 0,
dim ker(T (a) + H (b)) =
⎪
⎪m − n
⎪
⎪
if n ≤ 0, m > 0,
⎪
⎩
dim ker An,m if n > 0, m > 0,
⎧
⎪
⎪ 0 if m > 0, n ≤ 0,
⎪
⎪
⎪
⎨ −m if m ≤ 0, n ≤ 0,
dim ker(T (a) + H (b))∗ =
⎪
⎪ n−m if m ≤ 0, n > 0,
⎪
⎪
⎪
⎩
dim ker(An,m ) if m > 0, n > 0,
and
Now we turn attention to another method based on the classical approach and
recently developed by the authors of this paper. Let us start with a special factor-
ization of the operator T (V (a, b)) for generating functions a and b constituting a
matching pair. If a ∈ GL∞ (T), b ∈ L∞ (T) satisfy the matching condition (2.4),
the matrix function V (a, b) in (2.2) has the form
0 d a a
V (a, b) = , c= , d= . (2.11)
a −1
−c b
b
It follows that the corresponding block Toeplitz operator T (V (a, b)) with the
generating matrix-function (2.11) can be represented in the form
0 T (d)
T (V (a, b)) =
a −1 )
−T (c) T (
−T (d) 0 0 −I −T (c) 0
= , (2.12)
0 I a −1 )
I T ( 0 I
This representations turns out to be extremely useful in the study of Toeplitz plus
Hankel operators. To show this we start with the Coburn-Simonenko theorem. For
Toeplitz operators this theorem indicates that for any non-zero a ∈ L∞ (T) one has
It follows that Fredholm Toeplitz operators with index zero are invertible. However,
in general for block-Toeplitz and for Toeplitz plus Hankel operators, Coburn-
Simonenko theorem is not true. This causes serious difficulties when studying the
invertibility of the operators involved. Nevertheless, the following theorem holds.
Theorem 2.9 Let a ∈ GL∞ (T) and A refer to one of the operators T (a) +
H (at), T (a) − H (at −1 ), T (a) ± H (a). Then
Proof For a ∈ P C(T) this result goes back to Basor and Ehrhardt [2, 3, 28] with
involved proofs. However, there is an extremely simple proof—cf. [16], based on
the representation (2.2) and valid for generating functions a ∈ GL∞ . We would
like to sketch this proof here. Thus one of the consequences of Eq. (2.2) is that
there is an isomorphism between the kernels of the operators T (V (a, b)) and
diag (T (a) + H (b), T (a) − H (b)). Let us start with the operators T (a) ± H (a).
The corresponding subordinated pairs (c, d) have the form (±1, a a −1), the third
operator in Eq. (2.12) is diag (∓I, I ), so that it does not influence the kernel and
the image of T (V (a, ±a)). Considering the two remaining operators in (2.12), we
note that the Coburn-Simonenko theorem is valid for the block Toeplitz operator
T (V (a, ±a)) and hence for T (a) ± H (a).
Consider now the operators T (a) + H (at). The duo (a, at) is a matching pair
with the subordinated pair (t −1 , d), d = aa −1 t. The operator T (t −1 ) is Fredholm
−1 −1
and ind T (t ) = 1, so that im T (t ) = H p . Besides, a direct check shows that
the function e := e(t) = 1, t ∈ T belongs to the kernels of both operator T (t −1 )
and T (a) − H (at). Assuming that dim ker T (d) > 0 and using Coburn-Simonenko
theorem for Toeplitz operators, we note that im T (d) is dense in H p . On the other
hand, the factorization (2.12) and Eq. (2.2) yield that both spaces im T (V (a, at))
and im diag (T (a) + H (at), T (a) − H (at)) are dense in H p × H p . Hence,
1 = dim ker T (t −1 )) = dim ker diag ((T (a) + H (at), T (a) − H (at)),
and since the kernel of the operator T (a) − H (at) contains the function e(t) = 1, it
follows that
3 Kernel Representations
As was already mentioned, Eq. (2.2) is of limited use in studying the Fredholmness
of the operators T (a) + H (b). However, if the generating functions a and b satisfy
the matching condition, the representation (2.2) allows to determine defect numbers
and obtain efficient representations for the kernels and cokernels of Toeplitz plus
Hankel operators. In order to present the method, we recall relevant results for oper-
ators acting on Hardy spaces—cf. [16]. Let us start with the connections between
the kernels of Toeplitz plus/minus operators and the kernels of the corresponding
block Toeplitz operators. The following lemma is a direct consequence of Eq. (2.2)
and is valid even if a and b do not constitute a matching pair.
Lemma 3.1 If a ∈ GL∞ , b ∈ L∞ and the operators T (a) ± H (b) are considered
on the space H p , 1 < p < ∞, then the following relations hold:
• If (ϕ, ψ)T ∈ ker T (V (a, b)), then
1
a −1 ψ, ϕ + J Qcϕ − J Q
(, ")T = (ϕ − J Qcϕ + J Q a −1 ψ)T (3.1)
2
∈ ker diag (T (a) + H (b), T (a) − H (b)).
defined, respectively, by Eqs. (3.1) and (3.2) are mutually inverses to each other.
Thus, if the kernel of the operator T (V (a, b)) is known, the kernels of both operators
T (a) + H (b) and T (a) − H (b) can be also determined. However, the kernels of the
Invertibility Issues for Toeplitz Plus Hankel Operators 125
operators T (V (a, b)) are known only for a few special classes of matrices V (a, b),
and in the case of general generating functions a, b ∈ L∞ the kernel T (V (a, b)) is
not available. The problem becomes more manageable if a and b form a matching
pair. In this case, V (a, b) is a triangular matrix—cf. (2.11) and the subordinated
functions c and d satisfy the equations
c = 1,
c d d = 1.
Moreover, it follows from Eq. (2.12) that for any function ϕ ∈ ker T (c), the vector
(ϕ, 0) belongs to the kernel of the operator T (V (a, b)) and the first assertion in
Lemma 3.1 shows that
Another remarkable fact is that both functions ϕ + J QcP ϕ and ϕ − J QcP ϕ also
belong to the kernel of the operator T (c). In order to show this, we need an auxiliary
result.
Lemma 3.2 Let g ∈ L∞ satisfy the relation g
g = 1 and f ∈ ker T (g). Then
1
P±
g := (I ± J QgP ) ,
2 ker T (g)
are complementary projections. This property and Eqs. (3.3) lead to the following
conclusion.
Corollary 3.3 If (c, d) is the subordinated pair for the matching pair (a, b) ∈
L∞ × L∞ , then
ker T (c) = im P− +
c im Pc ,
im P−
c ⊂ ker(T (a) + H (b)),
im P+
c ⊂ ker(T (a) − H (b)).
126 V. D. Didenko and B. Silbermann
Corollary 3.3 shows the impact of the operator T (c) on the kernels of T (a) +
H (b) and T (a) − H (b). The impact of the operator T (d) on ker(T (a) ± H (b)) is
much more involved. Thus if T (c) is right-invertible and s ∈ ker T (d), then
(Tr−1 (c)T (
a −1 )s, s) ∈ ker T (V (a, b)),
ker T (V (a, b)) = (c) (d)
where
(c) := (ϕ, 0)T : ϕ ∈ ker T (c) ,
(d) := (Tr−1 (c)T (
a −1 )s, s)T : s ∈ ker T (d) .
Taking into account this representation and using the first assertion in Lemma 3.1,
we obtain that if s ∈ ker T (d), then
The operators ϕ± can be referred as transition operators, since they transform the
kernel of T (d) into the kernels of the operators T (a)±H (b). An important property
of these operators ϕ± is expressed by the following lemma.
Lemma 3.5 Let (c, d) be the subordinated pair for a matching pair (a, b) ∈
L∞ × L∞ . If the operator T (c) is right-invertible, then for every s ∈ ker T (d)
the following relations
(P a J P )ϕ+ (s) = P+
bP + P d (s),
(P a J P )ϕ− (s) = P−
bP − P d (s),
hold. Thus the transition operators ϕ+ and ϕ− are injections on the spaces im P+
d
and im P−
d , respectively.
Using Lemmas 3.1–3.5, one can obtain a complete description for the kernel of
the operator T (a) + H (b) if (a, b) is a matching pair and T (c) is right-invertible.
Invertibility Issues for Toeplitz Plus Hankel Operators 127
Proposition 3.6 (VD and BS [16]) Let (c, d) be the subordinated pair for the
matching pair (a, b) ∈ L∞ × L∞ . If the operator T (c) is right-invertible, then
the kernels of the operators T (a) ± H (b) can be represented in the form
Remark 3.7 It was shown in [16] that the operators ϕ+ and ϕ− send the elements
of the spaces im P− + − +
d and im Pd into im Pc and im Pc , respectively. Therefore,
ϕ+ : im P− −
d → im Pc , ϕ− : im P+ +
d → im Pc
a −1 )s− , s− ∈ im P−
ϕ− (s− ) = Tr−1 (c)T ( d,
ϕ+ (s+ ) = Tr−1 (c)T (a −1 )s+ , s+ ∈ im P+
d .
Hence,
Tr−1 (c)T (
a −1 )s− = J QcP Tr−1 (c)T (
a −1 )s− − J Q
a −1 s− .
Thus
Thus in order to obtain an efficient description of the spaces ker(T (a) + H (b))
±
and ker(T (a)−H (b)), one has to characterize the projections P± c and Pd first. Such
a characterization can be derived from the Wiener-Hopf factorization (2.3) of the
subordinated functions c and d. The Wiener-Hopf factorization of these functions
can be described in more details, which yields a very comprehensive representation
±
of the kernels of T (c), T (d) and the related projections P±
c , Pd .
128 V. D. Didenko and B. Silbermann
We first consider related constructions for a matching function g such that the
operator T (g) is Fredholm on H p with the index n. One can show that under the
condition g− (∞) = 1, the factorization (2.3) takes the form
−1
g = σ (g)g+ t −n
g+ , (3.6)
−1
where σ (g) = g+ (0) = ±1 and g− = σ (g) g+ . The term σ (g) is called
factorization signature. The finding of σ (g) is a non-trivial problem but if T (g)
is invertible and g is continuous at t = 1 or t = −1, then n = 0 and σ (g) = g(1) or
σ (g) = g(−1), respectively. For piecewise continuous functions g, the term σ (g)
can be also determined.
Notice that T (a) − H (b) can be also written as Toeplitz plus Hankel operator
T (a) + H (−b). Thereby, the duo (a, −b) is again a matching pair with the
subordinated pair (−c, −d) and for the factorization signatures we have σ (−c) =
−σ (c), σ (−d) = −σ (d). This observation shows that we can restrict ourselves
to the study of Toeplitz plus Hankel operators. Nevertheless, in some cases it is
preferable to consider the operator T (a) − H (b) too. But then the leading role still
belongs to the operator T (a)+H (b) since the notions of matching pair, subordinated
pair and factorization signature are associated with this operator.
Let g stand for the subordinated function c or d, so that g g = 1. If T (g)
is Fredholm, then the factorization (3.6) exists with a function g+ satisfying the
conditions for factorization (2.3) and we can describe the spaces im P± g . This
description depends on the evenness of the index of T (g).
Theorem 3.8 (VD and BS [16]) Assume that g is a matching function, the operator
T (g) is Fredholm, ind T (g) = n ≥ 0 and g+ is the plus factor in the Wiener-Hopf
factorization (3.6) of g in H p . Then
• For n = 2r, r ∈ N, the systems of functions
−1 r−k−1
B± (g) := {g+ (t ± σ (g)t r+k ) : k = 0, 1, · · · , r − 1},
Thus if T (c), T (d) are Fredholm and T (c) is right-invertible, Proposition 3.6
provides complete description of the spaces ker(T (a) ± H (b)). On the other hand,
if T (c) is Fredholm but not right-invertible, the representation
can be used to study ker(T (a) + H (b)). This is because for any matching pair (a, b)
the duo (at −n , bt n ) is also a matching pair with the subordinated pair (ct −2n , d). A
suitable choice of n leads to the right-invertibility of the operator T (ct −2n ) and we
consequently obtain
The representation (3.7) has been used in [16], to derive the description of the
kernels of the operators T (a) ± H (b). It can be also employed to study one-sided
or generalized invertibility of Toeplitz plus Hankel operators and to construct the
corresponding one-sided and generalized inverses [17, 19, 22]. In the forthcoming
sections, invertibility problems will be discussed in more details. In this connection,
we note that the (familiar) adjoint operator (T (a) + H (b))∗ can be identified with
the operator T (a) + H ( b) acting on the space H q , 1/p + 1/q = 1. It is easily
seen that for any matching pair (a, b), the duo (a, b) is also a matching pair with
the subordinated pair (d, c) and σ (c) = σ (c), σ (d) = σ (d). Therefore, cokernel
description can be determined directly from the previous results for the kernels of
Toeplitz plus Hankel operators.
In some cases the approach above allows omitting the condition of Fredholmness
of the operator T (d). We note a few results, which can be viewed as an extension of
Coburn-Simonenko Theorem 2.9.
Proposition 3.9 (VD and BS [16]) Let (a, b) ∈ L∞ × L∞ be a matching pair with
the subordinated pair (c, d), and let T (c) be a Fredholm operator. Then:
(a) If ind T (c) = 1 and σ (c) = 1, then
(b) If ind T (
b) = −1 and σ (
b) = 1, then
(c) If ind T (
b) = 0, then
Proposition 3.11 (Šneı̆berg [54]) The set AF is open. Moreover, for each con-
nected component γ ∈ AF , the index of the operator A : Lp → Lp , p ∈ γ is
constant.
This result also holds for operators A acting on the spaces H p , 1 < p < ∞,
since any operator A : H p → H p can be identified with the operator AP + Q
acting already on Lp . For Toeplitz operators the structure of the set AF can be
characterized as follows.
Proposition 3.12 (Spitkovskiı̆ [55]) Let G be an invertible matrix–function with
entries from P C and let A := T (G). Then there is an at most countable subset
SA ⊂ (1, ∞) with the only possible accumulation points t = 1 and t = ∞ such
that AF = (1, ∞) \ SA .
Clearly, if G is piecewise continuous with only a finite number of discontinuities,
then SA is at most finite. This result can be used to describe the corresponding set
AF for Toeplitz plus Hankel operators with P C-generating functions.
Corollary 3.13 Let a, b ∈ P C and
Then there is at most countable subset SA ⊂ (1, ∞) with only possible accumula-
tion points t = 1 and t = ∞ such that AF = (1, ∞) \ SA .
Invertibility Issues for Toeplitz Plus Hankel Operators 131
Proof It follows directly from Proposition 3.12 since diag (T (a) + H (b), T (a) −
H (b)) is Fredholm if and only if so is the operator T (V (a, b)).
Thus if a, b ∈ P C and the operator T (a) + H (b) is Fredholm on H p , there is
an interval (p , p ) containing p such that T (a) + H (b) is Fredholm on all spaces
H r , r ∈ (p , p ) and the index of this operator does not depend on r. Moreover,
there is an interval (p, p0 ) ⊂ (p , p ), p < p0 such that T (a) − H (b) is Fredholm
on H r , r ∈ (p, p0 ) and its index does not depend on r. Recalling that for ν < s
the space H s is continuously embedded into H ν , we note that the kernel of T (a) +
H (b) : H r → H r does not depend on r ∈ (p , p ). The same is also true for
ker(T (a) − H (b)), r ∈ (p, p0 ). We want to note that both claims are based on the
following well-known result.
Lemma 3.14 (Gohberg and Feldman [31]) Let a Banach space B1 be continu-
ously and densely embedded into a Banach space B2 . Further, let A1 and A2 be
bounded Fredholm operators which respectively act on B1 and B2 and have equal
indices. If A2 is an extension of A1 , then
ker A1 = ker A2 .
Hence, the kernel of the operator T (a) + H (b) acting on the space H p coincides
with the kernel of this operator acting on the space H r for r ∈ (p, p0 ) and the
latter can be studied by the approach above. Therefore, if T (a) + H (b) ∈ L(H p ) is
Fredholm and a, b ∈ P C form a matching pair, the kernel of the operator T (a) +
H (b) can be described.
Let a, b ∈ L∞ be a matching pair with the subordinated pair (c, d). The pair is
called a Fredholm matching pair (with respect to H p ) if the operators T (c), T (d) ∈
p
L(H p ) are Fredholm. We write T (a) + H (b) ∈ FT H (κ1 , κ2 ) if (a, b) is a Fredholm
matching pair with the subordinated pair (c, d) such that ind T (c) = κ1 , ind T (d) =
κ2 . It was first noted in [15] that if κ1 ≥ 0, κ2 ≥ 0 or if κ1 ≤ 0, κ2 ≤ 0, then (2.2)
and (2.12) yield one-sided invertibility of the operator T (a) + H (b). However, if
κ1 κ2 < 0, the invertibility issues become more involved. We start this section by
considering the generalized invertibility of the operators T (a) + H (b), a, b ∈ L∞ .
Set
B := PV (a, b)P + Q,
Theorem 4.1 (VD and BS [17]) Assume that (a, b) is a matching pair with the
subordinated pair (c, d) and B is generalized invertible operator, which has a
generalized inverse B −1 of the form
AB
B −1 = + Q, (4.1)
D0
where A, B and D are operators acting in the space H p . Then the operator T (a) +
H (b) : H p → H p is also generalized invertible and one of its generalized inverses
has the form
(T (a) + H (b))−1
g = −H ( a −1))
c)(A(I − H (d)) − BH (
+ H (a −1 )D(I − H (d)) + T (a −1 ).
This result can now be used to construct generalized inverses for the operator
T (a) + H (b) in the following cases—cf. [17]:
(a) If κ1 ≥ 0 and κ2 ≥ 0, then B has generalized inverse of the form (4.1) with
A = Tr−1 (c)T (
a −1 )Tr−1 (d), B = −Tr−1 (c) and D = Tr−1 (d).
(b) If κ1 ≤ 0 and κ2 ≤ 0, then B has generalized inverse of the form (4.1) with
A = Tl−1 (c)T (
a −1 )Tl−1 (d), B = −Tl−1 (c) and D = Tl−1 (d).
(c) If κ1 ≥ 0 and κ2 ≤ 0, then B has generalized inverse of the form (4.1) with
a −1 )Tl−1 (d), B = −Tr−1 (c) and D = Tl−1 (d).
A = Tr−1 (c)T (
It is clear that in the cases (a) and (b), generalized inverses are, respectively, right
and left inverses. We also note that under conditions of assertion (a), a right inverse
of T (a) + H (b) can be written in a simpler form—cf. [19]
c))Tr−1 (c)T (
B := (I − H ( a −1 )Tr−1 (d) + H (a −1)Tr−1 (d). (4.2)
The proof of this result is straightforward—i.e. one can use the relations (2.1) to
verify that (T (a) + H (b))B = I . On the other hand, under conditions of (b), a
simpler representation of the left-inverse of T (a) + H (b) can be derived from (4.2)
by passing to the adjoint operator.
In addition to the cases considered, there is one more situation—viz.
(d) κ1 < 0, κ2 > 0.
This case is much more involved and we will deal with it later on. At the moment,
p
we focus on invertibility of operators from FT H (κ1 , κ2 ), 1 < p < ∞.
p
Theorem 4.2 (VD and BS [19]) Assume that T (a) + H (b) ∈ FT H (κ1 , κ2 ) is
invertible. Then:
(i) If κ1 ≥ κ2 or κ1 κ2 ≥ 0, then
c )(Tl−1 (c)T (
(T (a) + H (b))−1 = − H ( a −1 )Tl−1 (d)(I − H (d))
+ Tl−1 (c)H (
a −1 )) + H (a −1 )Tl−1 (d)(I − H (d)) + T (a −1 ).
Theorem 4.3 is, in fact, the collection of various results from [19]. On the other
hand, conditions (i)–(ix) are not necessary for the invertibility of T (a) + H (b) in
the case (4.3). Thus if κ1 = −1, κ2 = 1, then the operator T (a) + H (b) can be
invertible even if σ (c) and σ (d) do not satisfy condition (ix). This case, however,
requires special consideration and has been not yet studied.
Consider now the situation (ix) in more detail. This is a subcase of assertion (ii)
in Theorem 4.2 and a closer inspection shows substantial difference from the other
cases in Theorem 4.3. What makes it very special is the presence of the factorization
factor of c in the representation of the inverse operator. It is also worth noting that
the construction of (T (a) + H (b))−1 is more involved and requires the following
result.
Lemma 4.4 (VD and BS [19, 22]) Let C, D be operators acting on a Banach space
X. If A = CD is an invertible operator, then C and D are, respectively, right- and
left-invertible operators. Moreover, the operator D : X → im D is invertible and
if D0−1 : im D → X is the corresponding inverse, then the operator A−1 can be
represented in the form
where P0 is the projection from X onto im D parallel to ker C and Cr−1 is any right-
inverse of C.
Theorem 4.2(ii) provides necessary conditions for the invertibility of T (a) +
p
H (b) ∈ FT H (κ1 , κ2 ) for negative κ1 and positive κ2 . In the next section, we take a
closer look at the condition (iia). The related ideas can be seen as a model to study
invertibility in cases (iib)–(iid) of Theorem 4.2.
Now we would like to discuss a few examples.
Example 4.5 Let us consider the operator T (a) + H (b) in the case where a = b. In
this situation c(t) = 1 and d(t) = a(t)a −1 (t). Hence, H (
c) = 0, T (c) = I and if
ind T (d) = 0, then the operator T (a) + H (a) is also invertible and
(T (a) + H (a))−1 = (T (
a −1 ) + H (a −1 ))T −1 (a
a −1).
a ))−1 = (I − H (
(T (a) + H ( aa −1 ))T −1 (a
a −1)T (
a −1 ) + H (a −1 ).
Example 4.7 Consider the operator I +H (φ0 b), where b(t) b(t) = 1 and φ0 (t) = t,
φ0 (t) = −t −1 or ϕ0 (t) = ±1 for all t ∈ T. Assume that the operator T ( b) is
Fredholm. Corollary 3.10 indicates that if one of the conditions
(a) ind T (
b) = 0 and ϕ0 (t) = ±1,
(b) ind T (
b) = 0, σ (
b) = 1 and ϕ0 (t) = t,
(c) ind T (b) = 0, σ (
b) = 1 and ϕ0 (t) = −t −1 ,
Invertibility Issues for Toeplitz Plus Hankel Operators 135
holds, then
Therefore, if I +H (ϕ0 b) is Fredholm with index zero, then this operator is invertible.
However, for b ∈ P C, the Fredholmness of the operators T ( b) and I + H (ϕ0b) can
be studied by Theorems 2.4 and 2.5. It is also possible to construct the inverse
operator using the corresponding results on the factorization of P C-functions.
However, instead of going into details, we would like to observe that if T (b),
b ∈ L∞ is invertible, then I + T (ϕ0 b) is also invertible under the conditions of
Theorem 4.3(i), (viii), and (ix), respectively. Moreover, the inverse operator can be
explicitly constructed.
Using a distinct method, Basor and Ehrhardt [5] also proved the invertibility
of this operator on H 2 under the condition that T ( b) : H 2 → H 2 is invertible.
2
For the H -space, the invertibility of T (b) automatically follows from that of
T (b). However, if p = 2, this is not true and the corresponding examples can be
found already among operators with piecewise continuous generating functions. It
is interesting enough that in each case ϕ0 (t) = ±1, ϕ0 (t) = t or ϕ0 (t) = −t −1 , the
inverse operator can be represented in the form
(I + H (ϕ0b))−1 = (T (
b) + H (ϕ0))−1 (I + H (ϕ0
b))(T (b) + H (ϕ0 ))−1 .
p
5 Invertibility of Operators from FT H (−2n, 2n), n ∈ N
It is easily seen that the operator T (a) + H (b) can be represented in the form
where a1 = at −1 and b1 = tb. The duo (a1 , b1 ) = (at −1 , bt) is a matching pair
with the subordinated pair (c1 , d1 ) = (ct −2 , d). Hence, T (a1 )+H (b1 ) ∈ FT H (0, 2)
p
136 V. D. Didenko and B. Silbermann
and we note that T (c1 ) is invertible. The invertibility of T (c1 ) implies that both
projections P+ −
c1 are Pc1 are the zero operators. According to Remark 3.7, the
functions ϕ± admit the representations
so that
a1−1 )(im P±
ker(T (a1 ) ± H (b1)) = Tr−1 (c1 )T ( d ).
im P± −1
d = {νd+ (1 ± σ (d)t : ν ∈ C}
which respectively belong to the kernels of the operators T (a1 ) ± H (b1). It is clear
that ωa,b,± also depend on p.
Representation (5.1) shows that T (a) + H (b) has trivial kernel if and only if
It is possible only if
ω0a,b,+ = 0,
where ω0a,b,+ is the zero Fourier coefficient of the function ωa,b,+ (t). Similar result
is valid for the operator T (a) − H (b). Note that if T (a) + H (b) belongs to the set
p
FT H (−2, 2), then so is the operator T (a) − H (b), since T (a) − H (b) = T (a) +
H (−b).
p
Theorem 5.1 (VD and BS [22]) Let T (a) + H (b) ∈ FT H (−2, 2).
(a) The operator T (a) + H (b) (T (a) − H (b)) is left-invertible if and only if
ω0a,b,+ = 0 (
ω0a,b,− = 0).
(b) The operator T (a) + H (b) (T (a) − H (b)) is right-invertible if and only if
0a,b,+ = 0 (
ω ω0a,b,− = 0).
ω0a,b,+ = 0
(c) The operator T (a)+H (b) (T (a)−H (b)) is invertible if and only if
and ω0a,b,+ = 0 (
ω0a,b,− = 0 and ω0a,b,− = 0).
a,b,+ a,b,−
(d) If
ω0 = 0 and ω0 = 0, then both operators T (a) + H (b) and T (a) −
H (b) are invertible.
Invertibility Issues for Toeplitz Plus Hankel Operators 137
Let us sketch the proof of Assertion (d). It follows from the representations (2.2)
and (2.12) that
By Assertion (a), both operators T (a) + H (b) and T (a) − H (b) are left-invertible.
Therefore, ind (T (a) + H (b)) ≤ 0, ind (T (a) − H (b)) ≤ 0 and taking into account
(5.3), we obtain that
(T (a) + H (b))−1
1 −1
= T (t −1 ) I − T −1 (ct −2 )T (
a −1 t −1 )d+ (t)(1 + σ (d)t)tQt −1
ω0a,b,+
c))T −1 (t −2 c)T (
× (I − H (t 2 a −1 t −1 )Tr−1 (d) + H (a −1t)Tr−1 (d) . (5.4)
Example 5.3 We now consider the operator A = T (a) + H (t −2 a), defined by the
function
Hence, σ (d) = 1 and computing the zero Fourier coefficients of the corresponding
functions (5.2), we obtain
−2 ,+ −2 ,−
0a,at
w 0a,at
=w = γ 2 − γ + 1.
138 V. D. Didenko and B. Silbermann
It is easily seen that for any γ ∈ (0, 1) these coefficients are not equal to zero, so that
by Theorem 5.1(c), the operator T (a) + H (at −2 ) is invertible. The corresponding
inverse operator, which is constructed according to the representation (5.4), has the
form
(T (a) + H (t −2 a))−1
1
= T (t −1 ) I − 2 P (1 − γ t −1 )(1 − γ t)(1 + t) Qt −1
γ −γ +1
(1 − γ t −1 )t −1 (1 − γ t)t
× T +H
1 − γt 1 − γ t −1
× T ((1 − γ t)2 )T ((1 − γ t −1 )−2 )T (t 2 ).
Consider now the invertibility of the operators T (a) + H (b) from the set
p
FT H (−2n, 2n) for n greater than 1. Thus we assume that the subordinated operator
T (c) = T (ab−1 ) and T (d) = T (ab −1) are Fredholm and
with respect to Lp . Notice that the functions ωka,b form a basis in ϕ+ (im P+
d ), where
+
ϕ+ : im Pd → ker(T (a1 ) + H (b1 )) is defined by
ϕ+ = T −1 (ct −2n )T (
a −1 t −n )).
p
The invertibility of the operators from FT H (−2n, 2n) is described by the following
theorem.
Invertibility Issues for Toeplitz Plus Hankel Operators 139
p
Theorem 5.4 (VD and BS [22]) If T (a) + H (b) ∈ FT H (−2n, 2n), then:
(a) T (a)+H (b) is left-invertible if and only if Wn (a, b) is a non-degenerate matrix.
(b) T (a) + H (b) is right-invertible if and only if Wn (a, b) is a non-degenerate
matrix.
(c) T (a) + H (b) is invertible if and only if Wn (a, b) and Wn (a, b) are non-
degenerate matrices.
(d) If Wn (a, b) and Wn (a, −b) are non-degenerate matrices, then both operators
T (a) + H (b) and T (a) − H (b) are invertible.
Example 5.5 Consider operator T (a) + H (at −2n ), n ∈ N and
1 − γ t −1
a(t) = , γ ∈ (0, 1). (5.5)
1 − γt
It was shown in [22] that for any n ∈ N, the above operator is left-invertible in any
space H p , 1 < p < ∞. Moreover, since H (at −2n ) is compact and ind T (a) = 0,
the operator at hand is even invertible. The inverse of T (a) + H (at −2n ) can be
constructed in explicit form.
p
Remark 5.6 If m, n ∈ N and m = n, the set FT H (−2m, 2n) does not contain
invertible operators, but it still includes one-sided invertible operators.
Let l p (Z) denote the complex Banach space of all sequences ξ = (ξn )n∈N of
complex numbers with the norm
1/p
||ξ ||p = |ξn | p
, 1 ≤ p < ∞.
n∈Z
140 V. D. Didenko and B. Silbermann
As usual, Z denotes the set of all integers. If we replace Z by the set of all non-
negative integers Z+ , we get another Banach space l p (Z+ ). It can be viewed as a
subspace of l p (Z) and we will often write l p for l p (Z+ ). By P , we now denote the
canonical projection from l p (Z) onto l p (Z+ ) and let Q := I − P . Further, let J
refer to the operator on l p (Z) defined by
(J ξ )n = ξ−n−1 , n ∈ Z.
The operators J, P and Q are connected with each other by the relations
J 2 = I, J P J = Q, J QJ = P .
For each a ∈ Lp = Lp (T), let ( ak )k∈Z be the sequence of its Fourier coefficients.
The Laurent operator L(a) associated with a ∈ L∞ (T) acts on the space l 0 (Z) of
all finitely supported sequences on Z by
(L(a)x)k :=
ak−m xm , (6.1)
m∈Z
where the sum in the right-hand side of (6.1) contains only a finite number of non-
zero terms for every k ∈ Z. We say that a is a multiplier on l p (Z) if L(a)x ∈ l p (Z)
for any x ∈ l0 (Z) and if
is finite. In this case, L(a) extends to a bounded linear operator on l p (Z), which is
again denoted by L(a). The set M p of all multipliers on l p (Z) is a Banach algebra
under the norm ||a||Mp := ||L(a)||—cf. [7]. It is well-known that M 2 = L∞ (T).
Moreover, every function a ∈ L∞ (T) with bounded total variation Var (a) is in M p
for every p ∈ (1, ∞) and the Stechkin inequality
W ⊂ Cp ⊂ P Cp ⊂ P C and M p ⊂ L∞ .
Invertibility Issues for Toeplitz Plus Hankel Operators 141
For this and other properties of multiplier cf. [7]. We also recall the equation
J L(a)J = L( a ) often used in what follows.
Let a ∈ M p . The operators T (a) : l p → l p , x → P L(a)x and H (a) : l p → l p ,
x → P L(a)J x = P L(a)QJ are, respectively, called Toeplitz and Hankel operators
with generating function a. It is well-known that ||T (a)|| = ||a||M p and ||H (a)|| ≤
||a||M p for every multiplier a ∈ M p . In this section we also use the notation Tp (a)
or Hp (a) in order to underline that the corresponding Toeplitz or Hankel operator
is considered on the space l p for a fixed p ∈ (1, ∞). The action of the operators
Tp (a) and Hp (a) on the elements from l p can be written as follows
⎛ ⎞
T (a) : (ξj )j ∈Z+ → ⎝ aj −k ξk ⎠
,
k∈Z+ j ∈Z+
⎛ ⎞
H (a) : (ξj )j ∈Z+ → ⎝ aj +k+1 ξk ⎠
.
k∈Z+ j ∈Z+
The goal of this subsection is to present a method on how to study certain problems
for Toeplitz plus Hankel operator Tp (a) + Hp (b) defined on l p via known results
obtained in Sects. 2–5. Since M p ⊂ L∞ (T), for any given elements a, b ∈ M p the
operator Tp (a) + Hp (b) ∈ L(l p ) generates the operator Ts (a) + Hs (b) ∈ L(H s ),
1 < s < ∞ in an obvious manner. We denote by Lp,q (l p ) the collection of
all Toeplitz plus Hankel operators acting on the space l p such that the following
conditions hold:
(a) a, b ∈ M p .
(b) Tp (a) + Hp (b) ∈ L(l p ) is Fredholm.
(c) Tq (a) + Hq (b) ∈ L(H q ), 1/p + 1/q = 1 is Fredholm.
(d) Both operators acting on the spaces mentioned have the same Fredholm index.
142 V. D. Didenko and B. Silbermann
The following observation is crucial for what follows. The famous Hausdorff-Young
Theorem connects the spaces l p and H q , 1/p + 1/q = 1 via Fourier transform
F (a) = (
an )n∈Z , a ∈ H q .
Theorem 6.2 (Hausdorff and Young [27])
(a) If g ∈ H p and 1 ≤ p ≤ 2, then F g ∈ l q and
||ϕ̆||q ≤ ||ϕ||p .
embedded into H q .
Theorem 6.3 (VD and BS [21]) Let p ∈ (1, ∞). If Tp (a) + Hp (b) ∈ Lp,q , then
the Fourier transform F is an isomorphism between the spaces ker(Tq (a) + Hq (b))
and ker(Tp (a) + Hp (b)).
Let us give a sketch of the proof. The matrix representation [Tq (a) + Hq (b)] of
the operator Tq (a) + Hq (b) in the standard basis (t n )n∈Z+ induces a linear bounded
operator on Hq , so that
ai−j +
[Tq (a) + Hq (b)] = ( bi+j +1 )∞
i,j =0
The operators Tq (a) + Hq (b) ∈ L(H q ) and [Tq (a) + Hq (b)] ∈ L(H q ) have the
same Fredholm properties as Tp (a) + Hp (b) ∈ L(l ) for 1 < q ≤ 2. Moreover, the
p
operator Tp (a) + Hp (b) ∈ L(l p ) is the extension of [Tq (a) + Hq (b)] with the same
index. However, in this case Lemma 3.14 then indicates that
and for p ≥ 2 the assertion follows. The case 1 < p ≤ 2 can be treated analogously.
Thus the main problem in using Theorem 6.2 is whether it is known that Tp (a) +
Hp (b) ∈ Lp,q . This is a non-trivial fact but the following result holds.
Proposition 6.4 (VD and BS [21]) Let a, b ∈ P Cp , 1 < p < ∞. If the operator
Tp (a) + Hp (b) ∈ L(l p ) is Fredholm, then Tp (a) + Hp (b) ∈ Lp,q .
Invertibility Issues for Toeplitz Plus Hankel Operators 143
The proof of this proposition can be carried out using ideas from [50] and [51].
However, a simpler proof can be obtained if the generating function a and b satisfy
the matching condition a a = bb and the operators Tp (c), Tp (d) are Fredholm say
in l . Then Tp (c), Tp (d) ∈ Lp,q —cf. [7, Chapters 4 and 6], and using classical
p
l = 0, · · · , (κ1 − 1)/2}.
l = 0, 1, · · · , κ1 /2 − 1}.
Remark 6.6 Sometime the study of invertibility of Toeplitz plus Hankel operators
Tp (a) + Hp (b) can be carried out even if it is not known, whether this operator
belongs to Lp,q . Thus l p -versions of Theorem 2.9, Proposition 3.9 and Corol-
lary 3.10 can be directly proved.
This section is devoted to Wiener-Hopf plus Hankel operators. Let χE refer to the
characteristic function of the subset E ⊂ R and let F and F −1 be the direct and
144 V. D. Didenko and B. Silbermann
In what follows, we identify the spaces Lp (R+ ) and Lp (R− ), 1 ≤ p ≤ ∞ with the
subspaces P (Lp (R)) and Q(Lp (R)) of the space Lp (R), where P and Q are the
projections in Lp (R) defined by Pf := χR+ f and Q := I − P , respectively.
Consider the set G of functions g having the form
g(t) = (F k)(t) + gj eiδj t , t ∈ R, (7.1)
j ∈Z
Here and throughout this section, J : Lp (R) → Lp (R) is the reflection operator
defined by J ϕ := ϕ and
ϕ (t) := ϕ(−t) for any ϕ ∈ Lp (R), p ∈ [1, ∞]. Operators
W (g) and H (g) are, respectively, called Wiener-Hopf and Hankel operators on the
semi-axis R+ . It is well-known [31] that for g ∈ G, all three operators above are
bounded on any space Lp , p ∈ [1, ∞).
In particular, the operator W (g) has the form
∞
∞
W (g)ϕ(t) = gj Bδj ϕ(t) + k(t − s)ϕ(s) ds, t ∈ R+ ,
j =−∞ 0
where
if δ > 0.
For various classes of generating functions, the Fredholm properties of operators
W (a) are well studied [7, 8, 12, 24–26, 31]. In particular, Fredholm and semi-
Fredholm Wiener-Hopf operators are one-sided invertible, and for a ∈ G there
is efficient description of the kernels and cokernels of W (a) and formulas for the
corresponding one-sided inverses.
The study of Wiener-Hopf plus Hankel operators
The assumption (7.3) allows us to employ the method developed in Sects. 2–5 and
establish necessary and also sufficient conditions for invertibility and one-sided
invertibility of the operators under consideration and obtain efficient representations
for the corresponding inverses. Note that here we only provide the main results. For
more details the reader can consult [14, 20, 23].
Let G+ ⊂ G and G− ⊂ G denote the sets of functions, which admit holomorphic
extensions to the upper and lower half-planes, respectively. If g ∈ G is a matching
function—i.e. g g = 1, then according to [14, 20], it can be represented in the form
t − i n
−1
g(t) = σ (g)
g+ (t) eiνt g+ (t),
t +i
±1
where ν = ν(g) ∈ R, n = n(g) ∈ N, σ (g) = (−1)n g(0), g+ (t) ∈ G− and
g+ (∞) = 1. This representation is unique and the numbers ν(g) and n(g) in the
representation (7.1) are defined as follows:
1 1
ν(g) := lim [arg g(t)]l−l , n(g) := [arg(1 + g −1 (t)(F k)(t)]∞
t =−∞ .
l→∞ 2l 2π
Moreover, following the considerations of Sects. 2–3, for any right-invertible oper-
ator W (g) generated by a matching function g, we can introduce complementary
projections P±g on ker W (g). More precisely, if ν < 0 or if ν = 0 and n < 0, then
W (g) is right-invertible and the projections P±
g are defined by
P±
g := (1/2)(I ± J QP W (g)P ) : ker W (g) → ker W (g).
0
1 −1
ϕ + = ϕ + (a, b) := (W (c)W (ã −1 ) − J QW 0 (c)P Wr−1 (c)W (ã −1 ))
2 r
1
+ J QW 0 (ã −1 ),
2
We are ready to discuss the invertibility of Wiener-Hopf plus Hankel operators
starting with necessary conditions in the case where at least one of the indices
ν1 := ν(c) or ν2 := ν(d) is not equal to zero. The situation ν1 = ν2 = 0 will
be considered later on. Let n1 and n2 denote the indices n(c) and n(d), respectively.
Theorem 7.1 (VD and BS [23]) If a, b ∈ G, the operator W (a) + H (b) is one-
sided invertible in Lp (R+ ) and at least one of the indices ν1 or ν2 is not equal to
zero, then:
(i) Either ν1 ν2 ≥ 0 or ν1 > 0 and ν2 < 0.
(ii) If ν1 = 0 and ν2 > 0, then n1 > −1 or n1 = −1 and σ (c) = −1.
(iii) If ν1 < 0 and ν2 = 0, then n2 < 1 or n2 = 1 and σ (d) = −1.
Invertibility Issues for Toeplitz Plus Hankel Operators 147
Consider now the case ν1 > 0 and ν2 < 0 in more detail. It can be specified as
follows.
p
Theorem 7.2 (VD and BS [23]) Let ν1 > 0, ν2 < 0, n1 = n2 = 0 and Nν , ν > 0
denote the set of functions f ∈ Lp (R+ ) such that f (t) = 0 for t ∈ (0, ν).
(i) If the operator W (a) + H (b) : Lp (R+ ) → Lp (R+ ), 1 < p < ∞ is invertible
from the left, then
ϕ + (P+
p
d ) ∩ Nν1 /2 = {0},
ϕ + (P+
p
c ) ∩ N−ν2 /2 = {0}, (7.4)
n1 ≥ −1
n1 ≥ 1,
Theorem 7.4 (VD and BS [23]) Let a, b ∈ G and indices ν1 , ν2 , n1 and n2 satisfy
any of the following conditions:
(i) ν1 < 0 and ν2 < 0.
(ii) ν1 > 0, ν2 < 0, n1 = n2 = 0, operator W (a) + H (b) is normally solvable
and satisfies the condition (7.4).
(iii) ν1 < 0, ν2 = 0 and n2 < 1 or n2 = 1 and σ (d) = −1.
(iv) ν1 = 0, n1 ≤ 0 and ν2 < 0.
(v) ν1 = 0 and ν2 = 0
(a) n1 ≤ 0, n2 < 1;
(b) n1 ≤ 0, n2 = 1 and σ (d) = −1;
Then the operator W (a) + H (b) is right-invertible.
The sufficient conditions for left-invertibility of W (a) + H (b) can be obtained
by passing to the adjoint operator. Here we are not going to discuss this problem
in whole generality. However, we use assumptions, which allow to derive simple
formulas for left- or right-inverses. These conditions are not necessary for one-
sided invertibility and the corresponding inverses can be also constructed even if
the conditions above are not satisfied.
Theorem 7.5 (VD and BS [23]) Let (a, b) ∈ G × G be a matching pair. Then:
1. If W (c) and W (d) are left-invertible, the operator W (a) + H (b) is also left-
invertible and one of its left-inverses has the form
(W (a) + H (b))−1 −1 −1 −1
l =Wl (c)W (ã )Wl (d)(I − H (d̃))
2. If W (c) and W (d) are right-invertible, the operator W (a) + H (b) is also right-
invertible and one of its right-inverses has the form
(W (a) + H (b))−1 −1 −1 −1
r =(I − H (c̃))Wr (c)W (ã )Wr (d)
Here we briefly discuss generalized Toeplitz plus Hankel operators. These operators
are similar to classical Toeplitz plus Hankel operators considered in Sect. 2, but the
flip operator J is replaced by another operator Jα generated by a linear fractional
shift α. It turns out that the classical approach of Sects. 2–5 can also be used but
the application of the method is not straightforward and requires solving various
specific problems. Therefore, in this section we mainly focus on the description of
the kernels and cokernels of the corresponding operators. These results lay down
foundation for the invertibility study. We also feel that the Basor-Ehrhardt method
can be realized in this situation, but we are not going to pursue this problem here.
The following construction is based on the considerations of [18]. Let S denote
the Riemann sphere. We consider a mapping α : S → S defined by
z−β
α(z) := , (8.1)
βz − 1
α(D) = S2 \ D, α(S2 \ D) = D.
Jα ϕ(t) := t −1 α− (t)ϕ(α(t)), t ∈ T.
Further, for a ∈ L∞ let aα denote the composition of the functions a and α—i.e.
aα (t) := a(α(t)), t ∈ T.
The operators Jα , P , Q and aI are connected with each other by the equations
Jα2 = I, Jα aI = aα Jα , Jα P = QJα , Jα Q = P Jα ,
and for any n ∈ Z, one has (a n )α = (aα )n := aαn . In addition to the Toeplitz
operator T (a), any element a ∈ L∞ defines another operator Hα := P αQJα ,
called generalized Hankel operator. Generalized Hankel operators are similar to
classical Hankel operators H (a). For example, analogously to (2.1) operators Hα
are connected with Toeplitz operators by the relations
On the space Lp , we now consider the operators of the form T (a) + Hα (b) and call
them generalized Toeplitz plus Hankel operators generated by the functions a, b and
the shift α.
The classical approach of Sect. 2 can be also employed to describe the kernels and
cokernels of T (a) + Hα (b). To this aim we develop a suitable framework, which is
not a straightforward extension of the methods of Sect. 2. Let us now assume that
a belongs to the group of invertible elements GL∞ and the duo a, b satisfy the
condition
Relation (8.3) is again called the matching condition and the duo (a, b) are α-
matching functions. To each matching pair, one can assign another α-matching
pair (c, d) := (ab−1, abα−1 ) called the subordinated pair for (a, b). It is easily
seen that ccα = ddα = 1. In what follows, any element g ∈ L∞ satisfying the
relation ggα = 1 is referred to as α-matching function. The functions c and d can
also be expressed in the form c = bα aα−1 , d = bα−1 a. Besides, if (c, d) is the
subordinated pair for an α-matching pair (a, b), then (d, c) is the subordinated pair
for the matching pair (a, bα ), which defines the adjoint operator
Jα ϕ(t) := χ −1 (t)ϕ(α(t)),
where χ(t) = t/(α− (t)), and note that if α is the shift (8.1), then:
1. χ ∈ H ∞ is an α-matching function and wind χ = 1, where wind χ denotes the
winding number of the function χ with respect to the origin.
2. The function χα ∈ H ∞ and χα (∞) = 0.
3. If a, b ∈ L∞ and n is a positive integer, then
Further development runs similar to the one presented in Sect. 2.3 and all results
are valid if the operator H (b) is replaced by Hα (b), a by aα and b by bα . For
example, if (a, b) is an α-matching pair with the subordinated pair (c, d), then the
block Toeplitz operator T (V (a, b)) with the matrix function
0 d
V (a, b) = ,
−c aα−1
Moreover, assuming that T (c) is invertible from the right and Tr−1 (c) is one of the
right inverses, we can again represent the kernel of T (V (a, b)) as the direct sum
(d),
ker T (V (a, b)) = (c)
152 V. D. Didenko and B. Silbermann
where
(c) := (ϕ, 0)T : ϕ ∈ ker T (c) ,
(d) := (Tr−1 (c)T (aα−1 )s, s)T : s ∈ ker T (d) .
1
P±
α (c) := (I ± Jα QcP ) : ker T (c) → ker T (c),
2
1
P±
α (d) := (I ± Jα QdP ) : ker T (d) → ker T (d),
2
are projections on the corresponding spaces and consider the translation operators
ϕα± : P±
α (d) → ker(T (a) ± Hα (b)),
which are defined similar to (3.5), but with J and a −1 replaced by Jα and aα−1 ,
respectively.
In this situation, Proposition 3.6 reads as follows.
Proposition 8.2 Assume that (a, b) ∈ L∞ × L∞ is a Fredholm matching pair. If
the operator T (c) is right-invertible, then
g = g− t −n g+ , g− (∞) = 1,
Invertibility Issues for Toeplitz Plus Hankel Operators 153
1. If n = 2m, m ∈ N, then
−1 m−k−1
Bα± (g) := {g+ (χ ± σα (g)χ m+k ) : k = 0, 1, · · · , m − 1},
and dim im P±
α (g) = m.
2. If n = 2m + 1, m ∈ Z+ , then
−1 m+k
Bα± (g) := {g+ (χ ± σα (g)χ m−k ) : k = 0, 1, · · · , m} \ {0},
dim im P±
α (g) = m + (1 ± σα (g))/2 ,
and the zero element belongs to one of the sets Bα+ (g) or Bα− (g)—viz. for k = 0
one of the terms χ m (1 ± σα (g)) is equal to zero.
The above considerations provide a powerful tool for the study of invertibility of
generalized Toeplitz plus Hankel operators and it should be clear that the results
obtained in Sects. 4 and 5 can be extended to this class of operators. However,
additional studies may be needed in certain situations. Nevertheless, let us mention
one of such results here.
Proposition 8.6 Assume that (a, b) ∈ L∞ × L∞ is a Fredholm matching pair and
the operators T (c) and T (d) are right-invertible. Then T (a) + Hα (b) and T (a) −
Hα (b) are also right-invertible and corresponding right inverses are given by
(T (a) ± Hα (b))−1 −1 −1 −1 −1 −1
r = (I ∓ Hα (cα ))Tr (c)T (aα )Tr (d) ± Hα (a )Tr (d).
154 V. D. Didenko and B. Silbermann
In conclusion of this section, we note the works [40, 41] where more general
operators with the shift (8.1) are considered. However, the conditions imposed on
coefficient functions are more restrictive and the results obtained are less complete.
9 Final Remarks
Acknowledgments The authors express their sincere gratitude to anonymous referees for insight-
ful comments and suggestions that helped to improve the paper.
References
1. J. Baik, E.M. Rains, Algebraic aspects of increasing subsequences. Duke Math. J. 109, 1–65
(2001)
2. E.L. Basor, T. Ehrhardt, On a class of Toeplitz + Hankel operators. New York J. Math. 5, 1–16
(1999)
3. E.L. Basor, T. Ehrhardt, Factorization theory for a class of Toeplitz + Hankel operators. J.
Oper. Theory 51, 411–433 (2004)
4. E.L. Basor, T. Ehrhardt, Fredholm and invertibility theory for a special class of
Toeplitz + Hankel operators. J. Spectral Theory 3, 171–214 (2013)
5. E.L. Basor, T. Ehrhardt, Asymptotic formulas for determinants of a special class of
Toeplitz + Hankel matrices, in Large Truncated Toeplitz Matrices, Toeplitz Operators, and
Related Topics, vol. 259. The Albrecht Böttcher Anniversary Volume. Operator Theory:
Advances and Applications (Birkhäuser, Basel, 2017), pp. 125–154
6. E. Basor, Y. Chen, T. Ehrhardt, Painlevé V and time-dependent Jacobi polynomials. J. Phys. A
43, 015204, 25 (2010)
7. A. Böttcher, B. Silbermann, Analysis of Toeplitz Operators, 2nd edn. Springer Monographs in
Mathematics (Springer, Berlin, 2006)
8. A. Böttcher, Y.I. Karlovich, I.M. Spitkovsky, Convolution Operators and Factorization of
Almost Periodic Matrix Functions (Birkhäuser, Basel, 2002)
Invertibility Issues for Toeplitz Plus Hankel Operators 155
9. L.P. Castro, A.P. Nolasco, A semi-Fredholm theory for Wiener-Hopf-Hankel operators with
piecewise almost periodic Fourier symbols. J. Oper. Theory 62, 3–31 (2009)
10. L.P. Castro, A.S. Silva, Wiener-Hopf and Wiener-Hopf-Hankel operators with piecewise-
almost periodic symbols on weighted Lebesgue spaces. Mem. Diff. Equ. Math. Phys. 53, 39–62
(2011)
11. K. Clancey, I. Gohberg, Factorization of Matrix Functions and Singular Integral Operators
(Birkhäuser, Basel, 1981)
12. L.A. Coburn, R.G. Douglas, Translation operators on the half-line. Proc. Nat. Acad. Sci. USA
62, 1010–1013 (1969)
13. V.D. Didenko, B. Silbermann, Index calculation for Toeplitz plus Hankel operators with
piecewise quasi-continuous generating functions. Bull. London Math. Soc. 45, 633–650 (2013)
14. V.D. Didenko, B. Silbermann, The Coburn-Simonenko Theorem for some classes of Wiener–
Hopf plus Hankel operators. Publ. de l’Institut Mathématique 96(110), 85–102 (2014)
15. V.D. Didenko, B. Silbermann, Some results on the invertibility of Toeplitz plus Hankel
operators. Ann. Acad. Sci. Fenn. Math. 39, 443–461 (2014)
16. V.D. Didenko, B. Silbermann, Structure of kernels and cokernels of Toeplitz plus Hankel
operators. Integr. Equ. Oper. Theory 80, 1–31 (2014)
17. V.D. Didenko, B. Silbermann, Generalized inverses and solution of equations with Toeplitz
plus Hankel operators. Bol. Soc. Mat. Mex. 22, 645–667 (2016)
18. V.D. Didenko, B. Silbermann, Generalized Toeplitz plus Hankel operators: kernel structure
and defect numbers. Compl. Anal. Oper. Theory 10, 1351–1381 (2016)
19. V.D. Didenko, B. Silbermann, Invertibility and inverses of Toeplitz plus Hankel operators. J.
Oper. Theory 72, 293–307 (2017)
20. V.D. Didenko, B. Silbermann, Kernels of Wiener-Hopf plus Hankel operators with matching
generating functions, in Recent Trends in Operator Theory and Partial Differential Equations,
vol. 258. The Roland Duduchava Anniversary Volume. Operator Theory: Advances and
Applications (Birkhäuser, Basel, 2017), pp. 111–127
21. V.D. Didenko, B. Silbermann, Kernels of a class of Toeplitz plus Hankel operators with
piecewise continuous generating functions, in Contemporary Computational Mathematics –
A Celebration of the 80th Birthday of Ian Sloan, ed. by J. Dick, F.Y. Kuo, H. Woźniakowski
(eds). (Springer, Cham, 2018), pp. 317–337
22. V.D. Didenko, B. Silbermann, The invertibility of Toeplitz plus Hankel operators with
subordinated operators of even index. Linear Algebra Appl. 578, 425–445 (2019)
23. V.D. Didenko, B. Silbermann, Invertibility issues for a class of Wiener-Hopf plus Hankel
operators. J. Spectral Theory 11 (2021)
24. R.V. Duduchava, Wiener-Hopf integral operators with discontinuous symbols. Dokl. Akad.
Nauk SSSR 211, 277–280 (1973) (in Russian)
25. R.V. Duduchava, Integral operators of convolution type with discontinuous coefficients. Math.
Nachr. 79, 75–98 (1977)
26. R.V. Duduchava, Integral Equations with Fixed Singularities (BSB B. G. Teubner Verlagsge-
sellschaft, Leipzig, 1979)
27. R. Edwards, Fourier Series. A Modern Introduction, vol. 1. Graduate Texts in Mathematics,
vol. 85 (Springer, Berlin, 1982)
28. T. Ehrhardt, Factorization theory for Toeplitz+Hankel operators and singular integral opera-
tors with flip. Habilitation Thesis, Technische Universität Chemnitz (2004)
29. T. Ehrhardt, Invertibility theory for Toeplitz plus Hankel operators and singular integral
operators with flip. J. Funct. Anal. 208, 64–106 (2004)
30. P.J. Forrester, N.E. Frankel, Applications and generalizations of Fisher-Hartwig asymptotics.
J. Math. Phys. 45, 2003–2028 (2004)
31. I.C. Gohberg, I.A. Feldman, Convolution Equations and Projection Methods for Their Solution
(American Mathematical Society, Providence, 1974)
32. I. Gohberg, N. Krupnik, One-Dimensional Linear Singular Integral Equations. I, vol. 53.
Operator Theory: Advances and Applications (Birkhäuser Verlag, Basel, 1992)
156 V. D. Didenko and B. Silbermann
33. I. Gohberg, N. Krupnik, One-Dimensional Linear Singular Integral Equations. II, vol. 54.
Operator Theory: Advances and Applications (Birkhäuser Verlag, Basel, 1992)
34. S. Grudsky, A. Rybkin, On positive type initial profiles for the KdV equation. Proc. Am. Math.
Soc. 142, 2079–2086 (2014)
35. S. Grudsky, A. Rybkin, Soliton theory and Hankel operators. SIAM J. Math. Anal. 47, 2283–
2323 (2015)
36. S.M. Grudsky, A.V. Rybkin, On the trace-class property of Hankel operators arising in the
theory of the Korteweg-de Vries equation. Math. Notes 104, 377–394 (2018)
37. P. Junghanns, R. Kaiser, A note on Kalandiya’s method for a crack problem. Appl. Numer.
Math. 149, 52–64 (2020)
38. N.K. Karapetiants, S.G. Samko, On Fredholm properties of a class of Hankel operators. Math.
Nachr. 217, 75–103 (2000)
39. N.K. Karapetiants, S.G. Samko, Equations with Involutive Operators (Birkhäuser Boston Inc.,
Boston, 2001)
40. V.G. Kravchenko, A.B. Lebre, J.S. Rodríguez, Factorization of singular integral operators with
a Carleman shift via factorization of matrix functions: the anticommutative case. Math. Nachr.
280, 1157–1175 (2007)
41. V.G. Kravchenko, A.B. Lebre, J.S. Rodríguez, Factorization of singular integral operators with
a Carleman backward shift: the case of bounded measurable coefficients. J. Anal. Math. 107,
1–37 (2009)
42. N.Y. Krupnik, Banach Algebras with Symbol and Singular Integral Operators, vol. 26.
Operator Theory: Advances and Applications (Birkhäuser Verlag, Basel, 1987)
43. A.B. Lebre, E. Meister, F.S. Teixeira, Some results on the invertibility of Wiener-Hopf-Hankel
operators. Z. Anal. Anwend. 11, 57–76 (1992)
44. G.S. Litvinchuk, I.M. Spitkovskii, Factorization of Measurable Matrix Functions, vol. 25.
Operator Theory: Advances and Applications (Birkhäuser Verlag, Basel, 1987)
45. E. Meister, F. Penzel, F.-O. Speck, F.S. Teixeira, Two-media scattering problems in a half-
space, in Partial Differential Equations with Real Analysis. Dedicated to Robert Pertsch Gilbert
on the occasion of his 60th birthday (Longman Scientific & Technical, Harlow; John Wiley &
Sons, Inc., New York, 1992), pp. 122–146
46. E. Meister, F.-O. Speck, F.S. Teixeira, Wiener-Hopf-Hankel operators for some wedge
diffraction problems with mixed boundary conditions. J. Integral Equ. Appl. 4, 229–255 (1992)
47. V.V. Peller, Hankel Operators and Their Applications. Springer Monographs in Mathematics
(Springer, New York, 2003)
48. S.C. Power, C*-algebras generated by Hankel operators and Toeplitz operators. J. Funct. Anal.
31, 52–68 (1979)
49. S. Roch, B. Silbermann, Algebras of convolution operators and their image in the Calkin
algebra, vol. 90. Report MATH (Akademie der Wissenschaften der DDR Karl-Weierstrass-
Institut für Mathematik, Berlin, 1990)
50. S. Roch, B. Silbermann, A handy formula for the Fredholm index of Toeplitz plus Hankel
operators. Indag. Math. 23, 663–689 (2012)
51. S. Roch, P.A. Santos, B. Silbermann, Non-Commutative Gelfand Theories. A Tool-Kit for
Operator Theorists and Numerical Analysts. Universitext (Springer, London, 2011)
52. B. Silbermann, The C ∗ -algebra generated by Toeplitz and Hankel operators with piecewise
quasicontinuous symbols. Integr. Equ. Oper. Theory 10, 730–738 (1987)
53. I.B. Simonenko, Some general questions in the theory of Riemann boundary problem. Math.
USSR Izvestiya 2, 1091–1099 (1968)
54. I.J. Šneı̆berg, Spectral properties of linear operators in interpolation families of Banach spaces.
Mat. Issled. 9, 2(32), 214–229 (1974) (in Russian)
55. I.M. Spitkovskiı̆, The problem of the factorization of measurable matrix-valued functions.
Dokl. Akad. Nauk SSSR 227, 576–579 (1976) (in Russian)
56. F.S. Teixeira, Diffraction by a rectangular wedge: Wiener-Hopf-Hankel formulation. Integr.
Equ. Oper. Theory 14, 436–454 (1991)
K-Inner Functions and K-Contractions
1 Introduction
be an analytic function without zeros on the unit disc D in C such that a0 = 1, an >
0 for all n ∈ N and such that
an an
0 < inf ≤ sup < ∞.
n∈N an+1 n∈N an+1
Since k has no zeros, the reciprocal function 1/k ∈ O(D) admits a Taylor expansion
∞
(1/k)(z) = cn z n (z ∈ D).
n=0
defines an analytic functional Hilbert space HK such that the row operator
Mz : HKd → HK is bounded and has closed range [2, Theorem A.1]. Typical
examples of functional Hilbert spaces of this type on the unit ball Bd are the Drury–
Arveson space, the Dirichlet space, the Hardy space and the weighted Bergman
spaces.
Let T = (T1 , . . . , Td ) ∈ L(H )d be a commuting tuple of bounded linear
Hilbert space H and let σT : L(H ) → L(H ) be the map
operators on a complex
defined by σT (X) = di=1 Ti XTi∗ . The tuple T is called a K-contraction if the limit
∞
1
(T ) = SOT− cn σTn (1H ) = SOT− lim c|α| γα T α T ∗α
K N→∞
n=0 |α|≤N
exists and defines a positive operator. Here and in the following we use the
abbreviation γα = |α|!/α! for α ∈ Nd .
If K(z, w) = 1/(1 − z, w) is the Drury–Arveson kernel, then under a
natural pureness condition the K-contractions coincide with the commuting row
contractions of class C·0 . If m is a positive integer and Km (z, w) = 1/(1 −z, w)m ,
then the pure Km -contractions are precisely the row-m-hypercontractions of class
C·0 [17, Theorem 3.49] and [12, Lemma 2].
An operator-valued analytic function W : Bd → L(E∗ , E ) with Hilbert spaces
E and E∗ is called K-inner if the map E∗ → HK (E ), x → W x, is a well-defined
isometry and
It was shown by Olofsson [13] that, for d = 1 and the Bergman-type kernel
1
Km : D × D → C, Km (z, w) = (m ∈ N \ {0}),
(1 − zw)m
the Km -inner functions W : D → L(E∗ , E ) are precisely the functions of the form
m
W (z) = D + C (1 − zT ∗ )−k B,
k=1
m−1
m
!T = (−1)k T k T ∗k .
k+1
k=0
In the present note we show that results of Olofsson from [13, 14] hold true for a
large class of kernels
∞
K : Bd × Bd → C, K(z, w) = an z, wn
n=0
extend a uniqueness result for minimal K-dilations due to Arveson to our class of
kernels.
2 Wandering Subspaces
N
1
SOT− lim 1H − an σTn ( (T )) = 0.
N→∞ K
n=0
Let us define the defect operator and the defect space of a K-contraction T by
1 1
C= (T ) 2 and D = Im C.
K
M = HK (D) Im j ⊂ HK (D)
W ⊥ Sα W (α ∈ Nd \ {0}).
The
6 space W is called a generating wandering subspace for S if in addition H =
(S α W; α ∈ Nd ). For each closed S-invariant subspace L ⊂ H , the space
d
WS (L) = L Si L
i=1
for the wandering subspace associated with the restriction of Mz to the invariant
subspace M = HK (D) Im j . Our main tool will be the matrix operator
Since the row operator Mz : HK (D)d → HK (D) has closed range [2, Theorem
A.1], the operator
and
⎛ ⎞
∞ ∞
an+1
! : HK (D) → HK (D), !⎝ fα zα ⎠ = fα zα .
an
n=0 |α|=n n=0 |α|=n
δMzi = Mzi !
for i = 1, . . . , d.
Lemma 2.2 For f ∈ HK (D), we have
Proof Since the column operator Mz∗ annihilates the constant functions, to prove
suppose that f (0) = 0. With respect to the orthogonal
the first identity, we may 7
decomposition HK (D) = ∞ ∗
n=0 Hn (D) the operator Mz Mz acts as (Lemma 4.3 in
[11])
∞ ∞
an−1
Mz Mz∗ fn = fn .
an
n=0 n=1
K-Inner Functions and K-Contractions 163
!T = j ∗ !j
f = f0 + Mz (j xi )di=1
Thus if f ∈ W (M), then (xi )di=1 = (j ∗ Mz∗i f )di=1 defines a tuple in H d with
(j xi )di=1 = Mz∗ f such that Cf (0) + T (!T xi )di=1 = j ∗ f = 0 and
f = f0 + Mz (j xi )di=1
d
f 2 = f0 2 + !T xi , xi .
i=1
it follows that
(x, y) = !T x, y
defines a scalar product on H such that the induced norm · T is equivalent to the
original norm with
IT : H → H̃ , x → x
IT∗ x, y = !T x, y (x ∈ H̃ , y ∈ H ).
d
T̃ T̃ ∗ = Ti (IT∗ IT )Ti∗ = σT (!T ) = σT (j ∗ !j ) = j ∗ Mz (⊕!)Mz∗ j
i=1
Here the identity (j ∗ PD j )1/2 = C follows from the definition of j and the
representation of j ∗ explained in the section following Theorem 2.1. We write
DT̃ = DT̃ H̃ d ⊂ H̃ d and DT̃ ∗ = DT̃ ∗ H = D for the defect spaces of T̃ . As in
the classical single-variable theory of contractions it follows that T̃ DT̃ = DT̃ ∗ T̃
and that
T̃ DT̃ ∗
U= : H̃ d ⊕ DT̃ ∗ → H ⊕ DT̃
DT̃ −T̃ ∗
such that
f = f0 + Mz (j xi )di=1
or equivalently, with
(IT xi ) 0∗ DT̃ y
=U = .
f0 y −T̃ y
But then y ∈ D̃ and f = −T̃ y + Mz (⊕j IT−1 )DT̃ y. Conversely, if f is of this form,
then using the definitions of T̃ , D̃ and the intertwining relation T̃ DT̃ = DT̃ ∗ T̃ one
can easily show that the vectors defined by
d
d
f 2 = f0 2 + !T xi , xi = T̃ y2 + IT xi 2H̃
i=1 i=1
exists. Then r ∈ [1, ∞) is the radius of convergence of the power series defining k
and by Theorem 4.5 in [11] the Taylor spectrum of Mz ∈ L(HK (D))d is given by
√
σ (Mz ) = {z ∈ Cd ; z ≤ r}.
k(z) − 1
F (z) = (z ∈ Dr (0) \ {0}).
z
d
For z ∈ Bd , let us denote by Z : H d → H, (hi )di=1 → i=1 zi hi , the row
operator induced by z. As a particular case of a much more general analytic spectral
168 J. Eschmeier and S. Toth
mapping theorem for the Taylor spectrum [9, Theorem 2.5.10] we find that
d
σ (ZT ∗ ) = { zi wi ; w ∈ σ (T ∗ )} ⊂ Dr (0)
i=1
∞
d
= an+1 γα CT ∗α xi zα+ei ,
i=1 n=0 |α|=n
where the series converge in HK (D). Since the point evaluations are continuous on
HK (D), we obtain
d
∞
∗α
δMz (j xi )di=1 (z) = an+1 γα CT zi xi zα
n=0 |α|=n i=1
∗
= CF (ZT )Z(xi )di=1
for all z ∈ Bd .
By Lemma 2.6 the map WT : Bd → L(D̃, D),
MW : Hd2 (E ) → HK , f → Wf
from the E -valued Drury–Arveson space Hd2 (E ) to HK (E˜ ) [3, Theorem 6.2].
3 K-Inner Functions
1
(K1) C ∗ C = (T ),
K
(K2) D ∗ C + B ∗ (⊕!T )T ∗ = 0,
170 J. Eschmeier and S. Toth
is a well defined isometry that intertwines the tuples T ∗ ∈ L(H )d and Mz∗ ∈
L(HK (E ))d componentwise. As in the section following Theorem 2.1 one can show
that
jC∗ f = T α C ∗ fα
α∈Nd
for f = α∈Nd fα zα ∈ HK (E ). Hence we find that
jC∗ !jC x = jC∗ ! a|α| γα (CT ∗α x)zα
α∈Nd
= jC∗ a|α|+1 γα (CT ∗α x)zα
α∈Nd
= a|α|+1 γα (T α C ∗ CT ∗α x)
α∈Nd
1
= a|α|+1 γα (T α (T )T ∗α x)
K
α∈Nd
jC∗ !jC = j ∗ !j = !T .
and that, conversely, under a natural condition on the kernel K each K-inner
function is of this form.
Theorem 3.1 Let W : Bd → L(E∗ , E ) be an operator-valued function between
Hilbert spaces E∗ and E such that
for all z ∈ Bd . Since δ(Mz Mz∗ ) = PIm Mz is an orthogonal projection and since
δMz = Mz (⊕!), we find that
W E∗ ⊥ zα (W E∗ )
1
Kν : Bd × Bd → C, Kν (z, w) = (ν ∈ (0, ∞))
(1 − z, w)ν
of the Drury–Arveson kernel (see the discussion following Theorem 4.2). In the
proof we shall use a uniqueness result for minimal K-dilations whose proof we
postpone to Sect. 4.
Theorem 3.2 Let Mz ∈ L(HK )d be a K-contraction. If W : Bd → L(E∗ , E ) is a
K-inner function, then there exist a pure K-contraction T ∈ L(H )d and a matrix
operator
T∗ B
∈ L(H ⊕ E∗ , H d ⊕ E )
C D
W = W E∗ ⊂ HK (E )
K-Inner Functions and K-Contractions 173
i = (1HK ⊗ U )j.
Define Eˆ = E (R ∩ E ). By construction
HK (Eˆ ) = HK (E ) HK (R ∩ E ) = HK (E ) R ⊂ S
M = HK (D) Im j
1HK ⊗ U : M → HK (R ∩ E ) S ⊥ = HK (R ∩ E ) ∩ S
174 J. Eschmeier and S. Toth
= Eˆ ⊕ (1HK ⊗ U )W (M).
such that
and W (M) = {WT x; x ∈ D̃} (see the beginning of Sect. 3 and Theorem 2.7). Let
us denote by
E∗ → W , x → W x
and
D̃ → W (M) x → WT x.
U1 : E∗ → Eˆ , U1 x = P1 W x
and
(U1 , U2 ) : E∗ → Eˆ ⊕ D̃
K-Inner Functions and K-Contractions 175
holds for z ∈ Bd and x ∈ E∗ . To complete the proof we show that the operators
1
C̃ ∗ C̃ = C ∗ U ∗ U C = C ∗ C = (T )
K
and
D̃ ∗ C̃ = U2∗ D ∗ U ∗ U C = U2∗ D ∗ C
= −U2∗ B ∗ (⊕!T ) T ∗ = −B̃ ∗ (⊕!T ) T ∗ .
D̃ = (U1 , U DU2 ) : E∗ → E = Eˆ ⊕ (R ∩ E ).
holds for all x ∈ E∗ . Thus the K-inner function W : Bd → L(E∗ , E ) admits a matrix
representation of the claimed form.
4 Minimal K-Dilations
B = span· {AA∗ }
Arveson proved in [1, Lemmma 8.6] that every unitary operator that intertwines
two A-morphisms ϕi : B → L(Hi ) (i = 1, 2) pointwise on A extends to a unitary
operator that intertwines the minimal Stinespring representations of ϕ1 and ϕ2 .
Straightforward modifications of the arguments given in [1] show that Arveson’s
result remains true if B is a von Neumann algebra which is the w∗ -closed linear
span
∗
B = spanw {AA∗ }
But then the hypothesis that Mz is essentially normal implies that L ⊂ L(HK ) is
a subalgebra. Since the involution on L(HK ) is w∗ -continuous, the algebra L ⊂
L(HK ) is a von Neumann algebra and hence L = W ∗ (Mz ).
The tuple Mz ∈ L(HK )d is known to be a K-contraction if there is a natural
number p ∈ N such that cn ≥ 0 for all n ≥ p or cn ≤ 0 for all n ≥ p [7, Lemma
2.2] or [17, Proposition 2.10]. The latter condition holds, for instance, if HK is a
complete Nevanlinna–Pick space or if K is a kernel of the form
1
Kν : Bd × Bd → C, Kν (z, w) =
(1 − z, w)ν
with a positive real number ν > 0 (see [8, Lemma 2.1] and [17, Section 1.5.2] for
these results and further examples).
Let T ∈ L(H )d be a commuting tuple and let j : H → HK (E ) be a K-dilation
of T . We denote by B = W ∗ (Mz ) ⊂ L(HK ) the von Neumann algebra generated
by Mz and set A = {p(Mz ); p ∈ C[z]}. The unital C ∗ -homomorphism
π : B → L(HK (E )), X → X ⊗ 1E
for all p ∈ C[z] and X ∈ B. Standard duality theory for Banach space operators
shows that π is w∗ -continuous. Indeed, as an application of Krein–Smulian’s
theorem (Theorem IV. 6.4 in [16]) one only has to check that τw∗ −limα (Xα ⊗1E ) =
X ⊗ 1E for each norm-bounded net (Xα ) in B with τw∗ − limα Xα = X. To
complete the argument it suffices to recall that on norm-bounded sets the w∗ -
topology and the weak operator topology coincide. Thus we have shown that ϕ is
a w∗ -continuous A-morphism with Stinespring representation π. By definition the
K-dilation j : H → HK (E ) is minimal if and only if
8
π(X)(j H ) = HK (E ),
X∈W ∗ (Mz )
Since
for all p ∈ C[z] and i = 1, 2, Theorem 4.1 implies that there is a unitary operator
W : HK (E1 ) → HK (E2 ) with Wj1 = j2 and W (X ⊗ 1E1 ) = (X ⊗ 1E2 )W for all
X ∈ B. In particular, the unitary operator W satisfies the intertwining relations
Proof The hypothesis implies that M is reducing for the von Neumann algebra
W ∗ (Mz ) ⊂ L(HK (E )) generated by Mz1 , . . . Mzd ∈ L(HK (E )). Standard results
on von Neumann algebras (Corollary 17.6 and Proposition 24.1 in [18]) show that
Hence PE M ⊂ M. Let f = α∈Nd fα zα ∈ HK (E ) be arbitrary. An elementary
calculation yields that
PE (Mz∗β f ) ∈ (C \ {0})fβ (β ∈ Nd ).
and that
CH = PD (Im j ) ⊂ PD (M) ⊂ M ∩ D.
6
It follows that D = CH = M ∩ D and that M = α∈Nd zα D = HK (D).
It should be interesting to compare the uniqueness result proved in this section
with a related result proved by Olofsson [15, Theorem 7.6] for single contractions
T ∈ L(H ) satisfying a slightly different K-contractivity condition. In [15] it is
shown that even in the non-pure case each dilation factors through a canonical
defined dilation of T .
References
1. W. Arveson, Subalgebras of C ∗ -algebras. III: multivariable operator theory. Acta Math. 181,
159–228 (1998)
2. W. Arveson, Quotients of standard Hilbert modules. Trans. Am. Math. Soc. 359, 6027–6055
(2007)
3. M. Bhattacharjee, J. Eschmeier, D.K. Keshari, J. Sarkar, Dilations, wandering subspaces and
inner functions. Linear Algebra Appl. 523, 263–280 (2017)
4. C. Barbian, A characterization of multiplication operators on reproducing kernel Hilbert
spaces. J. Oper. Theory 65, 235–240 (2011)
5. J.A. Ball, V. Bolotnikov, Weighted Bergman spaces: shift-invariant subspaces and
input/state/output linear systems. Integr. Equ. Oper. Theor. 76, 301–356 (2013)
6. J.A. Ball, V. Bolotnikov, Weighted Hardy spaces: shift-invariant subspaces and coinvariant
subspaces, linear systems and operator model theory. Acta Sci. Math. (Szeged) 79, 623–686
(2013)
7. Y. Chen, Quasi-wandering subspaces in a class of reproducing analytic Hilbert spaces. Proc.
Am. Math. Soc. 140, 4235–4242 (2012)
8. R. Clouâtre, M. Hartz, Multiplier algebras of complete Nevanlinna–Pick spaces: dilations,
boundary representations and hyperrigidity. J. Funct. Anal. 274, 1690–1738 (2018)
9. J. Eschmeier, M. Putinar, Spectral Decompositions and Analytic Sheaves. London Mathemati-
cal Society Monographs, New Series, vol. 10 (Clarendon Press, Oxford, 1996)
10. J. Eschmeier, Bergman inner functions and m-hypercontractions. J. Funct. Anal. 275, 73–102
(2018)
11. K. Guo, J. Hu, X. Xu, Toeplitz algebras, subnormal tuples and rigidity on reproducing
C[z1 , . . . , zd ]-modules. J. Funct. Anal. 210, 214–247 (2004)
12. V. Müller, F.-H. Vasilescu, Standard models for some commuting multioperators. Proc. Am.
Math. Soc. 117, 979–989 (1993)
13. A. Olofsson, A characteristic operator function for the class of n-hypercontractions. J. Funct.
Anal. 236, 517–545 (2006)
K-Inner Functions and K-Contractions 181
14. A. Olofsson, Operator-valued Bergman inner functions as transfer functions. Algebra Anal. 19,
146–173 (2007); St. Petersberg Math. J. 19, 603–623 (2008)
15. A. Olofsson, Parts of adjoint weighted shifts. J. Oper. Theory 74, 249–280 (2015)
16. H.H. Schaefer, Topological Vector Spaces (Macmillan, New York, 1966)
17. D. Schillo, K-contractions and perturbations of Toeplitz operators. Ph.D. thesis, Saarland
University, 2018
18. K. Zhu, An Introduction to Operator Algebras. Studies in Advanced Mathematics (CRC Press,
Boca Raton, 1993)
Tight and Cover-to-Join Representations
of Semilattices and Inverse Semigroups
Ruy Exel
1 Introduction
R. Exel ()
Departamento de Matemática, Universidade Federal de Santa Catarina, Florianópolis, SC, Brazil
a ≤ b ∈ C ⇒ a ∈ C.
Such an ideal is evidently also closed under ∧ and under relative complements, so
it is a generalized Boolean algebra in itself.
Given any nonempty subset S of B, notice that the subset C defined by
6
C = a ∈ B : a ≤ z∈Z z, for some nonempty finite subset Z ⊆ S ,
From now on let us fix a meet semilattice E (always assumed to have a zero
element).
Definition 3.1 A representation of E in a generalized Boolean algebra B is any
map π : E → B, such that
(i) π(0) = 0, and
(ii) π(x ∧ y) = π(x) ∧ π(y), for every x and y in E.
In order to spell out the definition of the notion of tight representations,
introduced in [4], let F be any subset of E. We then say that a given subset Z ⊆ F
is a cover for F , if for every nonzero x in F , there exists some z in Z, such that
z ∧ x = 0.
Furthermore, if X and Y are finite subsets of E, we let
E X,Y = {z ∈ E : z ≤ x, ∀x ∈ X, and z ⊥ y, ∀y ∈ Y }.
The latter condition is useful for dealing with characters, i.e. with representations
of E in the Boolean algebra {0, 1}, because the requirement that a character
be nonzero immediately implies (3.3), so again cover-to-join suffices to prove
tightness.
On the other hand, an advantage of the notion of cover-to-join representations is
that it makes sense for representations in generalized Boolean algebras, while the
reference to the unary operation ¬ in (3.1) precludes it from being applied when the
target algebra lacks a unit, that is, for a representation into a generalized Boolean
algebra.
Again referring to the occurrence of ¬ in (3.1), observe that if X is nonempty,
then the right hand side of (3.1) lies in the ideal of B generated by the range of
Tight and Cover-to-Join Representations 187
π. This is because, even though ¬ π(y) is not necessarily in π(E) , this term will
appear besides π(x), for some x in X, and hence
π(x) ∧ ¬π(y) = π(x) \ π(x) ∧ π(y) ∈ π(E) .
we claim that
π(x) ≤ e, ∀x ∈ E. (3.5)
To see this, pick x in E and notice that, since Z is a cover for E, we have in particular
that the set
{z ∧ x : z ∈ Z}
B = {a ∈ B : a ≤ e},
The following is perhaps the most obvious adaptation of the notion of non-
degenerate representations extensively used in the theory of operator algebras [15,
Definition 9.3].
Definition 4.1 We shall say that a representation π of a semilattice E in a
generalized Boolean algebra B is 6 non-degenerate if, for every a in B, there is a
finite subset Z of E such that a ≤ z∈Z π(z). In other words, π is non-degenerate
if and only if B coincides with the ideal generated by the range of π.
Observe that, if both E and B have a unit, and if π is a unital map, then π is
evidently non-degenerate. More generally, if π satisfies (3.3), then the same is also
clearly true.
The following result says that, by adjusting the codomain of a representation, we
can always make it non-degenerate.
Proposition 4.2 Let π be a representation of E in the generalized Boolean algebra
B. Letting C be the ideal of B generated by the range of π, one has that π is a
non-degenerate representation of E in C.
Proof Obvious.
Tight and Cover-to-Join Representations 189
By its very nature, the concept of a tight representation pertains to the realm of
semilattices and Boolean algebras. However, given the relevance of the study of
semilattices in the theory of inverse semigroups, tight representations have had a
strong impact on the latter.
Recall that a Boolean inverse semigroup (see [5] but please observe that this
notion is not equivalent to the homonym studied in [9] and [16]) is an inverse
semigroup whose idempotent semilattice E(S) is a Boolean algebra. In accordance
with what we have been discussing up to now, it is sensible to give the following:
Definition 5.1
(i) A generalized Boolean inverse semigroup is an inverse semigroup whose
idempotent semilattice is a generalized Boolean algebra.
(ii) (cf. [4, Definition 13.1] and [5, Proposition 6.2]) If S is any inverse semigroup1
and T is a generalized Boolean inverse semigroup, we say that a homomor-
phism π : S → T (always assumed to preserve zero) is tight if the restriction
of π to E(S) is a tight representation into E(T ), in the sense of Definition 3.4.
(iii) If π is as above, we say that π is cover-to-join if the restriction of π to E(S) is
cover-to-join.
Addressing the already mentioned slight imprecision in the proof of [2, Theorem
2.2], we then have the following version of Theorem 3.5 and Proposition 4.2:
T = {t ∈ T : t ∗ t ≤ e, tt ∗ ≤ e},
where the last inequality above follows as in (3.5). By a similar reasoning one shows
that also π(s)π(s)∗ ≤ e, so we see that π(s) lies in T , and we may then think of π
as a representation of S in T . As in Theorem 3.5, one may now easily prove that π
becomes a tight representation into T .
6 Conclusion
π : S → B(H )
π(s) = π (s) ⊕ 0,
References
1. G. Boava, G.G. de Castro, F.de L. Mortari, Inverse semigroups associated with labelled spaces
and their tight spectra. Semigroup Forum 94, 582–609 (2017)
2. A.P. Donsig, D. Milan, Joins and covers in inverse semigroups and tight C*-algebras. Bull.
Aust. Math. Soc. 90, 121–133 (2014)
3. R. Exel, Inverse Semigroups and Combinatorial C*-algebras (2007). arXiv:math/ 0703182v1
[math.OA]
4. R. Exel, Inverse semigroups and combinatorial C*-algebras. Bull. Braz. Math. Soc. (N.S.) 39,
191–313 (2008)
5. R. Exel, Tight representations of semilattices and inverse semigroups. Semigroup Forum 79,
159–182 (2009)
6. R. Exel, E. Pardo, Self-similar graphs, a unified treatment of Katsura and Nekrashevych C∗ -
algebras. Adv. Math. 306, 1046–1129 (2017)
7. R. Exel, C. Starling, Self-similar graph C ∗ -algebras and partial crossed products. J. Oper.
Theory 75, 299–317 (2016)
8. R. Exel, D. Goncalves, C. Starling, The tiling C*-algebra viewed as a tight inverse semigroup
algebra. Semigroup Forum 84, 229–240 (2012)
9. M.V. Lawson, Non-commutative Stone duality: inverse semigroups, topological groupoids and
C*-algebras. Internat. J. Algebra Comput. 22(6), 1250058, 47 (2012)
10. M.V. Lawson, D.G. Jones, Graph inverse semigroups: their characterization and completion. J.
Algebra 409, 444–473 (2014)
192 R. Exel
11. M.V. Lawson, A. Vdovina, The universal Boolean inverse semigroup presented by the abstract
Cuntz-Krieger relations. J. Noncommut. Geom. (2019, to appear). arXiv:1902.02583v3
[math.OA], February, 28, 2019.
12. C. Starling, Boundary quotients of C∗ -algebras of right LCM semigroups. J. Funct. Anal. 268,
3326–3356 (2015)
13. C. Starling, Inverse semigroups associated to subshifts. J. Algebra 463, 211–233 (2016)
14. M.H. Stone, Postulates for Boolean algebras and generalized Boolean algebras. Am. J. Math.
57, 703–732 (1935)
15. M. Takesaki, Theory of Operator Algebras. I (Springer, Heidelberg, 1979)
16. F. Wehrung, in Refinement Monoids, Equidecomposability Types, and Boolean Inverse Semi-
groups. Lecture Notes in Mathematics, vol. 2188 (Springer, Berlin, 2017)
Calkin Images of Fourier Convolution
Operators with Slowly Oscillating
Symbols
3
Abstract Let be a C ∗ -subalgebra of L∞ (R) and SOX(R) be the Banach algebra
of slowly oscillating Fourier multipliers on a Banach function space X(R). We show
that the intersection of the Calkin image of the algebra generated by the operators of
multiplication aI by functions a ∈ and the Calkin image of the algebra generated
3
by the Fourier convolution operators W 0 (b) with symbols in SOX(R) coincides with
the Calkin image of the algebra generated by the operators of multiplication by
constants.
This work was partially supported by the Fundação para a Ciência e a Tecnologia (Portuguese
Foundation for Science and Technology) through the project UID/MAT/00297/2019 (Centro de
Matemática e Aplicações). The third author was also supported by the SEP-CONACYT Project
A1-S-8793 (México).
1 Introduction
W 0 (a) := F −1 aF (1.1)
[aI ]π := aI + K(X(R))
For a unital Banach subalgebra " of the algebra MX(R) , we also consider the
quotient algebra COπ (") consisting of the cosets
It is easy to see that MOπ () and COπ (") are commutative unital Banach
subalgebras of the Calkin algebra B π (X(R)). It is natural to refer to the algebras
MOπ () and COπ (") as the Calkin images of the algebras
respectively. The algebras MO() and CO(") are building blocks of the algebra
of convolution type operators
A(, "; X(R)) = algB(X(R)) aI, W 0 (b) : a ∈ , b ∈ " ,
the smallest closed subalgebra of B(X(R)) that contains the algebras MO() and
CO(").
3
Let SO 3 be the C ∗ -algebra of slowly oscillating functions and SOX(R) be
the Banach algebra of all slowly oscillating Fourier multipliers on the space
X(R), which are defined below in Sects. 2.5–2.7. The third author proved in [22,
Lemma 4.3] in the case of Lebesgue spaces Lp (R, w), 1 < p < ∞, with
Muckenhoupt weights w ∈ Ap (R) that
where
This result allowed him to describe the maximal ideal space of the commutative
Banach algebra
Aπ (SO 3 , SOL3 p (R,w) ; Lp (R, w)) = A(SO 3 , SOL3 p (R,w) ; Lp (R, w))/K(Lp (R, w))
(see [22, Theorem 3.1]). In turn, this description plays a crucial role in the study
of the Fredholmness of operators in more general algebras of convolution type
operators with piecewise slowly oscillating data on weighted Lebesgue space
Lp (R, w) (see [22, 24, 25]).
196 C. A. Fernandes et al.
where the supremum is taken over all intervals I ⊂ R of finite length containing
x. The Hardy-Littlewood maximal operator M defined by the rule f → Mf is a
sublinear operator.
The aim of this paper is to extend (1.2) to the case of separable Banach function
spaces such that the Hardy-Littlewood maximal operator M is bounded on X(R)
and on its associate space X (R) and to the case of arbitrary algebras of functions
⊂ L∞ (R) in place of SO 3 .
The following statement extends [22, Lemma 4.3].
Theorem 1.1 (Main Result) Let X(R) be a separable Banach function space such
that the Hardy-Littlewood maximal operator M is bounded on the space X(R) and
on its associate space X (R). If is a unital C ∗ -subalgebra of L∞ (R), then
3
MOπ () ∩ COπ (SOX(R) ) = MOπ (C), (1.4)
Aπ (SO 3 , SOX(R)
3
; X(R)) = A(SO 3 , SOX(R)
3
; X(R))/K(X(R))
is commutative. It seems, however, that the proofs of both hypotheses will require
tools, which are not available in the setting of general Banach function spaces.
We plan to return to these questions in a forthcoming work, restricting ourselves
to particular Banach function spaces, like rearrangement-invariant spaces with
Muckenhoupt weights or variable Lebesgue spaces, where interpolation theorems
are available.
The paper is organized as follows. In Sect. 2, we collect necessary facts on
Banach function spaces and Fourier multipliers on them. Further, we recall the
definition of the C ∗ -algebra SO 3 of slowly oscillating functions and introduce
3
the Banach algebra of slowly oscillating Fourier multipliers SOX(R) on a Banach
function space X(R). In Sect. 3, we discuss the structure of the maximal ideal
3
spaces M(SO 3 ) and M(SOX(R) ) of the C ∗ -algebra SO 3 of slowly oscillating
3
functions and the Banach algebra SOX(R) of slowly oscillating Fourier multipliers
on a Banach function space X(R). In particular, we show that the fibers Mt (SO 3 )
Calkin Images of Fourier Convolution Operators 197
of M(SO 3 ) over the points t ∈ Ṙ := R ∪ {∞} can be identified with the fibers
Mt (SOt ), where SOt is the C ∗ -algebra of all bounded continuous functions on
Ṙ \ {t} that slowly oscillate at the point t. An analogous result is also obtained
for the fibers of the maximal ideal spaces of algebras of slowly oscillating Fourier
multipliers on a Banach function space X(R). In Sect. 4, we show that the maximal
ideal spaces of the algebras MOπ () and COπ (") are homeomorphic to the
maximal ideal spaces of the algebras and ", respectively, where is a unital C ∗ -
subalgebra of L∞ (R) and " is a unital Banach subalgebra of MX(R). In Sect. 5,
we recall the definition of a limit operator (see [26] for a general theory of limit
operators), as well as, a known fact about limit operators of compact operators acting
on Banach function spaces. Further, we calculate the limit operators of the Fourier
3
convolution operator W 0 (b) with a slowly oscillating symbol b ∈ SOX(R) . Finally,
gathering the above mentioned results on limit operators, we prove Theorem 1.1.
2 Preliminaries
Under the natural linear space operations and under this norm, the set X(R) becomes
a Banach space (see [4, Chap. 1, Theorems 1.4 and 1.6]). If ρ is a Banach function
norm, its associate norm ρ is defined on M+ (R) by
ρ (g) := sup f (x)g(x) dx : f ∈ M+ (R), ρ(f ) ≤ 1 , g ∈ M+ (R).
R
It is a Banach function norm itself [4, Chap. 1, Theorem 2.2]. The Banach function
space X (R) determined by the Banach function norm ρ is called the associate
space (Köthe dual) of X(R). The associate space X (R) is naturally identified with
a subspace of the (Banach) dual space [X(R)]∗ .
As usual, let C0∞ (R) denote the set of all infinitely differentiable functions with
compact support.
Lemma 2.1 ([8, Lemma 2.1] and [23, Lemma 2.12(a)]) If X(R) is a separable
Banach function space, then the sets C0∞ (R) and L2 (R) ∩ X(R) are dense in the
space X(R).
Let S(R) be the Schwartz space of rapidly decreasing smooth functions and let
S0 (R) denote the set of functions f ∈ S(R) such that their Fourier transforms F f
have compact support.
Theorem 2.2 ([10, Theorem 4]) Let X(R) be a separable Banach function space
such that the Hardy-Littlewood maximal operator M is bounded on X(R). Then the
set S0 (R) is dense in the space X(R).
Inequality (2.1) was established earlier in [18, Theorem 1] with some constant
on the right-hand side that depends on the space X(R).
Since (2.1) is available, an easy adaptation of the proof of [13, Proposition 2.5.13]
leads to the following (we refer to the proof of [18, Corollary 1] for details).
Corollary 2.4 Let X(R) be a separable Banach function space such that the
Hardy-Littlewood maximal operator M is bounded on X(R) and on its associate
space X (R). Then the set of Fourier multipliers MX(R) is a Banach algebra under
pointwise operations and the norm · MX(R) .
Let V (R) be the Banach algebra of all functions a : R → C with finite total
variation
n
V (a) := sup |a(ti ) − a(ti−1 )|,
i=1
Theorem 2.5 Let X(R) be a separable Banach function space such that the Hardy-
Littlewood maximal operator M is bounded on X(R) and on its associate space
X (R). If a ∈ V (R), then the convolution operator W 0 (a) is bounded on the space
X(R) and
We refer to [7, Theorem 2.11] for the proof of (2.2) in the case of Lebesgue
spaces Lp (R) with cLp = SB(Lp (R)) and to [12, Chap. 13, Theorem 1.3]
for the calculation of the norm of S given in the second equality in (2.3). For
Lebesgue spaces with Muckenhoupt weights Lp (R, w), the proof of Theorem 2.5
with cLp (w) = SB(Lp (R,w)) is contained in [5, Theorem 17.1]. Further, for variable
Lebesgue spaces Lp(·) (R), Theorem 2.5 with cLp(·) = SB(Lp(·) (R)) was obtained
in [20, Theorem 2].
Following [3, Section 4], [24, Section 2.1], and [25, Section 2.1], we say that a
function f ∈ L∞ (R) is slowly oscillating at a point λ ∈ Ṙ if for every r ∈ (0, 1) or,
equivalently, for some r ∈ (0, 1), one has
lim osc f, λ + ([−x, −rx] ∪ [rx, x]) = 0 if λ ∈ R,
x→0+
lim osc f, [−x, −rx] ∪ [rx, x] = 0 if λ = ∞.
x→+∞
For every λ ∈ Ṙ, let SOλ denote the C ∗ -subalgebra of L∞ (R) defined by
SOλ := f ∈ Cb (Ṙ \ {λ}) : f slowly oscillates at λ ,
For a point λ ∈ Ṙ, let C 3 (R \ {λ}) be the set of all three times continuously
differentiable functions a : R \ {λ} → C. Following [24, Section 2.4] and [25,
Section 2.3], consider the commutative Banach algebras
SOλ3 := a ∈ SOλ ∩ C 3 (R \ {λ}) : lim (Dλk a)(x) = 0, k = 1, 2, 3
x→λ
1 ; ;
3
; k ;
aSO 3 := ;Dλ a ; ∞ ,
λ k! L (R)
k=0
where (Dλ a)(x) = (x − λ)a (x) for λ ∈ R and (Dλ a)(x) = xa (x) for λ = ∞.
Lemma 2.6 For every λ ∈ Ṙ, the set SOλ3 is dense in the C ∗ -algebra SOλ .
Proof In view of [2, Lemma 2.3], the set
∞
SO∞ := f ∈ SO∞ ∩ Cb∞ (R) : lim (D∞
k
f )(x) = 0, k ∈ N (2.4)
x→∞
is dense in the Banach algebra SO∞ . Here Cb∞ (R) denotes the set of all infinitely
differentiable functions f : R → C, which are bounded with all their derivatives.
Note that SO∞ ∞ can be equivalently defined by replacing C ∞ (R) in (2.4) by C ∞ (R),
b
because f ∈ SO∞ is bounded and its derivatives f (k) are bounded for all k ∈ N in
view of limx→∞ (D∞ k f )(x) = 0. Since SO ∞ ⊂ SO 3 , this completes the proof in
∞ ∞
the case λ = ∞.
If λ ∈ R, then by Karlovich and Loreto Hernández [25, Corollary 2.2], the
mapping T a = a ◦ βλ , where βλ : Ṙ → Ṙ is defined by
λx − 1
βλ (x) = , (2.5)
x+λ
is an isometric isomorphism of the algebra SOλ onto the algebra SO∞ . Hence each
function a ∈ SOλ can be approximated in the norm of SOλ by functions cn =
bn ◦ βλ−1 , where bn ∈ SO∞∞ for n ∈ N and
λy + 1
βλ−1 (y) = = x, x, y ∈ Ṙ. (2.6)
λ−y
202 C. A. Fernandes et al.
It remains to show that cn ∈ SOλ3 . Taking into account (2.5)–(2.6), we obtain for
y = βλ (x) ∈ R \ {λ} and x = βλ−1 (y) ∈ R:
λ2 + 1
(Dλ cn )(y) =bn βλ−1 (y) = −bn (x)(x + λ), (2.7)
y−λ
(λ2 + 1)2 λ2 + 1
(Dλ2 cn )(y) =bn βλ−1 (y) − bn βλ−1 (y)
y−λ y−λ
= − bn (x)(x + λ)(λ2 + 1) + bn (x)(x + λ), (2.8)
Since
k
lim (D∞ bn )(x) = 0 for k ∈ {1, 2, 3},
x→∞
we see that
The following result leads us to the definition of slowly oscillating Fourier multipli-
ers.
Theorem 2.7 ([19, Theorem 2.5]) Let X(R) be a separable Banach function space
such that the Hardy-Littlewood maximal operator M is bounded on X(R) and on
Calkin Images of Fourier Convolution Operators 203
its associate space X (R). If λ ∈ Ṙ and a ∈ SOλ3 , then the convolution operator
W 0 (a) is bounded on the space X(R) and
3
Proof The continuous embedding SOX(R) ⊂ SOL3 2 (R) (with the embedding
constant one) follows immediately from Theorem 2.3 and the definitions of the
3
Banach algebras SOX(R) and SOL3 2 (R) . It is clear that SOL3 2 (R) ⊂ SO 3 . The
embedding SO 3 ⊂ SOL3 2 (R) follows from Lemma 2.6.
3 Maximal Ideal Spaces of the Algebras SO and SOX(R)
For a C ∗ -algebra (or, more generally, a Banach algebra) A with unit e and an element
a ∈ A, let spA (a) denote the spectrum of a in A. Recall that an element a of a C ∗ -
algebra A is said to be positive if it is self-adjoint and spA (a) ⊂ [0, ∞). A linear
functional φ on A is said to be a state if φ(a) ≥ 0 for all positive elements a ∈ A
and φ(e) = 1. The set of all states of A is denoted by S(A). The extreme points of
S(A) are called pure states of A (see, e.g., [15, Section 4.3]).
Following [1, p. 304], for a state φ, let
and let Gφ+ (A) denote the set of all positive elements of Gφ (A). Let A and B be
C ∗ -algebras such that e ∈ B ⊂ A. Let φ be a state of B. Following [1, p. 310], we
say that A is B-compressible modulo φ if for each x ∈ A and each ε > 0 there is
b ∈ Gφ+ (B) and y ∈ B such that bxb − yA < ε.
204 C. A. Fernandes et al.
For t ∈ Ṙ and ω > 0, let ψt,ω be a real-valued function in C(Ṙ) such that 0 ≤
ψt,ω (x) ≤ 1 for all x ∈ R. Assume that for t ∈ R,
and for t = ∞,
Let M(A) denote the maximal ideal space of a commutative Banach algebra A.
Lemma 3.2 For t ∈ Ṙ and ω > 0, the function ψt,ω is a positive element of the
C ∗ -algebras C(Ṙ), SOt , and SO 3 .
Proof Since M(C(Ṙ)) = Ṙ, it follows from the Gelfand theorem (see, e.g., [28,
Theorem 2.1.3]) that spC(Ṙ) (ψt,ω ) = [0, 1] for all t ∈ Ṙ and all ω > 0. Since
C(Ṙ) ⊂ SOt ⊂ SO 3 , we conclude that the functions ψt,ω for t ∈ Ṙ and ω > 0 are
positive elements of the C ∗ -algebras C(Ṙ), SOt , and SO 3 because their spectra in
each of these algebras coincide with [0, 1] in view of [15, Proposition 4.1.5].
Mλ (A) := {ξ ∈ M(A) : ξ |B = λ}
is called the fiber of M(A) over λ ∈ M(B). Hence for every Banach algebra ⊂
L∞ (R) with M(C(Ṙ) ∩ ) = Ṙ and every t ∈ Ṙ, the fiber Mt () is the set of
all multiplicative linear functionals (characters) on that annihilate the set {f ∈
C(Ṙ) ∩ : f (t) = 0}. As usual, for all a ∈ and all ξ ∈ M(), we put a(ξ ) :=
ξ(a). We will frequently identify the points t ∈ Ṙ with the evaluation functionals δt
Calkin Images of Fourier Convolution Operators 205
defined by
Lemma 3.3 For every point t ∈ Ṙ, the fibers Mt (SOt ) and Mt (SO 3 ) can be
identified as sets:
Since
<
M() = Mt () for ∈ {SO 3 , SOλ : λ ∈ Ṙ},
t ∈Ṙ
where
1
0<ω< min |λ − t|
2 λ∈F \{t }
and b := ψ∞,ω . Then the function y defined by (3.5) is equal to zero on (−ω, ω)
and y ∈ SO∞ .
For t ∈ Ṙ, we have
; ; ; ;
; ; ; ;
; ; ; ;
bxb − yL∞ (R) = ;b x − xλ b ; ≤ ;x − xλ ; < ε.
; ; ; ;
λ∈F L∞ (R) λ∈F L∞ (R)
Since b is a positive element of SOt in view of Lemma 3.2, we have b ∈ Gη+ (SOt ),
which completes the proof of the fact that SO 3 is SOt -compressible modulo the
multiplicative linear functional η ∈ Mt (SOt ).
In view of Lemma 3.1, there exists a unique extension η of the multiplicative
linear functional η to the whole algebra SO 3 . By the definition of the fiber
Mt (SO 3 ), we have η ∈ Mt (SO 3 ). Thus, we can identify Mt (SOt ) with a subset of
Mt (SO 3 ):
On the other hand, functionals ψ, ψ2 ∈ M(B) are continuous on B (see, e.g., [16,
Lemma 2.1.5]). Since A is dense in B, for every b ∈ B there exists a sequence
{an }n∈N ⊂ A such that an − bB → 0 as n → ∞. It follows from this observation
and (3.7) that for every b ∈ B,
Proof It follows from Theorem 2.3 that SOt3 ⊂ SOt,X(R) ⊂ SOt , where the
imbeddings are homomorphic. By the definition of the algebra SOt,X(R), the algebra
SOt3 is dense in SOt,X(R) with respect to the norm of MX(R). Taking into account
these observations and Lemma 3.6, we see that the commutative Banach algebras
satisfy all the conditions of Theorem 3.5. By this theorem, every multiplicative
linear functional on SOt,X(R) admits a unique extension to a multiplicative linear
208 C. A. Fernandes et al.
3
Proof Since SOt,X(R) ⊂ SOX(R) for every t ∈ Ṙ, we conclude by the restriction of
a multiplicative linear functional defined on the bigger algebra to the smaller algebra
3
that M(SOX(R) ) ⊂ M(SOt,X(R)). Hence
3
Mt (SOX(R) ) ⊂ Mt (SOt,X(R)). (3.12)
On the other hand, in view of Lemma 3.7, any multiplicative linear functional ξ ∈
Mt (SOt,X(R)) admits a unique extension ξ ∈ M(SOt ). Moreover, ξ belongs to
Mt (SOt ) as well. By Lemma 3.3, the functional ξ ∈ Mt (SOt ) admits a unique
3
extension ξ ∈ Mt (SO 3 ). It is clear that the restriction of ξ to SOX(R) belongs to
3 3
Mt (SOX(R) ). Thus Mt (SOt,X(R)) can be identified with a subset of Mt (SOX(R) ):
3
Mt (SOt,X(R)) ⊂ Mt (SOX(R) ). (3.13)
3.6 Maximal Ideal Space of the Banach Algebra SOX(R)
Now we are in a position to prove that the maximal ideal spaces of the commutative
3
Banach algebra SOX(R) and the C ∗ -algebra SO 3 can be identified as sets.
Theorem 3.9 Let X(R) be a separable Banach function space such that the Hardy-
Littlewood maximal operator M is bounded on the space X(R) and on its associate
Calkin Images of Fourier Convolution Operators 209
3
space X (R). Then the maximal ideal space of the Banach algebra SOX(R) can be
∗ 3
identified with the maximal ideal space of the C -algebra SO :
3
M(SOX(R) ) = M(SO 3 ).
Proof It follows from Lemmas 3.8, 3.7, and 3.3 that for every t ∈ Ṙ,
3
Mt (SOX(R) ) = Mt (SOt,X(R)) = Mt (SOt ) = Mt (SO 3 ).
Hence
< <
3 3
M(SOX(R) )= Mt (SOX(R) )= Mt (SO 3 ) = M(SO 3 ),
t ∈Ṙ t ∈Ṙ
We start with the following known result [14, Theorem 2.4] (see also [9, Theo-
rem 3.1]).
Theorem 4.1 Let X(R) be a separable Banach function space and a ∈ L∞ (R).
Then the multiplication operator aI is compact on the space X(R) if and only if
a = 0 almost everywhere on R.
The next theorem says that one can identify the maximal ideal spaces of the
algebras MOπ () and for an arbitrary unital C ∗ -subalgebra of L∞ (R).
Theorem 4.2 Let X(R) be a separable Banach function space. If is a unital C ∗ -
subalagebra of L∞ (R), then the maximal ideal spaces of the commutative Banach
algebra MOπ () and the commutative C ∗ -algebra are homeomorphic:
Proof Consider the mapping F : → MOπ () defined by F (a) = [aI ]π for
every a ∈ . It is clear that this mapping is surjective. If [aI ]π = [bI ]π for some
a, b ∈ , then (a − b)I ∈ K(X(R)). It follows from Theorem 4.1 that a = b a.e.
on R. This implies that the mapping F is injective. Thus, F : → MOπ ()
is an algebraic isomorphism of commutative Banach algebras. It follows from
[16, Lemma 2.2.12] that the maximal ideal spaces M(MOπ ()) and M() are
homeomorphic.
210 C. A. Fernandes et al.
The following analogue of Theorem 4.1 for Fourier convolution operators was
obtained recently by the authors [8, Theorem 1.1].
Theorem 4.3 Let X(R) be a separable Banach function space such that the Hardy-
Littlewood maximal operator M is bounded on X(R) and on its associate space
X (R). Suppose that b ∈ MX(R). Then the Fourier convolution operator W 0 (a) is
compact on the space X(R) if and only if b = 0 almost everywhere on R.
The next theorem is an analogue of Theorem 4.2 for Fourier multipliers.
Theorem 4.4 Let X(R) be a separable Banach function space such that the Hardy-
Littlewood maximal operator M is bounded on X(R) and on its associate space
X (R). If " is a unital Banach subalagebra of MX(R), then the maximal ideal
spaces of the commutative Banach algebras COπ (") and " are homeomorphic:
Proof The proof is analogous to the proof of Theorem 4.2. Consider the mapping
F : " → COπ (") defined by F (a) = [W 0 (a)]π for every a ∈ ". It is obvious
that this mapping is surjective. If [W 0 (a)]π = [W 0 (b)]π for some a, b ∈ ", then
W 0 (a − b) = W 0 (a) − W 0 (b) ∈ K(X(R)). By Theorem 4.3, we conclude that
a = b a.e. on R. Therefore, the mapping F is injective. Thus, F : " → COπ (") is
an algebraic isomorphism of commutative Banach algebras. In this case it follows
from [16, Lemma 2.2.12] that the maximal ideal spaces M(COπ (")) and M(") are
homeomorphic.
Let X(R) be a Banach function space. For a sequence of operators {An }n∈N ⊂
B(X(R)), let
s-lim An
n→∞
denote the strong limit of this sequence, if it exists. For λ, x ∈ R, consider the
function
eλ (x) := eiλx .
Calkin Images of Fourier Convolution Operators 211
Let T ∈ B(X(R)) and let h = {hn }n∈N be a sequence of numbers hn > 0 such
that hn → +∞ as n → ∞. The strong limit
is called the limit operator of T related to the sequence h = {hn }n∈N , if it exists.
In our previous paper [9] we calculated the limit operators for all compact
operators.
Lemma 5.1 ([9, Lemma 3.2]) Let X(R) be a separable Banach function space and
K be a compact operator on X(R). Then for every sequence {hn }n∈N of positive
numbers satisfying hn → +∞ as n → ∞, one has
Now we will calculate the limit operators for the Fourier convolution operator with
a slowly oscillating symbol.
Theorem 5.2 Let X(R) be a separable Banach function space such that the Hardy-
Littlewood maximal operator M is bounded on the space X(R) and on its associate
3
space X (R). If b ∈ SOX(R) , then for every ξ ∈ M∞ (SO 3 ) there exists a sequence
{hn }n∈N of positive numbers such that hn → +∞ as n → ∞ and
where Fm ⊂ Ṙ are finite sets and bm,λ ∈ SOλ3 for λ ∈ Fm and all m ∈ N, such that
B∞ := {bm,∞ ∈ SO∞
3
: m ∈ N}
is not empty. Since the set B∞ is at most countable, it follows from [2, Corollary 3.3]
or [25, Proposition 3.1] that there exists a sequence {hn }n∈N such that hn → +∞
as n → ∞ and
If the set B∞ is empty, we can take an arbitrary sequence {hn }n∈N such that hn →
+∞ as n → ∞.
Let f ∈ S0 (R). Then, by a smooth version of Urysohn’s lemma (see, e.g., [11,
Proposition 6.5]), there is a function ψ ∈ C0∞ (R) such that 0 ≤ ψ ≤ 1, supp F f ⊂
supp ψ and ψ|supp F f = 1. Therefore, for all n ∈ N,
ehn W 0 (b)eh−1
n
f − b(ξ )f = W 0 [b(· + hn )]f − ξ(b)f
= F −1 [b(· + hn ) − ξ(b)]ψF f
and
; ; ;4 5
; ehn W 0 (b)e−1 − b(ξ ) f ; ≤ ; b(· + hn ) − ξ(b) ψMX(R) f X(R) . (5.6)
hn X(R)
; 5
+ ;ξ(bm ) − ξ(b) ψMX(R)
≤2b − bm MX(R) ψMX(R)
;4 5
+ ; bm (· + hn ) − ξ(bm ) ψMX(R) . (5.7)
Fix ε > 0. By Theorem 2.5, ψMX(R) < ∞. It follows from (5.2) that there
exists a sufficiently large number m ∈ N (which we fix until the end of the proof)
such that
Let
⎧
⎨ max |λ| if Fm \ {∞} = ∅,
$ := λ∈Fm \{∞}
⎩0 if Fm \ {∞} = ∅,
For x ∈ K and n ∈ N, let In (x) be the segment with the endpoints hn and x + hn .
Then In (x) ⊂ [hn − k, hn + k]. Since hn → +∞ as n → ∞, there exists N1 ∈ N
such that for all n > N1 , one has
1 ; 5 ;
3
; j 4 ;
= cX ;D∞ bm (· + hn ) − bm (hn ) ψ ; ∞ . (5.11)
j! L (R)
j =0
j 4 5
D∞ bm (· + hn ) − bm (hn ) ψ
j
j 4 5 j −ν
= ν
D∞ bm (· + hn ) − bm (hn ) D∞ ψ . (5.12)
ν
ν=0
and
3
4 5
D∞ bm (· + hn ) − bm (hn ) (x)
x3 3 3x 2 hn
= (D∞ bm )(x + hn ) + (D 2 bm )(x + hn )
(x + hn ) 3 (x + hn )3 ∞
xh2n − x 2 hn
+ (D∞ bm )(x + hn ). (5.16)
(x + hn )3
k2 ; ;
; 2 ; khn
≤ ;D∞ b m ; ∞ + D∞ bm L∞ (R) , (5.18)
(hn − k) 2 L (R) (hn − k)2
and
; 5 ;
; 3 4 ;
; D∞ bm (· + hn ) − bm (hn ) χK ;
L∞ (R)
k3 ; ; 3k 2 hn ; ;
; 3 ; ; 2 ;
≤ ;D∞ b m ; + ;D∞ b m ; ∞
(hn − k) 3 ∞
L (R) (hn − k) 3 L (R)
kh2n + k 2 hn
+ D∞ bm L∞ (R) . (5.19)
(hn − k)3
Since
; ;
; j ;
max ;D∞ ψ ; < ∞,
j ∈{0,1,2,3} L∞ (R)
it follows from (5.12)–(5.13) and (5.17)–(5.19) that for all j ∈ {0, 1, 2, 3},
; 5 ;
; j 4 ;
lim ;D∞ bm (· + hn ) − bm (hn ) ψ ; = 0. (5.20)
n→∞ L∞ (R)
We deduce from (5.11) and (5.20) that there exists N3 ∈ N such that N3 ≥ N2 and
for all n > N3 ,
;4 5 ;
; bm (· + hn ) − bm (hn ) ψ ; < ε/4. (5.21)
MX(R)
216 C. A. Fernandes et al.
Combining (5.7)–(5.10) and (5.21), we see that for every f ∈ S0 (R) and every
ε > 0 there exists N3 ∈ N such that for all n > N3 ,
; ;
; ;
; ehn W 0 (b)eh−1
n
− b(ξ ) f; < εf X(R) ,
X(R)
Since S0 (R) is dense in X(R) (see Theorem 2.2), this equality immediately implies
(5.1) in view of [28, Lemma 1.4.1(ii)], which completes the proof.
Since the function e0 ≡ 1 belongs to and "SOX(R) 3 , we see that the set of all
3
constant functions is contained in and in SOX(R) . Therefore
3
MOπ (C) ⊂ MOπ () ∩ COπ (SOX(R) ). (5.22)
3
Let Aπ ∈ MOπ () ∩ COπ (SOX(R) ). Then Aπ = [aI ]π = [W 0 (b)]π , where
3
a ∈ and b ∈ SOX(R) . Therefore, there is an operator K ∈ K(X(R)) such that
aI = W 0 (b) + K. (5.23)
By Theorem 5.2, for every ξ ∈ M∞ (SO 3 ) there exists a sequence {hn }n∈N of
positive numbers such that hn → +∞ as n → ∞ and
Acknowledgments We would like to thank the anonymous referee for pointing out a gap in the
original version of the paper. To fill in this gap, we strengthened the hypotheses in the main result.
References
24. Y.I. Karlovich, I. Loreto Hernández, Algebras of convolution type operators with piecewise
slowly oscillating data. I: Local and structural study. Integr. Equ. Oper. Theory 74, 377–415
(2012)
25. Y.I. Karlovich, I. Loreto Hernández, On convolution type operators with piecewise slowly
oscillating data. Oper. Theory Adv. Appl. 228, 185–207 (2013)
26. V. Rabinovich, S. Roch, B. Silbermann, Limit Operators and Their Applications in Operator
Theory (Birkhäuser, Basel, 2004)
27. M. Reed, B. Simon, Methods of Modern Mathematical Physics. I: Functional Analysis
(Academic Press, New York, 1980)
28. S. Roch, P.A. Santos, B. Silbermann, Non-Commutative Gelfand Theories. A Tool-kit for
Operator Theorists and Numerical Analysts (Springer, Berlin, 2011)
29. I.B. Simonenko, Local Method in the Theory of Shift Invariant Operators and Their Envelopes
(Rostov University Press, Rostov on Don, 2007, in Russian)
30. I.B. Simonenko, C.N. Min, Local Method in the Theory of One-Dimensional Singular Integral
Equations with Piecewise Continuous Coefficients. Noetherity (Rostov University Press,
Rostov on Don, 1986, in Russian)
Inner Outer Factorization of Wide
Rational Matrix Valued Functions on the
Half Plane
Abstract The main purpose of this note is to use operator methods to solve
a rational inner-outer factorization problem for wide functions. It is believed
that this will provide valuable insight into the inner-outer factorization problem.
Our approach involves Wiener–Hopf operators, Hankel operators and invariant
subspaces for the backward shift. It should be emphasized that the formulas for
the inner and outer factor are derived in a computational manner.
1 Introduction
In this note we will use operator techniques to develop a method to compute the
inner-outer factorization for certain matrix valued rational functions defined on the
closure of the right half plane. We shall focus on the “wide” case, i.e., the case where
the matrix function has more columns than rows (or an equal number, thereby also
including the square case). The “tall” case, where the matrix function has more
A. E. Frazho
Department of Aeronautics and Astronautics, Purdue University, West Lafayette, IN, USA
e-mail: [email protected]
A. C. M. Ran ()
Department of Mathematics, Faculty of Science, Vrije Universiteit Amsterdam, Amsterdam,
The Netherlands
Research Focus: Pure and Applied Analytics, North-West University, Potchefstroom,
South Africa
e-mail: [email protected]
rows than columns (or an equal number, again including the square case) is well
understood, and presented in, e.g., [3, 6, 18] and elsewhere.
It should be emphasized that we obtain explicit formulas for the inner and outer
factors in terms of the matrices appearing in a state space realization of the original
rational matrix valued function. Finally, we shall always assume that the rational
matrix function is proper, i.e., it has a finite value at infinity.
To set the stage, let E, U and Y be finite dimensional, complex vector spaces and
dim Y ≤ dim U. The Hilbert space of all Lebesgue measurable square integrable
functions over [0, ∞) with values in E is denoted by L2+ (E). Throughout H ∞ (U, E)
is the set of all analytic functions G in the open right hand plane {s : #(s) > 0}
such that
an invertible outer function. To avoid this we simply assume that G(s) is full rank.
Moreover, from an operator perspective if G(s) is not full rank, then one has to
invert an unbounded Wiener–Hopf operator which is difficult. Furthermore, if G
does not admit a full rank inner-outer factorization, then a small H ∞ perturbation
of G does admit such a factorization. First we will present necessary and sufficient
conditions to determine when G admits a full rank inner-outer factorization. Then
we will give a state space algorithm to compute Gi and then Go .
Inner-outer factorization for the “wide” case has received attention before. In
several papers procedures to find the inner and outer factors have been derived.
For instance, in [4] the factors were derived by considering G∗ G + ε2 I , and
deducing the outer factor from this by solving a reduced Riccati equation. In [21]
the reduced Riccati equation is replaced by a different approach, avoiding in fact an
approximation procedure. In that paper, first the inner-outer factorization for G(s)∗
is computed, and then balanced coordinates are used to further factor the inner part
of G(s)∗ . Then they compute another inner-outer factorization of the remaining part.
In [16], see also [13] a different approach is chosen, [16] uses state space methods
along with some interesting state space decompositions to derive algorithms for
the inner and outer factorization in a general rational setting. Their methods are
algebraic in nature and quite different from our operator approach. Yet another
approach is taken in [17]. In all of these papers state space formulas are given and
their approach is based on finite dimensional techniques with properties of Riccati
equations. A common method is to compute the outer factor first and then solve for
the inner factor using the outer factor. Our approach is different: we derive a formula
for the inner factor from the fact that the subspace ker TG∗ is shift invariant, and thus,
by the Beurling–Lax–Halmos theorem (see[14]) there is an inner function Gi such
that ker TG∗ i = ker TG∗ . It is this inner function which we construct first, providing
a state space formula for Gi , and then deduce the outer factor from Go = G∗i G on
the imaginary axis.
Finally, it is emphasized that the previous methods to compute the inner-outer
factorization rely mainly on state space methods. Roughly speaking, the previous
methods involve solving Riccati equations, Hamilton methods, decomposing the
state space with balanced coordinates or other decompositions, or eigenvectors and
eigenvalues for certain state space operators; see for instance [4, 16, 17, 21]. Our
approach is quite different. We solve the inner-outer factorization problem by using
operator techniques for the general formulas and then use state space methods to
write down an explicit formula in the rational case. The operator methods give us
valuable insight into the underlying framework. Once the foundation is constructed,
then one simply uses state space techniques to formulate a realization of the inner
and outer part.
222 A. E. Frazho and A. C. M. Ran
2 Preliminaries
In this paper we shall make heavy use of the results and methods from [10]. Let
be a proper rational matrix function with values in L(U, Y) and no poles on the
imaginary axis iR. Let ϕ be the Lebesgue integrable (continuous) matrix function
on the imaginary axis determined by via the Fourier transform, that is,
∞
(iω) = (∞) + e−iωt ϕ(t) dt.
−∞
The Wiener–Hopf operator T associated with and the Hankel operator H , both
mapping L2+ (U) into L2+ (Y), are defined as follows for t ≥ 0 and f in L2+ (U) :
t
(T f )(t) = (∞)f (t) + ϕ(t − τ )f (τ )dτ (2.1)
0
∞
(H f )(t) = ϕ(t + τ )f (τ )dτ. (2.2)
0
In the sequel we shall freely use the basic theory of Wiener–Hopf and Hankel
operators which can be found in Chapters XII and XIII of [11]. Note that in [11]
the Fourier transform is taken with respect to the real line instead of the imaginary
axis as is done here.
Now let G be the stable rational function in H ∞ (U, Y) given by the following
state space realization:
Hence the corresponding Wiener–Hopf operator TG and the Hankel operator HG can
be expressed in terms of the matrices appearing in the realization of G as follows
t
(TG f )(t) = Df (t) + CeA(t −τ )Bf (τ )dτ (for t ≥ 0) (2.4)
0
∞
(HG f )(t) = CeA(t +τ )Bf (τ )dτ (for t ≥ 0). (2.5)
0
Inner Outer Factorization on the Half Plane 223
Throughout this note we assume that G is full row rank and DD ∗ is invertible.
A realization is called controllable if ∨∞ j =0 Im A B equals the state space X , and
j
systems theory that the dimension of the state space X is as small as possible
(over all possible realizations) if and only if the realization is both observable and
controllable. Such a realization is called minimal, and it will be assumed throughout
that the realization (2.3) for G is minimal.
With G we also associate the rational matrix function R given by R(s) =
G(s)G(−s)∗ . Notice that R is a proper rational matrix function with values in
L(Y, Y) (the set of bounded linear operators mapping Y to itself) and has no poles
on the imaginary axis. By TR we denote the corresponding Wiener–Hopf operator
acting on L2+ (Y). It is well-known (see, e.g., formula (24) in Section XII.2 of [11])
that
Let P be the controllability Grammian for the pair {A, B}. In other words,
∞ ∗
P = eAt BB ∗ eA t dt.
0
AP + P A∗ + BB ∗ = 0. (2.7)
Wo x = CeAt x (x ∈ X ).
= BD ∗ + P C ∗ .
Q = Wo∗ TR−1 Wo ;
TR−1 Wo x = Co eAo t x (x ∈ X ).
In other words, TR−1 Wo equals the observability operator Co eAo t mapping X into
L2+ (Y) determined by the pair {Co , Ao }. Since TR−1 Wo is one to one, the pair
{Co , Ao } is observable.
The Hankel operator HG can be written as HG = Wo Wc ; see (2.5). Using P =
Wc Wc∗ , we have HG HG∗ = Wo P Wo∗ . This, with Eq. (2.6) implies that
TG TG∗ = TR − Wo P Wo∗ .
Hence ker TG∗ = ker(TR − Wo P Wo∗ ). By consulting Lemma 4.1 in [9], restated as
Lemma 4.4 at the end of this paper for the readers convenience, we have
Later we will see that the McMillan degree of the inner part Gi of G equals
dim ker(I − QP ).
The next lemma characterizes the existence of a full rank inner-outer factoriza-
tion. The proof of this lemma is essentially the same as that of Lemma 3.1 in [9],
and is therefore omitted.
Lemma 2.1 Let G be a rational function in H ∞ (U, Y) where U and Y are finite
dimensional spaces satisfying dim Y ≤ dim U. Then G admits a full rank inner-
outer factorization if and only if
3 Inner Functions
Because Gi is a rational function in H ∞ (Y, Y), and the realization is minimal, the
state space operator Ai is stable. Let Qi be the observability Grammian for the pair
{Ci , Ai }. In other words, Qi is the unique solution to the Lyapunov equation
(The state space formula for an inner function is classical; for example, see [1, 3, 21]
and also Theorem 19.15 in [5].) In this case, Q−1 i is the controllability Grammian
for the pair {Ai , Bi }, that is, Q−1
i is the unique solution to the Lyapunov equation
Ai Q−1 −1 ∗ ∗
i + Qi Ai + Bi Bi = 0. (3.2)
Let Sτ∗ be the backward shift on L2+ (Y) for τ ≥ 0, that is, (Sτ∗ f )(t) = f (t + τ )
for f in L2+ (Y). It is noted that H() is an invariant subspace for the backward
shift Sτ∗ for all τ ≥ 0. According to the Beurling–Lax–Halmos Theorem if H is any
invariant subspace for the backward shift Sτ∗ for all τ ≥ 0, then there exists a unique
inner function in H ∞ (E, Y) such that H = H(). By unique we mean that if
H = H(") where " is an inner function in H ∞ (L, Y), then there exists a constant
unitary operator V from E onto L such that = "V ; see [2], Section 3.9 for the
half plane case we use here, see also [6, 12, 15, 18–20] for further details in the unit
disc case.
226 A. E. Frazho and A. C. M. Ran
Now we present the following classical result; see Theorem 7.1 in [8], Sections
4.2 and 4.3 in [6] and Section XXVIII.7 in [12] for the disc case, and [2], Lemma
3.45 and Lemma 3.46 for the half plane case.
Lemma 3.1 Let be an inner function in H ∞ (Y, Y) where Y is finite dimensional.
Then the Hankel operator H is a contraction. Moreover,
4 Main Result
Let us combine several results of [10] into one theorem, which is the right half plane
analogue of Theorem 3.2 in [9] in the open unit disc case.
Theorem 4.1 Let {A on X , B, C, D} be a minimal realization for a rational
function G in H ∞ (U, Y) where dim Y ≤ dim U. Let R be the function in L∞ (Y, Y)
defined by R(iω) = G(iω)G(iω)∗ . Let P the controllability Grammian for the pair
{A, B}; see (2.7). Then the following statements are equivalent.
(i) The function G admits a full rank inner-outer factorization;
(ii) The Wiener–Hopf operator TR is strictly positive.
(iii) There exists a (unique) stabilizing solution Q to the algebraic Riccati equation
(2.8).
In this case, the solution Q is given by Q = Wo∗ TR−1 Wo and Q is strictly positive.
Moreover, the following holds.
Inner Outer Factorization on the Half Plane 227
(iv) The eigenvalues of QP are real numbers contained in the interval [0, 1].
(v) If Gi is the inner factor of G, then the dimension of H(Gi ) is given by
dim H(Gi ) = dim ker TG∗ i = dim ker TG∗ = dim ker(I − QP ),
k = dim ker(I − QP ).
Ai = U ∗ QAo P U and Ci = Co P U.
The following lemma is a crucial part of the proof. We shall give two proofs, one
algebraic in nature, and a second operator theoretic proof using the fact that H(Gi )
is an invariant subspace for the backward shift.
Lemma 4.3 Assume that the hypotheses of Theorem 4.2 hold. Then there exists
a unique stable operator Ai on Ck such that Ao P U = P U Ai . Finally, Ai =
U ∗ QAo P U .
Algebraic Proof First we prove the existence of an operator Ai such that
QAo P U = QP U Ai . We claim that Im QAo P U ⊂ Im QP U = Im U. Since
Im U equals ker(I − QP ), we have U = QP U . Let us show that
(I − QP )QAo P U = 0. (4.3)
0 = U ∗ P (I − QP )QAo P U = −U ∗ P C1∗ C1 P U.
Inner Outer Factorization on the Half Plane 229
In other words, C1 P U = 0. Hence (4.3) holds, and thus, the range of QA0 P U
is contained in the kernel of I − QP . So there exists an operator Ai on Ck such
that QA0 P U = U Ai . Multiplying by U ∗ on the left yields, U ∗ QA0 P U = Ai . In
particular, Ai is uniquely determined by our choice of the isometry U .
Equation (4.3) also implies that
QAo P U = QP QAo P U = QP U Ai .
Ao P U = P U Ai .
TR−1 Wo P U x = Co eA0 t P U x (x ∈ Ck )
H(Gi ) = Im TR−1 Wo P U . (4.4)
Since H(Gi ) is an invariant subspace of the backward shift on L2+ (Y) (see [2,
Chapter 3]), we have for all t > 0 and t1 > 0 and all x ∈ Ck
This readily implies that Co eAo (t +t1 ) P U x = Co eAo t P U F (t1 )x for some linear
operator F (t1 ) on Ck . One easily checks that F satisfies the semigroup property
F (t1 + t2 ) = F (t1 )F (t2 ). Hence there exists an operator Ai on Ck such that
F (t) = eAi t . In other words,
Since the observability operator Co eAo t is one to one, we have eAo t P U = P U eAi t
(for all t). It is now clear that Ai must be stable. To see this simply note that P is
invertible, U is an isometry, and Ao is stable. Therefore eAi t converges to zero as
t tends to infinity, and thus, Ai is stable. By taking the derivative and then letting
t approach 0, we see that Ao P U = P U Ai as desired. Multiplying both sides by
U ∗ Q with U = QP U , we obtain Ai = U ∗ QAo P U .
Proof of Theorem 4.2 Note that using the definition of Ci = Co P U and Ai we
have
A∗i Qi + Qi Ai + Ci∗ Ci = 0,
and let Bi and Di be the complementary operators for the pair {Ci , Ai }. Finally,
{Ai , Bi , Ci , Di } is the state space realization for our inner function Gi (s); see (4.2).
By construction H(Gi ) = ker TG∗ i .
By a direct calculation or consulting formula (2.12) in [10], we also have
The formula for the outer factor Go (s) can now be derived in the same way as the
last section of the paper [9]. Indeed, for s ∈ iR, we have
This yields
Gi (s)∗ G(s) = Di∗ − Bi∗ (sI + A∗i )−1 Ci∗ D + C(sI − A)−1 B
Rewrite this as
(sI + A∗i )−1 Ci∗ C(sI − A)−1 = −U ∗ (sI − A)−1 + (sI + A∗i )−1 U ∗ .
Inner Outer Factorization on the Half Plane 231
where C1 = B ∗ Q + D ∗ Co ; see also formula (3.3) in [10]. Now use equation (3.8)
in [10], which states that
Ai = −U ∗ A∗ U + U ∗ C ∗ (DD ∗ )−1 DB ∗ U
∗
= −U ∗ A − BD ∗ (DD ∗ )−1 C U.
232 A. E. Frazho and A. C. M. Ran
The latter formula compares well with the formula for the inner factor in the tall
case as presented in [3],Theorem 17.26.
Finally, Lemma 4.1 in [9] is restated in the following Lemma.
Lemma 4.4 Let T be a strictly positive operator on a Hilbert space H and P a
strictly positive operator on a Hilbert space X . Let W be an operator mapping X
into H and set Q = W ∗ T −1 W . Then the following two assertions hold.
(i) Let X and H be the subspaces defined by
X = ker(I − QP ) and H = ker T − W P W ∗ .
$1 = W ∗ |H : H → X and $2 = T −1 W P |X : X → H
are both well defined and invertible. Moreover, $−1 1 = $2 . In particular, X and
H have the same dimension.
(ii) The operator T − W P W ∗ is positive if and only if P −1 − Q is positive, or
1 1
equivalently, P 2 QP 2 is a contraction. In this case, the spectrum of QP is
contained in [0, 1]. In particular, if X is finite dimensional, then the eigenvalues
for QP are contained in [0, 1].
References
6. A.E. Frazho, W. Bosri, An Operator Perspective on Signals and Systems. Operator Theory:
Advances and Applications, vol. 204 (Birkhäuser Verlag, Basel, 2010)
7. C. Foias, A. Frazho, The Commutant Lifting Approach to Interpolation Problems. Operator
Theory: Advances and Applications, vol. 44 (Birkhäuser Verlag, Basel, 1990)
8. C. Foias, A. Frazho, I. Gohberg, M.A. Kaashoek, Metric Constrained Interpolation, Commu-
tant Lifting and Systems. Operator Theory: Advances and Applications, vol. 100 (Birkhäuser
Verlag, Basel, 1998)
9. A.E. Frazho, A.C.M. Ran, A note on inner-outer factorization for wide matrix-valued functions,
in: Operator Theory, Analysis and the State Space Approach. Operator Theory: Advances and
Applications, vol. 271 (Birkhäuser Verlag, Basel, 2018), pp. 201–214
10. A.E. Frazho, M.A. Kaashoek, A.C.M. Ran, Rational matrix solutions of a Bezout type equation
on the half plane, in: Advances in Structured Operator Theory and Related Areas. Operator
Theory: Advances and Applications, vol. 237 (Birkhäuser Verlag, Basel, 2013), pp. 145–160
11. I. Gohberg, S. Goldberg, M.A. Kaashoek, Classes of Linear Operators, Volume I. Operator
Theory: Advances and Applications, vol. 49 (Birkhäuser Verlag, Basel, 1990)
12. I. Gohberg, S. Goldberg, M.A. Kaashoek, Classes of Linear Operators, Volume II. Operator
Theory: Advances and Applications, vol. 63 (Birkhäuser Verlag, Basel, 1993)
13. V. Ionescu, C. Oară, M. Weiss, Generalized Riccati Theory and Robust Control, A Popov
Function Approach (Wiley, Chichester, 1999)
14. P.D. Lax, Translation invariant spaces. Acta Math. 101, 163–178 (1959)
15. N.K. Nikol’skii, Treatise on the Shift Operator. Grundlehren, vol. 273 (Springer, Berlin, 1986)
16. C. Oară, A. Varga, Computation of general inner–outer and spectral factorizations. IEEE Trans.
Autom. Control 45, 2307–2325 (2000)
17. T. Reis, M. Voigt, Inner-outer factorization for differential-algebraic systems. Math. Control
Signals Syst. 30, Art. 15, 19pp. (2018)
18. M. Rosenblum, J. Rovnyak, Hardy Classes and Operator Theory (Oxford University Press,
Oxford, 1985)
19. B. Sz.-Nagy, C. Foias, Harmonic Analysis of Operators on Hilbert Space (North-Holland,
Amsterdam, 1970)
20. B. Sz.-Nagy, C. Foias, H. Bercovici, L. Kérchy, Harmonic Analysis of Operators on Hilbert
Space (Springer, New York, 2010)
21. F.-B. Yeh, L.-F. Wei, Inner-outer factorizations of right-invertible real-rational matrices. Syst.
Control Lett. 14, 31–36 (1990)
Convergence Rates for Solutions
of Inhomogeneous Ill-posed Problems
in Banach Space with Sufficiently Smooth
Data
Matthew A. Fury
M. A. Fury ()
Penn State Abington, Department of Mathematics, Abington, PA, USA
e-mail: [email protected]
1 Introduction
=t
is well-posed with unique solution vβ (t) = etfβ (A) ϕ + 0 e(t −s)fβ (A) h(s)ds. The
sum −A + fβ (A) = −βA2 generates a C0 semigroup as well, and in fact its growth
order does not depend on β. Consequently, regularization is established as vβ (t)
then converges to u(t) for each t ∈ [0, T ] as β → 0 (cf. [24, Section 3.1.1]). As for
fβ (A) = A(I + βA)−1 , regardless of the angle θ , this approximation yields well-
posedness of (1.2) being a bounded operator. The auxiliary operator −A + fβ (A)
generates a C0 semigroup as well, but as shown in the literature, its growth order is
independent of β only when θ ∈ ( π4 , π2 ] (cf. [2, 10, 18, 35]).
Recently, authors have investigated possibilities where the restriction on the
angle θ of the semigroup may be relaxed. Huang and Zheng [17] show that the
approximation fβ (A) = A − βA2 may be modified by use of the fractional
power of A [3]. Here, even if 0 < θ < π4 , then both fβ (A) = A − βAσ and
−A + fβ (A) = −βAσ still generate C0 semigroups for a suitable 1 < σ < 2. Also,
a logarithmic approximation
1
fβ (A) = − ln(β + e−pT A ), 0 < β < 1, p≥1 (1.3)
pT
first introduced by Boussetila and Rebbani [4] in Hilbert space, and later modified
by Tuan and Trong [38], may be applied in Banach space (cf. [16], [6], [12]). The
approximation (1.3) has lately received significant attention since it induces an error
that is less severe than those of A − βA2 and A(I + βA)−1 , both of which satisfy
and etfβ (A) ≤ et C/β . Recently, the author [11] investigated two additional
approximations fβ (A) = Ae−βA and fβ (A) = (ln 2)−1 A Log(1 + e−βA ) which
satisfy these same properties, but in such a way that the calculations do not rely on
the value of the angle θ .
In this paper, we generalize the results of [11] in order to unify all four of the
approximations
and prove convergence estimates between u(t) and vβ (t) without needing to restrict
the angle θ (Proposition 2.6 and Theorem 2.7 below). For this, our method relies on
an assumed smoothness of the data ϕ and h in (1.1). For example, if ϕ ∈ Dom(eQA )
for large enough Q > T then et A ϕ behaves like e−(Q−t )A(eQA ϕ) where e−(Q−t )A
is bounded for each t. We note that such a requirement is not out of the ordinary
since, for example, a solution u(t) of (1.1) with h ≡ 0 exists if and only if u(t) ∈
Dom(et A ) for each t (cf. [17, Introduction]). Also, many of the results associated
with the logarithmic approximation (1.3) demonstrate a stricter property than (1.4),
that is (−A + fβ (A))x ≤ βeτ A x for all x ∈ Dom(eτ A ) where τ may be larger
than T (cf. [37, Definition 1], [13, Lemma 1], [12, Proposition 6]). It is notable
238 M. A. Fury
that our intention, to regularize problem (1.1) when favorable C0 semigroups are
unavailable, likens to deLaubenfels’s motivation of C-semigroups, for example
{Cet A }t ≥0 where C is a bounded, injective operator such that Cet A = et A C is a
bounded linear operator for every t ≥ 0 [7, 8].
Below, B(X) denotes the space of all bounded linear operators on X. For a linear
operator A in X, ρ(A) denotes the resolvent set of A and σ (A), the spectrum of A, is
the complement of ρ(A) in C. Also, a strong solution of (1.1) is a function u which is
differentiable almost everywhere on [0, T ] such that u ∈ L1 ((0, T ) : X), u(0) = ϕ,
and u (t) = Au(t) + h(t) almost everywhere on [0, T ] (cf. [29, Definition 4.2.8]).
/ S π2 −θ1 7⇒ w ∈ ρ(A),
w∈
and
M1
(w − A)−1 ≤ for all w ∈
/ S π2 −θ1 . (2.1)
dist(w, S̄ π2 −θ1 )
M1
(w − A)−1 ≤ for all w ∈
/ S̄ π2 −θ1 (2.2)
|w|
that for each 0 < β < 1, fβ (A) is the infinitesimal generator of a C0 semigroup.
Then fβ (A) satisfies Condition (A+ ) if the following are satisfied:
(i) Dom(fβ (A)) ⊇ Dom(A2 ) and there exists a constant R independent of β such
that
An (et A x − etfβ (A) x) = (I − etfβ (A) e−t A )An et A x
= (I − etfβ (A) e−t A )An (et A e−QA (eQA x))
Solutions of Ill-posed Problems with Sufficiently Smooth Data 241
; t ;
; ∂ τfβ (A) −τ A ;
;
= ;− e e A e e (e x) dτ ;
n t A −QA QA
∂τ ;
0
; t ;
; ;
;
= ;− (−A + fβ (A))e e A e e (e x) dτ ;
τfβ (A) −τ A n t A −QA QA
;
0
t
≤ βR A2+n eτfβ (A) e−τ A et A e−QA (eQA x) dτ. (2.4)
0
Mα (T + &)n+1 − (T − t + &)n+1 QA
= βR (n + 1)!C −(2+n) e x
π (T − t + &)n+1 (T + &)n+1
Mα
≤ βR (n + 1)!C −(2+n) (T − t + &)−(n+1) eQA x.
π
Q = Q(&, α) > T be as in Proposition 2.6. Assume that u(t) and vβ (t) are
strong solutions of (1.1) and (1.2) respectively where h satisfies the conditions
of Theorem 2.2, and as well that initial data ϕ and Ran(h) are contained in
Dom(eQA ) with eQA h ∈ L1 ((0, T ) : X). Then there exist constants C̃ and M
each independent of β and & such that for 0 ≤ t < T ,
where
T
N = eQA ϕ + eQA h(t)dt, (2.7)
0
and ω(ζ ) is a harmonic function which is bounded and continuous on the bent strip
∞
T
1 ¯ 1 1
F (ζ ) = w(ζ ) + ∂w(z) + dξ dη (2.8)
π z−ζ z̄ + 1 + ζ
0 0
where z = ξ +ηe±iγ and ∂¯ denotes the Cauchy-Riemann operator (cf. [32]). Denote
±iγ
M = max e−(re )A .
r≥0
t
−1
+ βK(T − (t − s) + &) e QA
h(s)ds
0
T
−1
≤ M βK& e ϕ +
QA
e QA
h(y)dy
0
= M βK& −1 N. (2.9)
≤ βK& −2 N.
t
≤ βR A2 etfβ (A) ϕ + A2 e(t −s)fβ (A) h(s)ds
0
= βR A2 etfβ (A) e−t A e−(Q−t )A(eQA ϕ)
244 M. A. Fury
t
(t −s)fβ (A) −(t −s)A −(Q−(t −s))A
+ e e e (e QA
h(s))ds
0
≤ βRL& −2 N
¯
∂w(ζ ) ≤ βC & −2 N, (2.10)
By (2.9) and (2.10), together with the fact that 0 < & < 1, then
for 0 ≤ t ≤ T , where
¯
= L max ∂(re ±iγ ∗
)x
r≥0
≤ L βC & −2 Nx ∗ .
Solutions of Ill-posed Problems with Sufficiently Smooth Data 245
for 0 ≤ t ≤ T where C̃ and M are constants each independent of β and &. Then for
0 ≤ t ≤ T,
Here we outline several examples which satisfy Condition (A+ ), so that Theo-
rem 2.7 may be applied. As mentioned in Sect. 1, each of these approximations may
be applied to backward problems defined in Lp (Rn ), 1 < p < ∞. For specifics as
well as further applications, see Remark 4.2 below.
Example (1) fβ (A) = A(I + βA)−1 . Property (2.2) implies (I + βA)−1 ∈ B(X)
and
R
(I + βA)−1 = β −1 (−β −1 − A)−1 ≤ β −1 =R (3.1)
β −1
for each t ≥ 0.
Note Condition (A+ ) (i) is satisfied as
w
Re(−w + fβ (w)) = Re(−w + )
1 + βw
βw2
= −Re( )
1 + βw
2
βr cos(2α) ± iβr 2 sin(2α)
= −Re
(1 + βr cos α) ± iβr sin α
βr 2 cos(2α) + β 2 r 3 (cos(2α) cos α + sin(2α) sin(α))
=−
(1 + βr cos α)2 + (βr sin α)2
βr 2 cos(2α) + β 2 r 3 cos α
=− := g(r). (3.2)
1 + 2βr cos α + β 2 r 2
Define
π
k(t) = 2 cos t + cos(2t), 0≤t ≤ .
2
Solutions of Ill-posed Problems with Sufficiently Smooth Data 247
Then k (t) = −2(sin t +sin(2t)) ≤ 0 since both sin t and sin(2t) are nonnegative on
[0, π2 ]. Consequently, k(t) is a decreasing function, and thus is never smaller than
k( π2 ) = −1. We have
for all x ∈ Dom(A2 ). Condition (A+ ) (ii) is also satisfied with C = 0 since 2ν < π
2
implies
where is a contour similar to α = ∂Sα but avoids both the origin and the negative
real axis [29, Section 2.6]. In fact, since 0 ∈ ρ(A), there exists δ > 0 such that the
closed disk of radius δ, centered at the origin is contained in ρ(A). Hence, we may
248 M. A. Fury
α = 1 ∪ 2 ∪ 3 ,
1 = {reiα : r ≥ δ},
2 = {δeiφ : −α ≤ φ ≤ α},
3 = {re−iα : r ≥ δ},
oriented so that Im(w) decreases as w travels along α (cf. [17]). But 2 is bounded
and does not affect the convergence of the contour integral, so the calculations in
Proposition 2.6 remain fundamentally unchanged. Henceforth, using the properties
(cf. [29, Theorem 2.6.8, Lemma 2.6.3])
Aσ1 (Aσ2 )x = Aσ1 +σ2 x = Aσ2 (Aσ1 )x for x ∈ Dom(Amax{σ1 ,σ2 ,σ1 +σ2 } ),
for all x ∈ Dom(A2 ). Also similar to the case σ = 2, Condition (A+ ) (ii) is satisfied
with C = 0 since σ ν < π2 implies
Finally, we point out that in either case whether σ = 2 or 1 < σ < 2, while
fβ (A) and −A+fβ (A) are unbounded operators, it may be shown that both generate
C0 semigroups. Again, if σ is not an integer, careful revisions to α must be taken
near the origin. In either case, one may show the semigroup generated by fβ (A)
satisfies
−1
β (1−σ )
etfβ (A) ≤ P et P
β (cf. [29, Theorem 2.5.2 (d)]). Therefore, fβ (A) generates a uniformly continuous
semigroup satisfying
−1
etfβ (A) ≤ et Mβ
for all x ∈ Dom(A2 ). Therefore, Condition (A+ ) (i) is satisfied. Also, for w =
re±iα ∈ Sν ,
Example (4) fβ (A) = (ln 2)−1 A Log(1 + e−βA ). Here we define fβ (A) by the
Dunford integral
1
fβ (A) = w Log(1 + e−βw )(w − A)−1 dw
2 ln 2 πi α
where the principal branch of the logarithm is taken [9, 27]. The calculations
for Condition (A+ ) here are involved and so are omitted, but we can refer to
[11] for a complete analysis. There, it is found that fβ (A) is a bounded operator
with fβ (A) ≤ Cβα where Cα is a constant independent of β but dependent
on α. Further, it may be shown that fβ (A) satisfies Condition (A+ ) (i) with a
constant R independent of β and also Condition (A+ ) (ii) with C = 2 lnπ
2 and any
π
2 − θ < ν < π
2 [11, Lemma 2.4 and Proposition 3.1].
Traditionally, regularization arguments are made where u(t) is any solution of (1.1)
with limited assumptions on the data (cf. [17, Definition 4.1]). As is the theme
throughout this paper, our calculations rely on sufficient smoothness of the data ϕ
and h as in (2.7). We outline a regularization-type argument under the assumptions
of Theorem 2.7.
250 M. A. Fury
First, note that in each of the four examples listed in Sect. 3, with the exception
of the case 1 < σ < 2 in Example (2), fβ (A) generates a C0 semigroup satisfying
−1
etfβ (A) ≤ P et P β for some constants P , P independent of β. Therefore, if
δ
vβ (t) is the solution of the approximate well-posed problem with perturbed data
satisfying ϕ − ϕδ ≤ δ, then
; t
; tf (A)
vβ (t) − vβδ (t) =;
; e β ϕ + e (t −s)fβ (A)
h(s)ds
0
t ;
;
− etfβ (A) ϕδ + e(t −s)fβ (A) h(s)ds ;
;
0
=etfβ (A) (ϕ − ϕδ )
β −1
≤P eT P δ.
Now, let u(t) be a strong solution of (1.1) that satisfies the hypotheses of Theo-
rem 2.7. Choosing β = −2T P (ln δ)−1 we have β → 0 as δ → 0, and
for all 0 ≤ t < T . Also, the case for t = T may be addressed separately. Indeed,
similar to the calculation (2.9), we obtain
In the case of Sect. 3 Example (2) where 1 < σ < 2, the only difference is that
etfβ (A) satisfies
−1
β −(σ −1)
etfβ (A) ≤ P et P .
Solutions of Ill-posed Problems with Sufficiently Smooth Data 251
Acknowledgments The author would like to thank the editors for the proceedings of the IWOTA
2019 and the referee for their helpful suggestions.
References
10. M.A. Fury, Nonautonomous ill-posed evolution problems with strongly elliptic differential
operators. Electron. J. Differ. Eq. 2013(92), 1–25 (2013)
11. M.A. Fury, A class of well-posed approximations for ill-posed problems in Banach spaces.
Commun. Appl. Anal. 23(1), 97–14 (2019)
12. M.A. Fury, Logarithmic well-posed approximation of the backward heat equation in Banach
space. J. Math. Anal. Appl. 475, 1367–1384 (2019)
13. M. Fury, B. Campbell Hetrick, W. Huddell, Continuous dependence on modeling in Banach
space using a logarithmic approximation, in Mathematical and Computation Approaches in
Advancing Modern Science and Engineering (Springer, Cham, 2016)
14. A. Gorny, Contribution à l’étude des fonctions dérivables d’une variable réelle. Acta Math. 71,
317–358 (1993)
15. M. Haase, The Functional Calculus for Sectorial Operators (Birkäuser Verlag, Basel, 2006)
16. Y. Huang, Modified quasi-reversibility method for final value problems in Banach spaces. J.
Math. Anal. Appl. 340, 757–769 (2008)
17. Y. Huang, Q. Zheng, Regularization for ill-posed Cauchy problems associated with generators
of analytic semigroups. J. Differ. Eq. 203, 38–54 (2004)
18. Y. Huang, Q. Zheng, Regularization for a class of ill-posed Cauchy problems. Proc. Amer.
Math. Soc. 133–10, 3005–3012 (2005)
19. K. Ito, B. Jin, Inverse Problems: Tikhonov Theory and Algorithms (World Scientific, Singapore,
2014)
20. R. Lattes, J.L. Lions, The Method of Quasi-reversibility, Applications to Partial Differential
Equations (Elsevier, New York, 1969)
21. N.T. Long, A.P.N. Dinh, Approximation of a parabolic non-linear evolution equation back-
wards in time. Inverse Probl. 10, 905–914 (1994)
22. A. Lorenzi, I.I. Vrabie, An identification problem for a linear evolution equation in a Banach
space and applications. Discrete Contin. Dyn. Syst Ser. S 4, 671–691 (2011)
23. I.V. Mel’nikova, General theory of the ill-posed Cauchy problem. J. Inverse Ill-posed Probl. 3,
149–171 (1995)
24. I.V. Mel’nikova, A.I. Filinkov, Abstract Cauchy Problems: Three Approaches. Chapman &
Hall/CRC Monographs and Surveys in Pure and Applied Mathematics, vol. 120 (Chapman &
Hall, Boca Raton, 2001)
25. K. Miller, Stabilized quasi-reversibility and other nearly-best-possible methods for non-well-
posed problems, in Symposium on Non-Well-Posed Problems and Logarithmic Convexity.
Lecture Notes in Mathematics, vol. 316 (Springer, Berlin, 1973), pp. 161–176
26. K. Miller, Logarithmic convexity results for holomorphic semigroups. Pacific J. Math. 58, 549–
551 (1975)
27. V. Nollau, Über den logarithmus abgeschlossener operatoren in Banachschen Räumen. Acta
Sci. Math. 30, 161–174 (1969)
28. L.E. Payne, Improperly Posed Problems in Partial Differential Equations. CBMS Regional
Conference Series in Applied Mathematics, vol. 22 (Society for Industrial and Applied
Mathematics, Philadelphia, 1975)
29. A. Pazy, Semigroups of Linear Operators and Applications to Partial Differential Equations
(Springer, New York, 1983)
30. A.I. Prilepko, D.G. Orlovsky, I.A. Vasin, Methods for Solving Inverse Problems in Mathemat-
ical Physics (Dekker, New York, 2000)
31. M. Reed, B. Simon, Methods of Modern Mathematical Physics, Vol. II: Fourier Analysis, Self-
Adjointness (Academic, New York, 1975)
32. W. Rudin, Real and Complex Analysis, 3rd edn. (McGraw-Hill, New York, 1987)
33. O. Scherzer, M. Grasmair, H. Grossauer, M. Haltmeier, F. Lenzen, Variational Methods in
Imaging. Applied Mathematical Sciences, vol. 167 (Springer, New York, 2009)
34. T. Schuster, B. Kaltenbacher, B. Hofmann, K.S. Kazimierski, Regularization Methods in
Banach Spaces (Walter de Gruyter, Berlin, 2012)
35. R.E. Showalter, The final value problem for evolution equations. J. Math. Anal. Appl. 47, 563–
572 (1974)
Solutions of Ill-posed Problems with Sufficiently Smooth Data 253
36. T.H. Skaggs, Z.J. Kabala, Recovering the release history of a groundwater contaminant. Water
Resour. Res. 30, 71–79 (1994)
37. D.D. Trong, N.H. Tuan, Stabilized quasi-reversibility method for a class of nonlinear ill-posed
problems. Electron. J. Differ. Eq. 2008(84), 1–12 (2008)
38. N.H. Tuan, D.D. Trong, On a backward parabolic problem with local Lipschitz source. J. Math.
Anal. Appl. 414, 678–692 (2014)
39. N.H. Tuan, D.D. Trong, T.H. Thong, N.D. Minh, Identification of the pollution source of a
parabolic equation with the time-dependent heat conduction. J. Inequal. Appl. 2014, 1–15
(2014)
A Closer Look at Bishop Operators
1 Introduction
where { · } denotes the fractional part. Clearly every Bishop operator Tα is the
product of two simple and well-understood operators, namely the multiplication
operator Mt by the independent variable in Lp [0, 1) and the composition operator
Cτα induced by the symbol τα (t) = {t + α}. Nevertheless, the structure of Tα
is largely unknown for every irrational α ∈ (0, 1). In particular, it is unknown
E. A. Gallardo-Gutiérrez ()
Dpto. de Análisis Matemático y Matemática Aplicada, Facultad de Ciencias Matemáticas,
Universidad Complutense de Madrid, Madrid, Spain
e-mail: [email protected]
M. Monsalve-López
Instituto de Ciencias Matemáticas ICMAT (CSIC-UAM-UC3M-UCM), Madrid, Spain
e-mail: [email protected]
Wφ,τ f = φ (f ◦ τ ).
pn 1
α− < n.
qn qn
Observe that by Jarník-Besicovitch Theorem (see [11, Section 5.5], for instance),
Liouville irrationals form a set of vanishing Hausdorff dimension; so for almost
every α ∈ (0, 1), Davie’s Theorem states that Tα has non-trivial closed invariant
subspaces in Lp [0, 1). More indeed, Tα has non-trivial closed hyperinvariant
A Closer Look at Bishop Operators 257
subspaces, that is, closed subspaces invariant under every linear bounded operator
in the commutant of Tα .
In the nineties, extensions strengthening Davie’s Theorem were due to Blecher
and Davie [9] and MacDonald in [22] for Bishop-type operators, that is, weighted
translation operators in Lp [0, 1) where τ = τα :
Once again, the brick wall consisted of Liouville irrationals, and despite of the
efforts, the interesting extensions regarding the non-invertible weighted translation
operators included many weights φ but neither Liouville number.
In 2008, Flattot [18], by refining the functional calculus approach, was able to
provide a large class of irrationals α ∈ (0, 1) including some Liouville numbers for
which Tα has non-trivial closed hyperinvariant ∞ subspaces in Lp [0, 1) -for instance,
the classical example of a Liouville number n=1 10−n! .
Recently, in [12] the authors have extended the class of irrationals α ∈ (0, 1) such
that Tα has non-trivial closed hyperinvariant subspaces in Lp [0, 1) by considering
arithmetical techniques to strengthen the analysis of certain functions associated
to the functional calculus model. Indeed, for these Liouville numbers, Gallardo-
Gutiérrez and Monsalve-López [20] have recently shown the existence of non-trivial
spectral subspaces, revealing, indeed, the spectral nature of the hyperinvariant
subspaces. Moreover, those Liouville numbers α left, that is, those for which it is
open whether Tα has a non-trivial closed invariant subspace, are so extreme that the
functional calculus approach fails to produce invariant subspaces (see [12, Theorem
4.1]).
The aim of this work is to survey the recent results at this regard and show how
techniques in Operator Theory, Analytic Number Theory or Spectral Theory are
linked together to produce, when it succeeds, invariant subspaces for such a simple
family of operators as Bishop operators are. To that end, the rest of the manuscript
is organized as follows. In Sect. 2 we deal with a first approach by studying
the behaviour of the norm of the iterates Tαn (clearly, they tend to 0 by means
of Parrott’s characterization of σ (Tα )). In Sect. 3, we detail the main techniques
and approaches previously mentioned to provide invariant subspaces for Bishop
operators recalling the updated results in this context. In Sect. 4, our approach deals
with local spectral properties fulfilled by Bishop operators Tα , independently of
the irrational α ∈ (0, 1). Note that though all Bishop operators Tα share the same
spectrum, not all of them are known to possess invariant subspaces; and hence, a
deeper insight in σ (Tα ) could lead to study invariant subspaces by considering local
spectral properties like the Dunford property (Property (C)) or the Bishop property
(β). Indeed, in Sect. 5 the local spectral analysis is pushed further to provide spectral
subspaces for all those Bishop operators that, up to now, are known to have non-
trivial closed hyperinvariant subspaces. Finally, in Sect. 6, we deal with Bishop-type
operators and their properties, ending up with some open questions in this context.
258 E. A. Gallardo-Gutiérrez and M. Monsalve-López
In this section, and as a preliminary stage, we determine explicitly the norm of the
iterates of Tα acting on Lp [0, 1) for 1 ≤ p ≤ ∞. In particular, it provides an insight
of the behaviour of Tα depending on the irrational α.
Let n be a positive integer and denote by Tαn the n-th iterate of Tα . A simple
computation shows that
where
n! √
min Sn (α) = ∼ e−(n+1) 2πn.
α∈[0,1) (n + 1) n
1 1
0.75 0.75
0.5 0.5
0.25 0.25
(a) (b)
1 1
0.75 0.75
0.5 0.5
0.25 0.25
(c) (d)
Fig. 1 Sm (α) for m = 1, . . . , 4. (a) S1 (α) for α ∈ [0, 1). (b) S2 (α) for α ∈ [0, 1). (c) S3 (α) for
α ∈ [0, 1). (d) S4 (α) for α ∈ [0, 1)
The first goal of this section will be giving a detailed outline of the main techniques
and approaches used to find invariant subspaces for Bishop operators. This will serve
as a context in order to, thereupon, present in more detail our results, in which we
considerably enlarge the set of known values α such that the Bishop operator Tα has
invariant subspaces on each Lp [0, 1). On the other hand, at the end of the section, we
will show that in some sense, when our approach fails to produce invariant subspaces
for Tα it is actually because the standard techniques cannot be applied anymore.
All the original results appearing in this section are included in the article [12].
260 E. A. Gallardo-Gutiérrez and M. Monsalve-López
1 1
0.75 0.75
0.5 0.5
0.25 0.25
(a) (b)
1 1
0.75 0.75
0.5 0.5
0.25 0.25
(c) (d)
Fig. 2 Sm (α) for m = 5, . . . , 8. (a) S5 (α) for α ∈ [0, 1). (b) S6 (α) for α ∈ [0, 1). (c) S7 (α) for
α ∈ [0, 1). (d) S8 (α) for α ∈ [0, 1)
1/n
lim ρn = 1; (3.2)
|n|→∞
log ρn
< ∞. (3.3)
1 + n2
n∈Z
Suppose further that both sequences (xn )n∈Z and (yn )n∈Z are dominated by a
Beurling sequence, and there is at least a λ ∈ T at which the following functions
Gx and Gy do not both possess analytic continuation into a neighbourhood of λ:
∞
n=1 x−n z
n−1 if |z| < 1;
Gx (z) := 0
− n=−∞ x−n zn−1 if |z| > 1.
∞
n=1 y−n z
n−1 if |z| < 1;
Gy (z) := 0
− n=−∞ y−n zn−1 if |z| > 1.
Applying a novel version of these methods, Davie [15] was the first in obtaining
positive results on the existence of invariant subspaces for some classes of Bishop
operators. In particular, he showed that whenever α ∈ (0, 1) is a non-Liouville
number, the corresponding Bishop operator Tα has a non-trivial hyperinvariant
subspace on each Lp [0, 1).
Further refinements due to Flattot [18] enlarged the class of such irrational
numbers, embracing some Liouville numbers. More specifically, using the language
of continued fractions, if (aj /qj )j ≥0 denote the convergents of α, the limit of
Flattot’s result ensures the existence of invariant subspaces for Tα on each Lp [0, 1)
whenever
1/2−ε
log qj +1 = O qj for every ε > 0. (3.4)
Turning to Bishop operators, up to date, our approach yields the most general
result regarding the existence of invariant subspaces [12]:
Theorem 3.4 (Chamizo et al. [12]) Let α ∈ (0, 1) be any irrational and
(aj /qj )j ≥0 the convergents in its continuous fraction. If the following condition
holds
q
j
log qj +1 = O ; (3.5)
log3 qj
Again, the idea behind our address is exactly the same: a careful application of
Atzmon’s Theorem. For this, it shall be convenient to work with the operators
α := e Tα .
T
This scalar multiple of Tα , far from being arbitrary, is determined by the fact
σ (Tα ) = D. Now, without going into much details, an insight in the proof may
be the following: given any irrational α ∈ (0, 1), in the sequel, consider the real
functions
n−1
Ln (t) := 1 + log({t + j α}) , (n ∈ N).
j =0
It is plain that the functions Ln (t) play a fundamental role in the understanding of
the iterates of both Tα and Tα∗ , since they codify their behaviours via the equations
and, in a similar way, for Tα∗ . Note that Eq. (3.6) makes sense for every Lp -function
despite of the fact that Tα has not a bounded inverse.
In light of (3.6), it is not hard to figure out that, in order to control the growth
of both iterates T αn f and (Tα∗ )n f , it could be a good idea to construct ad hoc a
function which kills each of the singularities arising from each summand of Ln (t).
To accomplish that, we may consider, for instance, the characteristic function of the
set
1 19 1
Bα := <t < : t − nα > 2
for every n ∈ Z \ {0} ,
20 20 20n
where t := min {t}, 1 − {t} denotes the distance from t to the closest integer.
First, note that, since the set Bα has positive Lebesgue measure, it follows that 1Bα
does not vanish identically as an element of Lp [0, 1). On the other hand, it turns
out that |Ln (t)| can be bound properly in terms of the sequence (qj )j ≥0 for t ∈ Bα ,
when |n| is comparable to qj +1 .
We prefer not to deal with technicalities which are out of the scope of this survey.
Anyway, as explained in [12], after some arithmetical estimations, we have:
Proposition 3.5 Let α ∈ (0, 1) be any irrational and (aj /qj )j ≥0 the convergents
3/2
in its continuous fraction. Then, for every n ∈ Z with |n| ≤ qj +1 , we have
; n ; ; ;
log 1 + ;
Tα 1Bα ;∞ + ;(Tα∗ )n 1Bα ;∞
|n| |n| + qj +1 (3.7)
9 qj + log qj + log(|n| + 1).
qj qj +1
264 E. A. Gallardo-Gutiérrez and M. Monsalve-López
and consider Tα on Lp [0, 1) for some 1 ≤ p < ∞. Then, for every non-zero
f ∈ Lp [0, 1), we have
; n ;
log 1 + ;
Tα f ;p
= ∞.
1 + n2
n∈Z
and observe that, α ∈ M means exactly that α is pretty close to some rationals a/q.
Roughly speaking, this implies that whenever n is comparable to q and q | n, we
have a very accurate approximation between
a
Ln (t − nα) ≈ Ln t − n = Ln (t).
q
But, in such a case, it results that the integral in (3.8) must be large unless |Ln (t)| is
small, which turns out to happen rarely.
The aim of the current section is identifying which local spectral properties are
shared by all Bishop operators Tα , independently of the irrational α ∈ (0, 1).
This apparent digression is justified by the facts presented in the preceding
sections: while all Bishop operators Tα share the same spectrum, only a few of
them are known to possess invariant subspaces; hence, might a deeper insight in
σ (Tα ) help us to deal with those cases not covered by Atzmon’s Theorem? Actually,
this naive idea of studying invariant subspaces via spectral subsets rarely succeeds;
nevertheless, as we shall argue at the end of the survey, this will not be completely
senseless in the case of Bishop operators.
All the original results presented in Sect. 4.2 may be found in [19]; on the other
hand, those appearing in Sect. 4.3 are stated in [12].
266 E. A. Gallardo-Gutiérrez and M. Monsalve-López
We begin this part by recalling some preliminaries regarding local spectral theory.
In what follows, X will stand for an arbitrary complex Banach space, L(X) will
denote the class of linear operators on X and B(X) the Banach algebra of linear
bounded operators on X.
Given any operator T ∈ B(X), let σT (x) denote the local spectrum of T at
x ∈ X; i.e., the complement of the set of all λ ∈ C for which there exists an open
neighbourhood Uλ 4 λ and an analytic function f : Uλ → X such that
The complement of σT (x), which is denoted by ρT (x), stands for the local resolvent
of T at x ∈ X; and a function verifying (4.1) is called a local resolvent function
nearby λ ∈ C for x ∈ X. In general, the uniqueness of the local resolvent
function cannot be assumed. Thus, T satisfies the single-valued extension property
(abbreviated as SVEP) if, fixed any x ∈ X and λ ∈ ρT (x), there exists a unique
local resolvent function on a sufficiently small neighbourhood of λ. We highlight
that any operator with σp (T ) = ∅ enjoys the SVEP.
The notion of local spectrum allows us to gain a further knowledge on what con-
stitutes each part of σ (T ), this may be accomplished via local spectral manifolds:
given any subset ⊆ C, the local spectral manifold XT () is defined as
XT () := x ∈ X : σT (x) ⊆ .
σ (T |Y ) ⊆ U and σ (T |Z) ⊆ V ,
(T − zI )fn (z) → 0 as n → ∞
A Closer Look at Bishop Operators 267
X = XT (U ) + XT (V );
Not surprisingly, likewise in Gelfand’s formula, the local spectral radius and the
local spectrum may be related via the inequality
rT (x) ≥ max |λ| : λ ∈ σT (x) ,
= r(Tα ) = e−1 .
1/n
lim Tαn f p
n→∞
Once we know Corollary 4.3, the Proof of Theorem 4.4 is pretty simple: suppose
that Tα is decomposable in Lp [0, 1); then, fixed arbitrary 0 < r < s < e−1 , we may
consider the open cover
By means of Corollary 4.3, the unique closed invariant subspace Y ⊆ Lp [0, 1) such
that σ (Tα |Y ) ⊆ U would be Y = {0}; but this leads us to a contradiction, since
σ (Tα ) ⊆ V .
Furthermore, it is possible to obtain a slightly stronger result by combining
a similar argument to the previous one with the duality principle [2] between
properties (β) and (δ):
Theorem 4.5 (Gallardo-Gutiérrez and Monsalve-López [19]) Let α ∈ (0, 1) be
any irrational and consider Tα on Lp [0, 1) for 1 < p < ∞. Then, Tα does not
satisfy either property (β) or (δ).
As a final remark, note that one could reasonably argue that, in a very vague
sense, Proposition 4.2 and Corollary 4.3 seem to be suggesting that the most
significant part of σ (Tα ) is precisely its boundary. This intuition shall be explored
more deeply and confirmed in more concrete terms throughout the rest of the article.
Throughout this subsection, we shall prove that Bishop operators can neither satisfy
property (C); nevertheless, along the way, something much more stronger will be
shown. More precisely, we shall see that, unlike for subsets ⊆ int(σ (Tα )), for
which Proposition 4.2 ensured that XTα () = {0}; the local spectral manifold
XTα (∂σ (Tα )) turns out to be dense for every irrational α. To do so, we are going
to split the proof into two parts: firstly, using (3.7), we will check that σTα (1Bα )
must be confined into ∂σ (Tα ); on the other hand, an argument with L∞ functions
will let us to elucidate the density of XTα (∂σ (Tα )).
Proposition 4.6 Let α ∈ (0, 1) be any irrational number and consider Tα acting
on Lp [0, 1) for some 1 ≤ p < ∞. Then, both local spectra
In order to prove Proposition 4.6, consider the Lp [0, 1)-valued analytic function
given by
∞
gα (z) := α−n 1Bα · zn−1 .
T
n=1
270 E. A. Gallardo-Gutiérrez and M. Monsalve-López
Now, note that gα must be analytic on the open disk |z| < 1, since, by means of the
asymptotic estimate (3.7), we have
; −n ;1/n
lim sup ;
Tα 1Bα ;p ≤ 1.
n→∞
Finally, it is routine to check that (Tα − zI )gα (z) = 1Bα for every |z| < 1.
An important remark is in order: given an operator with σp (T ) = σp (T ∗ ) = ∅,
it may be seen that if Atzmon’s Theorem may be applied to the sequences
xn := T n x and yn := (T ∗ )n y (n ∈ Z)
for a pair x ∈ X and y ∈ X∗ , the corresponding local spectra σT (x) and σT ∗ (y)
must be inside ∂D. Nevertheless, the converse is not true. This evinces the main
gain of the asymptotic (3.7) with respect to the ones in the previous works [15] and
[18], since Proposition 3.5 can be applied independently of the irrational α.
Now, once we are aware of 1Bα ∈ XTα (∂σ (Tα )), we are in position to take
another step ahead and prove the density of XTα (∂σ (Tα )):
Theorem 4.7 (Chamizo et al. [12]) Let α ∈ (0, 1) be any irrational number. Then,
the local spectral manifold
XTα ∂σ (Tα ) = f ∈ Lp [0, 1) : σTα (f ) ⊆ ∂σ (Tα )
is norm-dense in Lp [0, 1) for each 1 ≤ p < ∞. In particular, Tα does not satify the
Dunford property (C) on Lp [0, 1) for 1 ≤ p < ∞.
In the argument of Theorem 4.7, we repeatedly exploit the fact that 1Bα is a
characteristic function. Firstly, note that standard bounds yield the inclusion
g(t)1Bα (t) : ess sup |g(t)| < ∞ ⊆ XTα (∂σ (Tα )).
t ∈[0,1)
Therefore, by the density of L∞ into Lp for each 1 ≤ p < ∞ and taking into
account that the support of 1Bα is precisely the set Bα , we have
f ∈ Lp [0, 1) : supp(f ) ⊆ Bα ⊆ XTα (∂σ (Tα )).
Now, since Tα 1Bα ∈ XTα (∂σ (Tα )) as well, we deduce a similar inclusion:
tg(t)1Bα ({t + α}) : ess sup |g(t)| < ∞ ⊆ XTα (∂σ (Tα )),
t ∈[0,1)
but, since Bα has strictly positive measure and τα is ergodic, the proof is done.
As it was promised before beginning this subsection, we may clearly appreciate
that the meaning behind each part of σ (Tα ) differs significantly. In this sense, it is
plain that the boundary of σ (Tα ) stores much more information about Tα than the
interior. This idea will be raised into a new level in the following section, showing
that those already known closed invariant subspaces for Tα can be characterized as
the norm-closure of some local spectral manifolds related to ∂σ (Tα ).
It is plain from the preceding section, that we cannot expect to encounter invariant
subspaces for Tα when we consider certain local spectral manifolds. For example,
if we pick an arbitrary set inside in C \ ∂σ (Tα ), we fall too short since, in such a
case, the associated local spectral manifold is trivial:
on the other hand, if ∂σ (Tα ) ⊆ , we go too far since, in such case, the associated
local spectral manifold is dense:
for the corresponding 1 ≤ p < ∞. These two facts restrict significantly our quest
of invariant subspaces for Tα via local spectral manifolds and indicate us that, our
unique hope could be choosing subsets which intersect the boundary of σ (Tα ) but
not covering it entirely. Surprisingly, at least for many values of α, this new attempt
turns out to be successful, bringing about local spectral manifolds verifying
Nevertheless, not everything shall be good news, since our address will depend
on a local spectral variant of Atzmon’s Theorem and the restriction imposed by
Theorem 3.6 still remains. Anyway, considering these ideas, it seems fully justified
to conjecture about the possibility of addressing the invariant subspace problem for
general Bishop operators Tα using techniques borrowed from local spectral theory.
This section is mainly based on the article [20].
272 E. A. Gallardo-Gutiérrez and M. Monsalve-López
As before, X will be any infinite-dimensional Banach space, L(X) will stand for
the class of linear operators on X while B(X) will denote the Banach algebra of
linear bounded operators on X. Such distinction between linear and linear bounded
operators will be of particular interest along the current section.
As the title announces, throughout the first half of this section, our aim is
unravelling the spectral meaning hidden behind the statement of Atzmon’s Theorem
(and related results in [3] and [7]) in order to, further on, apply it in the study of
Bishop operators.
The preliminary step before understanding properly Atzmon’s Theorem from a
spectral perspective should be the following decomposability version of Wermer’s
theorem [29] given by Colojoară and Foiaş [13]:
Theorem 5.1 (Colojoară and Foiaş [13]) Let T ∈ B(X) be an invertible operator
on a complex Banach space X with σ (T ) ⊆ T satisfying
log T n
< ∞. (5.1)
1 + n2
n∈Z
φ : Aρ → B(X)
(5.2)
eınt → T n
for every n ∈ Z. Now, given any open cover {U, V } of σ (T ), due to the regularity
of Aρ , we can find a function h ∈ Aρ such that
h ≡ 1 on (C \ V ) ∩ σ (T ) and h ≡ 0 on (C \ U ) ∩ σ (T );
but, the manner in which h has been chosen causes that the ranges of the operators
φ(h) and I − φ(h) split the spectrum of T in the following way:
σ T | φ(h)(X) ⊆ U and σ T | (I − φ(h))(X) ⊆ V ,
one may rightly conjecture that Atzmon’s Theorem should also involve a sort
of decomposition on the spectrum σ (T ); nevertheless, in this case, such spectral
decomposition shall be much more subtle and less manageable. More specifically,
when the invertibility of T is missing, we are forced to consider a decomposition
of σ (T ) only with respect to a proper linear submanifold of X. For the sake of
completeness, we explain it in more accurate terms: in the sequel, we are intended
to modify the Proof of Theorem 5.1 with the aim of fitting it into the hypothesis
required by Atzmon’s Theorem.
Let ρ := (ρn )n∈Z be an arbitrary Beurling sequence. Since T needs not to be
invertible anymore, the algebraic homomorphism (5.2) must be replaced by
φ : Aρ → L(X)
(n ∈ Z)
eınt → T n
T n x 9 ρn , (n ∈ Z);
allows us to employ again the regularity of the Beurling algebra Aρ . Thus, for any
open cover {U, V } of σ (T ), the regularity of Aρ enables us to pick a function h ∈
Aρ such that
h ≡ 1 on (C \ V ) ∩ σ (T ) and h ≡ 0 on (C \ U ) ∩ σ (T ).
XT (F ) ⊆ ⊥ XT∗ ∗ (G),
valid whenever T has the SVEP and for every pair of disjoint closed sets F, G ⊆ C;
where, as usual, ⊥ N denotes the preannihilator of a subset in N ⊆ X∗ , i.e.
⊥
N := x ∈ X : ϕ(x) = 0 for every ϕ ∈ N .
for some Beurling sequence ρ := (ρn )n∈Z . Then, if σT (x)∪σT ∗ (y) is not a singleton,
for every open subset U ⊆ C such that U ∩ σT (x) = ∅ and σT ∗ (y) \ U = ∅, we
have
{0} = XT (U ) = X.
According to the exposition held in Sect. 4, the lack of any profitable local spectral
property seems to be an evident feature regarding Bishop operators. Nevertheless, as
it shall be discussed along the following lines with the aid of the results exposed in
the preceding subsection, this turns out to be just a kind of illusion (at least in some
cases) caused by a misguided choice of the local spectral manifolds. Of course, the
cornerstone in the development of such richer spectral theory for Bishop operators
shall be the aforementioned local spectral version of Atzmon’s Theorem.
Again, as we aim to apply Atzmon’s Theorem to Bishop operators, we recall that
the condition
q
j
log qj +1 = O , (5.4)
log3 qj
which shall play a prominent role in the sequel. By means of Theorem 3.4, we know
that given an irrational α ∈ (0, 1) satisfying (5.4), there exists a Beurling sequence
ρ = (ρn )n∈Z such that
; n ; ; ;
;
Tα 1Bα ; 9 ρn and ;(Tα∗ )n 1Bα ; 9 ρn .
Thus, due to the irrationality of α, it is plain that provided any open subset U ⊆ C
such that U ∩T = T and T\U = ∅, we may find , m ∈ Z satisfying both conditions
then, for every algebra action φ : Aρ → L(Lp [0, 1)) with φ(1) = I and φ(eınt ) =
αn from a regular Beurling algebra Aρ , the corresponding continuity core must be
T
Dφ = {0}.
On the other hand, it seems reasonable to expect that, for those cases Tα uncov-
ered by Atzmon’s Theorem (despite the absence of a profitable functional calculus),
their local spectral properties remain identical and rich enough to construct closed
invariant subspaces. In this sense, we suggest that one feasible route to solve the
invariant subspace problem for all Bishop operators could be understanding in depth
their local spectral manifolds XTα (U ) arising from open sets U intersecting the
boundary of σ (Tα ). To support our question in the positive, we recall that there exist
decomposable operators T ∈ B(X) whose spectral behaviour cannot be described
in terms of a functional calculus from a suitable algebra [1]. We pose it as an open
question:
Question 1 Let α ∈ (0, 1) be any irrational number and ε > 0 sufficiently small. Is
the local spectral manifold
XTα D(e−1 , ε)
6 Bishop-Type Operators
In order to complete the survey, our purpose throughout this last section is discussing
analogous results to the ones described above in a more general framework. In
particular, we shall show that many Bishop-type operators also admit a rich spectral
theory.
We begin by recalling that MacDonald in [22] was able to determine the
spectrum of many weighted translations operators. In order to state his result,
let us fix (, B, μ) a non-atomic probability space which arises from the Borel
sets of a compact metrizable space. Recall that any invertible measure-preserving
transformation τ : → is called ergodic for (, B, μ) if given any B ∈ B such
that τ −1 (B) = B then either μ(B) = 0 or μ(B) = 1 and, it is called uniquely
ergodic whenever it is continuous and μ is the unique -probability measure for
which τ is ergodic. As an important instance, observe that whenever = [0, 1) is
endowed with the usual Lebesgue measure, having in mind the natural identification
with T = {e2πit : t ∈ [0, 1)}, the transformation τα (t) := {t + α} with α ∈ Q is
uniquely ergodic.
We are now in position to state MacDonald’s result. Recall that, given an arbitrary
φ ∈ L∞ [0, 1) and τ any measure-preserving transformation with measurable
inverse, the Bishop-type operator Tφ,τ is defined as
Tφ,τ f := φ · (f ◦ τ )
Clearly the assumptions on φ and τ determine the spectrum of Tφ,τ , but the
general case is open and the following difficult question arises naturally:
Question 2 Determine the spectrum of weighted translation operators.
On the other hand, following Atzmon’s theorem approach, MacDonald was able
to establish the existence of non-trivial closed hyperinvariant subspaces for a large
class of Bishop-type operators Tφ,τα , for the sake of brevity denoted by Tφ,α (see
[22, Theorem 2.5, Theorem 2.6]). Consequently, in the spirit of Sect. 5, for those
cases covered by MacDonald, it is possible to obtain spectral decompositions similar
to those appearing in Theorems 5.4 and 5.5. We address that in the following
subsection.
In order to state the results accurately, we must dwell on some minor technicalities
for a moment: given a positive real number M, we recall from [22] the class of step
functions
SM = S = rj χIj : rj ∈ R, Ij intervals and |rj | ≤ M .
j =1 j =1
In addition, let L denote the class of all real functions f ∈ L∞ [0, 1) for which there
exists γ > 0 and Kf > 0 (Kf depending exclusively on f ) such that
1
inf {f − S∞ : S ∈ SM } < Kf
M
A Closer Look at Bishop Operators 279
∞
1 q
j +1
log <∞
qj qj
j =0
is satisfied by the convergents (aj /qj )j ≥0 of the irrational α. Note that, as a matter
of fact, the latter condition embraces some Liouville numbers.
In view of the above discussion, as it was previously posed for Bishop operators,
it might happen that addressing of the invariant subspace problem for general
Bishop-type operators based on identifying some of their local spectral manifolds
was a fruitful approach. Similarly, we pose it as an open question:
Question 3 Let α ∈ (0, 1) be any irrational number and U ⊆ C a “sufficiently
small” open set intersecting ∂σ (Tφ,α ). Is the local spectral manifold XTφ,α U non-
trivial and non-dense in Lp [0, 1) for every 1 ≤ p < ∞?
References
1. E. Albrecht, On two questions of I. Colojoară and C. Foiaş. Manuscripta Math. 25, 1–15 (1978)
2. E. Albrecht, J. Eschmeier, Functional models and local spectral theory. Proc. London Math.
Soc. 75, 323–348 (1997)
3. A. Atzmon, Operators which are annihilated by analytic functions and invariant subspaces.
Acta Math. 144, 27–63 (1980)
4. A. Atzmon, On the existence of hyperinvariant subspaces. J. Operator Theor. 11, 4–40 (1984)
5. A. Atzmon, Power-regular operators. Trans. Amer. Math. Soc. 347, 3101–3109 (1995)
6. J.J. Bastian, Decomposition of weighted translation operators. Ph.D. Dissertation. Indiana
University, 1973
7. B. Beauzamy, Sous-espaces invariants de type fonctionnel dans les espaces de Banach. Acta
Math. 144, 65–82 (1980)
8. A. Beurling, Sur les intégrales de Fourier absolument convergentes et leur application à une
transformation fonctionelle, in Ninth Scandinavian Mathematical Congress (1938), pp. 345–
366
9. D.P. Blecher, A.M. Davie, Invariant subspaces for an operator on L2 () composed of a
multiplication and a translation. J. Operator Theory. 23, 115–123 (1990)
10. S.W. Brown, Hyponormal operators with thick spectra have invariant subspaces. Ann. Math.
125, 93–103 (1987)
11. Y. Bugeaud, Approximation by Algebraic Numbers (Cambridge University Press, Cambridge,
2004)
12. F. Chamizo, E.A. Gallardo-Gutiérrez, M. Monsalve-López, A. Ubis, Invariant subspaces for
Bishop operators and beyond. Adv. Math. 375, 107365 (2020)
13. I. Colojoară, C. Foiaş, Theory of Generalized Spectral Operators (CRC Press, Boca Raton,
1968)
14. J. Daneš, On local spectral radius. C̆asopis Pes̆t. Mat. 112, 177–187 (1987)
15. A.M. Davie, Invariant subspaces for Bishop’s operators. Bull. London Math. Soc. 6, 343–348
(1974)
16. J. Eschmeier, B. Prunaru, Invariant subspaces for operators with property (β) and thick
spectrum. J. Funct. Anal. 94, 196–222 (1990)
17. J. Eschmeier, B. Prunaru, Invariant subspaces and localizable spectrum. Integr. Equ. Oper.
Theory 42, 461–471 (2002)
18. A. Flattot, Hyperinvariant subspaces for Bishop-type operators. Acta. Sci. Math. 74, 689–718
(2008)
19. E. Gallardo-Gutiérrez, M. Monsalve-López, Power-regular Bishop operators and spectral
decompositions. J. Operator Theory (in press). https://doi.org/10.7900/jot.2019sep21.2256
20. E. Gallardo-Gutiérrez, M. Monsalve-López, Spectral decompositions arising from Atzmon’s
hyperinvariant subspace theorem. (under review)
21. K. Laursen, M. Neumann, An Introduction to Local Spectral Theory (Clarendon Press, Oxford,
2000)
22. G.W. MacDonald, Invariant subspaces for Bishop-type operators. J. Funct. Anal. 91, 287–311
(1990)
23. G.W. MacDonald, Decomposable weighted rotations on the unit circle. J. Operator Theory 35,
205–221 (1996)
24. V. Müller, Local spectral radius formula for operators in Banach spaces. Czechoslovak Math.
J. 38, 726–729 (1988)
25. M. Neumann, Banach algebras, decomposable convolution operators, and a spectral mapping
property, in Function Spaces. Marcel Dekker Series in Pure and Applied Mathematics, vol.
136 (Dekker, New York, 1992), pp. 307–323
26. S.K. Parrott, Weighted translation operators. ProQuest LLC, Ann Arbor, MI. Thesis (Ph.D)-
University of Michigan, 1965
A Closer Look at Bishop Operators 281
27. K. Petersen, The spectrum and commutant of a certain weighted translation operator. Math.
Scand. 37, 297–306 (1975)
28. W. Rudin, Real and Complex Analysis (Tata McGraw-Hill Education, New York, 2006)
29. J. Wermer, The existence of invariant subspaces. Duke Math. J. 19, 615–622 (1952)
Products of Unbounded Bloch Functions
Daniel Girela
Abstract We give new constructions of pair of functions (f, g), analytic in the unit
disc, with g ∈ H ∞ and f an unbounded Bloch function, such that the product g · f
is not a Bloch function.
Let D = {z ∈ C : |z| < 1} denote the open unit disc in the complex plane C. The
space of all analytic functions in D will be denoted by Hol(D).
For 0 < p ≤ ∞, the classical Hardy space H p is defined as the set of all
f ∈ Hol(D) for which
def
f H p = sup Mp (r, f ) < ∞,
0<r<1
D. Girela ()
Análisis Matemático, Facultad de Ciencias, Universidad de Málaga, Málaga, Spain
e-mail: [email protected]
def
f B = |f (0)| + sup (1 − |z|2 )|f (z)| < ∞.
z∈D
The space of all Bloch functions is denoted by B, it is a Banach space with the just
defined norm · B . It is well known that
H ∞ B.
1
f (z) = log , z ∈ D.
1−z
|f (z)|
sup(1 − |z|2 ) < ∞.
z∈D 1 + |f (z)|2
For simplicity, we shall let N denote the set of all holomorphic normal functions in
D. It is clear that any Bloch function is a normal function, that is, we have B ⊂ N .
We refer to [1, 15] and [16] for the theory of normal functions. In particular, we
remark here that if f ∈ N , ξ ∈ ∂D and f has the asymptotic value L at ξ , (that is,
there exists a curve γ in D ending at ξ such that f (z) → L, as z → ξ along γ ) then
f has the non-tangential limit L at ξ .
Let us recall that if a sequence of points {an } in the unit disc satisfies the Blaschke
condition:
∞
(1 − |an |) < ∞,
n=1
/ an − am
≥ δ, for all n,
1 − a n am
m=n
we say that the sequence {an } is uniformly separated and that B is an interpolating
Blaschke product. Equivalently,
We refer to [7, Chapter 9] and [9, Chapter VII] for the basic properties of inter-
polating Blaschke products. In particular, we recall that an exponential sequence is
uniformly separated and that the converse holds if all the ak ’s are positive.
Lappan [14, Theorem 3] proved that if B is an interpolating Blaschke product
and f is a normal analytic function in D, the product B · f need not be normal.
Lappan used this to show that N is not a vector space.
Lappan’s result is a consequence of the following easy fact: if B is an interpolat-
ing Blaschke product whose sequence of zeros is {an } and G is an analytic function
in D with G(an ) → ∞, then f = B · G is not a normal function (and hence it
is not a Bloch function either). This result has been used by several authors (see
[3, 5, 10, 11, 17, 18]) to construct distinct classes of non-normal functions.
The author and Suárez proved in [12] a result of this kind dealing with Blaschke
products with zeros in a Stolz angle but not necessarily interpolating, improving a
result of [13]. Namely, Theorem 1 of [12] is the following.
Theorem A Let B be an infinite Blaschke product whose sequence of zeros {an } is
contained in a Stolz angle with vertex at 1 and let G be analytic in D with G(z) →
∞, as z → 1. Then the function f = B · G is not a normal function.
It is natural to ask whether it is possible to prove results similar to those
described, substituting “Blaschke products” by some other classes of H ∞ -
functions. Our first result in this paper deals with the atomic singular inner function.
Theorem 1.1 Let S be the atomic singular inner function defined by
1+z
S(z) = exp − , z ∈ D, (1.1)
1−z
lim |f (z)| = ∞.
z→1
Then the function F defined by F (z) = S(z)f (z) is not a normal function (hence, it
is not a Bloch function).
286 D. Girela
and, consequently,
1+r
|f (r)| = o exp , as r → 1− .
1−r
Then the function F defined by F (z) = S(z)f (z) is not a normal function (hence, it
is not a Bloch function).
For g ∈ Hol(D), the multiplication operator Mg is defined by
def
Mg (f )(z) = g(z)f (z), f ∈ Hol(D), z ∈ D.
Let us recall that if X and Y are two spaces of analytic function in D and g ∈ Hol(D)
then g is said to be a multiplier from X to Y if Mg (X) ⊂ Y . The space of all
multipliers from X to Y will be denoted by M(X, Y ) and M(X) will stand for
M(X, X). Brown and Shields [4] characterized the space of multipliers of the Bloch
space M(B) as follows.
Theorem B A function g ∈ Hol(D) is a multiplier of the Bloch space if and only
if g ∈ H ∞ ∩ Blog , where Blog is the Banach space of those functions f ∈ Hol(D)
which satisfy
def 2
f Blog = |f (0)| + sup(1 − |z| ) log
2
|f (z)| < ∞.
z∈D 1 − |z|2
$11 = {f ∈ Hol(D) : f ∈ H 1 }.
Products of Unbounded Bloch Functions 287
On the other hand, Theorem 1 of [8] shows the existence of a Jordan domain with
rectifiable boundary and 0 ∈ , and such that the conformal mapping g from D
onto with g(0) = 0 and g (0) > 0 does not belong to Blog. For this function g we
have that g ∈ $11 but g is not a multiplier of B. Thus we have:
$11 ⊂ M(B).
In view of this and the results involving Blaschke products that we have
mentioned above, it is natural to ask the following question:
Question 1.3 Is it true that for any given f ∈ B \H ∞ there exists a function g ∈ $11
such that g · f ∈
/ B?
We shall show that the answer to this question is affirmative. Actually we shall
prove a stronger result.
We let B 1 denote the minimal Besov space which consists of those functions
f ∈ Hol(D) such that
|f (z)| dA(z) < ∞.
D
∞
f (z) = λ0 + λk ϕak (z), z ∈ D.
k=1
a−z
ϕa (z) = , z ∈ D.
1 − az
It is well known that B 1 ⊂ $11 (see [2, 6]) and then our next result implies that the
answer to question 1.3 is affirmative.
Theorem 1.4 If f ∈ B \ H ∞ then there exists g ∈ B 1 such that g · f ∈
/ B.
The proofs of Theorems 1.2 and 1.4 will be presented in Sect. 2. We close
this section noticing that throughout the paper we shall be using the convention
that C = C(p, α, q, β, . . . ) will denote a positive constant which depends only
upon the displayed parameters p, α, q, β, . . . (which often will be omitted) but
not necessarily the same at different occurrences. Moreover, for two real-valued
functions E1 , E2 we write E1 E2 , or E1 E2 , if there exists a positive constant
C independent of the arguments such that E1 ≤ CE2 , respectively E1 ≥ CE2 . If
288 D. Girela
2 The Proofs
1+z a
Re =
1−z 1−a
and, hence,
−a
|S(z)| = exp , z ∈ a .
1−a
F (z) → ∞, as z → 1 along a .
Hence F has the asymptotic value ∞ at 1. On the other hand, (ii) implies that F has
the radial limit 0 at 1. Then it follows that F is not normal.
Take f ∈ B \ H ∞ . Set
λk = ϕ(rk )−1/2 , k = 1, 2, . . . .
For each k, take ak ∈ D with |ak | = rk and |f (ak )| = ϕ(rk ). Using (iii), it follows
that
∞
λk < ∞. (2.2)
k=1
Define
∞
g(z) = λk ϕak (z), z ∈ D. (2.3)
k=1
Using (2.2) we see that the sum in (2.3) defines an analytic function in D which
belongs to B 1 . Set
where
n−1
1 − |ak |2
I I |f (an )| λk ,
|1 − ak an |2
k=1
∞
1 − |ak |2
I I I |f (an )| λk .
|1 − ak an |2
k=n+1
Clearly,
ϕ(rn )1/2
I = |f (an )|λn |ϕan (an )| $ . (2.6)
1 − rn
Using the definitions, the facts that ϕ and the sequence {rn } are increasing, and (ii),
we obtain
n−1
1 − |ak |2
I I |f (an )| λk
|1 − ak an |2
k=1
n−1
1 − |ak |
ϕ(rn ) ϕ(rk )−1/2
[(1 − |ak |) + (1 − |an )]2
k=1
n−1
1
ϕ(rn )
ϕ(rk )1/2 (1 − rk )
k=1
nϕ(rn )
1 − rn−1
ϕ(rn )1/2 n(1 − rn )
= ϕ(rn )1/2
1 − rn 1 − rn−1
1/2
ϕ(rn )
=o . (2.7)
1 − rn
Likewise, using the definitions, the facts that ϕ and the sequence {rn } are increasing,
(iii), and (iv), we obtain
∞
ϕ(rk )−1/2 (1 − rk )
I I I ϕ(rn )
[(1 − rk ) + (1 − rn )]2
k=n+1
∞
1 − rk
ϕ(rn ) ϕ(rk )−1/2
(1 − rn )2
k=n+1
Products of Unbounded Bloch Functions 291
∞
1 − rn+1
ϕ(rn ) ϕ(rk )−1/2
(1 − rn )2
k=n+1
Using (2.5)–(2.8), and the fact that lim ϕ(rn ) = ∞, we deduce that
Acknowledgments I wish to thank the referees for their careful reading of the article and for their
suggestions to improve it.
This research is supported in part by a grant from “El Ministerio de Economía y Competitivi-
dad”, Spain (PGC2018-096166-B-I00) and by grants from la Junta de Andalucía (FQM-210 and
UMA18-FEDERJA-002).
References
1. J.M. Anderson, J. Clunie, Ch. Pommerenke, On Bloch functions and normal functions. J. Reine
Angew. Math. 270, 12–37 (1974)
2. J. Arazy, S.D. Fisher, J. Peetre, Möbius invariant function spaces. J. Reine Angew. Math. 363,
110–145 (1985)
3. O. Blasco, D. Girela, M.A. Márquez, Mean growth of the derivative of analytic functions,
bounded mean oscillation, and normal functions. Indiana Univ. Math. J. 47, 893–912 (1998)
4. L. Brown, A.L. Shields, Multipliers and cyclic vectors in the Bloch space. Michigan Math. J.
38, 141–146 (1991)
5. D.M. Campbell, Nonnormal sums and products of unbounded normal functions. II. Proc. Amer.
Math. Soc. 74, 202–203 (1979)
6. J.J. Donaire, D. Girela, D. Vukotić, On univalent functions in some Möbius invariant spaces. J.
Reine Angew. Math. 553, 43–72 (2002)
7. P.L. Duren, Theory of H p Spaces (Academic, New York, 1970; Reprint: Dover, Mineola-New
York, 2000)
8. P. Galanopoulos, D. Girela, R. Hernández, Univalent functions, VMOA and related spaces. J.
Geom. Anal. 21, 665–682 (2011)
9. J.B. Garnett, Bounded Analytic Functions (Academic, New York, 1981)
10. D. Girela, On a theorem of Privalov and normal funcions. Proc. Amer. Math. Soc. 125, 433–
442 (1997)
11. D. Girela, Mean Lipschitz spaces and bounded mean oscillation. Illinois J. Math. 41, 214–230
(1997)
12. D. Girela, D. Suárez, On Blaschke products, Bloch functions and normal functions. Rev. Mat.
Complut. 24, 49–57 (2011)
13. D. Girela, C. González, J.A. Peláez, Multiplication and division by inner functions in the space
of Bloch functions. Proc. Amer. Math. Soc. 134, 1309–1314 (2006)
292 D. Girela
14. P. Lappan, Non-normal sums and products of unbounded normal function. Michigan Math. J.
8, 187–192 (1961)
15. O. Lehto, K.I. Virtanen, Boundary behaviour and normal meromorphic functions. Acta Math.
97, 47–65 (1957)
16. Ch. Pommerenke, Univalent Functions (Vandenhoeck und Ruprecht, Göttingen, 1975)
17. S. Yamashita, A nonnormal function whose derivative has finite area integral of order 0 < p <
2. Ann. Acad. Sci. Fenn. Ser. A I Math. 4(2), 293–298 (1979)
18. S. Yamashita, A nonnormal function whose derivative is of Hardy class H p , 0 < p < 1.
Canad. Math. Bull. 23, 499–500 (1980)
Birkhoff–James Orthogonality
and Applications: A Survey
Abstract In the last few decades, the concept of Birkhoff–James orthogonality has
been used in several applications. In this survey article, the results known on the nec-
essary and sufficient conditions for Birkhoff–James orthogonality in certain Banach
spaces are mentioned. Their applications in studying the geometry of normed
spaces are given. The connections between this concept of orthogonality, and the
Gateaux derivative and the subdifferential set of the norm function are provided.
Several interesting distance formulas can be obtained using the characterizations
of Birkhoff–James orthogonality, which are also mentioned. In the end, some new
results are obtained.
1 Introduction
Let (V , ·) be a normed space over the field R or C. For normed spaces V1 , V2 , let
B(V1 , V2 ) denotes the space of bounded linear operators from V1 to V2 endowed
with the operator norm, and let B(V ) denotes B(V , V ). Let K(V1 , V2 ) denotes the
space of compact operators from V1 to V2 . Let H be a Hilbert space over R or C.
If the underlying field is C, the inner product on H is taken to be linear in the first
coordinate and conjugate linear in the second coordinate. The notations Mn (R) and
Mn (C) stand for n × n real and complex matrices, respectively.
Normed spaces provide a natural setting for studying geometry in the context of
vector spaces. While inner product spaces capture the concept of the measure of an
angle, orthogonality of two vectors can be described without knowing the notion of
measure of angle. For example, a vector v is orthogonal to another vector u in Rn if
and only if there exists a rigid motion T fixing the origin such that the union of rays
→ −−−→ −−−→
− →
−
0u, 0T (v), 0T (u) minus the open ray 0v is the one dimensional subspace generated
by u. This description of orthogonality by just using the notion of distance in Rn
1
motivates to try and define orthogonality in normed spaces. In this approach, one can
use the intuition about orthogonality in Rn to guess the results in general normed
spaces and then prove them algebraically. This has been done in [3, 16, 44, 78].
One of the definitions for orthogonality in a normed space suggested by Roberts
[78], known as Roberts orthogonality, is defined as follows: elements u and v are
said to be (Roberts) orthogonal if v + tu = v − tu for all scalars t. In [44,
Example 2.1], it was shown that this definition has a disadvantage that there exist
normed spaces in which two elements are Roberts orthogonal implies that one of the
element has to be zero. In [44], two more inequivalent definitions of orthogonality
in normed spaces were introduced. One of them is isosceles orthogonality which
says that v is isosceles orthogonal to u if v + u = v − u. The other one
is called Pythagorean orthogonality, that is, v is Pythagorean orthogonal to u if
v2 + u2 = v − u2 . Note that if V is an inner product space, all the above
mentioned definitions are equivalent to the usual orthogonality in an inner product
space. Isosceles and Pythagorean orthogonalities have geometric intuitions for the
corresponding definitions. In Rn , two vectors are isosceles perpendicular if and only
if their sum and difference can be sides of an isosceles triangle, and two vectors are
Pythagorean perpendicular if there is a right triangle having the two vectors as legs.
In [44], it was also proved that if u and v are two elements of a normed space,
then there exist scalars a and b such that v is isosceles orthogonal to av + u (see
[44, Theorem 4.4]) and v is Pythagorean orthogonal to bv + u (see [44, Theorem
5.1]). So these definitions don’t have the above mentioned weakness of Roberts
orthogonality.
In an inner product space, the following properties are satisfied by orthogonality.
Let u, u1 , u2 , v, v1 , v2 ∈ V .
1. Symmetry: If v⊥u, then u⊥v.
2. Homogeneity: If v⊥u, then av⊥bu for all scalars a and b.
3. Right additivity: If v⊥u1 and v⊥u2 , then v⊥(u1 + u2 ).
4. Left additivity: If v1 ⊥u and v2 ⊥u, then (v1 + v2 )⊥u.
5. There exists a scalar a such that v⊥av + u. (In Rn , this corresponds to saying
that any plane containing a vector v contains a vector perpendicular to v.)
It is a natural question to study the above properties for any given definition
of orthogonality. All the above definitions clearly satisfy symmetry. James [44,
Theorem 4.7, Theorem 4.8, Theorem 5.2, Theorem 5.3] proved that if isosceles
V such that f = 1, f (w) = 0 for all w ∈ W and f (v) = v (see [89, Theorem
1.1, Ch. I]). This is equivalent to the following.
Theorem 2.1 ([45, Theorem 2.1]) Let W be a subspace of V . Let v ∈ V . Then v
is orthogonal to W if and only if there is a hyperplane H with v orthogonal to H
and W ⊆ H .
By the Hahn–Banach theorem and Theorem 2.1, it is easy to see that any element
of a normed space is orthogonal to some hyperplane (see [45, Theorem 2.2]). The
relation between orthogonality and hyperplanes is much deeper. We first recall some
definitions. For v ∈ V , we say S ⊆ V supports the closed ball D[v, r] := {x ∈ V :
x − v ≤ r} if dist(S, D[v, r]) = 0 and S ∩ Int D[v, r] = ∅. This is also
equivalent to saying that dist(v, S) = r (see [89, Lemma 1.3, Ch. I]). Let v0 be an
element of the boundary of D[v, r]. A hyperplane H is called a support hyperplane
to D[v, r] at v0 if H passes through v0 and supports D[v, r], and it is called a
tangent hyperplane to D[v, r] at v0 if H is the only support hyperplane to D[v, r]
at v0 . A real hyperplane is a hyperplane in V , when V is considered as a real normed
space.
Theorem 2.2 ([89, Theorem 1.2, Ch. I]) Let W be a subspace of V . Let v ∈ V .
Then v is orthogonal to W if and only if there exists a support hyperplane to D[v, r]
at 0 passing through W if and only if there exists a real hyperplane which supports
the closed ball D[v, v] at 0 and passes through W .
A direct consequence follows. If W is a non-trivial subspace of V , then 0 is the
unique best approximation of v in W if and only if there exists a tangent hyperplane
to D[v, r] at 0 passing through W (see [89, Corollary 1.5, Ch. I]).
The above results are related to the questions as to when the orthogonality is (left
or right) additive or symmetric. It was shown in [45, Theorem 5.1] that orthogonality
is right additive in V if and only if for any unit vector v ∈ V , there is a tangent
hyperplane to D[v, v] at 0. There are other interesting characterizations for
(left or right) additivity of orthogonality. To state them, some more definitions are
required. A normed space V is called a strictly convex space if given any v1 , v2 ∈ V ,
whenever v1 +v2 = v1 + v2 and v2 = 0, then there exists a scalar k such that
v1 = kv2 . This is also equivalent to saying that if v1 = v2 = 1 and v1 = v2 ,
then v1 + v2 < 2. The norm · is said to be Gateaux differentiable at v if
v + hu − v
lim
h→0 h
3. For v ∈ V , there exists a unique functional f of norm one on V such that f (v) =
v.
4. For v ∈ V , there is a tangent hyperplane to D[v, v] at 0.
If V is a reflexive space, then the above are also equivalent to the following
statements.
(5) Any bounded linear functional on a given subspace of V has a unique norm
preserving Hahn–Banach extension on V .
(6) The dual space V ∗ is strictly convex.
Proof Equivalence of (1) and (2) is proved in [45, Theorem 4.2] and equivalence
of (1), (3) and (4) is proved in [45, Theorem 5.1]. For a reflexive space, equivalence
of (1) and (5) is given in [45, Theorem 5.7]. Equivalence of (5) and (6) is a routine
exercise in functional analysis.
Characterization of inner product spaces of dimension three or more can be given in
terms of (left or right) additivity or symmetry of orthogonality. Birkhoff [16] gave a
necessary and sufficient condition for a normed space of dimension at least three to
be an inner product space, and examples to justify the restriction on the dimension.
James [45, Theorem 6.1] showed that a normed space of dimension at least three is
an inner product space if and only if orthogonality is right additive and symmetric
if and only if the normed space is strictly convex and orthogonality is symmetric.
Later, James improved his result and proved a much stronger theorem.
Theorem 2.4 ([46, Theorem 1, Theorem 2]) Let V be a normed space of dimen-
sion at least three. Then V is an inner product space if and only if orthogonality is
symmetric or left additive.
A characterization of orthogonality to be symmetric or left additive in a normed
space of dimension two can be found in [2]. Several other necessary and sufficient
conditions for a normed space to be an inner product space are given in [2, 44]. This
problem has also been extensively studied in [3, 89].
An element v is called a smooth point of D[0, v] if there exists a hyperplane
tangent to D[v, v] at 0. We say v is a smooth point if it is a smooth point
of D[0, v]. Equivalently, v is a smooth point if there exists a unique affine
hyperplane passing through v which supports D[0, v] at v (such an affine
hyperplane is called the affine hyperplane tangent to D[0, v] at v). A normed
space is called smooth if all its vectors are smooth points. By Theorem 2.3, we get
that orthogonality in a normed space is right additive if and only if the normed space
is smooth. We also have that v is a smooth point if and only if the norm function is
Gateaux differentiable at v:
Theorem 2.5 Let v ∈ V . The norm function is Gateaux differentiable at v if and
only if there is a unique f ∈ V ∗ such that f = 1 and f (v) = v. In this
case, the Gateaux derivative of the norm at v is given by Re f (u) for all u ∈ V . In
addition, for u ∈ V , we have that v is orthogonal to u if and only if f (u) = 0.
298 P. Grover and S. Singla
Smooth points and this connection with Gateaux differentiability was studied in
[1, 22, 52, 53] and many interesting results can be obtained as their applications. Let
the space of continuous functions on a compact Hausdorff space X be denoted by
C(X) and let the space of bounded continuous functions on a normal space be
denoted by Cb (). Kečkić [53, Corollary 2.2, Corollary 3.2] gave characterizations
of smooth points in C(X) and Cb (). A characterization of smooth points in B(H)
was given in [52, Corollary 3.3]. For H separable, Abatzoglou [1, Corollary 3.1]
showed that the operators in B(H) of unit norm which are also smooth points are
dense in the unit sphere of B(H). In K(H), this result was first proved by Holub [42,
Corollary 3.4]. Heinrich [40, Corollary 2.3] generalized this result for K(V1 , V2 ),
where V1 is a separable reflexive Banach space and V2 is any normed space. He
proved that the operators which attain their norm at a unique unit vector (upto scalar
multiplication) are dense in K(V1 , V2 ).
In this paragraph, H is a separable Hilbert space. Schatten [86] proved that
D[0, 1] in K(H) has no extreme points. In [42], the geometry of K(H) and its
dual B1 (H), the trace class, was studied by characterizing the smooth points and
extreme points of their closed unit balls. It was shown in [42, Corollary 3.1] that
the trace class operators of rank one and unit norm are exactly the extreme points of
D[0, 1] in B1 (H). The space B1 (H) is predual of B(H) and hence is isometrically
isomorphic to a subspace of B(H)∗ . An interesting result in [1, Corollary 3.3] is
that all the trace class operators of rank one and unit norm are also extreme points
of D[0, 1] in B(H)∗ . In [40], this study was continued to understand the geometry
of K(V1 , V2 ), B(V1 , V2 ) and the weak tensor product of V1 and V2 , where V1 and
V2 are Banach spaces. Characterizations of Gateaux differentiability and Fréchet
differentiability of the norm at an operator T in these spaces were obtained. For
Schatten classes of H, this problem was addressed in [1, Theorem 2.2, Theorem
2.3]. In [1, Theorem 3.1], another characterization of Fréchet differentiability of
the norm at T in B(H) was given, an alternative proof of which can be found in
[72, Theorem 4.6]. In [40, Corollary 2.2], a necessary and sufficient condition for
0 = T ∈ K(V1 , V2 ) to be a smooth point is obtained, where V1 is a reflexive Banach
space and V2 is any Banach space. It is shown that such a T is a smooth point if and
only if T attains its norm on the unique unit vector x0 (up to a scalar factor) and
T x0 is a smooth point. (This was proved for K(H) in [42, Theorem 3.3].) Recently,
as an application of orthogonality, it was shown in [72, Theorem 4.1, Theorem 4.2]
that this characterization also holds when V2 is any normed space (not necessarily
complete).
If T ∈ B(V1 , V2 ) attains its norm on the unique unit vector x0 (up to a scalar
factor) and T x0 is a smooth point of V2 , then T is said to satisfy Holub’s condition
(see [39]). Then Theorem 4.1 and Theorem 4.2 in [72] say that for a reflexive
Banach space V1 and any normed space V2 , smooth points of K(V1 , V2 ) are exactly
those operators which satisfy Holub’s condition. This characterization may not hold
if T is not compact (see [39, Example (a)]) or when V1 is not a reflexive space
(see [39, Example (b), Example (c)]). In the case when V1 is not a reflexive space,
usually some extra condition is needed along with Holub’s condition to characterize
smooth points. For example, Corollary 1 in [37] states that for 1 < p, r < ∞,
Birkhoff–James Orthogonality and Applications: A Survey 299
v + tu − v
lim = max{Re f (u) : f ∈ V ∗ , f = 1, f (v) = v}.
t →0+ t
Using this, Watson [97, Theorem 4] gave a characterization of ∂||| · ||| in the space
(Mn (R), ||| · |||). Zi˛etak [109, Theorem 3.1, Theorem 3.2] improved this result and
showed the following.
Theorem 2.8 ([109, Theorem 3.1, Theorem 3.2]) For A ∈ Mn (C),
∗
: A = U U ∗ is a singular value
∂|||A||| ={U diag(d1 , . . . , dn )U
decomposition of A, si (A)di = |||A||| = ((s1 , . . . , sn )),
∗ ((d1 , . . . , dn )) = 1}.
In [98, Theorem 1], the above result was proved using a different approach. For the
operator norm · on Mn (C), we have the following.
Corollary 2.9 ([98, Example 3]) For A ∈ Mn (C),
Actually the right hand derivative has a deeper connection with orthogonality
as explored by Kečkić [51], where the author introduced the notion of ϕ-Gateaux
derivatives: for u, v ∈ V and ϕ ∈ [0, 2π), the ϕ-Gateaux derivative of norm at v in
the direction u is defined as
v + teιϕ u − v
Dϕ,v (u) = lim .
t →0+ t
These always exist for any two vectors u and v (see [51, Proposition 1.2]). A
characterization of orthogonality follows.
Theorem 2.10 ([51, Theorem 1.4]) Let u, v ∈ V . Then v is orthogonal to u if and
only if
In [52, Theorem 2.4], the expression for the ϕ-Gateaux derivative of the norm on
B(H) was obtained. Using the above proposition, a characterization of orthogonal-
ity in B(H) was given in [52, Corollary 3.1], which was first proved in [14] using a
completely different approach. This characterization of orthogonality and many of
its generalizations are the main content of the next section.
It was noted in [14] that Theorem 3.1 is equivalent to saying that for A, B ∈
Mn (C),
dist(A, CB) = max Ax|y : x = y = 1 and y⊥Bx . (3.1)
It is natural to expect that in the infinite dimensional case, we should have for A, B ∈
B(H),
dist(A, CB) = sup Ax|y : x = y = 1 and y⊥Bx . (3.2)
This was indeed shown to be true in [14] by using the approach given in [5, p.
207]. We would like to point out that the book [5] deals with only separable spaces.
However the arguments can be modified by replacing the sequence of finite rank
operators converging pointwise to the identity operator by a net with this property.
Since the same proof as in [5, p. 207] was used in the proof of Theorem 2.4 of [96],
a similar modification is required there too.
Later, several authors have used different methods to prove Theorem 3.2. One of
these techniques was given in [7, Remark 2.2] using a different distance formula [7,
Proposition 2.1]. Another approach in [52, Corollary 3.1] uses Theorem 2.10 and
the expression for the ϕ-Gateaux derivative of the norm on B(H) (which is given
Birkhoff–James Orthogonality and Applications: A Survey 303
in [52, Theorem 2.4]). Using Theorem 2.10, Wójcik [99] extended Theorem 3.1 for
compact operators between two reflexive Banach spaces over C:
Theorem 3.3 ([99, Theorem 3.1]) Let V1 and V2 be reflexive Banach spaces over
C. Suppose A, B ∈ K(V1 , V2 ) and A = 0. Then A is orthogonal to B if and only if
If A is a complex (or real) unital C ∗ -algebra, then the triple (H, π, ξ ) denotes
a cyclic representation of A, where H is a complex (or real) Hilbert space and
π : A → B(H) is a ∗ -algebra map such that π(1A ) = 1B (H) and {π(a)ξ : a ∈ A}
is dense in B(H). For φ ∈ SA , there exists a cyclic representation (H, π, ξ ) such
that φ(a) = π(a)ξ |ξ for all a ∈ A (see [21, p. 250], [31, Proposition 15.2]).
Theorem 3.5 ([35, Corollary 1.3]) Let a ∈ A. Let B be a subspace of A. Then the
following are equivalent.
1. a is orthogonal to B.
2. There exists φ ∈ SA such that φ(a ∗ a) = a2 and φ(a ∗ b) = 0 for all b ∈ B.
3. There exists a cyclic representation (H, π, ξ ) such that π(a)ξ = a and
π(a)ξ |π(b)ξ = 0 for all b ∈ B.
When A = C(X), Theorem 3.5 and Riesz Representation Theorem yield the
following theorem by Singer [89, Theorem 1.3, Ch. I].
Corollary 3.6 ([89, Theorem 1.3, Ch. I]) Let f ∈ C(X). Let B be a subspace of
C(X). Then f is orthogonal to B if and only if there exists a probability measure μ
on X such that
dist(a, B)2 = 2
|f | dμ and f hdμ = 0
X X
for all h ∈ B.
The condition
f 2∞ 2
= dist(a, B) = |f |2 dμ
X
This was proved in [14, Theorem 1.2] and the discussion after that. The operator
δA (T ) = AT − T A on Mn (C) is called an inner derivation. So this gives that
δA = 2 dist(A, C1Mn (C) ). This was also extended to the infinite dimensional case
in [14, Remark 3.2]. These results were first proved by Stampfli [91] using a com-
pletely different approach. Since all the derivations on B(H) are inner derivations
(see [49, Theorem 9]), the norm of any derivation on B(H) is 2 dist(A, C1B (H) ),
for some A ∈ B(H). For A, B ∈ B(H), let δA,B (T ) = AT − T B for all
T ∈ B(H). In [91, Theorem 8], an expression for the norm of the elementary
operator δA,B is given. In [91, Theorem 5], a distance formula was obtained in an
irreducible unital C ∗ -algebra as follows.
Theorem 4.1 ([91, Theorem 5]) Let H be a complex Hilbert space. Let B be an
irreducible unital C ∗ -subalgebra of B(H). Let A ∈ B. Then
By the Russo–Dye theorem [17, II.3.2.15], under the assumptions of the above
theorem, we obtain
Expressions for the norm of a derivation on von Neumann algebras can be found in
[29]. The most important fact used here is that all the derivations on von Neumann
algebras are inner. This was a conjecture by Kadison for a long time and was proved
in [85]. More on derivations on a C ∗ -algebra can be found in [47, 48, 67, 110]. A
lot of work has been done to answer the question when the range of a derivation
is orthogonal to its kernel. It was proved in [4, Theorem 1.7] that if N is a normal
operator in B(H), then the kernel of δN is orthogonal to the range of δN . In [54,
Theorem 1], it was shown that the Hilbert–Schmidt operators in the kernel of δN are
orthogonal to the Hilbert–Schmidt operators in the range of δN , in the usual Hilbert
space sense. In [59, Theorem 3.2(a)], the Schatten p-class operators in the kernel
of δN were shown to be orthogonal to the Schatten p-class operators in the range
of δN , in the Schatten p-norm. A similar result for the orthogonality in unitarily
invariant norms defined on the norm ideals of K(H) is given in [55, Theorem 1].
For related study on derivations, elementary operators and orthogonality in these
normed spaces, see [27, 50, 56, 62–66, 92–94].
Similar to (4.2), an expression for the distance of an element of a general C ∗ -
algebra from a C ∗ -subalgebra can be obtained from the below theorem of Rieffel
[76].
Theorem 4.2 ([76, Theorem 3.2]) Let A be a C ∗ -algebra. Let B be a C ∗ -
subalgebra of A which contains a bounded approximate identity for A. Let a ∈ A.
Then there exists a cyclic representation (H, π, ξ ) of A and a Hermitian as well
as a unitary operator U on H such that π(b)U = U π(b) for all b ∈ B and
dist(a, B) = 12 π(a)U − U π(a).
308 P. Grover and S. Singla
We note that it is not possible to prove (4.3) by proceeding along the lines of the
proof of Theorem 4.1 given in [14], which uses (4.1). In particular, the following
does not hold true in Mn (C) :
dist(A, B) = max Ax|y : x = y = 1 and y⊥Bx for all B ∈ B .
For example, take A = 1Mn (C) and B = {X ∈ Mn (C) : tr(X) = 0}. Then 1Mn (C) is
orthogonal to B. Now if the above is true, then we would get unit vectors x, y such
that x|y = 1 and Bx|y = 0 for all B ∈ B. Let P = xy ∗. Then rank P = 1
and tr(BP ) = 0 for all B ∈ B. But tr(BP ) = 0 for all B ∈ B gives P = λ1Mn (C)
for some λ ∈ C (see [32, Remark 3]), which contradicts the fact that rank P = 1.
So this approach to prove (4.3) does not work. However it would be interesting to
know if (4.3) is true or not. This is an open question.
The above example contradicts Theorem 5.3 of [101], which says that for Hilbert
spaces H and K, if A ∈ K(H, K) and B is a finite dimensional subspace of
K(H, K), then
dist(A, B) = sup Ax|y : x = y = 1 and y⊥Bx for all B ∈ B .
The proof of this theorem has a gap, after invoking Theorem 5.2, in [101].
As an application of Theorem 3.5, we obtain the following distance formula.
Theorem 4.3 Let a ∈ A. Let B be a subspace of A. Suppose there is a best
approximation to a in B. Then
dist(a, B) = max π(a)ξ |η : (H, π, ξ ) is a cyclic representation of A,
η ∈ H, η = 1 and π(b)ξ |η = 0 for all b ∈ B . (4.4)
φ((a − b0 )∗ (a − b0 )) = a − b0 2
When A = Mn (C), another proof of (4.5) was given by Audenaert [12, Theorem
9]. Rieffel [77, Theorem 3.10] obtained (4.5), using a different method. In [77], it
was also desired to have a generalization of (4.5) with C1A replaced by a unital
C ∗ -subalgebra. For A = Mn (C), a formula in this direction was obtained in
[32, Theorem 2]. An immediate application of Theorem 3.5 gives the following
generalization of (4.5), when C1A is replaced by a subspace B of A and there is a
best approximation to a in B.
Theorem 4.4 ([35, Corollary 1.2]) Let a ∈ A. Let B be a subspace of A. Let b0
be a best approximation to a in B. Then
For details, see [35]. Geometric interpretations of Theorems 3.5 and 4.4 have also
been explained in [35].
Henceforward, C ∗ -algebras are assumed to be complex C ∗ -algebras. Another
distance formula, which is a generalization of [7, Proposition 2.3], is given below.
Some notations are in order. Given φ ∈ SA , let L = {c ∈ A : φ(c∗ c) = 0}, and let
a1 + L|a2 + LA/L = φ(a1∗ a2 ), for all a1 , a2 ∈ A. Then A/L is an inner product
space. For a ∈ A, let b0 be a best approximation to a in B. Let
Ma,B (φ) = sup{φ((a − b0 )∗ (a − b0 )) − |φ((a − b0 )∗ bα )|2 },
α
where the supremum is taken over all orthonormal bases {bα + L} of B/L in A/L.
310 P. Grover and S. Singla
And equality occurs because by Theorem 3.5, there exists φ ∈ SA such that φ((a −
b0 )∗ (a − b0 )) = dist(a, B)2 and φ((a − b0 )∗ b) = 0 for all b ∈ B.
Now along the lines of the proof of [7, Theorem 2.4] and using Theorem 4.5, we get
the next result. For a Hilbert C ∗ -module E over A and φ ∈ SA , let L = {e ∈ E :
φ(e|e) = 0}. On E /L, define an inner product as e1 +L|e2 +LE /L = φ(e1 |e2 )
for all e1 , e2 ∈ E . For e ∈ E , let f0 be a best approximation to e in F . Let
Me,F (φ) = sup{φ(e − f0 |e − f0 ) − |φ(e − f0 |fα )|2 },
α
where the supremum is taken over all orthonormal bases {fα + L} of F /L in E /L.
Theorem 4.6 Let E be a Hilbert C ∗ -module over A. Let e ∈ E . Let F be a
subspace of E . Let f0 be a best approximation to e in F . Then
Rieffel [77, p. 46] had questioned to have expressions of distance formulas in terms
of conditional expectations. We end the discussion on distance formulas with our
progress in this direction. For a C ∗ -algebra A and a C ∗ -subalgebra B of A, a
conditional expectation from A to B is a completely positive map E : A → B
of unit norm such that E(b) = b, E(ba) = bE(a) and E(ab) = E(a)b, for all
a ∈ A and b ∈ B [17, p. 141]. In fact any projection E : A → B of norm
one is a conditional expectation and vice-a-versa (see [17, Theorem II.6.10.2]). An
interesting fact is that a map E : A → B is a conditional expectation if and only
if E is idempotent, positive and satisfies E(b1 ab2) = b1 E(a)b2, for all a ∈ A and
b1 , b2 ∈ B (see [17, Theorem II.6.10.3]). Thus conditional expectations from A to B
are also determined completely by the algebraic structure. A Banach space V1 is said
to be injective if for any inclusion of Banach spaces V3 ⊆ V2 , every bounded linear
mapping f0 : V3 → V1 has a linear extension f : V2 → V1 with f = f0 . A
Banach space is injective if and only if it is isometrically isomorphic to C(X), where
X is a compact Hausdorff space in which closure of any open set is an open set (see
[28, p. 70]). For v ∈ V and W a subspace of V , let v, W denote the subspace
generated by v and W .
Birkhoff–James Orthogonality and Applications: A Survey 311
This happens in the special case when B = C1A , because for any c ∈ A, the norm
of the best approximation of c to C1A is less than or equal to c, and thus the norm
of the projection E in Theorem 4.7 is one.
Acknowledgments The authors would like to thank Amber Habib and Ved Prakash Gupta for
many useful discussions. The authors would also like to acknowledge very helpful comments by
the referees.
The research of P. Grover is supported by INSPIRE Faculty Award IFA14-MA-52 of DST,
India, and by Early Career Research Award ECR/2018/001784 of SERB, India.
312 P. Grover and S. Singla
References
1. T.J. Abatzoglou, Norm derivatives on spaces of operators. Math. Ann. 239, 129–135 (1979)
2. J. Alonso, Some properties of Birkhoff and isosceles orthogonality in normed linear spaces,
in Inner Product Spaces and Applications. Pitman Research Notes in Mathematical Series,
vol. 376 (Longman, Harlow, 1997), pp. 1–11
3. D. Amir, Characterizations of Inner Product Spaces. Operator Theory: Advances and
Applications, vol. 20 (Birkhäuser Verlag, Basel, 1986)
4. J. Anderson, On normal derivations. Proc. Am. Math. Soc. 38, 135–140 (1973)
5. C. Apostol, L.A. Fialkow, D.A. Herrero, D. Voiculescu, Approximation of Hilbert Space
Operators II. Research Notes in Mathematics, vol. 102 (Pitman (Advanced Publishing
Program), Boston, 1984)
6. L. Arambašić, R. Rajić, On some norm equalities in pre-Hilbert C ∗ -modules. Linear Algebra
Appl. 414, 19–28 (2006)
7. L. Arambašić, R. Rajić, The Birkhoff-James orthogonality in Hilbert C ∗ -modules. Linear
Algebra Appl. 437, 1913–1929 (2012)
8. L. Arambašić, R. Rajić, A strong version of the Birkhoff-James orthogonality in Hilbert C ∗ -
modules. Ann. Funct. Anal. 5, 109–120 (2014)
9. L. Arambašić, R. Rajić, On three concepts of orthogonality in Hilbert C ∗ -modules. Linear
Multilinear Algebra 63, 1485–1500 (2015)
10. L. Arambašić, R. Rajić, On symmetry of the (strong) Birkhoff-James orthogonality in Hilbert
C ∗ -modules. Ann. Funct. Anal. 7, 17–23 (2016)
11. J. Arazy, On the geometry of the unit ball of unitary matrix spaces. Integr. Equ. Oper. Theory
4, 151–171 (1981)
12. K.M.R. Audenaert, Variance bounds, with an application to norm bounds for commutators.
Linear Algebra Appl. 432, 1126–1143 (2010)
13. C. Benítez, M. Fernández, M.L. Soriano, Orthogonality of matrices. Linear Algebra Appl.
422, 155–163 (2007)
14. R. Bhatia, P. Šemrl, Orthogonality of matrices and some distance problems. Linear Algebra
Appl. 287, 77–85 (1999)
15. T. Bhattacharyya, P. Grover, Characterization of Birkhoff-James orthogonality. J. Math. Anal.
Appl. 407, 350–358 (2013)
16. G. Birkhoff, Orthogonality in linear metric spaces. Duke Math. J. 1, 169–172 (1935)
17. B. Blackadar, Operator Algebras-Theory of C ∗ -Algebras and von Neumann Algebras
(Springer, Berlin, 2006)
18. J. Chmieliński, Linear mappings approximately preserving orthogonality. J. Math. Anal.
Appl. 304, 158–169 (2005)
19. J. Chmieliński, On an ε-Birkhoff orthogonality. J. Inequal. Pure Appl. Math. 6, 1–7 (2005)
20. J. Chmieliński, T. Stypula, P. Wójcik, Approximate orthogonality in normed spaces and its
applications. Linear Algebra Appl. 531, 305–317 (2017)
21. J.B. Conway, A Course in Functional Analysis (Springer, New York, 1990)
22. J. Diestel, Geometry of Banach Spaces. Lecture Notes in Mathematics, vol. 485 (Springer,
Berlin, 1975)
23. S.S. Dragomir, On Approximation of Continuous Linear Functionals in Normed Linear
Spaces. An. Univ. Timişoara Ser. Ştiint. Mat. 29, 51–58 (1991)
24. S.S. Dragomir, Continuous linear functionals and norm derivatives in real normed spaces.
Univ. Beograd. Publ. Elektrotehn. Fak. Ser. Mat. 3, 5–12 (1992)
25. H.-K. Du, Another generalization of Anderson’s theorem. Proc. Am. Math. Soc. 123, 2709–
2714 (1995)
26. B.P. Duggal, A remark on normal derivations. Proc. Am. Math. Soc. 126, 2047–2052 (1998)
n
27. B.P. Duggal, Range-kernel orthogonality of the elementary operator X → Ai XBi − X.
i=1
Linear Algebra Appl. 337, 79–86 (2001)
Birkhoff–James Orthogonality and Applications: A Survey 313
28. E.G. Effros, Z.-J. Ruan, Operator Spaces (Oxford University Press, New York, 2000)
29. P. Gajendragadkar, Norm of a derivation on a von Neumann algebra. Trans. Am. Math. Soc.
170, 165–170 (1972)
30. P. Ghosh, D. Sain, K. Paul, On symmetry of Birkhoff-James orthogonality of linear operators.
Adv. Oper. Theory 2, 428–434 (2017)
31. K.R. Goodearl, Notes on Real and Complex C ∗ -Algebras (Shiva, Cambridge, 1982)
32. P. Grover, Orthogonality to matrix subspaces, and a distance formula. Linear Algebra Appl.
445, 280–288 (2014)
33. P. Grover, Some problems in differential and subdifferential calculus of matrices. Ph.D.
Thesis, Indian Statistical Institute (2014)
34. P. Grover, Orthogonality of matrices in the Ky Fan k-norms. Linear Multilinear Algebra 65,
496–509 (2017)
35. P. Grover, S. Singla, Best approximations, distance formulas and orthogonality in C ∗ -
algebras. J. Ramanujan Math. Soc. (to appear)
36. R. Grza̧ślewicz, R. Younis, Smooth points of some operator spaces. Arch. Math. 57, 402–405
(1991)
37. R. Grza̧ślewicz, R. Younis, Smooth points and M-ideals. J. Math. Anal. Appl. 175, 91–95
(1993)
38. P. Halmos, A Hilbert Space Problem Book (D. Van Nostrand, Princeton, 1967)
39. J. Hennefeld, Smooth, compact operators. Proc. Am. Math. Soc. 77, 87–90 (1979)
40. S. Heinrich, The differentiability of the norm in spaces of operators. Funct. Anal. Appl. 9,
360–362 (1975)
41. J.B. Hiriart-Urruty, C. Lemarèchal, Fundamentals of Convex Analysis (Springer, Berlin, 2000)
42. J.R. Holub, On the metric geometry of ideals of operators on Hilbert space. Math. Ann. 201,
157–163 (1973)
43. T. Jahn, Orthogonality in generalized Minkowski spaces. J. Convex Anal. 26, 49–76 (2019)
44. R.C. James, Orthogonality in normed linear spaces. Duke Math. J. 12, 291–302 (1945)
45. R.C. James, Orthogonality and linear functionals in normed linear spaces. Trans. Am. Math.
Soc. 61, 265–292 (1947)
46. R.C. James, Inner products in normed linear spaces. Bull. Am. Math. Soc. 53, 559–566 (1947)
47. B. Johnson, Characterization and norms of derivations on von Neumann algebras, in Algèbres
d’Opérateurs. Lecture Notes in Mathematics, vol. 725 (Springer, Berlin, 1979), pp. 228–236
48. R.V. Kadison, Derivations of operator algebras. Ann. Math. 83, 280–293 (1966)
49. I. Kaplansky, Modules over operator algebras. Am. J. Math. 75, 839–858 (1953)
50. D.J. Kečkić, Orthogonality of the range and the kernel of some elementary operators. Proc.
Am. Math. Soc. 128, 3369–3377 (2000)
51. D.J. Kečkić, Orthogonality in S1 and S∞ spaces and normal derivations. J. Oper. Theory 51,
89–104 (2004)
52. D.J. Kečkić, Gateaux derivative of B(H ) norm. Proc. Am. Math. Soc. 133, 2061–2067 (2005)
53. D.J. Kečkić, Orthogonality and smooth points in C(K) and Cb (). Eur. Math. J. 3, 44–52
(2012)
54. F. Kittaneh, On normal derivations of Hilbert-Schmidt type. Glasg. Math. J. 29, 245–248
(1987)
55. F. Kittaneh, Normal derivations in norm ideals. Proc. Am. Math. Soc. 123, 1779–1785 (1995)
56. F. Kittaneh, Operators that are orthogonal to the range of a derivation. J. Math. Anal. Appl.
203, 868–873 (1996)
57. F. Kittaneh, R. Younis, Smooth points of certain operator spaces. Integr. Equ. Oper. Theory
13, 849–855 (1990)
58. C.K. Li, H. Schneider, Orthogonality of matrices. Linear Algebra Appl. 347, 115–122 (2002)
59. P.J. Maher, Commutator approximants. Proc. Am. Math. Soc. 115, 995–1000 (1992)
60. A. Mal, K. Paul, T.S.S.R.K. Rao, D. Sain, Approximate Birkhoff-James orthogonality and
smoothness in the space of bounded linear operators. Monatsh. Math. 190, 549–558 (2019)
61. A. Mal, D. Sain, K. Paul, On some geometric properties of operator spaces. Banach J. Math.
Anal. 13, 174–191 (2019)
314 P. Grover and S. Singla
62. A. Mazouz, On the range and the kernel of the operator X → AXB − X. Proc. Am. Math.
Soc. 127, 2105–2107 (1999)
p
63. S. Mecheri, On minimizing S − (AX − XB)p . Serdica Math. J. 26, 119–126 (2000)
64. S. Mecheri, Some versions of Anderson’s and Maher’s inequalities I. Int. J. Math. Math. Sci.
52, 3281–3297 (2003)
65. S. Mecheri, Some versions of Anderson’s and Maher’s inequalities II. Int. J. Math. Math. Sci.
53, 3355–3372 (2003)
66. S. Mecheri, M. Bounkhel, Some variants of Anderson’s inequality in C1 -classes. JIPAM. J.
Inequal. Pure Appl. Math. 4, Article 24 (2003)
67. P. Miles, Derivations on B ∗ algebras. Pac. J. Math. 14, 1359–1366 (1964)
68. M.S. Moslehian, A. Zamani, Norm-parallelism in the geometry of Hilbert C ∗ -modules. Indag.
Math. 27, 266–281 (2016)
69. R. Nakamoto, S. Takahasi, Norm equality condition in triangular inequality. Sci. Math. Jpn.
55, 463–466 (2002)
70. K. Paul, Translatable radii of an operator in the direction of another operator. Sci. Math. 2,
119–122 (1999)
71. K. Paul, A. Mal, P. Wójcik, Symmetry of Birkhoff-James orthogonality of operators defined
between infinite dimensional Banach spaces. Linear Algebra Appl. 563, 142–153 (2019)
72. K. Paul, D. Sain, P. Ghosh, Birkhoff-James orthogonality and smoothness of bounded linear
operators. Linear Algebra Appl. 506, 551–563 (2016)
73. T.S.S.R.K. Rao, Very smooth points in spaces of operators. Proc. Indian Acad. Sci. Math. Sci.
113, 53–64 (2003)
74. T.S.S.R.K. Rao, Smooth points in spaces of operators. Linear Algebra Appl. 517, 129–133
(2017)
75. T.S.S.R.K. Rao, Adjoints of operators as smooth points in spaces of compact operators. Linear
Multilinear Algebra 66, 668–670 (2018)
76. M.A. Rieffel, Leibniz seminorms and best approximation from C ∗ -subalgebras. Sci. China
Math. 54, 2259–2274 (2011)
77. M.A. Rieffel, Standard deviation is a strongly Leibniz seminorm. New York J. Math. 20,
35–56 (2014)
78. B.D. Roberts, On geometry of abstract vector spaces. Tohoku Math. J. 39, 42–59 (1934)
79. D. Sain, K. Paul, Operator norm attainment and inner product spaces. Linear Algebra Appl.
439, 2448–2452 (2013)
80. D. Sain, K. Paul, S. Hait, Operator norm attainment and Birkhoff-James orthogonality. Linear
Algebra Appl. 476, 85–97 (2015)
81. D. Sain, P. Ghosh, K. Paul, On symmetry of Birkhoff-James orthogonality of linear operators
on finite-dimensional real Banach spaces. Oper. Matrices 11, 1087–1095 (2017)
82. D. Sain, K. Paul, A. Mal, A complete characterization of Birkhoff-James orthogonality in
infinite dimensional normed space. J. Oper. Theory 80, 399–413 (2018)
83. D. Sain, K. Paul, A. Mal, On approximate Birkhoff-James orthogonality and normal cones in
a normed space. J. Convex Anal. 26, 341–351 (2019)
84. D. Sain, K. Paul, A. Mal, A. Ray, A complete characterization of smoothness in the space
of bounded linear operators. Linear Multilinear Algebra (2019). https://doi.org/10.1080/
03081087.2019.1586824
85. S. Sakai, Derivations of W ∗ -algebras. Ann. Math. 83, 273–279 (1966)
86. R. Schatten, The space of completely continuous operators on a Hilbert space. Math. Ann.
134, 47–49 (1957)
87. A. Seddik, Rank one operators and norm of elementary operators. Linear Algebra Appl. 424,
177–183 (2007)
n
88. A. Seddik, On the injective norm of Ai ⊗ Bi and characterization of normaloid operators.
i=1
Oper. Matrices 2, 67–77 (2008)
89. I. Singer, Best Approximation in Normed Linear Spaces by Elements of Linear Subspaces
(Springer, New York, 1970)
Birkhoff–James Orthogonality and Applications: A Survey 315
90. W. So, Facial structures of Schatten p-norms. Linear Multilinear Algebra 27, 207–212 (1990)
91. J. Stampfli, The norm of a derivation. Pac. J. Math. 33, 737–747 (1970)
92. A. Turnšek, Elementary operators and orthogonality. Linear Algebra Appl. 317, 207–216
(2000)
93. A. Turnšek, Orthogonality in Cp classes. Monatsh. Math. 132, 349–354 (2001)
94. A. Turnšek, Generalized Anderson’s inequality. J. Math. Anal. Appl. 263, 121–134 (2001)
95. A. Turnšek, On operators preserving James’ orthogonality. Linear Algebra Appl. 407, 189–
195 (2005)
96. A. Turnšek, A remark on orthogonality and symmetry of operators in B (H ). Linear Algebra
Appl. 535, 141–150 (2017)
97. G.A. Watson, Characterization of the subdifferential of some matrix norms. Linear Algebra
Appl. 170, 33–45 (1992)
98. G.A. Watson, On matrix approximation problems with Ky Fan k norms. Numer. Algorithms
5, 263–272 (1993)
99. P. Wójcik, Gateaux derivative of norm in K(X; Y ). Ann. Funct. Anal. 7, 678–685 (2016)
100. P. Wójcik, Norm-parallelism in classical M-ideals. Indag. Math. 28, 287–293 (2017)
101. P. Wójcik, Orthogonality of compact operators. Expo. Math. 35, 86–94 (2017)
102. W. Werner, Smooth points in some spaces of bounded operators. Integr. Equ. Oper. Theory
15, 496–502 (1992)
103. J.P. Williams, Finite operators. Proc. Am. Math. Soc. 26, 129–136 (1970)
104. A. Zamani, The operator-valued parallelism. Linear Algebra Appl. 505, 282–295 (2016)
105. A. Zamani, Characterizations of norm-parallelism in spaces of continuous functions. Bull.
Iranian Math. Soc. 45, 557–567 (2019)
106. A. Zamani, M.S. Moslehian, Exact and approximate operator parallelism. Can. Math. Bull.
58, 207–224 (2015)
107. A. Zamani, M.S. Moslehian, M.-T. Chien, H. Nakazato, Norm-parallelism and the Davis-
Wielandt radius of Hilbert space operators. Linear Multilinear Algebra 67, 2147–2158 (2019)
108. K. Zi˛etak, On the characterization of the extremal points of the unit sphere of matrices. Linear
Algebra Appl. 106, 57–75 (1988)
109. K. Zi˛etak, Subdifferentials, faces, and dual matrices. Linear Algebra Appl. 185, 125–141
(1993)
110. L. Zsidó, The norm of a derivation in a W ∗ -algebra. Proc. Am. Math. Soc. 38, 147–150 (1973)
The Generalized ∂-Complex on the
Segal–Bargmann Space
Friedrich Haslinger
1 Introduction
F. Haslinger ()
Fakultät für Mathematik, Universität Wien, Wien, Austria
e-mail: [email protected]
write pj (u) for pj ( ∂z∂ 1 , . . . , ∂z∂ n )u, where u ∈ A2 (Cn , e−|z| ), and consider the
2
n
Du = pj (u) dzj , (1.1)
j =1
where u ∈ A2 (Cn , e−|z| ) and pj ( ∂z∂ 1 , . . . , ∂z∂ n ) are polynomial differential opera-
2
n
Du = pk (uJ ) dzk ∧ dzJ ,
|J |=p k=1
2 n −|z|2 ),
where u = |J |=p uJ dzJ is a (p, 0)-form with coefficients in A (C , e
here J = (j1 , . . . , jp ) is a multiindex and dzJ = dzj1 ∧ · · · ∧ dzjp and the
summation is taken only over increasing multiindices.
It is clear that D 2 = 0 and that we have
where
u ∈ dom(D) = u ∈ A2(p,0)(Cn , e−|z| ) : Du ∈ A2(p+1,0)(Cn , e−|z| )
2 2
and
n
∗
D v= pj∗ vj K dzK
|K|=p−1 j =1
∗
for v = |J |=p vJ dzJ and where pj (z1 , . . . , zn ) is the polynomial pj with
complex conjugate coefficients, taken as multiplication operator.
Now the corresponding D-complex has the form
D D
A2(p−1,0)(Cn , e−|z| ) −→ A2(p,0)(Cn , e−|z| ) −→ A2(p+1,0)(Cn , e−|z| ).
2 2 2
←− ←−
D∗ D∗
Similarly to the classical ∂-complex (see [3]) we consider the generalized box
operator ˜ D,p := D ∗ D + DD ∗ as a densely defined self-adjoint operator on
A2(p,0)(Cn , e−|z| ) with
2
The (p, 0)-forms with polynomial components are dense in the space
A2(p,0)(Cn , e−|z| ). In addition we have the following.
2
Lemma 1.1 The (p, 0)-forms with polynomial components are also dense in
dom(D) ∩ dom(D ∗ ) endowed with the graph norm
where
zα
ϕα (z) = √ and |uJ,α |2 < ∞
n
π α! α
A2(p−1,0)(Cn , e−|z| ). Hence the partial sums of the Fourier series of the components
2
We want to find conditions under which ˜ D,1 has a bounded inverse. For this
purpose we have to consider the graph norm (u2 + Du2 + D ∗ u2 )1/2 on
dom(D) ∩ dom(D ∗ ). We refer to [4, Theorem 5.1] here in a slightly different
improved form.
Theorem 2.1 Suppose that there exists a constant C > 0 such that
n
u2 ≤ C ([pk , pj∗ ]uj , uk ), (2.1)
j,k=1
n
for any (1, 0)-form u = j =1 uj dzj with polynomial components. Then
n
Du = (pj (uk ) − pk (uj )) dzj ∧ dzk and D ∗ u = pj∗ uj ,
j <k j =1
hence
Du2 + D ∗ u2 = |pk (uj ) − pj (uk )|2 e−|z| dλ
2
Cn j <k
n
pj∗ uj pk∗ uk e−|z| dλ
2
+
Cn j,k=1
n
|pk (uj )|2 e−|z| dλ
2
=
j,k=1 Cn
n
(pj∗ uj pk∗ uk − pk (uj )pj (uk )) e−|z| dλ
2
+
j,k=1 Cn
n
|pk (uj )|2 e−|z| dλ
2
=
j,k=1 C
n
n
[pk , pj∗ ]uj uk e−|z| dλ,
2
+
j,k=1 C
n
j,k=1 Cn
is finite, since the components uj are polynomials, and it follows that the expression
n ∗
j,k=1 ([pk , pj ]uj , uk ) is a real number.
Now the assumption (2.1) implies that (2.2) holds for (1, 0)-forms with poly-
nomial components and, by Lemma 1.1, we obtain (2.2) for any u ∈ dom(D) ∩
dom(D ∗ ).
Remark 2.2 In Theorem 2.1 we implicitly suppose that the expression
n
([pk , pj∗ ]uj , uk )
j,k=1
is nonnegative.
The Generalized ∂-Complex on the Segal–Bargmann Space 321
First we consider the one-dimensional case. Let pm denote the polynomial differen-
tial operator
∂ ∂m
pm = a0 + a1 + · · · + am m ,
∂z ∂z
∗ denote the polynomial
with constant coefficients a0 , a1 , . . . , am ∈ C, and let pm
∗
pm (z) = a 0 + a 1 z + · · · + a m zm ,
Du = pm (u) dz,
It is clear that D 2 = 0, as all (2, 0)-forms are zero if n = 1, and that we have
D ∗ v dz = pm
∗
v.
˜ D,1 := DD ∗
m m
2
k
e−|z| dλ(z).
2
= ! ak u(k−)(z) (2.3)
C k=
=1
m m
k j
! ak a j u(k−) u(j −) . (2.4)
=1 k,j =
322 F. Haslinger
∂k
m m m m
∗
[pm , pm ]u = ak k
a j zj u − a j zj aj u(j ) .
∂z
k=0 j =0 j =0 j =0
k
notice that = 0, in the case k < . Hence we obtain
m
j
∗ k j (k−) j −
[pm , pm ]u = ak a j ! u z .
j,k=1 =1
m
j
k j
u(k−)u(j −) e−|z| dλ(z).
2
= ak a j ! (2.5)
C
j,k=1 =1
Now it is easy to show that integration of (2.4) coincides with (2.5) and we are done.
Lemma 2.4 There exists a constant C > 0 such that, for each u ∈ dom(D ∗ ),
Proof First let u be a polynomial and note that the last term in (2.3) equals
|am u(z)|2 e−|z| dλ(z) = m! |am |2 u2
2
m!
C
and all the other terms are non-negative, see Lemma 2.3. Now we get that
1
D ∗ u2 = (pm
∗ ∗
u, pm ∗
u) = ([pm , pm ]u, u) + (pm (u), pm (u)) ≥ u2 ,
C
where
1
C= ,
m! |am |2
The Generalized ∂-Complex on the Segal–Bargmann Space 323
if we suppose that am = 0, and we are done. Finally apply Lemma 1.1 to obtain the
desired result.
˜ D,1 = DD ∗ has a bounded inverse
Theorem 2.5 Let D be as in (1.1). Then
˜ D,1 ).
ÑD,1 : A2(1,0)(C, e−|z| ) −→ dom(
2
D ∗ ÑD,1 α ≤ Cα
˜ D,1 : dom(
˜ D,1 ) −→ A2 (C, e−|z|2 )
(1,0)
is bijective and has the bounded inverse ÑD,1 , see [4, Theorem 5.1]. The rest follows
from [4, Theorem 5.2].
3 Commutators
and
[Aj , Bj ] = I, j = 1, . . . , n.
respect to zj on A2 (Cn , e−|z| ) and its adjoint, the creation operator Bj , by the
2
multiplication by zj , both operators being unbounded densely defined (see [1, 2]).
One can show that A2 (Cn , e−|z| ) with this action of the Bj and Aj is an irreducible
2
n
1
H = (Aj Bj + Bj Aj ).
2
j =1
Remark 3.1 If we apply (3.1) for the one-dimensional case of Lemma 2.3, we get
m
2
∗
]u(z) u(z) e−|z| dλ(z) = e−|z| dλ(z),
2 2
[pm , pm ()
pm u(z)
C =1 C
where (e1 ) and (e2 ) denote the derivatives with respect to z1 and z2 respectively. In
addition suppose that for all derivatives (α) of order 2 we have
(α)∗ (α)
pj pk = δj,k cj,α j, k = 1, 2, (3.3)
where
1 1
C1 = c1,α > 0 and C2 = c2,α > 0.
α! α!
|α|=2 |α|=2
Then
1
2
u2 ≤ ([pk , pj∗ ]uj , uk ), (3.4)
min(C1 , C2 )
j,k=1
2
for any (1, 0)-form u = j =1 uj dzj with polynomial components.
The Generalized ∂-Complex on the Segal–Bargmann Space 325
Now we use (3.2) and get for the first order derivatives
2
(e1 )∗ (e1 ) (e2 )∗ (e2 )
[(pj pk uj , uk ) + (pj pk uj , uk )]
j,k=1
(e )∗ (e ) (e )∗ (e )
=(p1 1 p1 1 u1 , u1 ) + (p2 1 p2 1 u2 , u2 )
(e )∗ (e ) (e )∗ (e )
± (p2 2 p1 2 u1 , u2 ) ± (p1 2 p2 2 u2 , u1 )
(e )∗ (e ) (e )∗ (e )
+ (p1 2 p1 2 u1 , u1 ) + (p2 2 p2 2 u2 , u2 )
(e )∗ (e ) (e )∗ (e )
± (p2 1 p1 1 u1 , u2 ) ± (p1 1 p2 1 u2 , u1 )
(e ) (e ) (e ) (e )
=(p1 1 u1 ± p2 1 u2 , p1 1 u1 ± p2 1 u2 )
(e ) (e ) (e ) (e )
+ (p1 2 u1 ± p2 2 u2 , p1 2 u1 ± p2 2 u2 )
(e ) (e ) (e ) (e )
=p1 1 u1 ± p2 1 u2 2 + p1 2 u1 ± p2 2 u2 2 .
2 1 (α)∗ (α)
(p pk uj , uk ) = C1 u1 2 + C2 u2 2 .
α! j
j,k=1 |α|=2
Hence we get
2
([pk , pj∗ ]uj , uk ) =C1 u1 2 + C2 u2 2
j,k=1
(e ) (e ) (e ) (e )
+ p1 1 u1 ± p2 1 u2 2 + p1 2 u1 ± p2 2 u2 2 ,
(e )∗ (e ) (e )∗ (e ) (e )∗ (e ) (e )∗ (e )
p2 1 p1 1 = ± p1 1 p2 1 , p1 2 p2 2 = ± p2 2 p1 2 , (3.5)
326 F. Haslinger
where (e1 ) and (e2 ) denote the derivatives with respect to z1 and z2 respectively. In
addition suppose that for all derivatives (α) of order 2 we have
(α)∗ (α)
pj pk = δj,k cj,α j, k = 1, 2, (3.6)
where
1 1
C1 = c1,α > 0 and C2 = c2,α > 0.
α! α!
|α|=2 |α|=2
Then
1
2
u2 ≤ ([pk , pj∗ ]uj , uk ),
min(C1 , C2 )
j,k=1
2
for any (1, 0)-form u = j =1 uj dzj with polynomial components.
Finally we exhibit some examples, where conditions (3.2) and (3.3), or (3.5)
and
n (3.6) are∗ checked. Examples (a) and (c) are taken from [4], where
j,k=1 ([pk , pj ]uj , uk ) was directly computed.
Example
∂2 ∂2 ∂2
(a) We take p1 = and p2 = + . Then p1∗ (z) = z1 z2 and p2∗ (z) =
∂z1 ∂z2 ∂z12 ∂z22
z12 + z22 and we see that (3.2) and (3.3) are satisfied:
and we obtain
2
([pk , pj∗ ]uj , uk )
j,k=1
2 2
∂u1 ∂u2 ∂u1 ∂u2
e−|z| dλ,
2
= |u1 |2 + 4|u2 |2 + +2 + +2
C2 ∂z1 ∂z2 ∂z2 ∂z1
2
for u = j =1 uj dzj with polynomial components.
2 ∂2 ∂2
(b) Taking p1 = i ∂z∂1 ∂z2 and p2 = ∂z 2 + ∂z2 we have that p1∗ (z) = −iz1 z2 and
1 2
p2∗ (z) = z12 + z22 and that (3.2) and (3.3) are satisfied:
(e )∗ (e ) (e )∗ (e ) (e )∗ (e ) (e )∗ (e )
p2 1 p1 1 = −p1 2 p2 2 , p1 1 p2 1 = −p2 2 p1 2 ,
The Generalized ∂-Complex on the Segal–Bargmann Space 327
and we obtain
2
([pk , pj∗ ]uj , uk )
j,k=1
2 2
∂u1 ∂u2 ∂u1 ∂u2
e−|z| dλ,
2
= |u1 |2 + 4|u2 |2 + + 2i + + 2i
C2 ∂z1 ∂z2 ∂z2 ∂z1
2
for u = j =1 uj dzj with polynomial components.
∂2
(c) Let pk = ,k = 1, 2. Then pj∗ (z) = zj2 , j = 1, 2 and we see that (3.5)
∂zk2
and (3.6) are satisfied and we have
2
2
2
∂uj
([pk , pj∗ ]uj , uk ) = (2δj,k uj , uk ) + (4δj k zj , uk )
∂zk
j,k=1 j,k=1 j,k=1
2 ;
;
; ∂uj ;2
= 2u + 4
2 ; ;
; ∂zj ; .
j =1
∂2 ∂2
(d) For p1 = + ∂
and p2 = ∂
+ we have p1∗ (z) = z12 + z2 and p2∗ (z) =
∂z12 ∂z2 ∂z1 ∂z22
z1 + z22 and we see that (3.2) and (3.5) are not satisfied. In particular,
; ; ; ;
2
; ∂u1 ;2 ; ∂u2 ;2
∗ ;
([pk , pj ]uj , uk ) =3(u1 + u2 ) + 4 ;
2 2 ; ;
+; ;
∂z1 ; ∂z2 ;
j,k=1
Acknowledgments The author thanks the referees for several useful suggestions.
This project was partially supported by the Austrian Science Fund (FWF) project P28154.
References
1. L.D. Fadeev, O.A. Yakubovskii, Lectures on Quantum Mechanics for Mathematics Students.
Student Mathematical Library, vol. 47 (American Mathematical Society, Providence, 2009)
2. G.B. Folland, Harmonic Analysis in Phase Space. Annals of Mathematics Studies, vol. 122
(Princeton University Press, Princeton, 1989)
3. F. Haslinger, The ∂-Neumann Problem and Schrödinger Operators. de Gruyter Expositions in
Mathematics, vol. 59 (Walter de Gruyter, Berlin, 2014)
328 F. Haslinger
4. F. Haslinger, The ∂-complex on the Segal-Bargmann space. Ann. Polon. Mat. 123, 295–317
(2019)
5. D.J. Newman, H.S. Shapiro, Certain Hilbert spaces of entire functions. Bull. Am. Math. Soc.
72, 971–977 (1966)
6. D.J. Newman, H.S. Shapiro, Fischer spaces of entire functions, in Entire Functions and Related
Parts of Analysis. Proceedings of Symposium in Pure Mathematics La Jolla, 1966 (American
Mathematical Society, Providence, 1968), pp. 360–369
7. D.G. Quillen, On the representation of Hermitian forms as sums of squares. Invent. Math. 5,
237–242 (1968)
8. F. Treves, Linear Partial Differential Equations with Constant Coefficients (Gordon and Breach,
New York, 1966)
The Inverse Characteristic Polynomial
Problem for Trees
1 Introduction
During the last 20+ years, a theory of the possible multiplicities of the eigenvalues,
of Hermitian matrices with a given graph, has developed. This is the most developed,
and most interesting, in the case of trees, but much is now known for general graphs.
C. R. Johnson ()
The College of William and Mary, Williamsburg, VA, USA
e-mail: [email protected]
E. Gruner
Gettysburg College, Gettysburg, PA, USA
e-mail: [email protected]
Recently the theory has expanded to geometric multiplicities for the eigenvalues of
a general (combinatorially symmetric) matrix; there are strong similarities and some
notable differences. Though ongoing, much of this work is summarized in the recent
book [6].
The question then naturally arises: “What about algebraic multiplicities of the
eigenvalues of a matrix with a given graph?” Here, the field makes a big difference.
In the case of the complex numbers, there is a complete answer because of the
“additive inverse eigenvalue problem.” Recall that the diagonal entries of a matrix
are independent of its undirected graph. Over the complex numbers, for a given
n-by-n matrix A and given desired complex numbers λ1 , λ2 , . . . , λn , there is a
diagonal matrix D such that A + D has eigenvalues λ1 , λ2 , . . . , λn [3].
This is not true over the reals. There it is natural to ask which real polynomials
occur for real matrices with a given undirected graph. There is good reason to guess
that any real polynomial may occur when the graph is connected. But, this is not
easy to prove. Initially, it is natural to consider trees. Recently, it has been shown [2]
that for the path (tridiagonal matrices), any real polynomial can occur. But, the proof
is existential. It is shown that a real matrix may be constructed for a given monic,
degree n real polynomial when there is a degree n − 1 polynomial with a certain
relationship to the former. Existence may be shown, but construction is difficult.
For the star (all vertices pendent from a single vertex), it has also recently been
shown [5] that any real polynomial occurs over R. The proof is constructive and
much more is shown, just for the star. We benefit here from the technology therein,
although the technology does not easily transfer to other trees.
Here, we consider general trees and real matrices; for a tree T , R(T ) denotes
the set of real matrices whose graph is T (not necessarily symmetric, but combi-
natorially symmetric, and no restriction besides reality on the diagonal entries). A
method is developed that, for each tree, realizes many characteristic polynomials in
R(T ), and for many trees, realizes all real characteristic polynomials. The method
is constructive and gives all real polynomials for the path (the first time that has
occurred), as well as many other trees. The smallest tree for which not all real
polynomials are realized has ten vertices, and this is not because of the number
of vertices, but because of a notion of balance of the tree.
2 Preliminaries
We assume that the reader is familiar with the basic terminology from linear algebra
and graph theory: see [4] and [6]. However, we shall briefly elaborate on the concept
of the graph of a matrix discussed in the introduction.
Given an n-by-n matrix A = (aij ) over a field F, we let G (A) denote the graph
of A. This graph consists of vertices {1, 2, . . . , n} with an edge {i, j }, i = j , if and
only if aij = 0. For G (A) to be undirected, A must be combinatorially symmetric;
that is, aij = 0 if and only if aj i = 0. This paper will only be concerned with real
matrices; given an undirected graph G, we let R(G) denote all real matrices whose
The Inverse Characteristic Polynomial Problem for Trees 331
graph is G. If v is a vertex of G (A), then we let A(v) denote the principal submatrix
of A resulting from the deletion of the row and column corresponding to v.
The matrices we construct will all be of a specific form. Suppose we have a tree
T on n vertices, and let v be any vertex with degree s. If s ≥ 1, let u1 , u2 , . . . , us be
the neighboring vertices of v. Additionally, let T1 , T2 , . . . , Ts be the branches of T
at v, in which each Ti contains the vertex ui , and let 1 , 2 , . . . , s be the respective
number of vertices in T1 , T2 , . . . , Ts . From this point forward, we will refer to the
number of vertices in a tree or a branch of a tree as its weight.
We say that A ∈ R(T ) is centered at v if it is 1-by-1, or if it has the following
form:
⎡ ⎤
b v1 · · · vs
⎢e1,1 A1 0 0⎥
⎢ ⎥
A=⎢ . .. ⎥ .
⎣ .. 0 . . . . ⎦
e1,s 0 · · · As
In this form, b ∈ R, e1,i = e1 ∈ Ri , and vi = (ai e1,i )T , with ai ∈ R \ {0}. (Note
that e1 represents the basic unit vector with a 1 in the first entry and 0’s elsewhere.)
Each Ai is a matrix in R(Ti ) that is centered at ui .
Define A to be pseudosymmetric if it is 1-by-1, or if it has the above form and
each Ai is diagonally similar to a symmetric matrix. We will usually be constructing
matrices that have this property.
The neighbors formula [6] provides a convenient recursive representation of the
characteristic polynomial for a matrix in this form. Let pA denote the characteristic
polynomial of the matrix A. If A = [b], then pA = (x − b). Otherwise, we have:
/
s
s /
s
pA = (x − b) pAi − ai pAi (ui ) pAj
i=1 i=1 j =1,j =i
s /
s
= (x − b)pA(v) − ai pAi (ui ) pAj . (1)
i=1 j =1,j =i
We define the empty product, and the characteristic polynomial of the empty
matrix, to be 1.
Given any vertex v of T and any matrix A ∈ R(T ), there is a diagonal and/or
permutation similarity that takes A to a matrix centered at v. Since the characteristic
polynomial is similarity invariant, if we are trying to realize a particular polynomial
for a particular tree, we can take our matrix to be centered at whatever vertex is most
convenient for us.
Finally, we will provide an overview of the algebraic technique known as the
partial fraction decomposition, or PFD.
The following theorem was adapted from a more general statement in [1]. The
proof is straightforward and is thus omitted.
332 C. R. Johnson and E. Gruner
E
Theorem 2.1 ([1]) Let g(x) = ki=1 (x − γi ), in which γ1 , γ2 , . . . , γk are real
E
numbers such that γ1 > γ2 > · · · > γk . Let Qi (x) = kj =1,j =i (x − γj ). Finally,
let f (x) be a real polynomial of degree less than k. Then we have
f (x) a1 a2 ak
= + + ···+
g(x) x − γ1 x − γ2 x − γk
p(x) a1 a2 an−1
= (x − b) − − − ··· − ,
g(x) x − γ1 x − γ2 x − γn−1
with b ∈ R and ai = −p(γi )/Qi (γi ). Furthermore, ai and ai+1 are of the same
sign for any i ∈ {1, 2, . . . , n − 2} if and only if p has an odd number of roots lying
strictly between γi and γi+1 .
Proof By the division algorithm for polynomials, we can write
in which q(x) and r(x) are real polynomials and deg r(x) < deg g(x) = n −
1. (While the division algorithm normally allows for the possibility of a zero
remainder, this cannot happen in our case because p and g are relatively prime.)
Since p and g are both monic and they differ by only one degree, we know that
q(x) = x − b for some unique real number b. So we may write
Since deg r(x) < deg g(x), we can apply Theorem 2.1 to write
r(x) a1 a2 an−1
= + + ···+ ,
g(x) x − γ1 x − γ2 x − γn−1
with ai = r(γi )/Qi (γi ). Since each γi is a root of g, we have r(γi ) = −p(γi ) for
all i ∈ {1, 2, . . . , n − 1}, so we can just as well write ai = −p(γi )/Qi (γi ). The last
statement of the lemma follows from a proof similar to that of Lemma 2.2.
3 Main Results
The ease of constructing a matrix for a particular polynomial and tree hinges
largely on the possible signs of the coefficients in a partial fraction decomposition.
Therefore, we would like to find a way to classify a polynomial according to this
characteristic.
Suppose we have a monic real polynomial p of degree n. As x ranges from −∞
to +∞, p(x) undergoes n − d strict sign changes, in which d is an even integer at
least 0 and at most n (if n is even) or n − 1 (if n is odd). We shall call this integer d
the root deficiency of p.
The root deficiency of p is directly related to its complete factorization over the
real polynomials. Suppose we have
t
p = w1s1 w2s2 · · · wαsα q1t1 q2t2 · · · qββ ,
Observe that d = 0 if and only if p has n distinct linear factors. On the other
hand, d has maximal value if and only if 1) p is composed entirely of irreducible
quadratics, or 2) at most one si is odd.
Let us also define a signed root to be a root at which p undergoes a strict sign
change. These are precisely the roots whose corresponding linear terms have odd
exponents, and there are n − d of them.
Example 3.1 The polynomial p1 (x) = x 3 + x 2 − 4x − 4 = (x + 1)(x − 2)(x + 2)
has root deficiency 0. The signed roots of p1 are −1, 2, and −2.
Example 3.2 The polynomial p2 (x) = x 2 (x −5)3 (x +1)(x 2 +1) has root deficiency
6. The signed roots of p2 are 5 and −1.
Let p be a monic real polynomial of degree n, and let γ1 > γ2 > · · · > γn−1
be real numbers with p(γi ) = 0 for each i ∈ {1, 2, . . . , n − 1}. Suppose
E we apply
the combined division algorithm/PFD procedure to p and g = n−1 i=1 (x − γi ) as
in Lemma 2.3. The potential positive/negative breakdown of the ai ’s we obtain is
connected to the root deficiency of p.
Proposition 3.3 Let p be a monic real polynomial of degree n and root deficiency
d. Let k1 ≥ k2 be nonnegative integers with k1 + k2 = n − 1. We can choose real
p(x)
numbers γ1 > γ2 > · · · > γn−1 such that the representation of En−1 shown in
i=1 (x−γi )
Lemma 2.3 yields k1 negative ai ’s and k2 positive ai ’s if and only if k2 ≥ (d − 2)/2.
Proof For the “only if” direction, assume that we have a selection of (n − 1) γi ’s
that yields k1 negative ai ’s and k2 positive ai ’s for a particular polynomial p, with
k1 ≥ k2 . We shall prove that d must be at most 2k2 +2, implying that k2 ≥ (d −2)/2.
Let z = k1 − k2 . We claim that we must have at least z − 1 distinct consecutive
pairs of negative ai ’s. We consider two consecutive pairs distinct if they differ in at
least one element: for example, the set {a1 , a2 , a3 } consists of two consecutive pairs.
If we ignore all the positive ai ’s, then we have k1 negative ai ’s, which consists
of k1 − 1 distinct consecutive pairs. Each positive ai that we introduce can break
up at most one of these consecutive pairs (none if it is terminal or adjacent to an
already introduced ai ). When we have introduced all k2 of these ai ’s, we are left
with at least (k1 − 1) − k2 = z − 1 consecutive pairs of negative ai ’s. Since each
consecutive pair of same-sign ai ’s necessitates at least one signed root of p between
the corresponding γi ’s (see Lemma 2.3), p must have at least z − 1 signed roots.
Recall that p has n−d distinct signed roots, so z−1 ≤ n−d. Substituting k1 −k2
for z and k1 + k2 + 1 for n, we determine that d ≤ 2k2 + 2, or k2 ≥ (d − 2)/2, as
claimed.
For the “if” direction, suppose we have k1 and k2 in mind such that k1 +k2 = n−1
and k2 ≥ (d − 2)/2. When we choose γi ’s in the following steps, we assume that
no γi is a root of p, signed or “unsigned.”
Let R1 , R2 , . . . , Rn−d be the distinct signed roots of p, with R1 > R2 > · · · >
Rn−d . If n − d > 0, then take R = R1 ; otherwise, take R to be any real number.
If k2 = k1 = (n − 1)/2, then choose all γi ’s to be larger than R. Then
by Lemma2.3, the ai ’s will alternate in sign, and with n − 1 being even we
The Inverse Characteristic Polynomial Problem for Trees 335
will have equal numbers of positive and negative ai ’s. Otherwise, choose only
γ1 , γ2 , . . . , γ2k2 +1 to be larger than R. This choice will ensure that the correspond-
ing ai ’s alternate in sign. Note that a1 = −p(γ 1)
Q1 (γ1 ) , as defined in Lemma 2.3; since
both p(γ1 ) and Q1 (γ1 ) are positive, a1 will be negative, as will a3 , a5 , . . . , a2k2 +1 .
Likewise, a2 , a4 , . . . , a2k2 will be positive. In total, we will have k2 +1 negative ai ’s
and k2 positive ai ’s.
Since we now have all the positive ai ’s we want, we’d like the remaining ai ’s, of
which there are (k1 + k2 ) − (2k2 + 1) = k1 − k2 − 1 = z − 1, to all be negative.
But since (d − 2)/2 ≤ k2 , we have that z − 1 ≤ n − d. Therefore, we can choose
the remaining γi ’s such that
R1 > γ2k2 +2 > R2 > γ2k2 +3 > R3 > · · · > Rz−1 > γn−1 .
Since exactly one signed root occurs between γi and γi+1 for i ∈ {2k2 + 1, 2k2 +
2, . . . , n−2}, by Lemma 2.3 a sign change will not occur between the corresponding
ai and ai+1 , so the remaining ai ’s will all be negative, as desired.
Arguably the most relevant previous result to our work is the following theorem,
first proved by Leal-Duarte in [7]. We present a slightly different proof, but the basic
argument is the same.
Theorem 3.4 ([7]) Let T be a tree on n vertices, and let v be any vertex of T .
Suppose p and g are monic polynomials of degrees n and n − 1 respectively,
that both polynomials have all distinct, real roots, and that the roots of g strictly
interlace the roots of p. Then there exists a matrix A ∈ R(T ), centered at v, such
that A has characteristic polynomial p and A(v) has characteristic polynomial g.
Furthermore, this matrix is diagonally similar to a symmetric matrix.
Proof We proceed by induction on n. Suppose first that n = 1; then g = 1, p =
(x − b) for some real number b, and A = [b] is the matrix we seek. Note that A
is diagonally similar to a symmetric matrix: namely, itself, via the similarity matrix
D = [1]. Now suppose n > 1, and the claim holds for t with n > t ≥ 1. Then
v must have degree s ≥ 1; let u1 , u2 , . . . , us be the neighboring vertices of v, in
which each ui is E
contained in the branch Ti with weight i .
Suppose g = n−1 i=1 (x − μi ) for μ1 > μ2 > · · · > μn−1 , and write
! "
p(x) a1 a2 an−1
= (x − b) − + + ···+ ,
g(x) x − μ1 x − μ2 x − μn−1
as in Lemma 2.3.
Since p and g have strictly interlacing roots, we can use the last statement of
Lemma 2.3 to conclude that all ai ’s are of the same sign. In this case, they are all
positive, since a1 = −p(μ1 )/Q1 (μ1 ) is positive.
Suppose we partition the n − 1 terms of the PFD into s partial sums, with the
ith partial sum containing i terms of the original expression—the exact grouping
336 C. R. Johnson and E. Gruner
does not matter here. Then we combine the terms within each partition into a single
rational expression hi (x)/Pi (x), with deg Pi (x) = i and deg hi (x) < i . We now
have:
! "
p(x) h1 (x) h2 (x) hs (x)
= (x − b) − + + ··· + .
g(x) P1 (x) P2 (x) Ps (x)
/
s
s /
s
pA = (x − b) pAi − ai pAi (ui ) pAj
i=1 i=1 j =1,j =i
s /
s
= (x − b)P1 (x) · · · Ps (x) − αi ĥi (x) Pj (x)
i=1 j =1,j =i
s /
s
= (x − b)g(x) − αi ĥi (x) Pj (x) = p(x).
i=1 j =1,j =i
The Inverse Characteristic Polynomial Problem for Trees 337
/
s
pA(v) = pAi = P1 (x) · · · Ps (x) = g(x),
i=1
as claimed.
By our inductive hypothesis, we have that for each i ∈ {1, 2, . . . , s}, Ai =
Di−1 Bi Di , in which Di is a diagonal matrix of appropriate size and Bi is symmetric.
Since each αi is positive, we can construct the diagonal matrix
⎡ ⎤
1 0 0 ··· 0
⎢0 √α1 D1 0 ··· 0 ⎥
⎢ √ ⎥
⎢0 ··· ⎥
D=⎢ 0 α 2 D2 0 ⎥.
⎢. . . .. .. ⎥
⎣ .. .. .. . . ⎦
√
0 0 0 ··· αs Ds
Proof Let v be a vertex of T that meets the above specifications. Without loss of
generality, assume that 1 + 2 + · · · + t = C, with t ≤ s and C ∈ [, k − ]. (Note
that since d ≤ n, we know < k/2 and therefore the range [, k−] contains at least
two integers.) Since k − C would also be in [, k − ], we can also assume without
loss of generality that CE≤ k − C. By Proposition 3.3, we can choose γ1 , γ2 , . . . , γk
such that, if we let g = ki=1 (x − γi ) and represent p/g as in Lemma 2.3, we obtain
C positive coefficients and k − C negative coefficients among the ai ’s.
Relabeling the γi ’s and ai ’s if necessary, write
p(x)
=(x − b)
g(x)
! "
a1 a2 aC aC+1 ak
− + + ··· + + + ··· + ,
x − γ1 x − γ2 x − γC x − γC+1 x − γk
/
s
s /
s
pA = (x − b) pAi − ai pAi (ui ) pAj
i=1 i=1 j =1,j =i
s /
s
= (x − b)P1 (x) · · · Ps (x) − αi ĥi (x) Pj (x)
i=1 j =1,j =i
s /
s
= (x − b)g(x) − αi ĥi (x) Pj (x) = p(x).
i=1 j =1,j =i
4 Constructive Examples
We present two examples that demonstrate the use of this algorithm. The first
example uses a polynomial with real, distinct roots, which can be realized using
the method presented in Theorem 3.4. The second example uses a polynomial with
repeated and complex roots, which can be realized using the method presented in
Theorem 3.5.
Example 4.1 Let p1 be the polynomial
Since there are no restrictions on the central vertex in the real, distinct root case,
let v be the far left vertex indicated in Fig. 1.
We need our characteristic polynomial for A(v) to be of degree 7 and have real,
distinct roots that strictly interlace those of p1 . Any such polynomial will suffice, so
let us choose
Observe that all the numerators in the PFD are positive, as the algorithm claims.
In this case T has only one branch at v - call it T1 - so we may combine all partial
fraction terms into a single rational expression:
p1
= (x − 0.5)
g
7x 6 − 21x 5 − 48.125x 4 + 131.25x 3 + 60.183x 2 − 129.938x − 6.152
− .
g
By pulling out the leading coefficient of the PFD numerator, and factoring the
resulting monic polynomial (call it ĥ1 ), we have:
p1
= (x − 0.5)
g
7(x + 2.261)(x + 1.144)(x + 0.046)(x − 1.046)(x − 2.144)(x − 3.261)
− .
(x + 2.5)(x + 1.5)(x + 0.5)(x − 0.5)(x − 1.5)(x − 2.5)(x − 3.5)
Observe that ĥ1 is of degree 6 and has real, distinct roots that strictly interlace
the roots of g. Letting u denote the sole neighboring vertex of v, we can construct a
The Inverse Characteristic Polynomial Problem for Trees 341
One can verify with the neighbors formula (1) that A has characteristic polyno-
mial p1 .
Now, we can follow the same process to construct A1 . Dividing g by ĥ1 as in
Lemma 2.3 yields:
g 0.458 0.588 0.641
=(x − 0.5) − + +
ĥ1 x + 2.261 x + 1.144 x + 0.046
0.641 0.588 0.458
− + + .
x − 1.046 x − 2.144 x − 3.261
Now, T1 has two branches at u, with weights 5 and 1; call these branches S1 and
S2 . We would like to combine the new PFD into two rational expressions such that
the denominators have degrees 5 and 1. There are several ways we could do this, but
let’s consolidate the first five terms of the PFD and leave the last term alone.
g
=(x − 0.5)
hˆ1
2.917(x + 1.941)(x + 0.687)(x − 0.535)(x − 1.767)
−
(x + 2.261)(x + 1.144)(x + 0.046)(x − 1.046)(x − 2.144)
0.458
− .
x − 3.261
While we will not perform the calculation here, one can use a similar inductive
process to confirm that A is diagonally similar to a symmetric matrix, since all off-
diagonal entries are positive.
Example 4.2 Let T be the same 8-vertex tree used in the previous example, and let
p2 be the following polynomial:
negative and two positive numerators, as shown in the proof of Proposition 3.3. So
let
We see that T has three branches at v, which have weights 3, 2, and 2. Call these
branches T1 , T2 , and T3 respectively, and suppose they each contain the respective
neighbors u1 , u2 , and u3 of v.
We would like to consolidate the PFD terms into three rational expressions such
that the denominators have degrees 3, 2, and 2. However, in order to apply our
algorithm, we must only consolidate terms whose numerators have the same sign.
So let’s combine the first, third, and fifth terms into one expression, the second and
fourth terms into another, and the last two terms into our final expression.
p2 −21397.6(x − 7.55)(x − 6.01) 21108.4(x − 6.63)
=(x + 20) − +
g (x − 8)(x − 7)(x − 6) (x − 7.5)(x − 6.5)
−0.051(x + 1.33)
−
(x − 1)(x + 2)
−21397.6ĥ1 21108.4ĥ2 −0.051ĥ3
=(x + 20) − + + .
P1 P2 P3
Since each ĥi and Pi have strictly interlacing roots, we can construct a matrix
Ai ∈ R(Ti ), centered at ui , such that Ai has characteristic polynomial Pi , Ai (ui )
344 C. R. Johnson and E. Gruner
One can use the neighbors formula to verify that A does indeed have character-
istic polynomial p2 .
Finally, we can construct A1 , A2 , and A3 using the algorithm in Theorem 3.4 to
yield the matrix
⎡ ⎤
−20 −21397.6 0 0 21108.4 0 −0.051 0
⎢ 1 7.44 0.249 0.012 0 0 0 0 ⎥
⎢ ⎥
⎢ 0 0 ⎥
⎢ 1 7.55 0 0 0 0 ⎥
⎢ ⎥
⎢ 0 1 0 6.01 0 0 0 0 ⎥
A=⎢ ⎥.
⎢ 1 0 0 0 7.38 0.109 0 0 ⎥
⎢ ⎥
⎢ 0 0 0 0 1 6.63 0 0 ⎥
⎢ ⎥
⎣ 1 0 0 0 0 0 0.327 1.57 ⎦
0 0 0 0 0 0 1 −1.33
To help narrow the scope of Theorem 3.5, and to pave the way for useful corollaries,
we present a few entirely graph-theoretic results. These theorems will aid us in
choosing the most appropriate vertex of the graph on which to center our constructed
matrix.
Unless otherwise specified, let T be an arbitrary tree on n vertices. We let k =
n − 1, be a nonnegative integer with < k/2, and m = k − 2 + 1, the number of
integers in the range [, k − ]. Observe that since < k/2, we must have m ≥ 2.
With respect to , we say that a vertex v of T is ideal if the largest branch of T
at v has weight k − or smaller.
If T is our tree and p is our desired polynomial with root deficiency d ≥ 2, then
the only vertices upon which we could center our constructed matrix A are the ideal
vertices of T with respect to = (d − 2)/2. If a vertex v is not ideal - that is, if T
has a branch at v with weight greater than k − - then any partial sum of the branch
weights of T at v will either be smaller than or larger than k − .
Proposition 5.1 T has at least one ideal vertex with respect to .
Proof Let v be any vertex of T , and let M be the weight of the largest branch of T
at v. We proceed by induction on M.
The Inverse Characteristic Polynomial Problem for Trees 345
Fig. 3 Images (a), (b), and (c) denote the ideal vertices of this 13-vertex tree with respect to 5, 4,
and 3 respectively
i < for each i, which also means that sj =1,j =i j > k − for each i. Thus for
any ui , the branch containing v would have weight greater than k − , so ui cannot
be ideal.
Therefore, if our desired polynomial p has root deficiency d, and our tree T has
a deficient vertex v with respect to = (d −2)/2, then any pseudosymmetric matrix
A would have to be centered at v. Fortunately, even in this case, we can still usually
come up with an appropriate partial sum. We will present some sufficient conditions
for that later.
We will conclude this graph theory discussion with a theorem that places an
upper bound on the number of ideal vertices a tree can have with respect to a given
.
This may not be the most useful result from a theoretical perspective—we only
need to find one appropriate ideal vertex to know that a matrix can be constructed—
but from a computational standpoint certain vertices may be better for centering
than others. An upper bound—along with the fact that all ideal vertices must be
connected—allows the algorithm user to know when he has found all possible ideal
vertices, allowing him to make the best judgment on which one to use.
First, we introduce one more term and lemma. We call an ideal vertex v of T
ideal pendent (with respect to ) if it is a pendent vertex on the subtree of ideal
vertices with respect to . It is well known that a tree on two or more vertices must
have two or more of those vertices pendent. So as long as T has more than one ideal
vertex, we are guaranteed at least two pendent ideal vertices.
The Inverse Characteristic Polynomial Problem for Trees 347
Fig. 5 A linear tree on 12 vertices with depth 2, with one deficient ideal vertex v with respect to
= 5. Note that the branch weights 4 and 1 sum to 5 ∈ [5, 6] = [, k − ]
had two or fewer branches at v, each with weight less than , then the sum of their
weights would be less than k, a contradiction.
Since T is a linear tree, at most two branches of T at v can contain additional
high-degree vertices, which means the remaining branches are all paths of length m
or smaller. The result follows from Proposition 5.8.
The previous result yields the following specific case:
Corollary 5.10 Suppose T is a linear tree with depth at most 2. Then p has a
pseudosymmetric realization A ∈ R(T ).
Proof Since < k/2, we know that is strictly less than k−, so the range [, k−]
must contain at least two integers. Therefore, m = k − 2 + 1 ≥ 2. See the previous
result (Fig. 5).
This corollary includes the case of paths, which can be considered linear trees
with depth 0. While the existence of arbitrary polynomial realizations for paths had
already been proven in the real number case, this is the first proof that uses an
explicit, constructive argument.
The next few results also follow from Proposition 5.8.
Proposition 5.11 If m ≥ − 1, or, equivalently, if n ≥ (3/2)(d − 2) − 1, then p
has a pseudosymmetric realization A ∈ R(T ).
Proof Let v be an ideal vertex of T . If v is not deficient, then the result follows from
Theorem 5.6. If v is deficient, then by definition all branches of T at v have weight
less than , which in this case also means they have weight less than or equal to m.
The result now follows from Proposition 5.8 (Fig. 6).
conditions of Proposition 5.11), S has depth greater than m when viewed as a linear
tree (violating the conditions of Proposition 5.9), and we have three branches with
weight greater than m = 2 (violating the conditions of Proposition 5.8).
Acknowledgement This work was supported by the 2019 National Science Foundation grant
DMS #1757603.
References
1. W. Adkins, M. Davidson, Synthetic partial fraction decompositions. Math. Mag. 81(1), 16–26
(2008)
2. R.S. Cuestas-Santos, C.R. Johnson, Spectra of tridiagonal matrices over a field (2018). Preprint.
arXiv:1807.08877
3. S. Friedland, Inverse eigenvalue problems. Linear Algebra Appl. 17, 15–51 (1977)
4. R.A. Horn, C.R. Johnson, Matrix Analysis, 2nd edn. (Cambridge University Press, New York,
2013)
5. C.R. Johnson, A. Leal-Duarte, Complete spectral theory for matrices over a field whose graph
is a star. Manuscript
6. C.R. Johnson, C.M. Saiago, Eigenvalues, Multiplicities, and Graphs (Cambridge University
Press, New York, 2018)
7. A. Leal-Duarte, Construction of acyclic matrices from spectral data. Linear Algebra Appl. 113,
173–182 (1989)
A Note on the Fredholm Theory
of Singular Integral Operators
with Cauchy and Mellin Kernels, II
1 Introduction
In this paper we consider linear integral operators, which are made up by multipli-
cation operators, the Cauchy singular integral operator and Mellin type operators.
More precisely, the operators under consideration are given by
1 1
b(x) u(y) dy x u(y) dy
(Au)(x) :=a(x)u(x) + + c+ (x) k+
πi 0 y−x 0 y y
(1.1)
1 1−x u(y) dy
+ c− (x) k− , 0 < x < 1,
0 1−y 1−y
P. Junghanns ()
Chemnitz University of Technology, Faculty of Mathematics, Chemnitz, Germany
e-mail: [email protected]
R. Kaiser
TU Bergakademie Freiberg, Faculty of Mathematics and Computer Science, Freiberg, Germany
e-mail: [email protected]
p p
and are considered in weighted Lp -spaces Lρ,σ := Lv ρ,σ (0, 1), where a, b, c± :
[0, 1] → C are piecewise continuous functions,
2 Preliminaries
and
1 R
FR− u (t) = eiηt u(η) dη, t ∈ R.
2π −R
Let 1 < p ≤ 2, p1 + q1 = 1 and u ∈ Lp (R). Then (cf. [9, p. 96, Theor. 74]) FR u
u and FR−
converges for R → ∞ in Lq (R) to a function u converges in Lp (R) to u.
Moreover, we have
1
uq,R ≤ (2π) q up,R .
F : Lp (R) → Lq (R), u →
u
the Fourier transformation and the function F u the Fourier transform of u ∈ Lp (R).
Of course, F − : Lp (R) → Lq (R) is also well defined, where for u ∈ Lp (R),
we set F − u = limR→∞ FR− u and take the limit in the Lq (R)-sense. Note that,
F v and F − v are also defined for v = F − u and v = F u, respectively, where
F v := limR→∞ FR v as well as F − v := limR→∞ FR− v and the limits are taken in
the Lp (R)-sense. Moreover, with these definitions we have
F −F u = F F −u = u ∀ u ∈ Lp (R), 1 < p ≤ 2.
356 P. Junghanns and R. Kaiser
1 ∗
F = F −1 = F − . (2.3)
2π
Let f, g : R → C be measurable functions. We assume that there is a set N ⊂ R of
measure zero such that f (t − ·)g(·) is integrable for all t ∈ R\N. In this case, we
define the convolution f ∗ g : R → C of these functions by
⎧ ∞
⎨ f (t − s)g(s) ds : t ∈ R\N
(f ∗ g)(t) := −∞
⎩
0 : t ∈ N.
If p ∈ (1, ∞), 1
p + 1
q = 1, f ∈ Lp (R), and g ∈ Lq (R), then
This relation is generalized by the well known Young’s inequality for convolutions
F (f ∗ g) = F f · F g and F − (f ∗ g) = 2π F − f · F − g
as well as
f ∗ g = F − (F f · F g) and f ∗ g = 2π F (F − f · F − g) (2.5)
∞ 1
p
uρ,p,R+ ,∼ = |u(t)| t dt
p ρ
.
0
where again the integral is considered as a Cauchy principal value integral. The
operator SR+ is a linear and bounded operator in Lρ (R+ ) (see, for example, [3,
p
ξ :
Z Lpρ (R+ ) → Lp (R), f → e−ξ · f (e−· ),
which is an isometric isomorphism with Z −1 f (y) = y −ξ f (− ln y). Now, a
ξ
consequence of relation (2.6) is the formula
ξ SR+ Z
Z −1 u = F aF − u ∀u ∈ Lp (R), (2.7)
ξ
is well defined, linear, and bounded (cf. (2.3)). In particular, for bγ (t) = eiγ t , γ ∈ R,
and aγ (t) = −sgn(t − γ ), due to (2.6), we have
Let p ∈ (1, ∞), −1 < σ < p − 1, and a ∈ L∞ (R). If there is a finite constant
M = M(a, p, σ ) such that
; ;
; 0 ;
;Wa u; ≤ M uσ,p,R ∀ u ∈ L2 (R) ∩ Lpσ (R),
σ,p,R
p
then the uniquely defined linear and continuous extension of Wa0 onto Lσ (R) is
again denoted by Wa0 : Lσ (R) → Lσ (R) and a ∈ L∞ (R) is called a (p, σ )-
p p
multiplier. Let us denote the set of all (p, σ )-multipliers by Mp,σ . In view of
p
the above example given by (2.8) and the fact that SR ∈ L(Lσ (R)), we have
p
Waγ ∈ L(Lσ (R)) for every γ ∈ R. Since, for the characteristic function χ(a,b]
0
is true, where the constant Cp,σ does only depend on p and σ and not on a ∈ V1 .
In what follows, R = R ∪ {±∞} and Ṙ denote the two-point compactification
and the one-point compactification of R, respectively. By PC(R) we refer to the
algebra of all piecewise continuous functions a : R → C (i.e., f ∈ PC(R) if and
only if all limits a(t ±0), t ∈ Ṙ, where a(∞±0) := a(∓∞) = limt →∓∞ a(t), exist
and are finite). Moreover, by PC p,σ we denote the completion of the; set ;PC(R) ∩
V1 (R) in the Banach algebra Mp,σ , ·Mp,σ , where aMp,σ = ;Wa0 ;L(Lp (R))
; ; σ
(cf. [1, p. 254]). Since a∞ = ;Wa0 ;2,R (cf.[2, Proposition 2.3]), we have PC2,0 =
(PC(R), ·∞ ). Note that PCp,σ ⊂ PC(R) (cf. [1]).
We introduce the space Lσ (R+ ) as the compression of Lσ (R) to the positive
p p
+ p +
half line R . Thus, Lσ (R ) may be identified with the image of the projection
p p
P : Lσ (R) → Lσ (R), which is defined by
u(t) : t ∈ R+ ,
(Pu)(t) =
0 : t ∈ R \ R+ .
∞ 1
tσ p
uσ,p,R+ = |u(t)| p
dt ,
0 (1 + t)σ
Fredholm Theory of Singular Integral Operators 359
where we will use the abbreviation up,R+ := u0,p,R+ . It is easily seen that
∞ 1
p
−t σ
up,σ,∼ := |u(t)| (1 − e ) dt
p
0
defines an equivalent norm on Lσ (R+ ), i.e., there are positive constants C1 and C2
p
such that
defined as the restriction of PWa0 P to the space Lσ (R+ ). For 1 < p < ∞, note
p
p q p
the dual space of Lσ (R) is equal to Lμ (R) with μ = (1 − q)σ and q = p−1 .
∗
Analogously, Lσ (R+ ) = Lμ (R+ ).
p q
Lemma 2.1 Let 1 < p < ∞ and p1 + q1 = 1. Then Mp,σ = Mq,μ . Moreover, if
∗
a ∈ Mp,σ , then Wa0 = Wa0 as well as (Wa )∗ = Wa .
p p
Proof Define R : Lσ (R) → Lσ (R) by (Ru)(t) = u(−t). Then, RFu = F Ru =
2πF − u for u ∈ Lp (R) and 1 < p ≤ 2. Let a ∈ Mp,σ and p ∈ (1, ∞). Taking into
p q
account (2.3), for u ∈ L2 (R) ∩ Lσ (R) and v ∈ L2 (R) ∩ Lμ (R), we have
Wa0 u, v = u, F aF −1 v = u, Wa0 v .
R R R
p
Since Wa0 is continuous on L2 (R) ∩ Lσ (R), .p,σ,R , the operator Wa0 is contin-
q
uous on L2 (R) ∩ Lμ (R), .q,μ,R , which means a ∈ Mq,μ . Using
we get
which implies a ∈ Mq,μ . Hence, due to symmetry reasons, Mp,σ = Mq,μ . By the
∗ ∗
way Wa0 = Wa0 and, consequently, (Wa )∗ = P Wa0 P = Wa , since P ∗ = P.
360 P. Junghanns and R. Kaiser
Let us finish this section with two lemmata on the triviality of the nullspace of a
Wiener-Hopf integral operator or of its adjoint.
Lemma 2.2 ([2], Prop. 2.8) Let us assume that 1 < p < ∞, a ∈ Mp,0 , and
a = 0 almost everywhere on R. Then the nullspace of Wa : Lp (R+ ) → Lp (R+ ) or
the nullspace of the adjoint operator Wa : Lq (R+ ) → Lq (R+ ) are trivial, where
p + q = 1.
1 1
The following lemma generalizes the previous one to the case of weighted spaces
using a stronger assumption on a(t). Here, by Cp,σ we refer to the closure of the
set of piecewise
functions on Ṙ (with finitely many jumps) in the Banach
constant
algebra Mp,σ , .Mp,σ .
Lemma 2.3 ([8], Prop. 1.8) If a ∈ Cp,σ and inft ∈R |a(t)| > 0, then the
homogeneous equation Wa u = in the space Lσ (R+ ) or the adjoint equation
p
3 Boundedness
In this section we introduce the Mellin transformation and the Mellin type operators
as well as the Cauchy singular integral operator on the finite interval (0, 1).
Useful properties of the Mellin transformation as well as a relation between the
convolution operators from the previous section and the Mellin type operators are
given. Furthermore, we formulate conditions for the Mellin operators to be bounded
in weighted Lp -spaces.
For z ∈ C and a measurable function f : (0, ∞) → C, for which t z−1 f (t)
is integrable on each compact subinterval of (0, ∞), the Mellin transform f(z) is
defined as
R
f(z) = lim t z−1 f (t) dt,
R→∞ R −1
Let ξ := {z ∈ C : #z = ξ } and
C0 (ξ ) = u ∈ C(ξ ) : lim u(ξ + iη) = 0 .
|η|→∞
Fredholm Theory of Singular Integral Operators 361
Since, for u ∈ L1 (R), F u ∈ C(R) and lim|η|→∞ (F u)(η) = 0 (due to the Riemann-
Lebesgue theorem), by (3.1) we see that, for ξ ∈ R, the Mellin transformation
Mξ :
L1ξ −1 (R+ ) → C0 (ξ ), f → f
is well defined. Moreover, if f(ξ + i·) ∈ L1 (R) and f ∈ C(0, ∞), then (cf. [4,
Lemma 2.8])
∞
1
f (t) = t −ξ −iη f(ξ + iη) dη , 0 < x < ∞. (3.2)
2π −∞
∞ 1
p
gξ ,p = |g(ξ + iη)| dη
p
.
−∞
Taking into account relation (3.1), we are able to define the Mellin transform for
f ∈Lpξ −1 (R+ ) and 1 < p ≤ 2 by
p
eR
f(ξ + i·) := F Z ξ f = lim
ξ f = lim FR Z t ξ +i ·−1 f (t) dt, (3.3)
R→∞ R→∞ e−R
Mξ :
Lpξ −1 (R+ ) → Lq (ξ ), f → f,
p
of the complex plane. The following lemma modifies [4, Cor. 2.9].
Lemma 3.2 Let 1 < p ≤ 2, p1 + q1 = 1, and α, β ∈ R with α < β. Moreover, let
k ∈ Lpξ −1 (R+ )∩C(R+ ) for every ξ ∈ (α, β). By Corollary 3.1 we have
p
k ∈ Lq (ξ )
for all ξ ∈ (α, β). If, additionally,
k is holomorphic in the strip α,β satisfying
Mα0 ,β0 := sup (1 + |z|)1+δ
k (z) : z ∈ α0 ,β0 < ∞ (3.4)
362 P. Junghanns and R. Kaiser
for all intervals [α0 , β0 ] ⊂ (α, β) and some δ = δ(α0 , β0 ) > 0, then there hold
∞
tμ 4 5
k(t) = t −ξ −iη k(ξ − μ + iη) −
k(ξ + μ + iη) dη (3.5)
2π(1 − t 2μ ) −∞
g (ξ + i·) =
k(ξ − μ + i·) −
k(ξ + μ + i·)
to L1 (R). Relation (3.2) yields (3.5), and relation (3.6) is a consequence of g(1) = 0.
if t → +0. Analogously, we can choose ξ ∈ (α, β) and μ > 0 such that 0 < μ <
β − ξ < ξ − α and β − ξ − μ < ε. Again by (3.5) and (3.6) we get, for t > 2,
; ;
t β−ε |k(t)| ≤ const t β−ε−μ−ξ ; k(ξ + μ + i·);L1 (R) → 0
k(ξ − μ + i·) −
if t → ∞.
The following Lemma modifies Lemma 3.10 in [5].
Lemma 3.4 Let p ∈ [1, ∞), −∞ < α < β < ∞, and
Lpα (R+ ) ∩
k ∈ Lβ (R+ ),
p
Then we have k ∈
L11+ρ (R+ ) for all ρ ∈ (α, β).
p −1
1
1+ρ−α
q1
−1 q
≤ t p
dt f α,p,R+ ,∼
0
∞ 1+ρ−β q1
−1 q
+ t p
dt f β,p,R+ ,∼ ,
1
where
1+ρ −α 1+ρ −β
− 1 q > −1 and − 1 q < −1.
p p
≤ f α,1,R+ ,∼ + f β,1,R+,∼ ,
where the integral is considered as a Cauchy principal value one. It is well known
p p
that S : Lρ,σ → Lρ,σ is a linear and bounded operator if and only if 1 < p < ∞
and −1 < ρ, σ < p − 1 (see, for example, [3, Sections 1.2, 1.5]). For a measurable
function k : R+ → C we define the Mellin operator Bk by
1
x f (y)
(Bk f )(x) = k dy, x ∈ (0, 1).
0 y y
p p
Recall that, for p ∈ [1, ∞) and ρ ∈ R, the integral operator Bk : Lρ,0 → Lρ,0 is
bounded, if k ∈ L11+ρ (R+ ) (cf. [5, Lemma 3.7]).
p −1
1+ρ
Let p ∈ [1, ∞), ρ, σ ∈ R and ξ = p . We introduce the mapping
Lemma 3.6 For p ∈ (1, ∞), ρ, σ ∈ (−1, p − 1), and a(t) = −i cot(πξ − iπt),
the relation Zξ SZξ−1 = Wa holds true in Lσ (R+ ), where ξ = 1+ρ
p
p .
Proof From (2.7) and the boundedness of SR+ : Lσ (R+ ) → Lσ (R+ ), we infer
p p
; ;
; ;
;F aF −1 u; ≤ SR+ L p
Lσ (R+ )
u
σ,p,R
σ,p,R
for all u ∈ L2 (R) ∩ Lσ (R). Hence, a ∈ Mp,σ . On the other hand, if u ∈ Lσ (R+ )
p p
and then
u = Pu,
e−ξ t ∞ y −ξ
u(− ln y) dy
Zξ SZξ−1 u (t) = , t ∈ R+ ,
πi 0 y − e−t
Fredholm Theory of Singular Integral Operators 365
On the other hand we have, for f ∈ Lp (R+ ) and for almost all x ∈ (0, 1),
∞
(− ln x) =
x −ξ PKPf k es x eξ s f (s) ds
0
1
x dy
= k y −ξ f (− ln y)
0 y y
1 x dy
= k Zξ−1 f (y) ,
0 y y
p
and the lemma is proved, since every g ∈ Lρ,0 can be represented in the form
g = Zξ−1 f with f ∈ Lp (R).
Lemma 3.8 ([5], Prop. 3.13) Let p ∈ (1, ∞), ρ ∈ (−1, p − 1), and k ∈ C(R+ ).
Moreover, we assume that there are real numbers α, β with α < β such that 1+ρ
p ∈
(α, β) and such that
p p
Then, for all σ ∈ (−1, p − 1), the integral operator Bk : Lρ,σ → Lρ,σ is bounded.
Corollary 3.9 Let p ∈ (1, ∞), ρ ∈ (−1, p − 1), and k ∈ C(R+ ). Furthermore, we
assume that there are real numbers α, β with α < β such that ξ := 1+ρ p ∈ (α, β)
and k ∈ Lqη−1 (R+ ) for some q ∈ (1, 2] and all η ∈ (α, β). Moreover, we suppose
q
that the Mellin transform k is holomorphic in the strip α,β and fulfils (3.4). Then
p p
the integral operator Bk : Lρ,σ → Lρ,σ is bounded for all σ ∈ (−1, p − 1).
Proof Choose ε > 0 such that α + ε < ξ < β − ε. Corollary 3.3 yields
Bk f = Zξ−1 Wa Zξ f,
p
f ∈ Lρ,0 ∩ Lpρ,σ ,
4 Fredholm Properties
Here we derive necessary and sufficient conditions for the Fredholmness of the
operators of the form (1.1) and also an index formula based on the winding number
of a certain closed curve in C associated with such operators.
Lemma 4.1 Let p ∈ (1, ∞), p1 + q1 = 1 and ρ, σ ∈ (−1, p − 1) as well as
k ∈ C(R+ ). For A ∈ L Lρ,σ ) we define the adjoint operator by
p
1 1
q
(Au)(x)v(x) dx = u(x)(A∗ v)(x) dx, ∀ u ∈ Lpρ,σ , ∀ v ∈ Lρ ,σ
0 0
p
with ρ = (1 − q)ρ and σ = (1 − q)σ . If Bk ∈ L Lρ,σ ) then
Proof The proof is straightforward if one takes into account the commutation
formula for Cauchy principal value and usual integrals (cf.[6, Chapter II, Prop. 4.4]).
Lpα−1 (R+ ) ∩
We recall that the assumption k ∈ Lpβ−1 (R+ ) implies that the Mellin
p p
(cf. [5, equation (3.11)]) by (3.4), which is a weaker condition regarding the Mellin
transform k.
Lemma 4.2 Let a, b ∈ C, p ∈ (1, ∞), ρ ∈ (−1, p−1), p1 + q1 = 1, and k ∈ C(R+ )
satisfy condition (A). Then, for all σ ∈ (−1, p − 1), we have the representations
and
−1
Z1−ξ (aI + bS + Bk )∗ Z1−ξ = Wa = (Wa )∗ , (4.2)
ξ Bk Z −1 f = PF
PZ
k(ξ − i·)F −1 Pf, f ∈ L2 (R+ ) ∩ Lpσ (R+ ).
ξ
p
Lρ,σ is Fredholm if and only if a ± b = 0 and there exists an integer κ satisfying
where
a+b
= e2πiν . (4.4)
a−b
(Note that κ can only take the values 0, −1, and 1). In this case, we have ind A = κ
and the operator A is invertible, invertible from the left, or invertible from the right,
if the index of A is zero, −1, or 1, respectively. The corresponding (one-sided)
inverse is given by
1 ν
1 b x 1 − y ν+κ f (y)
A(−1) f (x) = af (x) − dy ,
a 2 − b2 πi 0 y 1−x y−x
−1 4 5
i.e., A(−1) = a 2 − b 2 aI − bv ν,−ν−κ Sv −ν,ν+κ I .
a−b = e
Moreover, for a+b 2πiμ , μ = 0, and 0 ≤ #μ < 1,
1
b v μ−1,−μ (y) dy
av μ−1,−μ
(x) + = 0, 0 < x < 1, (4.5)
πi 0 y−x
p
i.e., in case κ = 1, we have μ = 1+ν and the nullspace of the operator A : Lρ,σ →
p
Lρ,σ is spanned by v ν,−ν−1 .
Corollary 4.4 In (4.4) we can choose ν in such a way that 0 ≤ #ν < 1. Then,
a−b
= e2πi(1−ν)
a+b
Moreover,
b 1 v −ν,ν (y) dy
a v −ν,ν (x) − = γ0 , 0 < x < 1, (4.7)
πi 0 y−x
and
1
1−ν,ν b v 1−ν,ν (y) dy
av (x) − = δ0 + δ1 x , 0 < x < 1, (4.8)
πi 0 y−x
Fredholm Theory of Singular Integral Operators 369
with certain constants γ0 , δ0 , δ1 ∈ C, where (4.7) remains true also in case of −1 <
#ν < 0.
Proof It remains to prove relations (4.7) and (4.8). By using (4.6), we get
b 1 v −ν,ν (y) dy
a v −ν,ν (x) −
πi 0 y−x
! 1 −ν,ν−1 " 1
b v (y) dy b
= (1 − x) a v −ν,ν−1 (x) − + υ −ν,ν−1 (y) dy
πi 0 y−x πi 0
1
b
= υ −ν,ν−1 (y) dy , 0 < x < 1,
πi 0
which proves (4.8). To prove (4.7) in case −1 < #ν < 0, we use a−b
a+b = e2πi(−ν)
and get from (4.5) the relation
−ν−1,ν b 1 υ −ν−1,ν (y) dy
aυ (x) − = 0, 0 < x < 1,
πi 0 y−x
14 5
ap,σ (t, x) = a(t + 0) + a(t − 0)
2
(4.9)
i4 5 4 5
− a(t + 0) − a(t − 0) cot π σ∞ (t) − ix .
2
Then, the image of ap,σ defines a closed curve in the complex plane, which we
denote by ap,σ . Under the assumption
inf |ap,σ (t, x)| : t ∈ R, x ∈ R > 0 (4.10)
the winding number wind ap,σ is well defined, where the orientation of ap,σ is
given by its inherent parametrization.
Lemma 4.6 ([1], Theorem 5.7, cf. also [8], Theorem 1.2) Let p ∈ (1, ∞), σ ∈
(−1, p − 1) and a ∈ PCp,σ . Then Wa is a Fredholm operator on Lσ (R+ ) if
p
A = Zξ−1 Wa Zξ in Lpρ,σ ,
Then
Zξ−1 Wa Zξ u = (aI + bS + Bk )u
p
∀u ∈ Lρ,0 ∩ Lpρ,σ
and Zξ−1 Wa Zξ is Fredholm on the space Lρ,σ if and only if the curve (4.11) does
p
not run through the zero point. In this case, the Fredholm index of Zξ−1 Wa Zξ :
p p
Lρ,σ → Lρ,σ is equal to the negative winding number of this curve, where the
orientation is given by its inherent parametrization.
4 5n 4 5n
For a = aj j =1 ∈ PC(R)n and b = bj j =1 ∈ PCnp,σ , now we consider
operators of the form
n
"a,b := aj Wb0j . (4.12)
j =1
Moreover, let
n
n
a± (t) := aj (t)bj (±∞) and b± (t) := aj (±∞)bj (t).
j =1 j =1
inf {|a+ (t)| : t ∈ R} > 0 and inf {|b− (t)| : t ∈ R} > 0 , (4.13)
372 P. Junghanns and R. Kaiser
a− (±∞)
Ap,σ (±∞, x) := ,
a+ (±∞)
and
! "
1 b+ (t + 0) b+ (t − 0)
Bp,σ (t, x) := +
2 b− (t + 0) b− (t − 0)
! "
i b+ (t + 0) b+ (t − 0) 1
− − cot π − ix , t ∈ R,
2 b− (t + 0) b− (t − 0) p
b+ (±∞)
Bp,σ (±∞, x) := ,
b− (±∞)
respectively, where
1
p : t ∈ R\{0}
σ0 (t) := 1+σ
p : t = 0.
If the conditions
inf |Ap,σ (t, x)| : t, ∈ R, x ∈ R > 0 (4.14)
and
inf |Bp,σ (t, x)| : t ∈ R, x ∈ R > 0 (4.15)
1 4 5
argAp,σ (t, x) + argBp,σ (t, x) ,
2π
a− b+
where t runs over R, and at the points of discontinuity of a+ or b− , the variable x
runs over R.
Fredholm Theory of Singular Integral Operators 373
Lemma 4.10 ([1], Theorem 5.7, cf. also [8], Theorem 3.2) Let n ∈ N, p ∈
(1, ∞), σ ∈ (−1, p − 1) and aj ∈ PC(R), bj ∈ PCp,σ for j = 1, · · · , n. Then
p
"a,b is a Fredholm operator on Lσ (R) if and only if (4.13), (4.14), and (4.15) hold
p p
true. In this case, the Fredholm index of "a,b : Lσ (R) → Lσ (R) is equal to −κp,σ .
A function a : [0, 1] → C is called piecewise continuous, if it is continuous
at 0 and 1, if the one-sided limits a(x ± 0) exist for all x ∈ (0, 1), and if at least
one of them coincides with a(x). The set of all piecewise continuous functions a :
[0, 1] → C having only a finite number of jumps is denoted by PC[0, 1].
Corollary 4.11 Let p ∈ (1, ∞), ρ, σ ∈ (−1, p − 1) and a, b ∈ PC[0, 1], and
assume that k ∈ C(R+ ) satisfies condition (A). Then the integral operator A :=
p
aI + bS + Bk is Fredholm on the space Lρ,σ if and only if a(x ± 0) + b(x ± 0) = 0
for all x ∈ (0, 1), if a(x) + b(x) = 0 for x ∈ {0, 1}, and if the closed curve
d
A :=
0d ∪ 1d ∪ 1d ∪ . . . ∪ N
d
∪ Nd ∪ N+1
d
∪
1d
does not contain the point 0. Here N stands for the number of discontinuity points
xj , j = 1, . . . , N, of the function
a(x) − b(x)
d(x) =
a(x) + b(x)
chosen in such way that x0 := 0 < x1 < · · · < xN < xN+1 := 1. Using these xj ,
the curves jd , j = 1, . . . , N + 1 and jd , j = 1, . . . , N are given by
jd := d(x) : xj −1 < x < xj
and
14 5
jd := d(xj + 0) + d(xj − 0)
2
i4 5 1
+ d(xj + 0) − d(xj − 0) cot π + it :t ∈R ,
2 p
and
14 5 i4 5 1+σ
1d := 1 + d(1) + 1 − d(1) cot π + it :t∈R .
2 2 p
In this case, the Fredholm index of A is equal to the winding number of the curve
A , where the orientation of A is due to the above given parametrization.
Proof Due to Lemma 4.2, we have the representation
A = Zξ−1
aI +
bWa + Wb Zξ in Lpρ,σ
= a1 I + b1 Wa0 + χ+ W 0 + χ− I
A in Lpσ (R),
b
where
a (t) : t ∈ R+ ,
b(t) : t ∈ R+ ,
a1 (t) = b1 (t) =
0 : otherwise, 0 : otherwise,
and
Due to our assumptions, condition (4.13) is fulfilled, and hence we have, for 0 <
t < ∞,
! "
1 a (t + 0) −
b(t + 0) a (t − 0) −
b(t − 0)
Ap,σ (t, x) = +
2 a (t + 0) +
b(t + 0) a (t − 0) +
b(t − 0)
! "
i a (t + 0) −
b(t + 0) a (t − 0) −
b(t − 0) 1
− − cot π − ix .
2 a (t + 0) +
b(t + 0) a (t − 0) +
b(t − 0) p
Fredholm Theory of Singular Integral Operators 375
Moreover,
! " ! "
1 a(1) − b(1) i a(1) − b(1) 1+σ
Ap,σ (0, x) = +1 − − 1 cot π − ix ,
2 a(1) + b(1) 2 a(1) + b(1) p
a(0) − b(0)
Ap,σ (+∞, x) = ,
a(0) + b(0)
Finally,
and, for t ∈ R,
a (t) +
inf | b(t)| : t ∈ R+ > 0,
inf |a(0) − b(0)i cot π(ξ − it) +
k(ξ − it)| : t ∈ R > 0,
and
inf |Ap,σ (t, x)| : t, ∈ R, x ∈ R > 0.
But this is obviously equivalent to our assumptions. In this case, the Fredholm index
is equal to −κp,σ , where κp,σ is defined as the increment of the function
of A
1 Bp,σ (t, x)
arg Ap,σ (t, x) + arg ,
2π a(0) + b(0)
where t runs over R, and at the points of discontinuity of a −b /a +b , the
variable x runs over R. But κp,σ is equal to the negative winding number of the
curve A . This completes the proof.
p p
We define the operator R : Lρ,σ → Lσ,ρ by (Rf )(x) = f (1 − x). Moreover,
we set
Theorem 4.12 Let p ∈ (1, ∞), σ± ∈ (−1, p − 1) and a, b, c± ∈ PC[0, 1]. If the
functions k± ∈ C(R+ ) satisfy condition (A) for ξ = ξ± := 1+σ ±
p , then the operator
p
A defined in (4.16) is Fredholm on Lσ+ ,σ− if and only if a(x ± 0) − b(x ± 0) = 0
for all x ∈ (0, 1), if a(x) − b(x) = 0 for x ∈ {0, 1}, and if the closed curve
c
A :=
0c ∪ 1c ∪ 1c ∪ . . . ∪ N
c
∪ Nc ∪ N+1
c
∪
1c
does not contain the point 0. Here N stands for the number of discontinuity points
xj , j = 1, . . . , N, of the function
a(x) + b(x)
c(x) =
a(x) − b(x)
chosen in such way that x0 := 0 < x1 < · · · < xN < xN+1 := 1. Using these xj ,
the curves jc , j = 1, . . . , N + 1 and jc , j = 1, . . . , N are given by
jc := c(x) : xj −1 < x < xj
and
14 5
jc := c(xj + 0) + c(xj − 0)
2
i4 5 1
− c(xj + 0) − c(xj − 0) cot π − it :t ∈R .
2 p
The curves j , j ∈ {0, 1} connecting the point 1 with one of the endpoints of 1c
c
and N+1 , respectively, are given by the formulas
a(0) − b(0)i cot π(ξ+ − it) + c+(0)
k+ (ξ+ − it)
0c := :t ∈R
a(0) − b(0)
and
a(1) + b(1)i cot π(ξ− − it) + c− (1)
k− (ξ− − it)
1c := :t ∈R .
a(1) − b(1)
In this case, the Fredholm index of A is equal to the negative winding number of the
c c
curve A , where the orientation of A is due to the above given parametrization.
Proof At first, we consider the case c+ = 1 and c− = 0. Setting ρ = σ+ , σ = σ− ,
ξ = ξ+ = 1+ρp , and ξ− = p . Having regard to Corollary 4.11, we have to show
1+σ
0 ∈ A
c
is equivalent to 0 ∈ Ad c
and that in this case wind A = −wind A d
.
Fredholm Theory of Singular Integral Operators 377
if one compares the increments of the argument along the respective pieces of the
c d
curves A and A .
This section deals with some specific properties of the Mellin operators and the
smoothness of the solutions of the equation
In addition, their asymptotic behaviour near the end point 1 is investigated. Let
n ∈ N0 . For k ∈ Cn (R+ ) and f : (0, 1) → C we define the operators ∂n Bk by
1
x f (y)
(∂n Bk f )(x) := k (n) dy.
0 y y n+1
Lemma 5.1 Let p ∈ (1, ∞), σ < p−1, ρ ∈ R, and n ∈ N0 as well as k ∈ Cn (R+ ).
We assume that there is a β > 1+ρ
p such that the functions
x
k := [a, 1] × [0, 1] → C, (x, y) → y −(β+) k () (5.1)
y
378 P. Junghanns and R. Kaiser
p
are continuous for all a ∈ (0, 1) and all ∈ {0, 1, . . . , n}. Then, for every f ∈ Lρ,σ ,
the function Bk f is n-times continuously differentiable on (0, 1], where
d
(Bk f )(x) = (∂ Bk f )(x), x ∈ (0, 1] (5.2)
dx
holds true.
Proof Let ε > 0. Then, there exists a δ = δ(ε) > 0, such that
1
p
p−1
p
β−1− pρ p−1 σ
,− p−1
≤ε υ (y) dy f ρ,σ,p
0
≤ const · ε
all x, x ∈ [a, 1], which satisfy |x − x | < δ. Here we took into account that
for
β − 1 − pρ p−1 p
> −1 is equivalent to β > 1+ρ p and that σ < p − 1. Hence
∂ Bk f is continuous on (0, 1]. Analogously, one can show that, for x ∈ [a, 1], the
relations
are true, where the constant does only depend on a ∈ (0, 1). The differentiability of
Bk f follows now from
x x 1
ξ f (y)
(∂1 Bk f )(ξ ) dξ = k dy dξ
c c 0 y y2
x 1 x ε
ξ f (y) ξ f (y)
= lim k 2
dy dξ + lim k dy dξ
ε→0 c ε y y ε→0 c 0 y y2
1 x x ε
ξ f (y) ξ f (y)
= lim k 2
dξ dy + lim k dy dξ
ε→0 ε c y y ε→0 c 0 y y2
1 ! " x ε
x f (y) c f (y) ξ f (y)
= lim k −k dy + lim k dy dξ,
ε→0 ε y y y y ε→0 c 0 y y2
Fredholm Theory of Singular Integral Operators 379
where x, c ∈ (0, 1]. With regard to (5.3), we can apply Lebesgue’s dominated
convergence theorem and get
x 1!
x f (y)
c f (y)
"
(∂1 Bk f )(ξ ) dξ = k −k dy = (Bk f )(x) + const.
c 0 y y y y
d
Hence, dx (Bk f )(x) = (∂1 Bk f )(x), x ∈ (0, 1]. The general case follows now by
induction.
Remark 5.2 Note, that the function k defined in (5.1) is continuous for all a ∈
(0, 1) if the limit lim t β+ k () (t) exists and is finite.
t →∞
Let ψ, ζ ≥ 0. By BCψ,ζ = BCψ,ζ (0, 1) we denote the set of all continuous
functions f : (0, 1) → C, for which the function υ ψ,ζ f is bounded on (0, 1). If we
introduce the norm
f ψ,ζ,∞ = sup υ ψ,ζ (x)|f (x)| : 0 < x < 1 ,
then BCψ,ζ becomes a Banach space. Moreover by Cψ,ζ = Cψ,ζ (0, 1) we denote
the set of all continuous functions f : (0, 1) → C, for which the limits
exist and if these limits are equal to zero if ψ > 0 or ζ > 0, respectively. The space
Cψ,ζ is a closed subspace of BCψ,ζ and, consequently, also a Banach space.
Lemma 5.3 Let p ∈ (1, ∞), 1
p + 1
q = 1, σ < p − 1, and ρ ∈ R, as well as
k∈ C(R+ ). Furthermore, we assume that there is a β > 1+ρ
p such that the function
x
k := [a, 1] × [0, 1] → C, (x, y) → y −β k
y
is continuous for all a ∈ (0, 1). If
∞
ρ
+1 q−2
|k(t)|q t p dt < ∞
0
then
⎧
⎨ L Lpρ,σ , BCψ,0 : max 0, 1+ρ
p , χ ≤ψ,
Bk ∈
⎩ L Lp , C 1+ρ
ρ,σ ψ,0 : max 0, p , χ < ψ .
380 P. Junghanns and R. Kaiser
p
Proof First we note that, due to Lemma 5.1, for f ∈ Lρ,σ , the function (Bk f ) (x)
is continuous on (0, 1]. Moreover, for x ∈ (0, 1), using Hölder’s inequality we can
estimate
1
x f (y) dy
v ψ,0 (x) (Bk f ) (x) = x ψ k
0 y y
− ρq σq 1
p ,− p
1 q q
x v (y) dy
≤x ψ
k f ρ,σ ,p
0 y yq
=: N(x) f ρ,σ ,p ,
=: N1 (x) + N2 (x) .
t0
Setting M0 = max |k(t)| : 2 ≤ t ≤ 2 and taking (5.4) into account, we get
⎧ x − ρq −q x − p
2x σq
⎪
⎪ q p
t −χq−2
1− dt : 2x < t0 ,
⎨ c0
t t
N1 (x) ≤ x ψq+1 x
2x − ρq −q
⎪ x − p
σq
⎪
⎩ Mq −2 x p
0 t 1 − dt : t0 ≤ 2x ,
x t t
⎧ 2
⎪
⎪ q (ψ−χ)q ρq σq
p +q−2+ p −χq (s − 1)
− σpq
ds : 2x < t0 ,
⎨ c0 x s
t =xs
= 1 2
⎪
⎪ ρq σq
− σpq
⎩ q
M0 x ψq s p +q−2+ p (s − 1) ds : t0 ≤ 2x ,
1
and
∞
ρ
p +1
σq
ψq+1− ρq
p −q
q−2
N2 (x) ≤ max 1, 2 p x |k(t)| t q
dt .
0
σq ρq
Since p = σ
p−1 < 1 and since ψq + 1 − p − q ≥ (>) 0 is equivalent to
1+ρ
ψ ≥ (>) p , we get the assertion.
Fredholm Theory of Singular Integral Operators 381
Writing
1
− x f (y) dy
(∂ Bk f ) (x) = x k
0 y y
then
⎧
⎨ L Lpρ,σ , BCψ,0 : + max 0, 1+ρ
p , χ ≤ψ,
∂ Bk ∈
⎩ L L ,C
p 1+ρ
ρ,σ ψ,0 : + max 0, p , χ < ψ .
for t → +0, and that the finite limit lim t β+ k ()(t) exists, then
t →∞
⎧
⎨ L Lpρ,σ , BCψ,0 : + max 0, 1+ρ
p ≤ψ,
∂ Bk ∈
⎩ L L ,C
p 1+ρ
ρ,σ ψ,0 : + max 0, p <ψ.
∞
ρ
p +1 q+q−2
|k ()(t)|q t dt
0
1 ρ ∞ ρ
p +1 q−αq−2 p +1 q−βq−2
≤ c1 t dt + c2 t dt
0 1
and
ρ 1+ρ 1
+ 1 q − αq − 2 = −α− q > −1
p p q
382 P. Junghanns and R. Kaiser
as well as
ρ 1+ρ 1
+ 1 q − βq − 2 = −β − q < −1 .
p p q
Corollary 5.6 Let a, c, f ∈ Cn (0, 1] and let the assumptions of Lemma 5.1 be
p
fulfilled. If the operator equation au + cBk u = f has a solution u ∈ Lρ,σ and if
a(x) = 0 for all x ∈ (0, 1], then u ∈ C (0, 1].
n
Proof With the help of Lemma 5.1, we get u = a −1 (f − cBk )u ∈ Cn (0, 1].
Corollary 5.7 Let a, c, f ∈ Cn [0, 1] and a(x) = 0 for all x ∈ [0, 1]. If the
conditions of Corollary 5.5 are satisfied and if the equation au + cBk u = f has
p
a solution u ∈ Lρ,σ , then
Proof Due to the assumptions, Corollary 5.5 in combination with (5.2) deliver
4 5()
u() = a −1 (f − cBk )u ∈ C 1+ρ ++ε,0
p
Moreover , let
< < m,γ
Cm,0 [a, b] = Cm,γ [a, b] and Cm,0
[a,b] (c) = C[a,b] (c) .
0<γ <1 0<γ <1
For γ ∈ [0, ∞) and −∞ < a < b < ∞, the function class Hγ (a, b) is defined as
the set of all functions f : (a, b) → C belonging to C[γ ],γ −[γ ] [c, d] for all intervals
[c, d] ⊂ (a, b), where [γ ] is the integer, which fulfills γ − 1 < [γ ] ≤ γ . In the
same manner we define the classes Hγ (a, b], Hγ [a, b), and Hγ [a, b]. Of course,
Hγ [a, b] = C[γ ],γ −[γ ] [a, b].
Lemma 5.8 Let p ∈ (1, ∞), σ < p − 1, n ∈ N0 and ρ, β ∈ R with
1+ρ
p < β. For k ∈ C(R+ ), assume that t β k(t) is bounded for t → ∞ (i.e.,
Fredholm Theory of Singular Integral Operators 383
ma := sup t β k(t) : a ≤ t < ∞ < ∞ for all a > 0) and that
s β k(s) − t β k(t)
Ma := sup : s, t ∈ [a, ∞), s = t <∞
|s − t|γ
1+ρ
for some γ ∈ (0, 1) satisfying p +γ < β and for all a > 0. Then Bk f ∈ Hγ (0, 1]
p
for all f ∈ Lρ,σ .
p
Proof Let f ∈ Lρ,σ and 0 < a ≤ x1 < x2 ≤ 1. Then
⎤
1
q1
−1+β− pρ q,− p−1
σ
+ υ (y) dy ⎦ f ρ,σ,p
0
≤ const f ρ,σ,p ,
384 P. Junghanns and R. Kaiser
where
the constant does not depend on x1 and x2 and where we took into account
that −1 − γ + β − pρ q > −1 is equivalent to 1+ρ p + γ < β.
Lemma 5.9 ([7], Theorem in §19) Let 0 ≤ a < b ≤ 1 and v ∈ C0,γ [a, b] for
some γ ∈ [0, 1). Then we have
where χ[a,b] is the characteristic function of the interval [a, b]. Moreover, if v(a) =
0 or v(b) = 0 are satisfied, then we even get
respectively.
Corollary 5.10 Let 0 ≤ a < b ≤ 1 and v ∈ Hγ (a, b) for some γ ∈ [0, ∞) as well
as χ[a,b] v ∈ L10,0 . Then Sχ[a,b] v ∈ Hγ (a, b).
Proof Without loss of generality, we can assume that a = 0 and b = 1. Let [c, d] ⊂
(0, 1) and choose ε > 0 such that ε < c and d < 1 − ε. Write
Sv = I1 + I2 + I3
with
ε 1−ε
1 v(y) dy 1 v(y) dy
I1 (x) = , I2 (x) = ,
πi 0 y−x πi ε y−x
and
1
1 v(y) dy
I3 (x) = .
πi 1−ε y−x
for some n ∈ N. We show that then I2 ∈ C[γ ],γ −[γ ] [c, d] also holds if γ ∈ [n, n+1).
Indeed, in that case v ∈ Hγ −1 (0, 1) and, by [6, Lemma 6.1],
v(ε) v(1 − ε) 1−ε v (y) dy
I2 (x) = − − , ε < x < 1−ε.
ε−x 1−ε−x ε y−x
which implies I2 ∈ C[γ ],γ −[γ ] [c, d]. The corollary is proved.
The following corollary can be proved analogously.
Corollary 5.11 Let 0 ≤ a < b ≤ 1, c ∈ {a, b} and v ∈ Hγ (a, b) ∪ {c} with
v(c) = v (c) = . . . = v [γ ] (c) = 0 for some γ ∈ [0, ∞) as well as χ[a,b] v ∈ L10,0 .
Then
Sχ[a,b] v ∈ Hγ (a, b) ∪ {c} .
u∗ (x)
u(x) = and u∗ ∈ C0,δ
[0,1] (1) ,
(1 − x)μ
where, in case α = 0,
u∗∗ ∈ C0,δ
[0,1] (1)
1 1 p1
−ν,ν+κ
|(v f )(x)| dx ≤ |f (x)| v p ρ,σ
(x) dx
0 0
1
p
p
p−1
p
−#ν− pρ p−1 , #ν+κ− σp p−1
× v (x) dx <∞
0
p
for f ∈ Lρ,σ .
Remark 5.14 ([7], §7) For α > 0 and β ∈ R, the function x → (1−x)α+iβ belongs
to C0,α 0,δ
[0,1] (1). Hence, if f ∈ C[0,1] (1) for some δ ∈ [0, 1), then
g ∈ Cmin{α,δ}
[0,1] (1) ,
where v −ν,ν−1 f ∈ L10,0 in view of Remark 5.13. Moreover, from (5.6) and (4.6) we
deliver
1 bυ ν,1−ν (x) 1 y −ν [f (y) − f (1)] dy
u(x) = 2 a[f (x) − f (1)] − .
a − b2 πi 0 (1 − y)1−ν y−x
In case δ > 1 − #ν, Lemma 5.12 delivers that the integral in the last equation
belongs to υ 0,α0 C0,λ
[0,1] (1) for 0 < α0 < 1 − #ν and 0 < λ < α0 . In particular,
u(1) = 0. Choosing α0 = 1 − #ν − ε2 and λ = 1 − #ν − ε, we get, taking into
account Remark 5.14, that u ∈ C0,δ [0,1] (1) for δ1 = 1−#ν −ε and all ε ∈ (0, 1−#ν).
1
Analogously, in case δ ≤ 1 − #ν, we obtain u ∈ C0,δ [0,1] (1) for δ1 = δ − ε and all
1
Now, let us consider the case κ = 0. Due to Lemma 4.3, the operator aI + bS :
p p
Lρ,σ → Lρ,σ is invertible and
−1 4 5
u = a 2 − b2 af − bv ν,−ν Sv −ν,ν f ,
0,min{#ν,δ}
u1 ∈ Hγ (0, 1) ∩ υ 0,−ν C[0,1] (1)
by using Corollary 5.11 and Remark 5.14 together with Corollary 5.10. Note that
ν = 0 due to b = 0. If #ν = 0, then we apply Lemma 5.12 in case α = 0.
p p
In case κ = 1, due to Lemma 4.3, the operator aI + bS : Lρ,σ → Lρ,σ is
invertible from the right and its null space is spanned by υ ν,−ν−1 , such that
−1
u = a 2 − b2 af − bυ ν,−ν−1 Sυ −ν,ν+1 f + γ0 υ ν,−ν−1
1
u(x) = a[f (x) − f (1)]
a 2 − b2
bυ ν,−ν−1(x) 1 y −ν (1 − y)ν+1 [f (y) − 1]
− dy
πi 0 y−x
4 5
+ γ0 + f (1)(δ0 + δ1 x) υ ν,−ν−1 (x) .
Now, we can again make use of Corollary 5.11 and Remark 5.14 together with
Corollary 5.10 to get the assertion also for this case.
For ρ, σ > −1, we define the space
>
L∞
ρ,σ = Lpρ,σ .
1<p<∞
Fredholm Theory of Singular Integral Operators 389
Corollary 5.16 Let ρ, σ > −1, 0 ≤ γ < ∞, and a ∈ C, b ∈ C\{0} with a±b = 0.
If u ∈
L∞
ρ,σ and
then
u ∈ Hγ (0, 1) ∩ C0,δ
[0,1] (1) and u(1) = 0 ,
1
Remark 5.17 Let p, ρ, σ , a, b, ν, and κ fulfil the conditions of Proposition 5.15, let
c ∈ Hγ (0, 1], and let k(t) satisfy the conditions of Lemma 5.8 for some γ ∈ (0, 1).
If u ∈
p
Lρ,σ and
and we can apply Proposition 5.15 with min {γ , δ} instead of δ to deduce a regularity
property for u.
Analogously, using Corollary 5.16, we get the following corollary.
Corollary 5.18 Let ρ, σ , a, and b satisfy the conditions of Corollary 5.16, let c ∈
Hγ (0, 1], and let k(t) satisfy the conditions of Lemma 5.8 for some γ ∈ (0, 1). If
u∈ L∞ρ,σ and
u ∈ Hγ (0, 1) ∩ C0,δ1
[0,1] (1)
6 One-Sided Invertibility
1
p + q = 1, have only the
1
trivial solution. In the remaining part of this section, let
1+ρ
ξ= p and p + q = 1.
1 1
Proposition 6.1 Let p ∈ (1, ∞), ρ, σ ∈ (−1, p − 1), a ∈ C \ {0}, and k ∈ C(R+ )
satisfy condition (A). Then the homogeneous equations (aI +Bk )u = 0 in the space
Lρ,σ or (aI + Bk )∗ v = 0 in the space L(1−q)ρ,(1−q)σ have only the trivial solution.
p q
p q
Proof Let u ∈ Lρ,σ , v ∈ L(1−q)ρ,(1−q)σ and
(aI + Bk )∗ v
Lemma 4.1
(aI + Bk )u = 0 , = (aI + Bk1 )v = 0 , (6.1)
This implies
1+(1−q)ρ
Since α0 < 1+ρ p is equivalent to q < 1 − α0 , we get, by using Remark 5.2
p q
together with Corollary 5.6 (both in case n = 0), that u ∈ Lρ,0 and v ∈ L(1−q)ρ,0 .
p
By Lemma 3.8, Bk ∈ L Lρ,σ ) is true for all σ ∈ (−1, p−1). Hence we can consider
p q
the equations in (6.1) in the spaces Lρ,0 and L(1−q)ρ,0 , respectively. Because of the
relations (4.1) and (4.2), it only remains to apply Lemma 2.2.
The relations (4.1) and (4.2) together with Lemma 2.2 also immediately deliver
the following proposition.
Proposition 6.2 Let p ∈ (1, ∞), ρ ∈ (−1, p − 1), a ∈ C, b ∈ C \ {0}, and
k ∈ C(R+ ) satisfy condition (A). Then the homogeneous equations
(aI + bS + Bk )u = 0
Fredholm Theory of Singular Integral Operators 391
p
in the space Lρ,0 or
(aI + bS + Bk )∗ v = 0
q
in the space L(1−q)ρ,0 have only the trivial solution.
Proposition 6.3 Let p ∈ (1, ∞), ρ, σ ∈ (−1, p − 1), a ∈ C, b ∈ C \ {0}, and
k ∈ C(R+ ) satisfy condition (A). Moreover, we assume that k(t) fulfils
s β k(s) − t β k(t)
Mβ,a,γ1 (k) := sup : s, t ∈ [a, ∞) , s = t <∞
|s − t|γ1
and
|s α k(s) − t α k(t)|
Nα,a,γ2 (k) := sup : s, t ∈ (0, a] , s = t < ∞ (6.2)
|s − t|γ2
for all a > 0 and for some γi ∈ (0, 1), i ∈ {1, 2}, with ξ + γ1 < β and α < ξ − γ2 .
p
Let aI + bS be invertible in Lρ,σ , i.e., there is a ν ∈ C satisfying (4.3) and (4.4)
for κ = 0. Moreover, we assume that
1 1
#ν < and − #ν < . (6.3)
p q
(aI + bS + Bk )u = 0
p
in the space Lρ,σ or
(aI + bS + Bk )∗ v = 0
q
in the space L(1−q)ρ,(1−q)σ have only the trivial solution.
p q
Proof Let u ∈ Lρ,σ , v ∈ L(1−q)ρ,(1−q)σ , and
(aI + bS + Bk )u = 0 , (aI + bS + Bk )∗ v
Lemma 4.1
= (aI + bS + Bk1 )v = 0 ,
where k1 (t) = k(t −1 )t −1 . Since the function k(t) fulfils (6.2), we get that k1 (t)
satisfies M1−α,a,γ2 (k1 ) < ∞ for all a > 0. Thus, Lemma 5.8 delivers Bk u ∈
Hγ1 (0, 1] and Bk1 v ∈ Hγ2 (0, 1]. From Proposition 5.15 (in case κ = 0), we derive
p q
u ∈ Lρ,0 and v ∈ L(1−q)ρ,0 , where we took into account assumption (6.3). Thus, it
only remains to apply (4.1) in combination with Lemma 2.2.
392 P. Junghanns and R. Kaiser
(aI + bS + Bk )u = 0
p
in the space Lρ,σ or the adjoint equation
(aI + bS + Bk )∗ v = 0
q
in the space L(1−q)ρ,(1−q)σ have only the trivial solution.
Proof Relation (4.1) in combination with Lemma 2.3 immediately delivers the
assertion.
References
1. A. Böttcher, I.M. Spitkovsky, Pseudodifferential operators with heavy spectrum. Integr. Equ.
Oper. Theory 19, 251–269 (1994)
2. R. Duduchava, Integral Equations with Fixed Singularities (BSB B. G. Teubner Verlagsge-
sellschaft, Leipzig, 1979)
3. I. Gohberg, N. Krupnik, One-Dimensional Linear Singular Integral Equations. Vol I: Introduc-
tion. Operator Theory: Advances and Applications, vol. 53 (Birkhäuser Verlag, Basel, 1992)
4. P. Junghanns, R. Kaiser, A numerical approach for a special crack problem. Dolomites Res.
Notes Approx. 10, 56–67 (2017)
5. P. Junghanns, R. Kaiser, A note on the Fredholm theory of singular integral operators with
Cauchy and Mellin kernels, in In Operator Theory, Analysis and the State Space Approach. In
Honor of Rien Kaashoek. Operator Theory : Advances and Applications, vol. 271 (Birkhäuser,
Basel, 2018), 291–325
6. S.G. Mikhlin, S. Prössdorf, Singular Integral Operators (Akademie-Verlag, Berlin, 1986)
7. N.I. Muskhelishvili, Singular Integral Equations. Boundary Problems of Function Theory and
Their Application to Mathematical Physics (P. Noordhoff N. V., Groningen, 1953)
8. R. Schneider, Integral equations with piecewise continuous coefficients in Lp -spaces with
weight. J. Integr. Equ. 9, 135–152 (1985)
9. E.C. Titchmarsh, Introduction to the Theory of Fourier Integrals, 3rd edn. (Chelsea Publishing
Co., New York, 1986)
A Note on Group Representations,
Determinantal Hypersurfaces
and Their Quantizations
The first author was supported by the Slovenian Research Agency grants J1-8132, N1-0057, P1-
0222, and partially supported by the Marsden Fund Council of the Royal Society of New Zealand.
I. Klep
Department of Mathematics, Faculty of Mathematics and Physics, University of Ljubljana,
Ljubljana, Slovenia
e-mail: [email protected]
J. Volčič ()
Department of Mathematics, Texas A&M University, College Station, TX, USA
e-mail: [email protected]
1 Introduction
This is a classical object in algebraic geometry [1, 6, 10, 11], where a key question
asks which hypersurfaces admit determinantal representations. When Aj are real
symmetric matrices, determinantal hypersurfaces pertain to hyperbolic and stable
polynomials [2, 15, 18, 23, 24]. The geometry of the hypersurface (1.1) is also
explored in multivariate operator theory [3, 4, 26]. If Aj are bounded operators
on a Hilbert space and the determinant in (1.1) is replaced with the condition that
ξ0 A0 + · · · + ξ A is not invertible, then (1.1) is known as the projective joint
spectrum of A0 , . . . , A (cf. Taylor spectrum [22] for ensembles of commuting
operators).
Through the work of Frobenius [13] and Dedekind [7] on group determinants
(see also [9]), determinantal hypersurfaces also pertain to representation theory.
Several fascinating developments in this direction [5, 14, 21] have been recently
made. This note addresses certain limitations for extensions of these results.
Let G be a finitely generated group. If T = (g1 , . . . , g ) is a finite sequence of
generators for G and ρ : G → GLd (C) is a representation of G, then denote
Z1 (T , ρ) = ξ ∈ C : det (Id + ξ1 ρ(g1 ) + · · · + ξ ρ(g )) = 0 . (1.2)
D∞ = a, t | a 2 = t 2 = 1
with respect to the generating set (1, a, t), and analyzed its properties through
the representation theory of D∞ . Determinantal hypersurfaces also have a strong
connection with representation theory in the case of finite Coxeter groups [5]. A
Coxeter group is a finitely generated group on generators g1 , . . . , g satisfying
(gi gj )mij = 1
where mii = 1 and mij ≥ 2 for i = j . In [5] the authors first showed that if G is a
finite Coxeter group, λ is its left regular representation, and T = (g1 , . . . , g ) are the
generators as above, then Z1 (T , λ) determines G up to isomorphism. Furthermore,
Group Representations, Determinantal Hypersurfaces and Quantizations 395
if G is not of exceptional type (in the Coxeter diagram sense) and ρ is an arbitrary
finite-dimensional representation of G, then Z1 (T , ρ) determines ρ.
These theorems were presented during the Multivariable Spectral Theory and
Representation Theory workshop at the Banff International Research Station in
April 2019. Several problems about extending these results beyond Coxeter groups
were posed by the speakers; among them were the following.
Questions 1.1 Let G be a finite group, T a fixed generating set for G, and ρ1 , ρ2
irreducible complex representations of G.
(1) Is Z1 (T , ρ1 ) a reduced and irreducible hypersurface?
(2) If Z1 (T , ρ1 ) = Z1 (T , ρ2 ), are ρ1 and ρ2 equivalent?
As usual, ρ1 : G → GLd1 (C) and ρ2 : G → GLd2 (C) are equivalent if d1 = d2
and ρ2 = Pρ2 P −1 for some P ∈ GLd1 (C). A representation ρ1 is irreducible if
its image does not admit a nontrivial common invariant subspace; equivalently, it
generates Md1 (C) as a C-algebra by Burnside’s theorem [17, Corollary 1.17]. The
hypersurface Z1 (T , ρ1 ) is reduced and irreducible (in the scheme-theoretic sense)
if its defining determinant in (1.2) is an irreducible polynomial. The main result of
this note is the following.
Theorem 1.2 Questions 1.1(1) and (2) have negative answers in general.
See Sects. 2.1 and 2.2 for concrete examples. On a more positive side, in
Sect. 3 we show that representation theory aligns well with a quantization of
the determinantal hypersurface, the free locus; see Theorem 3.1. Furthermore,
Proposition 3.4 determines whether a free locus arises from a representation of
a finite group, and Proposition 3.7 characterizes finite abelian groups from the
perspective of determinantal hypersurfaces. We conclude this note with an open
question.
In this section we give negative answers to Questions 1.1. The representations were
found with the help of the computer algebra system GAP and the online repository
ATLAS of Finite Group Representations. Verifying equivalence and irreducibility
of representations was sometimes done symbolically with the computing system
Mathematica.
Let
⎛ ⎞
1 0 0 0 0 0 0 0 0
⎜ 0 0 1 0 0 0 0 0 0⎟
⎜ ⎟
⎜ 0 1 0 0 0 0 0 0 0⎟
⎜ ⎟
⎜ 0 0 0 0 1 0 0 0 0⎟
⎜ ⎟
⎜ ⎟
A1 = ⎜ 0 0 0 1 0 0 0 0 0 ⎟
⎜ ⎟
⎜ 0 0 0 0 0 0 0 1 0⎟
⎜ ⎟
⎜ 0 0 0 0 0 0 1 0 0⎟
⎜ ⎟
⎝ 0 0 0 0 0 1 0 0 0⎠
−1 −1 −1 −1 −1 −1 −1 −1 −1
and
⎛ ⎞
010 00 000 0
⎜0 0 0 10 000 0⎟
⎜ ⎟
⎜0 0 0 0⎟
⎜ 01 000 ⎟
⎜0 0 0 0⎟
⎜ 00 100 ⎟
⎜ ⎟
A2 = ⎜ 0 0 0 00 010 0⎟.
⎜ ⎟
⎜1 0 0 00 000 0⎟
⎜ ⎟
⎜0 0 0 00 000 1⎟
⎜ ⎟
⎝0 0 0 00 001 0⎠
001 00 000 0
by cofactor expansion along the rows. The reader will have no difficulty verifying
that det(I + x1 ρ(g1 ) + x2 ρ(g2 )) equals
1 + x1 − 4x12 − 4x13 + 6x14 + 6x15 − 4x16 − 4x17 + x18 + x19 + x2 + 2x1x2 − 2x12 x2
− 6x13x2 + 6x15 x2 + 2x16 x2 − 2x17 x2 − x18 x2 + x12 x22 + x13 x22 − 2x14 x22 − 2x15x22
+ x16 x22 + x17 x22 − x12 x23 + 2x14x23 − x16 x23 − 2x24 + x12 x24 − x13 x24 + x14 x24 + x15 x24
− 2x25 − 2x1 x25 − x12 x25 + x14 x25 − x12 x26 − x13 x26 + x12 x27 + x28 − x1 x28 + x29
1 + 2x1 − 2x13 − x14 + x1 x2 + 2x12 x2 + x13 x2 − x1 x22 − x12 x22 + x1 x23 − x24 ,
1 − x1 − 2x12 + 2x13 + x14 − x15 + x2 − x1 x2 − x12 x2 + x13 x2 − x24 − x25 .
Some of the subsequent examples are presented in a more terse way to maintain the
focus on their intent.
Note that the above irreducible representation of A6 has dimension 9, which
is not the minimum among nontrivial complex representations of A6 ; namely,
A6 admits a representation σ of minimal dimension 5, and Z1 ((g1 , g2 ), σ ) is
irreducible. One might thus be tempted to suggest that for a group G generated by
a finite set T and its (irreducible) representation σ of minimal dimension, Z1 (T , σ )
is irreducible. However, even this weaker conjecture fails. The counterexample is
given by the Janko group J2 ,
J2 = g1 , g2 | g12 , g23 , (g1 g2 )7 , (g1 g2 g1−1 g2−1 )12 , (g1 g2 (g1 g2 g1 g2−1 )2 )6 .
This sporadic simple group of order 604800 admits two non-isomorphic complex
representations σ1 , σ2 of minimal dimension 14, courtesy of ATLAS of Finite Group
Representations. As in the previous example (albeit with slightly longer calcula-
tions), one can explicitly check that the curve Z1 ((g1 , g2 ), σ1 ) = Z1 ((g1 , g2 ), σ2 )
has two irreducible components.
It is easy to check that ρ+ and ρ− are not equivalent. On the other hand,
In this section we will see how representations of a finitely generated group are
determined by a noncommutative relaxation of (1.2). To A ∈ Md (C) we associate
the monic matrix pencil LA = I + A1 x1 + · · · + A x of size d in freely
noncommuting variables x = (x1 , . . . , x ). Thus L is an affine matrix over the
free algebra C<x>. At a matrix point X ∈ Mn (C) it evaluates as
By the definition of the free locus we see that (3.1) is indeed a quantization of (1.2).
The existing results on free loci [16, 19] readily apply to group representations.
Theorem 3.1 For i = 1, 2 let Gi be a group generated by a finite sequence Ti and
let ρi be a complex representation of Gi . Assume |T1 | = |T2 |.
(1) If ρi is irreducible, then there exists n0 ∈ N such that Zn (T1 , ρ1 ) is a reduced
and irreducible hypersurface for all n ≥ n0 .
Group Representations, Determinantal Hypersurfaces and Quantizations 399
X = (P1 C, · · · , P C) ∈ Mn (Z)
tX ∈ Z (LA ) 7⇒ t ∈ μ∞ .
Thus the matrix Ai1 · · · Ain has finite order if and only if tX ∈ Z (LA ) implies
t ∈ μ∞ , which holds by (ii).
(⇒) If A1 , . . . , A generate a finite group G, then Cd admits a G-invariant inner
product
u, v = (gu)∗ (gv).
g∈G
We also show how the free locus certifies whether its defining coefficients
generate a finite abelian group. The degree of an affine variety of codimension m
is the number of intersection points of the variety with m hyperplanes in general
position; in the case of a hypersurface, it is simply the degree of its square-free
defining polynomial.
Proposition 3.7 Let G be a finite group generated by A1 , . . . , A ∈ Md (C). Then
G is abelian if and only if the irreducible components of Zn (LA ) have degree n for
all n ∈ N.
Proof Let A be the C-algebra generated by A1 , . . . , A . As in the proof of
Theorem 3.1(3) we see that A is semisimple. After a basis change (which does
not affect the structure of G or Z (LA )) we can thus assume that
Aj = A(1) (s)
j ⊕ · · · ⊕ Aj
(k) (k)
where A1 , . . . , A ∈ Mdk (C) determine an irreducible representation of G for
every k = 1, . . . , s. For X ∈ Mn (C)d let us view det LA(k) (X) as a polynomial
in the entries of X. If dk = 1, then det LA(k) (X) is up to an affine change of
coordinates equal to the determinant of a generic n × n matrix, and hence an
irreducible polynomial of degree n. On the other hand, if dk > 1, then det LA(k) (X)
is a polynomial of degree dk n > n for all n, and irreducible for all large enough n
by [16, Theorem 3.4]. Since G is abelian if and only if d1 = · · · = ds = 1, and
it follows that G is abelian if and only if the irreducible components of Zn (LA ) are
hypersurfaces of degree n.
Remark 3.8 If = 2 and A1 , A2 are hermitian, then Z1 (LA ) alone determines
whether G is abelian, cf. [20].
The last two propositions offer some directions for future research. Theorem 3.1
implies that the linear group G generated by a tuple A is determined by Z (LA ).
It would be interesting to know which properties of G can be deduced from the
geometry of Z (LA ). For example, intersections of Z (LA ) with certain lines and
hyperplanes determine whether G is finite or abelian. An open problem is how to
decide whether a finite group G is nilpotent/solvable/simple (or any other group-
theoretic property) by considering the geometry of the hypersurfaces Zn (LA ).
Acknowledgments The authors thank Banff International Research Station for the hospitality
during the Multivariable Spectral Theory and Representation Theory workshop, and participants
for sharing their ideas.
402 I. Klep and J. Volčič
References
1 Introduction
The paper continues the study of algebras generated by Toeplitz operators, acting on
the multidimensional Hardy space H 2 (S 2n−1 ), which was started in [12]. Recall in
this connection the principal difference in algebraic properties of Toeplitz operators
acting on the one-dimensional Hardy H 2 (S 1 ) and Bergman A2 (D) spaces.
The classical result by Brown and Halmos [9] implies that there is no nontrivial
commutative C ∗ -algebra generated by Toeplitz operators acting on the Hardy space
H 2 (S 1 ), while there are only two commutative Banach algebras. One of them
is generated by Toeplitz operators with analytic symbols, and the other one is
generated by Toeplitz operators with conjugate analytic symbols. Of course, such
two algebras remain to be commutative for H 2 (S 2n−1 ), for all n > 1.
At the same time, at the turn of this century, it was observed [15, 16] (see also
[17]) that there are many nontrivial commutative C ∗ -algebras, generated by Toeplitz
operators acting on the Bergman space A2 (D) over the unit disk D ⊂ C. These
results were extended then to the case of Toeplitz operators acting on weighted
Bergman spaces on the unit ball Bn , see [13]. Further on many nontrivial Banach
algebras generated by Toeplitz operators, that are commutative on each standard
weighted Bergman space over Bn , were discovered and studied.
In this connection a rather challenging question appeared: what is the situation
with both C ∗ and Banach algebras generated by Toeplitz operators on the multidi-
mensional Hardy space H 2 (S 2n−1 ). A first step in this direction was made in [1]
(see as well [2]), where Z. Akkar, following all the reasonings of [13], described the
commutative C ∗ -algebras generated by Toeplitz operators on H 2 (S 2n−1 ).
In [12], we developed an alternative approach to the problem. Therein we
represented the Hardy space H 2 (S 2n−1 ) as a direct sum of weighted Bergman
spaces A2p (Bn−1 ), with p ∈ Z+ , which permitted us to represent Toeplitz operators,
acting on H 2 (S 2n−1 ), as direct sums of Toeplitz operators, acting on corresponding
Bergman spaces. The benefit of such an approach is that we can use now all the
results on Toeplitz operators on Bergman spaces in their full power. In particular, we
showed in [12] how to recover the results of [1] just as simple and straightforward
corollaries of [13], and explained how to unhide and describe a wide variety
of nontrivial commutative Banach algebras generated by Toeplitz operators on
H 2 (S 2n−1 ).
Following the same approach, in the present paper we pass to the description
of non-commutative algebras, and present the detailed description of two non-
commutative C ∗ -algebras generated by Toeplitz operators, acting on the Hardy
space H 2 (S 2n−1 ), based on the already obtained results [4, 6–8] for Toeplitz
operators, acting on the Bergman space.
We recall here the results on the representation of the multidimensional Hardy space
in terms of the Bergman spaces as well as the corresponding representation of
Toeplitz operators, acting on the Hardy space, in terms of Toeplitz operators, acting
on Bergman spaces. All proofs and details can be found in [12].
Denote by
the unit sphere, being the boundary of Bn . The following standard notations will
be used throughout the paper. For α = (α1 , . . . , αn ) ∈ Zn+ , where Z+ =
{0, 1, 2, . . .} ⊂ Z,
zα = z1α1 . . . znαn ,
α! = α1 ! . . . αn !,
|α| = α1 + · · · + αn .
(n + λ + 1) λ
dvλ (z) = 1 − |z| 2
dV (z).
π n (λ + 1)
The weighted Bergman space A2λ (Bn ) is the closed subspace of L2 (Bn , dvλ ) that
consists of all analytic functions. The Hardy space H 2 (S 2n−1 ) is defined as the set
of all holomorphic functions f , defined in Bn , such that
f 22 := sup |f (rζ )|2 dσ (ζ ) < ∞,
0<r<1 S 2n−1
where dσ denotes the normalized measure for S 2n−1 . This space can be defined,
alternatively, as the closed subspace of L2 (S 2n−1 , dσ ) that consists of all functions
f satisfying the tangential Cauchy-Riemann equations:
∂ ∂
Lk,j f = zk − zj f = 0, 1 ≤ k < j ≤ n. (2.1)
∂zj ∂zk
where
J
(n − 1)!
z = (z1 , . . . , zn−1 ), zn = 1 − |z |2 tn , dv(z ) = dV (z ),
π n−1
1 dtn
and dμ(tn ) = 2π 1
itn is the normalized arc-length measure on S .
Equations (2.1) are not independent. The equations that are independent and
equivalent to (2.1) have the form
! "
∂ 1 zk ∂
Dk (f) = + tn f = 0, k = 1, 2, . . . , n − 1.
∂zk 2 1 − |z |2 ∂tn
Recall that the discrete Fourier transform F : L2 (S 1 , dμ(tn )) → 2 (Z), has the
form
−p
F : g → cn = g(tn )tn dμ(tn ) .
S1 p∈Z
L2 (S 2n−1 , dσ ) ∼
= L2 (Bn−1 , dv(z )) ⊗ L2 (S 1 , dμ(tn ))
onto
where
−p
cp (z ) = f(z , tn )tn dμ(tn ).
S1
Observe that
K
2 (Z, L2 (Bn−1 , dv(z ))) = L2 (Bn−1 , dv(z )).
p∈Z
(n+|p|−1)!
where dvp (z ) = π n−1 |p|!
(1 − |z |2 )p dV (z ), which acts as follows
*
(n − 1)!|p|! p
up :
cp (z ) −→ cp (z ) = cp (z )(1 − |z |2 )− 2 .
(n + |p| − 1)!
Toeplitz Operators on the Unit Sphere 407
L2 (S 2n−1 , dσ ) ∼
= L2 (Bn−1 , dv(z )) ⊗ L2 (S 1 , dμ(tn ))
onto
K
L2 (Bn−1 , dvp (z ))
p∈Z
7
under which H 2 (S 2n−1 ) is mapped onto p∈Z+ A2p (Bn−1 ).
For each p ∈ Z+ , we denote by Bp the Bergman orthogonal projection from
L2 (Bn−1 , dvp (z )) onto the weighted Bergman space A2p (Bn−1 ), and let PS 2n−1 be
the Szegő projection from L2 (S 2n−1 , dσ ) onto the Hardy space H 2 (S 2n−1 ).
Corollary 2.2 ([12, Corollary 2.2]) The Szegő projection and the weighted
Bergman projections are connected as follows
K
U PS 2n−1 U −1 = Bp .
p∈Z+
Ta f = PS 2n−1 (af ).
where (z1 , . . . , zn−1 ) = z ∈ Bn−1 . Note that each function a ∈ L∞ (Bn−1 ) defines
in its turn the function a of the form (2.2) by a(z , |zn |) = a(z ).
p
We denote then by Ta the Toeplitz operator, with symbol a acting on the
weighted Bergman space A2p (Bn−1 ), with p ∈ Z+ .
Theorem 2.3 ([12, Theorem 3.1]) Given a bounded measurable symbol
a(z , |zn |), defined in S 2n−1 . Under the isomorphism U = U2 U1 , the Toeplitz
operator 7 Ta , acting on the Hardy space H 2 (S 2n−1 ), is unitarily equivalent to the
p
operator p∈Z+ Ta , acting on
K
A2p (Bn−1 ),
p∈Z+
Theorems 2.1 and 2.3 state then that for each generator T a of the algebra
T (S), the operator U T a U −1 leaves invariant each subspace in the direct sum
decomposition
K
U (H 2 (S 2n−1 )) = A2p (Bn−1 )
p∈Z+
and
K
U T a U −1 =
p
Ta . (2.4)
p∈Z+
We consider the case when S = VO∂ (Bn−1 ) with the corresponding class S VO ⊂
L∞ (S 2n−1 ). Recall [8], that a bounded continuous function a belongs to VO∂ (Bn−1 )
(has a vanishing oscillation at the boundary) if
4 5
lim sup {|a(z ) − a(w )| : β(z , w ) ≤ 1} = 0,
z →∂Bn−1
here β(·, ·) is the Bergman metric in Bn−1 , and that VO∂ (Bn−1 ) is a norm closed
C ∗ -subalgebra of L∞ (Bn−1 ).
Our choice of S = VO∂ (Bn−1 ) is motivated by the following observation. C.
Berger, L. Coburn, and K. Zhu described in [8] the largest C ∗ -subalgebra Q of
L∞ (Bn−1 ) which possesses the compact semi-commutator property, i.e.,
p p p p p
[Ta , Tb ) := Ta Tb − Tab is compact for every a, b ∈ Q.
n−1 n−1
Then, C(B ) ⊂ VO∂ (Bn−1 ) ⊂ Q and the algebra Tp (C(B )) contains all
compact operators of A2p (Bn−1 ), which, in particular, implies that the algebra
Tp (VO∂ (Bn−1 )) coincides with Tp (Q). At the same time it is more pleasant to
deal with (bounded uniformly continuous) symbols from VO∂ (Bn−1 ) than with
(generically discontinuous) symbols from Q.
Note, in this connection, that the C ∗ -algebra Tp (VO∂ (Bn−1 )) = Tp (Q) is
irreducible, contains the ideal Kp of all compact operators, and each its element
p
admits the representation T p = Ta + K p , with a ∈ VO∂ (Bn−1 ) and compact K p .
We recall as well that the Berezin transform Bp : L(A2p (Bn−1 )) → L∞ (Bn−1 ) is
p p p
given by Bp [A](z ) := Akz , kz p , where kz denotes the normalized reproducing
kernel function of A2p (Bn−1 ) defined by
n+p
p (1 − |z |2 ) 2
kz (w )= , z , w ∈ Bn−1 .
(1 − w · z )n+p
p
For the special case of a Toeplitz operator Ta ,
p p p
Bp [Ta ](z ) = Bp [a](z ) = a kz , kz p .
410 M. Loaiza and N. Vasilevski
The aim of this section is to study the algebra T (S VO ) generated by all Toeplitz
operators T a acting on H 2 (S 2n−1 ) and having symbols a ∈ S VO , associated to
S = VO∂ (Bn−1 ).
7
Lemma 3.1 For each T $ p∈Z+ T p we have
T = sup T p ,
p∈Z+
T a = a(z )∞ .
Proof The first equality easily follows by standard arguments from the direct sum
p
representation of the operator T . The second follows from Ta ≤ a(z )∞ and
[4, Proposition 4.4], implying
p p
a(z )∞ = lim Bp Ta ∞ ≤ sup Ta = T a .
p→∞ p∈Z
with K p → 0 when p → ∞.
Toeplitz Operators on the Unit Sphere 411
p
with Km → 0 when p → ∞. Using Lemma 3.1, the convergence of the sequence
p
{Tm }m∈N and that Km → 0 when p → ∞, it is easy to prove that the sequence
{am }m∈N is convergent. Denote by a the limit of the last sequence. Then, uniformly
p p
on p, {Tam }m∈N converges to Ta . This fact implies that the sequence of compact
p
operators {Km }m∈N converges to a compact operator K p . Then,
K p
T $ Ta + K p .
p∈Z+
p p
where a, b ∈ VO∂ (Bn−1 ), K1 , K2 are compact operators, for each p ∈ Z+ , and
p p
where limp→∞ K1 = limp→∞ K2 = 0. Then,
p p p
Ta−b + K1 − K2 = 0, for each p ∈ Z+ . (3.2)
p p
Then a = b and, from (3.2), K1 = K2 .
Corollary 3.3 The algebra S VO possesses the compact semi-commutator property,
i.e.,
T = T a + K,
ιp : T (S VO ) −→ Tp (VO∂ (Bn−1 ))
K p
T $ (Tak + Kk ) −→ Ta + K p ,
k∈Z+
p
and ιp (Tc ) = Tc , which is a diagonal operator with respect to the standard basis of
A2p (Bn−1 ). By [13, Theorem 10.1] and [3],
p p+1 q +1 q
T1−|z |2 = = = T1−|z |2 ,
n+p n+q
which is defined by
where M(VO∂ (Bn−1 )) denotes the maximal ideal space of VO∂ (Bn−1 ).
This leads to the following family of one-dimensional irreducible representa-
tions:
Corollary 3.5 For every (η, p) ∈ M(VO∂ (Bn−1 )) \ Bn−1 × Z+ the map
K ιp p
πη,p : T $ (Tak + K k ) −→ Ta + K p −→ a(η)
k∈Z+
defines a one-dimensional irreducible representation. Moreover, πη1 ,p1 and πη2 ,p2
are unitarily equivalent if and only if η1 = η2 .
Further we have the following variant of [5, Lemma 4.8] and [7, Lemma 5.4].
Lemma 3.6 The mapping
defined by
K
ν : T $ T p −→ lim Bp (T p ),
p→∞
p∈Z+
p
a = ν(T ) = lim Bp (T p ) ∈ VO∂ (Bn−1 ) and K p = T p − Ta .
p→∞
7
Then p∈Z+ K p is compact
7 and, as a consequence, K 7 is compact. Consider now
a compact operator T $ p∈Z+ T p in T (S VO ). Since p∈Z+ T p is compact we
have that
lim T p = 0.
p→∞
Then
then T ∈ KVO .
This implies that the Calkin algebra
Proof The proof of this theorem is exactly the same of [7, Theorem 5.7]. We
include it here for the sake of completeness. According to [11, Proposition 2.11.2]
each irreducible representation of T (S VO ) is either induced by an irreducible
representation of
Using Corollary 3.3 we give the following extended version of Corollary 3.10.
Proposition 3.11 The Calkin algebra
(S VO ) = T (S VO )/(T (S VO ) ∩ K(H 2 (S 2n−1 )))
T
(S VO ) ∼
π : T (S VO ) −→ T = VO∂ (Bn−1 )
is given by
π : T = T a + K −→ a(η),
Each common invariant subspace for all the operators from T (S VO ) is of the
form
⎛ ⎞
K
U −1 ⎝ A2p (Bn−1 )⎠ ,
p∈N
n−1
here r = |z | and ζ ∈ S 2n−3 . Then, obviously, as (rζ ) ∈ Matq (C(B )), for all
s ∈ (0, 1).
Toeplitz Operators on the Unit Sphere 417
In particular,
ess-sp T = λ : λ ∈ sp{a(η)}, η ∈ M(VO∂ (Bn−1 )) .
Proof Only the index formula needs to be justified. Let T be Fredholm, then the
matrix-valued function a is invertible, and
p
Ind T = Ind (Ta + K p ).
p∈Z+
For each p ∈ Z+ , Theorem 2.6 of [19], ensures that there is sρ,0 ∈ (0, 1) such that
p
for every s ∈ (sρ,0 , 1), each operator Tas + K p is Fredholm, and
p p
Ind (Ta + K p ) = Ind (Tas + K p ).
n−1
The matrix-valued function as is continuous on the closed unit ball B . The ball
n−1
B is retractable to a point, thus the matrix-valued function as is homotopic
n−1
to a constant matrix, say ap , in a class of invertible continuous on B matrix-
p
functions. This implies that the operator Tas + K is homotopic to the scalar-matrix
p
p
Note that, starting from some p0 , all operators Ta + K p are invertible, so that
both ker T and coker T are finite dimensional (see [5, p. 730] for details).
One, of course, easily makes a version of Proposition 3.11 for matrix-valued
symbols a ∈ Matq (VO∂ (Bn−1 )).
418 M. Loaiza and N. Vasilevski
In this section we consider the case when, for each p ∈ Z+ , the C ∗ -algebra
consisting of all operators T p of decomposition (2.5) coincides with the full
Toeplitz algebra, i.e., with the algebra Tp (L∞ (Bn−1 )), which is generated by all
p
Toeplitz operators Ta , with a ∈ L∞ (Bn−1 ), acting on the weighted Bergman space
Ap (B ).
2 n−1
T = sup T p ,
p∈Z+
T a = a(z )∞ .
T a with a from S ∞ \ S BUC . That is, the algebra T (S BUC ) is a proper subalgebra
of T (S ∞ ). This can be justified literally following the arguments of the proof of [7,
Lemma 4.5].
We describe now the irreducible representations of the C ∗ -algebra
T (S BUC ). First, for each p ∈ Z+ , the representation
defined by
K
ν : T $ T p −→ lim Bp (T p ),
p→∞
p∈Z+
Then we have
Corollary 4.4 For each η ∈ M(BUC(Bn−1 )), the mapping
νη : T (S BUC ) −→ C,
defined by
π : T (S VO ) −→ T
(S VO ) ∼
= BUC(Bn−1 )
Toeplitz Operators on the Unit Sphere 421
is given by
K p
π: T $ (Ta + N p ) −→ a(η),
p∈Z+
Acknowledgement This work was partially supported by CONACYT Project 238630, México.
References
1. Z. Akkar, Zur Spektraltheorie von Toeplitzoperatoren auf dem Hardyraum H 2 (Bn ). Ph.D.
Dissertation, Universität des Saarlandes, 2012
2. Z. Akkar, E. Albrecht, Spectral properties of Toeplitz operators on the unit ball and on the unit
sphere, in The Varied Landscape of Operator Theory. Theta Series in Advanced Mathematics,
vol. 17 (Theta, Bucharest, 2014), pp. 1–22
3. H. Alzer, Inequalities for the Beta function of n variables. ANZIAM J. 44, 609–623 (2003)
4. W. Bauer, L.A. Coburn, Heat flow, weighted Bergman spaces and real analytic Lipschitz
approximation. J. Reine Angew. Math. 703, 225–246 (2015)
5. W. Bauer, N. Vasilevski, On algebras generated by Toeplitz operators and their representations.
J. Funct. Anal. 272, 705–737 (2017)
6. W. Bauer, R. Hagger, N. Vasilevski, Uniform continuity and quantization on bounded
symmetric domains. J. Lond. Math. Soc. 96, 345–366 (2017)
7. W. Bauer, R. Hagger, N. Vasilevski, Algebras of Toeplitz operators on the n-dimensional unit
ball. Complex Anal. Oper. Theory 13, 493–524 (2019)
8. C.A. Berger, L.A. Coburn, K.H. Zhu, Function theory on Cartan domains and the Berezin-
Toeplitz symbol calculus. Amer. J. Math. 110, 921–953 (1988)
9. A. Brown, P.R. Halmos, Algebraic properties of Toeplitz operators. J. Reine Angew. Math.
213, 89–102 (1964)
422 M. Loaiza and N. Vasilevski
10. L. Coburn, Singular integral operators and Toeplitz operators on odd spheres. Indiana Univ.
Math. J. 23, 433–439 (1973)
11. J. Dixmier, Les C ∗ -algebres et leurs représensations (Gauthier-Villars, Paris 1964)
12. M. Loaiza, N. Vasilevski, Commutative algebras generated by Toeplitz operators on the unit
sphere. Intgr. Equ. Oper. Theory 92, 25 (2020). https://doi.org/10.1007/s00020-020-02580-x
13. R. Quiroga-Barranco, N.L. Vasilevski, Commutative algebras of Toeplitz operators on the unit
ball I: Bargmann type transforms and spectral representations of Toeplitz operators. Integr.
Equ. Oper. Theory 59, 379–419 (2007)
14. D. Suárez, The essential norm of operators in the Toeplitz algebra Ap (Bn ). Indiana Univ. Math.
J. 56, 2185–2232 (2007)
15. N.L. Vasilevski, Toeplitz operators on the Bergman spaces: inside-the-domain effects. Con-
temp. Math. 289, 79–146 (2001)
16. N.L. Vasilevski, Bergman space structure, commutative algebras of Toeplitz operators and
hyperbolic geometry. Integr. Equ. Oper. Theory 46, 235–251 (2003)
17. N.L. Vasilevski, Commutative Algebras of Toeplitz Operators on the Bergman Space. Operator
Theory: Advances and Applications, vol. 185 (Birkhäuser Verlag, Boston, 2008)
18. J. Xia, Localization and the Toeplitz algebra on the Bergman space. J. Funct. Anal. 269, 781–
814 (2015)
19. J. Xia, D. Zheng, Toeplitz operators and Toeplitz algebra with symbols of vanishing oscillation.
J. Operator Theory 76, 107–131 (2016)
d-Modified Riesz Potentials on Central
Campanato Spaces
Katsuo Matsuoka
Abstract Recently, we defined the d-modified Riesz potentials I˜α,d and proved
several estimates of boundedness of I˜α,d on the central Morrey spaces B p,λ (Rn ),
(d)
using the central Campanato spaces $p,λ (Rn ), the generalized σ -Lipschitz spaces
(d)
Lipβ,σ (Rn ) and so on. In this paper, we will consider the results of the boundedness
for I˜α,d on the λ-central mean oscillation spaces CMOp,λ (Rn ).
This work was supported by Grant-in-Aid for Scientific Research (C) (Grant No. 17K05306),
Japan Society for the Promotion of Science.
K. Matsuoka ()
College of Economics, Nihon University, Misaki-cho, Kanda, Chiyoda-ku, Tokyo, Japan
e-mail: [email protected]
1 Introduction
Let Iα and I˜α , 0 < α < n, be the Riesz potential and the modified Riesz potential,
respectively, which are defined for f ∈ L1loc (Rn ) by
f (y)
Iα f (x) = dy
Rn |x − y|n−α
and
1 1 − χQ1 (y)
I˜α f (x) = f (y) − dy
R n |x − y| n−α |y|n−α
(as for the notation χQ1 , see Sect. 2). Then, the following boundedness results on
Lp (Rn ) are well-known : For 0 < α < n, 1 ≤ p < n/α and 1/q = 1/p − α/n,
(i) Iα : Lp (Rn ) → Lq (Rn ), 1 < p < n/α (see [7, 25]);
(ii) Iα : L1 (Rn ) → W Lq (Rn ), p = 1 (see [26]).
And for 0 < α < n, n/α ≤ p < ∞ and β = α − n/p < 1,
(iii) I˜α : Lp (Rn ) → Lipβ (Rn ), n/α < p < ∞ (cf. [23]);
(iv) I˜α : Ln/α (Rn ) → BMO(Rn ), p = n/α, i.e., β = 0 (cf. [23, 26]).
Here (i) is the so-called Hardy–Littlewood–Sobolev theorem.
After that, in [4], for Iα on the (non-homogeneous) central Morrey space
B p,λ (Rn ), i.e.,
p
B p,λ (Rn ) = {f ∈ Lloc (Rn ) : f B p,λ < ∞},
−n/p−λ
(see [1, 4]; cf. [6]), which is the (non-homogeneous) Herz space Kp,∞ (Rn ) (cf.
[3, 8]), the following boundedness result was obtained : For 0 < α < n, 1 < p <
n/α, −n/p + α ≤ μ = λ + α < 0 and 1/q = 1/p − α/n,
(i’-1) Iα : B p,λ (Rn ) → B q,μ (Rn ).
On the other hand, in [11, 19] (cf. [10]), from the Bσ -Morrey–Campanato
estimates for Iα and I˜α , the following boundedness results on B p,λ (Rn ) were
obtained as the corollaries : For 0 < α < n, −n + α ≤ μ = λ + α < 0 and
1/q = 1 − α/n,
(ii’-1) Iα : B 1,λ (Rn ) → W B q,μ (Rn ).
d-Modified Riesz Potentials on Central Campanato Spaces 425
(see [11]). Also for 0 < α < n, 1 ≤ p < n/α, −n/p + α ≤ μ = λ + α < 1 and
1/q = 1/p − α/n,
(i’-2) I˜α : B p,λ (Rn ) → CMOq,μ (Rn ), 1 < p < n/α;
(ii’-2) I˜α : B 1,λ (Rn ) → W CMOq,μ (Rn ), p = 1.
And for 0 < α < n, n/α ≤ p < ∞, −n/p + α ≤ λ + α < 1 and β = α − n/p,
(iii’) I˜α : B p,λ (Rn ) → Lipβ,λ+n/p (Rn ), n/α < p < ∞;
(iv’) I˜α : B n/α,λ (Rn ) → BMOλ+n/p (Rn ), p = n/α, i.e., β = 0.
As for the spaces CMOp,λ (Rn ), W CMOp,λ (Rn ) and Lipβ,σ (Rn ), BMOσ (Rn ), refer
to Definitions 2.1, 2.2 and 2.11. Furthermore, the following boundedness results on
CMOp,λ (Rn ) were also gotten excepting (ii”) (as for (ii”), see Corollary 3.5): For
0 < α < 1, 1 ≤ p < n/α, −n/p + α ≤ λ + α = μ < 1 and 1/q = 1/p − α/n (see
[19]; cf. [16]),
(i”) I˜α : CMOp,λ (Rn ) → CMOq,μ (Rn ), 1 < p < n/α;
(ii”) I˜α : CMO1,λ (Rn ) → W CMOq,μ (Rn ), p = 1.
And for 0 < α < 1, n/α ≤ p < ∞, −n/p + α ≤ μ = λ + α < 1 and β =
α − n/p,
(iii”) I˜α : CMOp,λ (Rn ) → Lipβ,λ+n/p (Rn ), n/α < p < ∞;
(iv”) I˜α : CMOn/α,λ (Rn ) → BMOλ+n/p (Rn ), p = n/α, i.e., β = 0.
Recentry, for the whole of μ = λ + α such that 1 ≤ μ < ∞, we
extended the boundedness of I˜α for B p,λ (Rn ). In order to do so, in [17, 18],
we introduced the central Campanato spaces $(d) n
q,μ (R ) and the generalized σ -Lip
(d)
spaces Lipβ,σ (Rn ) (see Definitions 2.3 and 2.11), and also we defined the “ higher-
degree ” modification of Iα , i.e., the d-modified Riesz potentials I˜α,d , 0 < α < n
and d ∈ N ∪ {0}. Then, the following boundedness results were shown : For
0 < α < n, 1 < p < n/α, d ∈ N ∪ {0}, −n/p + α ≤ μ = λ + α < d + 1
and 1/q = 1/p − α/n,
(I) I˜α,d : B p,λ (Rn ) → $q,μ (Rn ), 1 < p < n/α;
(d)
We start by explaining the notation used in the present paper. The symbol A B
stands for A ≤ CB for some constant C > 0. If A B and B A, we then write
A ∼ B. For r > 0, by Qr , we mean the following: Qr = {y ∈ Rn : |y| < r} or
Qr = {y = (y1 , y2 , · · · , yn ) ∈ Rn : max1≤i≤n |yi | < r}. Further, for x ∈ Rn , we
set Q(x, r) = x + Qr = {x + y : y ∈ Qr }. For a measurable set G ⊂ Rn , we
denote by |G| and χG the Lebesgue measure of G and the characteristic function
of G, respectively. And also, for a function f ∈ L1loc (Rn ) and a measurable set
G ⊂ Rn with 0 < |G| < ∞, let
1
fG = − f (y) dy = f (y) dy,
G |G| G
where
1/p
1
f CMOp,λ = sup λ − |f (y) − fQr | dy
p
.
r≥1 r Qr
d-Modified Riesz Potentials on Central Campanato Spaces 427
Particularly
where
1/p
1 1
f W CMOp,λ = sup sup t p |{y ∈ Qr : |f (y) − fQr | > t}| ,
r≥1 rλ |Qr | t >0
Particularly
Here remark that CMOp (Rn ) and W CMOp (Rn ), so-called the central mean
oscillation (CMO) space and the weak CMO space, are defined by
p
CMOp (Rn ) = {f ∈ Lloc (Rn ) : f CMOp < ∞},
where
1/p
f CMOp = sup − |f (y) − fQr |p dy ,
r≥1 Qr
and
p
W CMOp (Rn ) = {f ∈ Lloc (Rn ) : f W CMOp < ∞},
where
1/p
1
f W CMOp = sup sup t p |{y ∈ Qr : |f (y) − fQr | > t}| ,
r≥1 |Qr | t >0
Particularly
(0)
$p,λ (Rn ) = CMOp,λ (Rn ).
Particularly
W $(0)
p,λ (R ) = W CMO
n p,λ
(Rn ).
and
1/p
1 1
f W $p,λ ∼ sup inf sup t p |{y ∈ Qr : |f (y) − P (y)| > t}| ,
r≥1 P ∈P (R ) r
d n λ |Qr | t >0
1
f Lip(d) (U ) = sup |Ad+1
h f (x)| < ∞,
β x,x+h∈U,h=0 |h| β
d-Modified Riesz Potentials on Central Campanato Spaces 429
In particular, we define
(d)
BMO(d)(U ) = Lip0 (U ),
f Lip(d) (U ) ∼ f L(d) (U ) ,
β p,β
where L(d)
p,β (U ) is the Campanato space on U as defined below.
Definition 2.9 Let U = Rn or U = Qr with r > 0. For 1 ≤ p < ∞, d ∈ N0
p
and −n/p ≤ λ < d + 1, a function f ∈ Lloc (U ) will be said to belong to the
(d)
Campanato space on U , i.e., Lp,λ (U ) if and only if for every Q(x, s) ⊂ U , there is
d
a polynomial PQ(x,s) f of degree at most d such that
1/p
1
f L(d) (U ) = sup − |f (y) − PQ(x,s)
d
f (y)|p dy < ∞.
p,λ Q(x,s)⊂U sλ Q(x,s)
Lp,λ (U ) = L(0)
p,λ (U ).
1
f Lip(d) = sup f Lip(d) (Q ) < ∞.
β,σ r≥1 rσ β r
In particular, we define
(d)
σ (R ) = Lip0,σ (R )
BMO(d) n n
and
(0)
Lipβ,σ (Rn ) = Lipβ,σ (Rn ), BMOσ (Rn ) = BMO(0) n
σ (R ),
which we call the generalized σ -BMO space and the σ -Lip space, the σ -BMO
space, respectively.
Identifying functions which differ by a polynomial of degree at most d, a.e., we
(d) (d)
see that Lipβ (Rn ) and Lipβ,σ (Rn ) are Banach spaces (see [12] and [22]).
Remark 2.12 ([17]; cf. [10]) We note that particularly
and
(0)
Lipβ (Rn ) = Lipβ (Rn ), BMO(0) (Rn ) = BMO(Rn ).
I˜α,d f (x)
⎧ ⎛ ⎞ ⎫
⎨ xl ⎬
= f (y) Kα (x − y) − ⎝ (D l Kα )(−y)⎠ (1 − χQ1 (y)) dy,
Rn ⎩ l! ⎭
{l:|l|≤d}
d-Modified Riesz Potentials on Central Campanato Spaces 431
1
Kα (x) =
|x|n−α
I˜α,0 = I˜α
In the above theorems, if d = 0, then we can get the following strong and weak
estimates for a modified Riesz potential I˜α,d on CMOp,λ (Rn ).
Corollary 3.4 (Corollary 2.7 of [19]; cf. [17]) Let 0 < α < 1, 1 < p < n/α,
−n/p + α ≤ μ = λ + α < 1 and q = pn/(n − pα), i.e., 1/q = 1/p − α/n. Then
432 K. Matsuoka
I˜α is bounded from CMOp,λ (Rn ) to CMOq,μ (Rn ), that is, there exists a constant
C > 0 such that
Corollary 3.5 Let 0 < α < 1, −n + α ≤ μ = λ + α < 1 and q = n/(n − α), i.e.,
1/q = 1 − α/n. Then I˜α is bounded from CMO1,λ (Rn ) to W CMOq,μ (Rn ), that is,
there exists a constant C > 0 such that
Next, for a d-modified Riesz potential I˜α,d , we prove our second result, i.e., the
following estimates on CMOp,λ (Rn ), where n/α ≤ p < ∞.
Theorem 3.6 Let 0 < α < 1, n/α ≤ p < ∞, d ∈ N0 , −n/p + α + d ≤ λ + α <
d +1 and β = α −n/p. Then I˜α,d is bounded from CMOp,λ (Rn ) to Lip(d) n
β,λ+n/p (R ),
that is, there exists a constant C > 0 such that
(i) when n/α < p < ∞,
4 Proofs of Theorems
First of all, we state the following well-definedness of I˜α,d for CMOp,λ (Rn ), which
is shown by the same argument as in the proof of Theorem 3.6 of [16].
Lemma 4.1 Let 0 < α < 1, 1 ≤ p < ∞, d ∈ N0 and −n/p + α ≤ λ + α < d + 1.
Then for f ∈ CMOp,λ (Rn ), I˜α,d f is well-defined.
Proof Let f ∈ CMOp,λ (Rn ), r ≥ 1 and x ∈ Qr , and let
I˜α,d f (x) = I˜α,d (f˜χQ2r )(x) + I˜α,d (f˜(1 − χQ2r ))(x) + fQ4r I˜α,d 1(x)
xl
˜
= Iα (f χQ2r )(x) − f˜(y)(D l Kα )(−y) dy
l! Q2r \Q1
{l:|l|≤d}
⎛ ⎞
xl
+ f˜(y) ⎝Kα (x − y) − (D l Kα )(−y)⎠ dy
Rn \Q2r l!
{l:|l|≤d}
where f˜ = f − fQ4r . Then, since f˜χQ2r ∈ Lp (Rn ), the first term is well-defined.
The second term is also well-defined, because (D l Kα )(χQ2r − χQ1 ) ∈ Lp (Rn ).
Here we note that the second term is a polynomial of degree at most d. For the
third term, the integral converges absolutely in virtue of Lemmas
= 4.2 and 4.4, which
are shown later, and so the present term is well-defined. As Rn \Q1 (D l Kα )(−y)dy
converges absolutely under the assumption 0 < α < 1, I˜α,d 1 ∈ P d (Qr ), and then
the forth term is well-defined.
Further, if we let for 1 ≤ s < r,
xl |x|d+1
Kα (x − y) − (D l Kα )(−y) ≤ C n−α+d+1 . (4.2)
l! |y|
{l:|l|≤d}
Lemma 4.4 (Lemma 4.1 of [16]; cf. Lemma 4.2 of [19]) Let 1 ≤ p < ∞ and
λ ∈ R. If β < 0 and β + λ < 0, then there exists a positive constant C such that
|f (y) − fQ2r |
dy ≤ Cr β+λ f CMOp,λ
Rn \Q r
|y|n−β
for all f ∈ CMOp,λ (Rn ) and r ≥ 1.
where
xl
Rrd f˜(x) = − f˜(y)(D l Kα )(−y) dy,
l! Q2r \Q1
{l:|l|≤d}
and
⎛ ⎞
xl
J˜α,d,r f (x) = f˜(y) ⎝Kα (x − y) − (D l Kα )(−y)⎠ dy.
Rn \Q2r l!
{l:|l|≤d}
d-Modified Riesz Potentials on Central Campanato Spaces 435
Then we have
1/q
|I˜α,d f (x) − R̃rd f (x)|q dx
Qr
1/q 1/q
≤ |Iα (f˜χQ2r )(x)|q dx + |J˜α,d,r f (x)|q dx
Qr Qr
=: I1 + I2 . (4.4)
xl |x|d+1 r d+1
Kα (x − y) − (D l Kα )(−y) ≤ .
l! |y|n−α+d+1 |y|n−α+d+1
{l:|l|≤d}
Consequently, we obtain by Lemma 4.4 and the assumptions 0 < α < 1 and μ+σ <
d + 1,
|f˜(y)|
|J˜α,d,r f (x)| r d+1 dy r λ+α f CMOp,λ
Rn \Q2r |y|n−α+d+1
= r μ f CMOp,λ , (4.5)
and then
1/q
1 q
I˜α,d f $(d) sup μ − I˜α,d f (y) − R̃rd f (y) dy
q,μ
r≥1 r Qr
1/q
1 1
sup · r μ+n/q f CMOp,λ ∼ f CMOp,λ .
r≥1 rμ |Qr |
Proof of Theorem 3.3 This proof is similar to that of Theorem 3.6 of [16]. Hence,
in the same way as (4.4), we have that for f ∈ CMO1,λ (Rn ), r ≥ 1 and x ∈ Qr ,
sup(2t)q x ∈ Qr : |I˜α,d f (x) − R̃rd f (x)| > 2t
t >0
≤ 2q sup t q x ∈ Qr : |Iα (f˜χQ2r )(x)| > t
t >0
=: 2 (I3 + I4 ) .
q
1/q
I3 r λ+n f CMO1,λ = r μ+n/q f CMO1,λ ,
1/q
I4 r μ+n/q f CMO1,λ .
Thus, we obtain
xl
Rrd f˜(x) = − f˜(y)(D l Kα )(−y) dy
l! Q2r \Q1
{l:1≤|l|≤d}
d-Modified Riesz Potentials on Central Campanato Spaces 437
that
=: I5 + I6 .
We firstly estimate I5 . When n/α < p < ∞, which implies 0 < β < 1, if we
apply the (Lp , Lipβ ) boundedness of I˜α , then we get
= r λ+n/p f CMOp,λ .
Similarly, when p = n/α, by using the (Ln/α , BMO) boundedness of I˜α , we obtain
1
I6 ∼ J˜α,d,r f Lip(d) (Q ) = sup |Ad+1 ˜
h Jα,d,r f (x)|,
β r
x,x+h∈Qr ,h=0 |h| β
we estimate Ad+1 ˜
h Jα,d,r f (x). To do so, if we use (4.3), Lemma 4.4 and the
assumptions 0 < α < 1 and λ + α < d + 1, then we have for x ∈ Qr and
y ∈ Rn \ Q2r ,
|Ad+1 ˜
h Jα,d,r f (x)|
⎧ ⎛ ⎞⎫
⎨ xl ⎬
= f˜(y) Ad+1 ⎝Kα (x − y) − (D l
Kα )(−y) ⎠ dy
Rn \Q2r ⎩ h l! ⎭
{l:|l|≤d}
|f˜(y)|
|h|d+1 dy |h|d+1 r α−d−1+λf CMOp,λ .
Rn \Q2r |y|n−α+d+1
438 K. Matsuoka
Hence
1
I6 sup · |h|d+1 r α−d−1+λf CMOp,λ
x,x+h∈Qr ,h=0 |h| β
1
I˜α,d f Lip(d) = sup I˜α,d f Lip(d) (Q ) f CMOp,λ .
β,λ+n/p r≥1 r λ+n/p β r
Acknowledgments The author would like to express his deep gratitude to the anonymous referees
for their careful reading and fruitful comments.
References
14. T. Kurokawa, Weighted norm inequalities for Riesz potentials. Jpn. J. Math. 14, 261–274
(1988)
15. S. Lu, D. Yang, The central BMO spaces and Littlewood–Paley operators. Approx. Theory
Appl. (N.S.) 11, 72–94 (1995)
16. K. Matsuoka, Bσ -Morrey–Campanato estimates and some estimates for singular integrals on
central Morrey spaces and λ-CMO spaces, in Banach and Function Spaces IV (Kitakyushu
2012) (Yokohama Publishers, Yokohama, 2014), pp. 325–335
17. K. Matsuoka, Generalized fractional integrals on central Morrey spaces and generalized λ-
CMO spaces, in Function Spaces X, Banach Center Publications, vol. 102 (Institute of
Mathematics, Polish Academy of Sciences, Warsawa, 2014), pp. 181–188
18. K. Matsuoka, Generalized fractional integrals on central Morrey spaces and generalized σ -
Lipschitz spaces, in Current Trends in Analysis and its Applications: Proceedings of the
9th ISAAC Congress, Kraków 2013. Springer Proceedings in Mathematics and Statistics
(Birkhäuser, Basel, 2015), pp. 179–189
19. K. Matsuoka, E. Nakai, Fractional integral operators on B p,λ with Morrey–Campanato norms,
in Function Spaces IX, Banach Center Publishers, vol. 92 (Institute of Mathematics, Polish
Academy of Sciences, Warsawa, 2011), pp. 249–264
20. Y. Mizuta, On the behaviour at infinity of superharmonic functions. J. Lond. Math. Soc. 27,
97–105 (1983)
21. Y. Mizuta, Potential Theory in Euclidean Spaces (Gakkōtosho, Tokyo, 1996)
22. E. Nakai, Y. Sawano, Hardy spaces with variable exponents and generalized Campanato spaces.
J. Funct. Anal. 262, 3665–3748 (2012)
23. J. Peetre, On the theory of Lp,λ spaces. J. Funct. Anal. 4, 71–87 (1969)
24. A. Pietsch, History of Banach Spaces and Linear Operators (Birkhäuser, Boston, 2007)
25. S.L. Sobolev, On a theorem in functional analysis. Mat. Sbornik 4, 471–497 (1938) (in Russian)
26. A. Zygmund, On a theorem of Marcinkiewicz concerning interpolation of operations. J. Math.
Pures Appl. 35, 223–248 (1956)
On Some Consequences of the Solvability
of the Caffarelli–Silvestre Extension
Problem
1 Introduction
Since the well-known Caffarelli–Silvestre paper [4] from 2007 various authors
considered an abstract version of the there presented problem by having a look on
the abstract (incomplete) ODE problem
1 − 2α
u (t) + u (t) = Au(t) (t > 0), u(0) = x (1.1)
t
J. Meichsner ()
Technische Universität Hamburg, Institut für Mathematik, Hamburg, Germany
e-mail: [email protected]
C. Seifert
Technische Universität Hamburg, Institut für Mathematik, Hamburg, Germany
Technische Universität Clausthal, Institut für Mathematik, Clausthal-Zellerfeld, Germany
e-mail: [email protected]; [email protected]
in some Hilbert or more general Banach space X, see [2, 5, 8, 9]. The abstract
setting made it necessary to introduce new tools and viewpoints completely different
from the techniques used in [4] for the rather concrete problem considered there.
This development was first started by Stinga–Torrea in [9] by the introduction
of explicit formulas for solutions of (1.1) and further continued by Galé–Miana–
Stinga in [5]. Typically, the operator A appearing on the right-hand side of (1.1)
is assumed to be sectorial (with or without dense domain). The basic question
of interest is to establish existence and uniqueness results. For the special choice
of α = 1/2 the singular first order term in (1.1) vanishes and the problem
reduces to an incomplete second order Cauchy problem as already considered by
Balakrishnan in [3, Theorem 6.1], where the well-posedness for a sectorial operator
A is shown under the additional assumption of a globally bounded solution u
of (1.1). Concerning generalisations of Balakrishnan’s result, in [9] the authors
considered the situation for a self-adjoint, positive, so in particular sectorial,
operator on a Hilbert space. The authors derived two explicit formulas for a solution
to (1.1) including the Stinga-Torrea formula, a Poisson formula for the solution.
The uniqueness of the solution is also shown under the assumption of A having
a pure point spectrum. The corresponding Banach space setting was considered
in [5] where the authors constructed explicitly solutions for both, generators of
tempered integrated semigroups and tempered integrated cosine families. These
results particularly cover generators of bounded C0 -semigroups. Combined with
the uniqueness result from [8], (1.1) is a well-posed problem and this fact actually
extends to arbitrary sectorial right-hand sides. Given an initial condition for the
first derivative one can also tackle the problem by cosine functions, see e.g. [1,
Chapter 3]. The connection between the various results was explored a bit in [8],
where one can see that the boundedness assumption appearing in Balakrishnan’s
paper ‘couples’ the two initial values and thus is suitable to replace one of them.
In [6], the authors consider the problem from a slightly different point of view.
They assume the ODE (1.1) to be given for α = 1/2 but merely assume A to be
closed with non-empty resolvent set. As a consequence of the well-posedness of
the problem, the existence of a generator B of a C0 -semigroup is deduced which
satisfies B 2 = A. Further, analyticity of √ the solution u of (1.1) on a sector is shown
to imply sectoriality of A and then B = A in the functional calculus sense.
This is the starting point of this paper. More precisely, we attempt to state the
right setting for proving similar results as explained above in the more general case
α ∈ (0, 1). Making use of similar strategies as in [6, Theorem 6.3.3], generalisations
of the first result on factorizing A (as B 2 ) can be obtained; cf. Corollary 2.13. For the
second result that analyticity of the solution u of (1.1) on a sector yields sectoriality
of the operator A, we formulate a conjecture.
Consequences of Solvability of the Extension Problem 443
In this paper X will always denote a complex Banach space with norm ·. Further
let A be a closed linear operator in X with non-empty resolvent set ρ(A) and with
dense domain D(A). We will interpret D(A) as a Banach space equipped with the
graph norm
1 − 2α
u (t) + u (t) = Au(t) (t > 0).
t
For the whole paper, we may assume as a standing assumption that Pα (x) has a
unique solution, denoted by ux , for every x ∈ D(A). This fact will not be mentioned
explicitly anymore but the reader should be always aware of it.
Remark 2.1 As noted in the introduction, there are various cases where Pα (x) has
a unique solution for all x ∈ D(A), e.g., when A is the generator of a bounded C0 -
semigroup ([5, Theorem 2.1]) or when A is the generator of a tempered integrated
cosine family ([5, Theorem 1.3] combined with [8, Theorem 5.8]).
For x ∈ D(A) and t ≥ 0 we may then define U x(t) := ux (t).
Lemma 2.2 We have U ∈ L D(A), Cb [0, ∞); D(A) .
Proof The proof essentially works as the one for the special case α = 1/2 as
presented in [6, Theorem 6.3.3] with the help of the closed graph theorem. So let
(xn ) be a sequence in D(A) which converges w.r.t. ·D(A) towards x ∈ D(A).
Further, assume that (U xn ) is convergent in Cb [0, ∞); D(A) towards some
element v. Let n, m ∈ N. From the fact that U xn and U xm solve Pα (xn ) and Pα (xm ),
respectively, we get (for t ≥ 0)
; ;
; 2α−1 d 1−2α d 2α−1 d 1−2α d
;
;t t U xn (t) − t t U xm (t);
; dt dt dt dt ;
t2
d d
t11−2α U xk (t1 ) − t21−2α U xk (t2 ) = s 1−2α AU xk (s)ds.
dt dt
t1
d d
t 1−2α U xn (t) − t 1−2α U xm (t)
dt dt
t
= yn − ym + s 1−2α A(U xn − U xm )(s)ds. (2.1)
0
Now, we multiply (2.1) by t 2α−1 , integrate again, take norms and solve for
yn − ym . Then we obtain
t 2α
yn − ym
2α
; ;
; t s ;
; ;
;
= ;U (xn − xm )(t) − (xn − xm ) − s 2α−1
r 1−2α
AU (xn − xm )(r)drds ;
;
; ;
0 0
t2
≤ 2+ U xn − U xm Cb ([0,∞);D(A)) .
4 − 4α
t 2−2α
≤ yn − ym + U xn − U xm Cb ([0,∞);D(A)) .
2 − 2α
Consequences of Solvability of the Extension Problem 445
Therefore, there exists u1 ∈ C (0, ∞); X such that
d
t → t 1−2αU xn (t) → t → t 1−2α u1 (t)
dt
uniformly on compacts in [0, ∞). Thus, v ∈ C 1 (0, ∞); X .
Furthermore, for t > 0 we have
; 2 ;
;d d2 ;
; U x (t) − U x (t) ;
; dt 2 n dt 2
m ;
; ;
; 2α−1 d 1−2α d 2α−1 d 1−2α d
;
≤; ; t t U x n (t) − t t U x m (t) ;
;
dt dt dt dt
; ;
|1 − 2α| ; ; d U xn (t) − d U xm (t);
;
+ ; ;
t dt dt
→ 0,
uniformly for t in a compact set K ⊂ (0, ∞). Therefore, v ∈ C 2 (0, ∞); X . Since
For x ∈ X let PLα (x) be the problem of finding u ∈ Cb [0, ∞); X with u(0) = x
and such that for all φ ∈ Cc∞ (0, ∞) we have
∞ ∞ ∞
d 1−2α d 2α−1
u(t)φ(t)dt ∈ D(A) and u(t) t t φ(t)dt = A u(t)φ(t)dt.
dt dt
0 0 0
U xCb ([0,∞);X)
; ;
; ;
= ;(λ − A)U (λ − A)−1 x ;
Cb ([0,∞);X)
; ; ; ;
; ; ; ;
≤ (|λ| + 1) ;U (λ − A)−1 x ; + ;AU (λ − A)−1 x ;
Cb ([0,∞);X) Cb ([0,∞);X)
; ;
; ;
≤ C1 ;(λ − A)−1 x ; ≤ C2 x
D (A)
∞ ∞
d d d 1−2α d 2α−1
U x(t) t 1−2α t 2α−1 φ(t)dt = lim U xn (t) t t φ(t)dt
dt dt n→∞ dt dt
0 0
∞
d 1−2α d
= lim t 2α−1 t U xn (t)φ(t)dt
n→∞ dt dt
0
∞
= lim AU xn (t)φ(t)dt
n→∞
0
∞
= lim A U xn (t)φ(t)dt,
n→∞
0
Consequences of Solvability of the Extension Problem 447
∞ ∞
U xn (t)φ(t)dt → U x(t)φ(t)dt,
0 0
by closedness of A we observe
∞
U x(t)φ(t)dt ∈ D(A)
0
and
∞ ∞
d d
U x(t) t 1−2α t 2α−1 φ(t)dt = A U x(t)φ(t)dt.
dt dt
0 0
Also
Lα (x) as claimed.
So U x is a solution to P
Lα (x) is unique. Indeed, if u is a solution to P
Note that the solution of P Lα (0) then
v := (λ − A)−1 u is a solution to Pα (0), hence v = 0 and, therefore, u = 0.
The uniqueness of a solution to P Lα (x) yields the extension of the already found
relationships between U , A and its resolvent (λ−A)−1 to U (λ−A)−1 = (λ−A)−1 U
and AU x = U Ax for x ∈ D(A).
Remark 2.5 In view of Lemma 2.4, the authors came to the conclusion that defining
a solution to the considered problem in [8] would have been better in the way as in
Lα (x).
P
Next we show that the smoothness of u and its scaled derivatives tells us precisely
about the smoothness of the initial datum x.
Definition 2.6 We define the operator Dα in Cb [0, ∞); X by
D(Dα ) := u ∈ Cb [0, ∞); X ∩ C 1 (0, ∞); X |
t → t 1−2α u (t) ∈ Cb [0, ∞); X ,
Dα u := t → t 1−2α u (t) .
D1−α Dα U x = AU x = U Ax
(D1−α Dα )k U x = U Ak x.
Conversely, let k ≥ 1 and U x ∈ D (D1−α Dα )k . Consider again
first the case
k = 1 and choose for s > 0 a sequence (φk ) in Cc∞ (0, ∞) which converges
Lα (x). Hence,
towards δs in Cc (0, ∞) . The function U x solves P
∞ ∞
D1−α Dα U x(t)φk (t)dt = A U x(t)φk (t)dt
0 0
d
Bα x := lim t 1−2α U x(t).
t →0+ dt
Bα (λ − A)−1 x = (λ − A)−1 Bα x
for x ∈ D(A). From the proof of Lemma 2.2 one can see the existence of a constant
C > 0, depending on a fixed parameter T ∈ [0, ∞) (let us take T = 1, say), such
that Bα x ≤ C xD(A) . Commutativity with the resolvent of A and continuity
with respect to the graph norm imply that Bα is closable as an operator in X with
domain D(A) ⊆ X. To see this, consider (xn ) in D(A) convergent towards 0 in X
Consequences of Solvability of the Extension Problem 449
which shows the closability since the above inequality implies y = 0. Let us denote
the closure of Bα in X again by Bα . The next statement will be an auxiliary lemma
for complex-valued functions.
Lemma 2.8 Let r ∈ [0, 1), u ∈ Cb [0, ∞) ∩ C 2 (0, ∞) such that t →
t r dtd t −r u (t) ∈ Cb [0, ∞) . Then also t → t −r u (t) ∈ Cb [0, ∞) .
Proof For r = 0 this is a standard result. The integrability of u near t = 0 implies
the continuity of u at t = 0 while the boundedness of u for large values of t
follows from Taylor’s theorem. For the general case consider the function v given
1
by v(s) := u s 1+r . By assumption v ∈ Cb [0, ∞) ∩ C 2 (0, ∞) and with the
1
relationship t := s 1+r we can write v (s) = u (t)t −r and
d −r
v (s) = t −r t u (t).
dt
The claim follows now by applying the result for r = 0 to the function v.
Corollary 2.9 Let x ∈ D(A) and α ∈ [1/2, 1). Then
d
t → t 1−2α U x(t) ∈ Cb [0, ∞); X .
dt
d
v := t → t 1−2α
Uα x(t) .
dt
By Corollary 2.9 we have v ∈ Cb [0, ∞); X and by the definition of Bα we
have v(0) = Bα x. Let us finally check that v solves the ODE for P1−α (Bα x).
Then v = U1−α Bα x. Let t > 0. Then
d d d
t 2α−1 v(t) = t 2α−1 t 1−2α Uα x(t) = AUα x(t) = Uα Ax(t)
dt dt dt
and
d 2α−1 d d
t 1−2α t v(t) = t 1−2α Uα Ax(t) = Av(t),
dt dt dt
where we used the closedness of A for the last equality.
(ii) Let us turn to the case α ∈ (0, 1/2). In this case, consider the function ṽ given
by
where for the boundedness of w̃ we used Lemma 2.8 with r := 1 − 2α. Note
that w̃(0) = B1−α Bα x. If we apply (i) to 1 − α instead of α we find
d 2α−1 d d
t 1−2α t U1−α x(t) = U1−α Ax(t) = t 1−2α Uα B1−α x(t).
dt dt dt
Consequences of Solvability of the Extension Problem 451
L
w̃(0) = Ax. Finally, we show that w̃ solves Pα (Ax).
Therefore, we observe
Let φ ∈ Cc∞ (0, ∞) . Since w solves the ODE in (1.1) we obtain
∞ ∞
d d d 2α−1
w̃(t) t 1−2α t 2α−1 φ(t)dt = − AU1−α Bα x(t) t φ(t)dt.
dt dt dt
0 0
∞ ∞
d 2α−1
w̃(t)φ(t)dt = − U1−α Bα x(t) t φ(t)dt ∈ D(A)
dt
0 0
and
∞ ∞
d
− AU1−α Bα x(t) t 2α−1 φ(t)dt = A w̃(t)φ(t)dt.
dt
0 0
Proof Let x ∈ D(Bα ). Then there exists a sequence (xn ) in D(A) which is
convergent towards x in X such that Bα xn → Bα x. By Lemma 2.4, we have
Uα xn → Uα x and by Theorem 2.10 we obtain
Dα Uα xn = U1−α Bα xn → U1−α Bα x.
∞
∀f ∈ C (0, ∞) : lim f (t)tφk (t)dt = f (s).
k→∞
0
d
t 1−2α Uα xn (t) = U1−α Bα xn (t) = Bα U1−α xn (t).
dt
Using the closedness of Bα , by Hille’s theorem we obtain
∞ ∞
1−2α d
t Uα (t)xn φk (t)dt = Bα U1−α (t)xn φk (t)dt.
dt
0 0
Sending n → ∞ we get
∞ ∞
1−2α d
t Uα (t)x φk (t)dt = Bα U1−α (t)x φk (t)dt.
dt
0 0
d
s 1−2α Uα (s)x = Bα U1−α x(s)
ds
Sending s → 0+ and using a last time the closedness of Bα finally yields x ∈
D(Bα ).
Corollary 2.13 We have Bα B1−α = B1−α Bα = A.
Proof Note, that we already showed Bα B1−α x = B1−α Bα x = Ax for x ∈ D(A2 ).
So, let now x ∈ D(A). Then x ∈ D(Bα ), essentially by definition and the fact that
Consequences of Solvability of the Extension Problem 453
References
A. Merzon ()
Instituto de Física y Matemáticas, Universidad Michoacana de San Nicolás de Hidalgo, Morelia,
Michoacán, México
P. Zhevandrov
Facultad de Ciencias Físico-Matemáticas, Universidad Michoacana de San Nicolás de Hidalgo,
Morelia, Michoacán, México
J. E. De la Paz Méndez · T. J. Villalba Vega
Universidad Autonoma de Guerrero, Chilpancingo, Guerrero, México
1 Introduction
The main goal of this paper is to prove the uniqueness of a solution to the Som-
merfeld half-plane problem [23, 32, 33] with a real wave number, proceeding from
the uniqueness of the corresponding time-dependent problem in a certain functional
class. The existence and uniqueness of solutions to this problem was considered in
many papers, for example in [8, 12, 25]. However, in our opinion, the problem of
uniqueness is still not solved in a satisfactory form from the point of view of the
boundary value problems (BVPs). The fact is that this problem is a homogeneous
BVP boundary value problem which admits various nontrivial solutions. Usually the
“correct” solutions are chosen by physical reasoning [23, 25, 32, 33], for example,
using the Sommerfeld radiation conditions and regularity conditions at the edge.
The question is: from where do the radiation and regularity conditions arise,
from the mathematical point of view?
Our goal is to show that they arise automatically from the non-stationary
problem. This means the following: we prove that the Sommerfeld solution is a
limiting amplitude of a solution to the corresponding non-stationary problem which
is unique in an appropriate functional class. Since the Sommerfeld solution, as is
well-known, satisfies the radiation and regularity conditions, our limiting amplitude
also satisfies them. Of course, the limiting amplitude principle (LAP) is very well-
known for the diffraction by smooth obstacles, see e.g. [28, 29], but we are unaware
of its rigorous proof in the case of diffraction by a half-plane.
The literature devoted to diffraction by wedges including the Sommerfeld
problem is enormous (see e.g. the review in [20]), and we will only indicate some
papers where the uniqueness is treated. In paper [25] a uniqueness theorem was
proven for the Helmholtz equation (! + 1)u = 0 in two-dimensional regions D of
half-plane type. These regions can have a finite number of bounded obstacles with
singularities on their boundaries. In particular, the uniqueness of solution u to the
Sommerfeld problem was proven by means of the decomposition of the solution into
the sum u = g + h, where g describes the geometrical optics incoming and reflected
waves and h satisfies the Sommerfeld radiation condition (clearly, u should also
satisfy the regularity conditions at the edge).
In paper [8] exact conditions were found for the uniqueness in the case of
complex wave number. The problem was considered in Sobolev spaces for a wide
class of generalized incident waves, and for DD and NN boundary conditions. In
paper [12] the same problem was considered also for the complex wave number and
for DN boundary conditions. In both papers the Wiener-Hopf method has been used.
Time-dependent scattering by wedges was considered in many papers although their
number is not so large as the number of papers devoted to the stationary scattering
by wedges. We indicate here the following papers: [1–4, 13, 14, 24, 26–31]. The
detailed description of these papers is given in [19].
In [6, 7, 10, 17–20, 22], the diffraction by a wedge of magnitude φ (which can be
a half-plane in the case φ = 0 as in [20]) with real wavenumber was considered as
a stationary problem which is the “limiting case” of a non-stationary one. More
Time-Dependent Approach to Uniqueness of the Sommerfeld Solution 457
all what follows.) The non-stationary incident plane wave in the absence of obstacles
reads
where
f (s) = 0, s < 0, sup(1 + |s|)p |f (s)| < ∞ for some p ∈ R, lim f (s) = 1.
s→+∞
(1.3)
In this case the front of the incident wave ui reaches the half-plane W 0 for the first
time at the moment t = 0 and at this moment the reflected wave ur (x, t) is born
(see Fig. 1). Thus
ur (x, t) ≡ 0, t < 0.
Note that for t → ∞ the limiting amplitude of ui is exactly equal to the Sommerfeld
incident wave [33] by (1.3), cf. also (2.1) below.
The time-dependent scattering with the Dirichlet boundary conditions is
described by the mixed problem
u(x, t) := (∂t2 − !)u(x, t) = 0, x ∈ Q
t ∈ R, (1.5)
u(x1 , ±0, t) = 0, x1 > 0
Denote
ϕ± := π ± α. (1.11)
Let us define the nonstationary incident wave in the presence of the obstacle W 0 ,
which is the opaque screen,
ui (ρ, ϕ, t), 0 < ϕ < ϕ+ ,
u0i (ρ, ϕ, t) := (1.13)
0, ϕ+ < ϕ < 2π.
460 A. Merzon et al.
Remark 1.2 The function us has no physical sense, since ui = u0i . The wave us
coincides with the scattered wave u0s := u − u0i in the zone {(ρ, ϕ) : 0 < ϕ < ϕ+ },
but in the zone {(ρ, ϕ) : ϕ+ < ϕ ≤ 2π} we have u0s = us + ui .
The goal of the paper is to prove that the Sommerfeld solution of half-plane
diffraction problem is the limiting amplitude of the solution to time-dependent
problem (1.5), (1.6) (with any f satisfying (1.3)) and this solution is unique in an
appropriate functional class.
The paper is organized as follows. In Sect. 2 we recall the Sommerfeld solution.
In Sect. 3 we reduce the time-dependent diffraction problem to a “stationary” one
and define a functional class of solutions. In Sect. 4 we give an explicit formula
for the solution of time-dependent problem and prove that the Sommerfeld solution
is its limiting amplitude. In Sect. 5 we prove that the solution belongs to a certain
functional class. Finally, in Sect. 6 we prove the uniqueness.
2 Sommerfeld’s Diffraction
Let us recall the Sommerfeld solution [23, 33]. The stationary incident wave (rather,
the incident wave limiting amplitude) in the presence of the obstacle is
e−iω0 ρ cos(ϕ−α) , ϕ ∈ (0, ϕ+ ),
A0i (ρ, ϕ) = (2.1)
0, ϕ ∈ (ϕ+ , 2π).
We denote this incident wave as A0i since it is the limiting amplitude of the non-
stationary incident wave u0i given by (1.13):
in view of formula (1.1), see Remark 1.2. The Sommerfeld half-plane diffraction
problem can be formulated as follows: find a function A(x), x ∈ Q, such that
(! + ω02 )A(x) = 0, x ∈ Q,
(2.2)
A(x1 , ±0) = 0, x1 > 0,
where
−1 −1
ζ (γ , ϕ) := 1 − ei(−γ +ϕ−α)/2 − 1 − ei(−γ +ϕ+α)/2 , γ ∈C (2.6)
and C is the Sommerfeld contour (see [20, formula (1.1) and Fig. 3]).
In the rest of the paper we prove that this solution is the limiting amplitude of the
solution of time-dependent problem (1.5) and is unique in an appropriate functional
class.
The Sommerfeld diffraction problem can also be considered for NN and DN
half-plane. The corresponding formulas for the solution can be found in [19].
Sommerfeld obtained his solution using an original method of solutions of the
Helmholtz equation on a Riemann surface. Note that a similar approach was used
for the diffraction by a wedge of rational angle [9], where well-posedness in suitable
Sobolev space was proved.
Let
h(ω), ω ∈ C+ , denote the Fourier–Laplace transform Ft →ω of h(t),
∞
h(ω) = Ft →ω [h(t)] = eiωt h(t) dt, h ∈ L1 (R+ ); (3.1)
0
Let us calculate
ui (x, ω). Changing the variable t − n · x = τ , and using the fact
that suppf ⊂ R+ we obtain from (1.1) and (1.2) that
Hence,
We are going to prove the existence and uniqueness of solution to problem (1.5),
(1.6) such that us given by (1.7) belongs to the space M, which is defined as follows:
Definition 3.1 M is the space of functions u(x, t) ∈ S (R2 × R+ ) such that its
u(x, ω) is a holomorphic function on ω ∈ C+ with
Fourier–Laplace transform
2
values in C (Q) and
u(·, ·, ω) ∈ H 1 (Q)
(3.5)
for any ω ∈ C+ .
Remark 3.2 We use the classical definition [11] of the space H 1 (Q) as the
completion of the space of smooth functions on Q with respect to the corresponding
norm. This definition does not coincide with the frequently used definition of
H 1 (Q) as the space restrictions of distributions from H 1 (R) to Q. In our case these
definitions lead to different spaces; in particular, the latter definition does not allow
for functions which are discontinuous across W 0 . In [34], another space allowing for
the same class of functions was introduced; the proof of uniqueness of the solution
to our problem in that space is an open question.
Remark 3.3 Note that ui (x, t) R2 ×R+
∈
/ M, where for ϕ ∈ D(R2 ),
ui (x, t) R2 ×R+
, ϕ := u(x, t)ϕ(x, t) dx dt.
R2 ×R+
In fact, eiωn·x = eω2 ρ cos(ϕ−α) and, for α − π/2 < ϕ < α + π/2, ω ∈ C+ it grows
exponentially as ρ → ∞, and hence does not satisfy (3.5); because of this we use
system (1.8)–(1.10) instead of (1.5) (they are equivalent by (1.6)) since (1.8)–(1.10)
Time-Dependent Approach to Uniqueness of the Sommerfeld Solution 463
involves only the values of ui on the boundary and the latter possess the Fourier–
Laplace transforms which do not grow exponentially.
Remark 3.4 Since for a (weak) solution of the Helmholtz equation us ∈ H 1 (Q)
the Dirichlet and Neumann data exist in the trace sense and in the distributional
sense, respectively (see, e.g., [5]), problem (3.4) is well-posed. Hence, problem
(1.8)–(1.10) is well-posed too.
In paper [20] we solved problem (1.5) and (1.6). Let us recall the corresponding
construction. First we define the non-stationary reflected wave [20, formula (26)]:
−e−iω0 (t −n·x) f (t − n · x), ϕ ∈ (0, ϕ− )
ur (x, t) = t ≥ 0, (4.1)
0, ϕ ∈ (ϕ− , 2π)
and
i
ud (ρ, ϕ, t) = Z(β, ϕ)F (t − ρ cosh β) dβ, (4.3)
8π
R
Obviously, the condition supp F ⊂ [0, ∞) (see (3.1)) implies that supp ud (·, ·, t) ⊂
[0, +∞).
Remark 4.1 The function U (γ + ϕ) essentially coincides with the Sommerfeld
kernel (2.6). This is for a reason. In paper [17] it was proven that the solution to the
corresponding time-dependent diffraction problem by an arbitrary angle φ ∈ (0, π]
belonging to a certain class similar to M necessarily has the form of the Sommerfeld
type integral with the Sommerfeld type kernel.
Finally, we proved [20, Th. 3.2, Th 4.1] the following.
Theorem 4.2
(i) For f ∈ L1loc (R) the function
This BVP (as well as (2.2)) is ill-posed since the homogeneous problem admits
many solutions (i.e., the solution is nonunique).
Time-Dependent Approach to Uniqueness of the Sommerfeld Solution 465
Remark 4.5 As can be decomposed similarly to (2.3). Namely, by (4.8) and (2.3),
we have
where A1i (x) = Ai (x) − A0i (x). Obviously, problems (4.9), (4.10) and (2.2), (2.3)
with condition (2.5) are equivalent, but the first problem is more convenient as we
will see later.
In this section we will obtain an explicit formula for the solution of (3.4) and prove
that it belongs to H 1 (Q) for all ω ∈ C+ .
Let Z(β, ϕ) be given by (4.2). First, we will need the Fourier–Laplace transforms
of the reflected and diffracted waves (4.1), (4.3).
Lemma 5.1 The Fourier–Laplace transforms of ur and ud are
−f(ω − ω0 )e−iωρ cos(ϕ+α) , ϕ ∈ (0, ϕ− ),
ur (x, ω) = (5.1)
0, ϕ ∈ (ϕ− , 2π),
i
ud (ρ, ϕ, ω) = f (ω−ω0 ) Z(β, ϕ) eiωρ cosh β dβ, ω ∈ C+ , ϕ = ϕ± . (5.2)
8π
R
Further,
∞
−iω0 (t −n·x)
−Ft →ω e f (t − n · x) = −e iω0 (n·x)
ei(ω−ω0 )t f (t − n · x) dt.
0
∞
iω n·x
ur (x, ω) = −e ei(ω−ω0 )τ f (τ ) dτ, ϕ ∈ (0, ϕ− ).
−n·x
466 A. Merzon et al.
Moreover, by (4.1),
since π/2 < α < ϕ + α < π by (1.4) and (1.11). Hence, we obtain (5.1), since
suppf ⊂ R+ . The second formula in (5.1) follows from definition (4.1) of ur .
Let us prove (5.2). Everywhere below we put ω = ω1 + iω2 , ω1,2 ∈ R, ω2 > 0,
for ω ∈ C+ . By Lemma 8.1(i), (1.3) and (4.4) we have
We have
∞
G(ρ, β, ω) := Ft →ω F (t − ρ cosh β) = eiωt F (t − ρ cosh β) dt, ω ∈ C+ .
0
Making the change of the variable τ = t − ρ cosh β in the last integral and
using the fact that supp F ⊂ [0, ∞) and F (ω) = f(ω − ω0 ) by (4.4), we get
G(ρ, β, ω) = e iωρ cosh β
f (ω − ω0 ). Substituting this expression into (5.3) we obtain
(5.2). Lemma 5.1 is proven.
Lemma 5.2 For any ω ∈ C, there exist C(ω), c(ω) > 0, such that both functions
ur and ∂ρ
ur admit the same estimate
ur (ρ, ϕ, ω) ≤ C(ω)e−c(ω)ρ
ρ > 0, ϕ ∈ (0, 2π), ϕ = ϕ± . (5.4)
ur (ρ, ϕ, ω) ≤ C(ω)e−c(ω)ρ
∂ρ
and ∂ϕ
ur (ρ, ϕ, ω) admits the estimate
by (1.4). Therefore (5.4) holds for ur . Hence, differentiating (5.1) we obtain (5.4)
for ∂ρ
ur and (5.5) for ∂ϕ
ur , for ϕ = ϕ− .
Proposition 5.3 There exist C(ω), c(ω) > 0 such that the function
ud , and ∂ρ
ud ,
∂ϕ
ud admit the estimates
ud (ρ, ϕ, ω) ≤ C(ω)e−c(ω)ρ ,
where
A(ρ, ϕ, ω) := Z(β, ϕ)eiωρ cosh β dβ, ϕ = ϕ± . (5.8)
R
Represent A as A = A1 + A2 , where
1
A1 (ρ, ϕ, ω) := Z(β, ϕ)eiωρ cosh β dβ
−1 ϕ ∈ (0, 2π), ϕ = ϕ± .
A2 (ρ, ϕ, ω) := Z(β, ϕ)eiωρ cosh β dβ
|β|≥1
(5.9)
468 A. Merzon et al.
The estimate (5.7) for A2 follows from (8.1) (see Appendix 1). It remains to
prove the same estimate for the function A1 . Let
ε± := ϕ± − ϕ. (5.10)
Representing A1 as
1
A1 (ρ, ϕ, ω) = −4K0 (ρ, w, ε+ ) + 4K0 (ρ, w, ε− ) + Ž(β, ϕ)eiωρ cosh β dβ,
−1
where K0 is defined by (8.7), we obtain (5.7) for A1 from Lemma 8.2 (i) and
(8.3).
(II) Let us prove (5.6) for ∂ρ
ud . By (5.2) it suffices to prove that
where
B(ρ, ϕ, ω) := Z(β, ϕ) cosh β eiωρ cosh β dβ.
R
1
B1 (ρ, ϕ, ω) := Z(β, ϕ) cosh βeiωρ cosh β dβ,
−1
B2 (ρ, ϕ, ω) := Z(β, ϕ) cosh βeiωρ cosh β dβ, ϕ = ϕ± .
|β|≥1
∞
1
eβ/2 e− 2 ω2 ρe dβ.
β
|B2 (ρ, ϕ, ω)| ≤ C1
1
Since for ρ ≥ 1,
∞
e−ω2 ξ/2 2 −ω2 ρ/2
1/2
dξ ≤ e ,
ξ ω2
ρ
1
B1 (ρ, ϕ, ω) = −4K1 (ρ, ω, ε+ ) + 4K1 (ρ, ω, ε− ) + Ž (β, ϕ) · cos β e iωρ cosh β dβ.
−1
Hence, B1 satisfies (5.7) (and, therefore, (5.11)) by Lemma 8.2 (i) and (8.3).
(III) Let us prove (5.6) for ∂ϕ
ud . By (5.2) it suffices to prove this estimate for ∂ϕ A,
where A is given by (5.8). From (9.3) we have
Similarly to the proof of estimate (5.11) for B, we obtain the same estimate
for A3 , so, by (5.12), the estimate (5.6) follows. Proposition 5.3 is proven.
u0s (ρ, ϕ, ω) =
ur (ρ, ϕ, ω) +
ud (ρ, ϕ, ω), ϕ = ϕ± , ω ∈ C+ , (5.15)
ur and
where ud are defined by (5.1) and (5.2), respectively. Hence the statement
follows from Lemma 5.2 and Proposition 5.3.
470 A. Merzon et al.
To estimate
us it is convenient to introduce one more “part” u1i of the non-stationary
incident wave ui , namely the difference between ui and u0i .
From (1.7) and (5.13) it follows that
where u1i (ρ, ϕ, t) := ui (ρ, ϕ, t) − u0i (ρ, ϕ, t). From (1.1) and (1.13) it follows that
0, 0 < ϕ < ϕ+ ,
u1i (ρ, ϕ, t) = (5.17)
−ui (ρ, ϕ, t), ϕ+ < ϕ < 2π.
By (3.3),
0, 0 < ϕ < ϕ+ ,
u1i (ρ, ϕ, ω) = (5.18)
−f(ω − ω0 ) eiωn·x , ϕ+ < ϕ < 2π.
u1i , ∂ρ
Lemma 5.5 There exist C(ω), c(ω) > 0 such that u1i satisfy (5.4) and ∂ϕ
u1i
satisfies (5.5) for ϕ ∈ (0, 2π), ϕ = ϕ± .
Proof By (3.3) it suffices to prove the statement for eiωn·x when ϕ ∈ (ϕ+ , 2π).
Since |eiωn·x | = eω2 ρ cos(ϕ−α) , ϕ ∈ (ϕ+ , 2π) we have
and for ϕ ∈ (ϕ+ , 2π), we have |eω2 ρ cos(ϕ−α) | ≤ e−cω2 ρ , c > 0, ϕ ∈ (ϕ+ , 2π),
because cos(ϕ − α) ≤ −c < 0 by (1.4). Hence the statement follows from (5.19).
us , ∂ρ
Corollary 5.6 The functions us and ∂ϕ
us satisfy (5.6) for ϕ ∈ (0, 2π), ϕ =
ϕ± .
Proof From (5.16) it follows that
us (ρ, ϕ, ω) =
u0s (ρ, ϕ, ω) −
u1i (ρ, ϕ, ω). (5.20)
Thus the statement follows from Corollary 5.4 and Lemma 5.5.
It is possible to get rid of the restriction ϕ = ϕ± in Corollary 5.6.
Let l± = {(ρ, ϕ) : ρ > 0, ϕ = ϕ± }.
Time-Dependent Approach to Uniqueness of the Sommerfeld Solution 471
(! + ω2 )
us (ρ, ϕ, ω) = 0, (ρ, ϕ) ∈ Q, ω ∈ C+ . (5.21)
us =
ur +
ud −
u1i . (5.22)
The function ur satisfies (5.21) for ϕ = ϕ± , u1i satisfies (5.21) for ϕ = ϕ± by
(5.17) and (3.3) and ud satisfies (5.21) for ϕ = ϕ± by (5.2), see Appendix 2. It
remains only to prove that us ∈ C 2 (Q), because this will mean that (5.6) holds by
Corollary 5.6 (and continuity) and (5.21) holds in Q including l± .
Let us prove this for ϕ close to ϕ− . The case of ϕ close to ϕ+ is analyzed
similarly.
Let h(s) be defined in (C \ R) ∩ B(s ∗ ), where B(s ∗ ) is a neighborhood of s ∗ ∈ R.
Define the jump of h at the point s ∗ as
= −J (
ur (ρ, ϕ, ω), ϕ− ) . (5.23)
Further, by (8.4),
ud (ρ, ϕ, ω), ϕ− = 0 = −J ∂ϕ
J ∂ϕ ur (ρ, ϕ, ω), ϕ− .
Finally, consider
M := J ∂ϕϕ
ud (ρ, ϕ, ω), ϕ− .
472 A. Merzon et al.
Similarly to (5.23), expanding eiωρ cos β in the Taylor series in β (at 0) and noting
= β k dβ
that all the terms (β+iε )3
, k = 2, are continuous, we obtain
−
1 ε− =−0
i eiωρ cos β
M = − f(ω − ω0 ) dβ
π (β + iε− )3 ε− =+0
−1
1 ε− =−0
−i f(ω − ω0 )(iωρ)eiωρ β2
= dβ .
2π (β + iε− )3 ε− =+0
−1
Hence,
M = f(ω − ω0 )(iωρ)eiωρ = −J (
ur (ρ, ϕ, ω), ϕ− ) .
| sin ϕ|2
|∂x1 us (·, ·, ω)|2 ≤ | cos ϕ|2 |∂ρ us (·, ·, ω)|2 + |∂ϕ us (·, ·, ω)|2 .
ρ2
This implies that ∂x1 us ∈ L2 (Q), since c(ω) > 0. Similarly, ∂x2 us (·, ·, ω) ∈
L2 (Q). (5.24) is proven.
(ii) The statement follows from Definition 3.1.
Time-Dependent Approach to Uniqueness of the Sommerfeld Solution 473
6 Uniqueness
From the real and imaginary parts of the last identity, we obtain
2
s |2 + Im ω |
|∇ w ws |2 dx = Re s w
∂n w s dSR (6.1)
QR ∂B(R)∩Q
for Re ω = 0 and
− 2 Re ω Im ω |
ws |2 dx = Im ∂n ws ws dSR (6.2)
QR ∂B(R)∩Q
474 A. Merzon et al.
for Re ω = 0. Recall that we consider the case Im k = 0. Now, note that since
ωs ∈ H 1 (Q), there exist a monotonic sequence of positive numbers {Rj } such that
Rj → ∞ as j → ∞ and
lim s
∂n w s dSRj = 0.
w (6.3)
j →∞
∂B(Rj )∩Q
are finite. This fact, in particular, implies that there exist a monotonic sequence of
positive numbers Rj such that Rj → ∞ as j → ∞ and
2π 2π
|
ws (Rj , ϕ)| dϕ = 2
o(Rj−1 ), s (Rj , ϕ)|2 dϕ = o(Rj−1 ) as j → ∞.
|∂n w
0 0
2π 2π
∂n w ws (Ri , ϕ)dϕ ≤
s (Ri , ϕ) |∂n w
s (Ri , ϕ)
ws (Ri , ϕ)|dϕ
0 0
⎛ 2π ⎞1/2 ⎛ 2π ⎞1/2
s (Ri , ϕ)|2 dϕ ⎠ ⎝ |
≤ ⎝ |∂n w ws (Ri , ϕ)|2 dϕ ⎠
0 0
= o(Rj−1 ) as j → ∞,
Fig. 2 Uniqueness
for Re ω = 0 and
|
ws | dϕ = lim
2
|
ws |2 dϕ = 0
R→∞
Q QR
7 Conclusion
Acknowledgments The authors are grateful to CONACYT (México) and CIC (UMSNH) for
partial financial support. We are also grateful to anonymous referees for valuable comments.
8 Appendix 1
Lemma 8.1
(i) The functions Z (given by (4.2)) and ∂ϕ Z admit uniform with respect to ϕ ∈
[0, 2π] estimates
4 4
Z(β, ϕ) = − + + Ž(β, ϕ), ε± = 0 (8.2)
β + iε+ β + iε−
476 A. Merzon et al.
with
4i 4i
∂ϕ Z = − + + Ž1 (β, ϕ), ε± = 0, (8.4)
(β + iε+ ) 2 (β + iε− )2
with
Ž1 (β, ϕ) ∈ C ∞ (R×[0, 2π]), |Ž1 (β, ϕ)| ≤ C, β ∈ R×[0, 2π]. (8.5)
Proof
(i) For a = im, b = in, we have
− sinh(α/2)
coth a − coth b = .
sinh(b) sinh(a)
Hence for m = −π/8 + a/4 and n = −π/8 − a/4 we obtain the estimate (8.1)
for U (ζ ) given by (4.6) with respect to ζ . So (8.1) for Z follows from (4.5)
and (4.2).
(ii) From (4.5) and (4.6) it follows that the function Z admits the representation
where
β + i(ϕ± − ϕ)
Z± (β, ϕ) = ± coth ,
4
(8.6)
β − i ϕ± + ϕ
Z ± (β, ϕ) = ± coth .
4
4
Z± (β, ϕ) = ± + ޱ (β, ϕ), ϕ = ϕ± ,
β + iε±
and
ޱ (β, ϕ) ∈ C ∞ (R × [0, 2π]), |ޱ (β, ϕ)| ≤ C, (β, ϕ) ∈ R × [0, 2π].
Time-Dependent Approach to Uniqueness of the Sommerfeld Solution 477
Finally, by (1.4),
Moreover, since
1
eiωρ cosh β
K0 (β, ρ, ω, ε) := , K0 (ρ, ω, ε) := K(β, ρ, ω, ε) dβ, (8.7)
β + iε
−1
1
K1 (β, ρ, ω, ε) := cosh β · eiωρ cosh β , K1 (ρ, ω, ε):= K1 (β, ρ, ω, ε) dβ,
−1
(8.8)
1
eiωρ cosh β
K2 (β, ρ, ϕ, ε) := , K2 (ρ, ω, ε) := K2 (β, ρ, ω, ε) dβ dβ.
(β + iε)2
−1
Lemma 8.2 There exist C(ω) > 0, c(ω) > 0 such that the functions K0 , K1 , and
K2 satisfy the estimates
Proof It suffices to prove (8.9) for 0 < ε < ε0 , since the functions K0 , K1 , K2 are
odd with respect to ε, and for ε ≥ ε0 > 0 they satisfy the estimate
1
K0,1,2 (β, ρ, ω, ε) ≤ C(ε0 ) e−ω2 ρ dβ ≤ 2C(ε0)e−ω2 ρ .
−1
478 A. Merzon et al.
1 ω2
|h(β)| < , |ω1 ||h(β)| ≤ for |β| ≤ 2ε0 := r, (8.11)
4 4
and define the contour
where
−r 1
I1 (ρ, ω, ε) = K0 (β, ρ, ω, ε) dβ, I2 (ρ, ω, ε) = + K0 (β, ρ, ω, ε) dβ
γr −1 r
I1 (ρ, ω, ε) ≤ |dβ|
|β + iε|
γr
1 −ω2 ρ
≤ e e−ω2 ρ h(β)+iω1 ρ h(β) dβ , (8.14)
ε0
γr
Fig. 3 Contour γr
by (8.11). Finally,
e−ω2 ρ cosh β+iω1 ρ cosh β e−ω2 ρ
|I2 (ρ, ω, ε)| ≤ dβ ≤ ,
β + iε 2ε0 (ω)
[−1,−r]∪[r,1]
(8.16)
since |β + iε| ≥ 2ε0 , β ∈ [−1, −r] ∪ [r, 1]. From (8.14)–(8.16), we obtain
(8.9) for K0 .
(II) Let us prove (8.9) for K1 . Let h(β), ε0 (ω), γr be defined by (8.10)–(8.12).
Then we have by the Cauchy Theorem
K1 (ρ, ω, ε) := K1 (β, ρ, ω, ε) dβ
γr ∪[−1,r]∪[r,1]
by (8.11). Further, by (8.11) similarly to the proof of (8.14), (8.15), and using
(8.10), we get
|ω| 5 −ω2 ρ
K1 (β, ρ, ω, ε) dβ ≤ · e |e−ω2 ρ h(β) eiω1 ρ h(β) | |dβ|
ε0 4
γr γr
ω ρ
− 22
≤ C(ω)e . (8.18)
480 A. Merzon et al.
9 Appendix 2
Since ω ∈ C+ the integral (9.2) converges after differentiation with respect to ρ and
ϕ. We have
∂ρ Ad (ρ, ϕ, ω) = (iω) Z(β, ϕ) cosh β eiωρ cosh β dβ,
R
∂ρ2 Ad (ρ, ϕ, ω) = −ω2 Z(β, ϕ) cosh2 β eiωρ cosh β dβ.
R
and
1
(! + ω2 )ud (ρ, ϕ, ω) =∂ρ2 Ad (ρ, ϕ, ω) + ∂ρ Ad (ρ, ϕω)
ρ
1 2
+ ∂ Ad (ρ, ϕ, ω) + ω2 Ad (ρ, ϕ, ω) = 0.
ρ2 ϕ
References
1. J. Bernard, Progresses on the diffraction by a wedge: transient solution for line source
illumination, single face contribution to scattered field, anew consequence of reciprocity on
the spectral function. Rev. Tech. Thomson 25(4), 1209–1220 (1993)
2. J. Bernard, On the time domain scattering by a passive classical frequency dependent-shaped
region in a lossy dispersive medium. Ann. Telecommun. 49(11–12), 673–683 (1994)
3. J. Bernard, G. Pelosi, G. Manara, A. Freni, Time domains scattering by an impedance wedge
for skew incidence, in Proceeding of the Conference ICEAA (1991), pp. 11–14
4. V. Borovikov, Diffraction at Polygons and Polyhedrons (Nauka, Moscow, 1996)
5. L.P. Castro, D. Kapanadze, Wave diffraction by wedges having arbitrary aperture angle. J.
Math. Anal. Appl. 421(2), 1295–1314 (2015)
6. J. De la Paz Méndez, A. Merzon, DN-Scattering of a plane wave by wedges. Math. Methods
Appl. Sci. 34(15), 1843–1872 (2011)
7. J. De la Paz Méndez, A. Merzon, Scattering of a plane wave by “hard-soft” wedges, in
Recent Progress in Operator Theory and Its Applications. Operator Theory: Advances and
Applications, vol. 220 (2012), pp. 207–227
8. A. Dos Santos, F. Teixeira, The Sommerfeld problem revisited: solution spaces and the edges
conditions. Math. Anal. Appl. 143, 341–357 (1989)
9. T. Ehrhardt, A. Nolasco, F.-O. Speck, A Riemann surface approach for diffraction from rational
wedges. Oper. Matrices 8(2), 301–355 (2014)
10. A. Esquivel Navarrete, A. Merzon, An explicit formula for the nonstationary diffracted wave
scattered on a NN-wedge. Acta Appl. Math. 136(1), 119–145 (2015)
11. P. Grisvard, Elliptic Problems in Nonsmooth Domains (Pitman Publishing, London, 1985)
12. A. Heins, The Sommerfeld half-plane problem revisited I: the solution of a pair of complex
Wiener–Hopf integral equations. Math. Methods Appl. Sci. 4, 74–90 (1982)
13. I. Kay, The diffraction of an arbitrary pulse by a wedge. Commun. Pure Appl. Math. 6, 521–546
(1953)
14. J. Keller, A. Blank, Diffraction and reflection of pulses by wedges and corners. Commun. Pure
Appl. Math. 4(1), 75–95 (1951)
15. A. Komech, Elliptic boundary value problems on manifolds with piecewise smooth boundary.
Math. USSR-Sb. 21(1), 91–135 (1973)
16. A. Komech, Elliptic differential equations with constant coefficients in a cone. Mosc. Univ.
Math. Bull. 29(2), 140–145 (1974)
482 A. Merzon et al.
The author (S.F.) was partially supported by JSPS KAKENHI Grant Number 16K05257.
H. R. Moradi
Young Researchers and Elite Club, Mashhad Branch, Islamic Azad University, Mashhad, Iran
e-mail: [email protected]
S. Furuichi ()
Department of Information Science, College of Humanities and Sciences, Nihon University,
Setagaya-ku, Tokyo, Japan
e-mail: [email protected]
M. Sababheh
Department of Basic Sciences, Princess Sumaya University for Technology, Amman, Jordan
e-mail: [email protected]
1 Introduction
It is evident that a convex function is not necessarily operator convex, and the
function f (x) = x 4 provides such an example. Thus, a convex function does not
necessarily satisfy the operator Jensen inequality (1.3). However, it turns out that
a convex function satisfies the following operator version of the Mercer inequality
(1.2).
Theorem 1.2 ([5, Theorem 1]) Let A1 , . . . , An ∈ B (H ) be self-adjoint operators
in [m, M] and let 1 , . . . , n : B (H ) → B (K ) be positive linear
with spectra
maps with ni=1 i (1H ) = 1K . If f : [m, M] ⊆ R → R is a convex function,
then
n
f (M + m) 1K − i (Ai )
i=1
n
≤ (f (M) + f (m)) 1K − i (f (Ai )). (1.4)
i=1
Further, in the same reference, the following series of inequalities was proved
n
f (M + m)1K − i (Ai )
i=1
≤ (f (M) + f (m)) 1K
n
i=1 i(Ai ) − M1K m1K − ni=1 i (Ai )
+ f (m) + f (M)
M−m M −m
n
≤ (f (M) + f (m))1K − i (f (Ai )).
i=1
Later, related and analogous results have been established in [3, 4, 6].
Our main goal of this article is to present a refinement of the operator inequality
(1.4) without using convexity of f . Rather, using the idea by Mićić et al. [8], we
assume a boundedness condition on f . Then a discussion of log-convex version of
Mercer’s operator inequality will be presented.
486 H. R. Moradi et al.
2 Main Results
In this section we present our main results in two parts. In the first part, we discuss
the twice differentiable case, then we discuss the log-convex case.
We begin with the non-convex version of Theorem 1.2. We use the following symbol
in this paper.
(i) Asa = (A1 , . . . , An ), where Ai ∈ B (H ) are self-adjoint operators with
σ (Ai ) ⊆ [m, M] for some scalars 0 < m < M.
(ii) + = (1 , . . . , n ), where i : B (H ) → B (K ) are positive linear maps.
Theorem 2.1 Let A1 , . . . , An ∈ B (H ) be self-adjoint operators with spectra in
n M] and let 1 , . . . , n : B (H ) → B (K ) be positive linear maps with
[m,
i=1 i (1H ) = 1K . If f : [m, M] ⊆ R → R is a continuous twice differentiable
function such that α ≤ f ≤ β with α, β ∈ R, then
n
(f (M) + f (m)) 1K − i (f (Ai )) − βJ (m, M, Asa , + )
i=1
n
≤f (M + m)1K − i (Ai ) (2.1)
i=1
n
≤ (f (M) + f (m)) 1K − i (f (Ai )) − αJ (m, M, Asa , + ), (2.2)
i=1
where
n
J (m, M, Asa , + ) := (M + m) i (Ai ) − Mm1K
i=1
⎛ ⎞
1⎝
n 2
n
− i (Ai ) + i A2i ⎠ ≥ 0.
2
i=1 i=1
where
M −t t −m
Lf (t) := f (m) + f (M). (2.4)
M −m M −m
Letting
α 2
gα (t) := f (t) − t (m ≤ t ≤ M),
2
α
f (t) ≤ Lf (t) − (M + m) t − Mm − t 2 . (2.5)
2
Since m ≤ M + m − t ≤ M, we can replace t in (2.5) with M + m − t, to get
α
f (M + m − t) ≤ L0 (t) − (M + m) t − Mm − t 2 ,
2
where
n
m1K ≤ i (Ai ) ≤ M1K ,
i=1
we infer that
n
f (M + m)1K − i (Ai )
i=1
n
≤ L0 i (Ai )
i=1
⎧ n ⎫
α⎨
2⎬
n
− (M + m) i (Ai ) − Mm1K − i (Ai ) . (2.6)
2⎩ ⎭
i=1 i=1
m1H ≤ Ai ≤ M1H
488 H. R. Moradi et al.
in (2.5), we get
α
f (Ai ) ≤ Lf (Ai ) − (M + m) Ai − Mm1H − A2i .
2
Applying the positive linear maps i and adding in the last inequality yield
n
i (f (Ai ))
i=1
n
≤ L0 i (Ai )
i=1
α
n
n
− (M + m) i (Ai ) − Mm1K − i (A2i ) . (2.7)
2
i=1 i=1
m1H ≤ Ai ≤ M1H ,
we have
which implies
Thus we have
n
n
(M + m) i (Ai ) − Mm1K − (A2i ) ≥ 0. (2.8)
i=1 i=1
n
Further, noting that m ≤ i=1 i (Ai ) ≤ M, we also have
2
n
n
(M + m) i (Ai ) − mM1K − i (Ai ) ≥ 0. (2.9)
i=1 i=1
On the Operator Jensen-Mercer Inequality 489
β
Lf (t) − (M + m)t − Mm − t 2 ≤ f (t), m ≤ t ≤ M.
2
The details are left to the reader. This completes the proof.
In the following example, we present the advantage of using twice differentiable
functions in Theorem 2.1.
Example 2.2 Let f (t) = sin t (0 ≤ t ≤ 2π),
π
0
A= 4
π
0 2
and (A) = 12 T r [A]. Actually the function f (t) = sin t is concave on [0, π].
Letting m = π4 and M = π2 , we obtain
That is, (1.4) may fail without the convexity assumption. However, by considering
the weaker assumptions assumed in Theorem 2.1, we get
for any a, b ∈ I . f is called log-concave if the inequality above is reversed (that is,
when f1 is log-convex). By virtue of the arithmetic-geometric mean inequality, we
have
n
≤ (f (M) + f (m)) 1K − i (f (Ai )).
i=1
On the Operator Jensen-Mercer Inequality 491
3 Applications
In this section, we present some applications of the main results that we have shown
so far. First, we review and introduce the notations.
(i) A+ = (A1 , . . . , An ), where Ai ∈ B (H ) are positive invertible operators with
σ (Ai ) ⊆ [m, M] for some scalars 0 < m < M.
(ii) + = (1 , . . . , n ), where i : B (H ) → B (K ) are positive linear maps.
(iii) C ([m, M]) is the set of all real valued continuous functions on an interval
[m, M].
We also need to remind the reader that a function f ∈ C([m, M]) is called operator
monotone increasing (or operator increasing for short) if f (A) ≤ f (B) whenever
A, B are self-adjoint operators with spectra in [m, M] and such that A ≤ B. That
is, when f preserves the order of self-adjoint operator. A function f ∈ C([m, M])
is said to be operator decreasing if −f is operator monotone.
The so called operator quasi-arithmetic mean of Mercer’s type was defined in [5]
as follows:
n
Mϕ A+ , + := ϕ −1 (ϕ (M) + ϕ (m)) 1K − i (ϕ (Ai )) .
i=1
where
K m, M, ϕ, A+ , +
n
:= (ϕ(M) + ϕ(m)) i (ϕ(Ai )) − ϕ(M)ϕ(m)1K
i=1
⎛ ⎞
1⎝
n 2
n
− i (ϕ(Ai )) + i ϕ(Ai )2 ⎠ .
2
i=1 i=1
(ii) If ψ ◦ ϕ −1 ≤ β with β ∈ R and ψ −1 is operator monotone, then the reverse
inequality is valid in (3.2) with β instead of α.
Proof Let f = ψ ◦ ϕ −1 in (2.2) and replace Ai , m and M with ϕ (Ai ), ϕ (m) and
ϕ (M) respectively. This implies
ϕ A+ , +
ψ M ψ A+ , +
≤ψ M − αK m, M, ϕ, A+ , + .
Since ψ −1 is operator monotone, the first conclusion follows immediately. The other
case follows in a similar manner from (2.1).
Similarly, Theorem 2.4 implies the following version.
Theorem 3.3 Let ϕ, ψ ∈ C([m, M]) be two strictly monotonic functions. If ψ ◦ϕ −1
is log-convex function and ψ −1 is operator increasing, then
ϕ (A+ , + )
M
n
ϕ(M)1K − n
i=1 i (ϕ(Ai ))−ϕ(m)1K i=1 i (ϕ(Ai ))
−1
≤ψ [ψ(m)] ϕ(M)−ϕ(m) [ψ(M)] ϕ(M)−ϕ(m)
ψ A+ , + .
≤M
Example 3.5 It is sufficient to compare (2.5) and the first inequality of (2.11). We
take m = 1 and M = 3. Setting
Some calculations show that g(2) ≈ −0.0052909 when p = −0.2, while g(2) ≈
0.0522794 when p = −1. We thus conclude that there is no ordering between the
RHS of inequality in (2.5) and the RHS of first inequality of (2.11).
Acknowledgments The authors would like to thank the referees for their careful and insightful
comments to improve our manuscript. The authors are also grateful to Dr. Trung Hoa Dinh for
fruitful discussion and revising the manuscript.
References
Somayeh Nemati
1 Introduction
S. Nemati ()
Department of Applied Mathematics, Faculty of Mathematical Sciences, University of
Mazandaran, Babolsar, Iran
Center for Research and Development in Mathematics and Applications (CIDMA),
Department of Mathematics, University of Aveiro, Aveiro, Portugal
e-mail: [email protected]; [email protected]
many real world phenomena in physics, mechanics, control, and signal processing
have been described by this approach [2, 13–15].
Hat functions are defined on the interval [0, 1] and are continuous with shape hats
[17]. In the last few years, a modification of hat functions has been introduced and
used for solving some different kinds of problems. For instance: two-dimensional
linear Fredholm integral equations [5], integral equations of Stratonovich–Volterra
[6] and Volterra–Fredholm type [7], fractional integro-differential equations [8],
fractional pantograph differential equations [9], fractional optimal control problems
[10] and systems of fractional differential equations [11]. These functions are
defined on the interval [0, T ], with T > 0. In this paper, we consider a general
definition of these basis functions on [a, b], with b > a, called generalized modified
hat functions (GMHFs). These basis functions are utilized to present a numerical
method to approximate the variable-order fractional integral of a given function. For
y : [a, b] → R and a real function α(t) such that 0 < α(t) ≤ 1 for all t ∈ [a, b],
the left Riemann–Liouville fractional integral operator of order α(t) of y is defined
by (see, e.g., [1])
t
α(t ) 1
a It y(t) = (t − s)α(t )−1y(s)ds, t > a,
(α(t)) a
if i is odd and 1 ≤ i ≤ m − 1:
⎧
⎪ −1
⎨ h2 (t − (a + (i − 1)h)) (t − (a + (i + 1)h)) ,
ψi (t) = if a + (i − 1)h ≤ t ≤ a + (i + 1)h, (2.2)
⎪
⎩ 0, otherwise,
if i is even and 2 ≤ i ≤ m − 2:
⎧ 1
⎪
⎪ (t − (a + (i − 1)h)) (t − (a + (i − 2)h)) ,
⎪
⎪ 2h2
⎪
⎨ it a + (i − 2)h ≤ t ≤ a + ih,
ψi (t) = 2h1 2 (t − (a + (i + 1)h)) (t − (a + (i + 2)h)) , (2.3)
⎪
⎪
⎪
⎪ if a + ih ≤ t ≤ a + (i + 2)h,
⎪
⎩
0, otherwise,
and
⎧
⎪
⎨ 2h2 (t − (b − h)) (t − (b − 2h)) ,
1
ψm (t) = if b − 2h ≤ t ≤ b, (2.4)
⎪
⎩ 0, otherwise.
The GMHFs build a set of linearly independent functions in the space L2 [a, b].
Figure 1 displays a set of the GMHFs obtained by m = 4 and defined on the interval
[−1, 1].
The following properties can be easily investigated using the definition of the
GMHFs:
m
1, i = j,
ψi (a + j h) = ψi (t) = 1, t ∈ [a, b],
0, i = j,
i=0
0, if i is even and |i − j | ≥ 3,
ψi (t)ψj (t) =
0, if i is odd and |i − j | ≥ 2.
m
y(t) 2 ym (t) = yi ψi (t) = Y T "(t), (2.5)
i=0
where
and
Y = [y0 , y1 , . . . , ym ]T ,
with
We have the following theorem for the error of the function approximation using
the GMHFs.
Theorem 2.1 If a function y ∈ C 3 ([a, b]) is approximated by the family of first
(m + 1) GMHFs as defined by (2.1)–(2.4), then
where h = b−a
m , and ym is defined by (2.5).
Proof It can be proved in a same way as done in [8, Theorem 4.1].
3 Numerical Method
In this section, we propose a numerical method based on the GMHFs for computing
the left Riemann–Liouville variable-order integral of a given function.
α(t )
m
a It ψi (t) 2 α
a Pij ψj (t), i = 0, 1, . . . , m,
j =0
500 S. Nemati
α(t )
wherein according to (2.7), a Pijα is the value of a It ψi (t) at a + j h. That is
a+j h
1
α
a Pij = (a + j h − s)α(a+j h)−1 ψi (s)ds, i, j = 0, 1, . . . , m.
(α(a + j h)) a
(3.1)
hα(a+j h)
α
a P0j =
2(α(a + j h) + 3)
2hα(a+ih)
α
a Pii = (1 + α(a + ih)), (3.6)
(α(a + ih) + 3)
2hα(a+j h)
α
a Pij =
(α(a + j h) + 3)
hα(a+(i−1)h)
α
a Pi,i−1 = [−α(a + (i − 1)h)] , (3.9)
2(α(a + (i − 1)h) + 3)
hα(a+ih)
α
a Pii = 2α(a+ih)+1(2 − α(a + ih)) , (3.10)
2(α(a + ih) + 3)
hα(a+(i+1)h)
α
a Pi,i+1 =
2(α(a + (i + 1)h) + 3)
hα(a+j h)
α
a Pij =
2(α(a + j h) + 3)
α(t )
where Pa is an upper Hessenberg matrix of dimension (m + 1) × (m + 1)
called operational matrix of variable-order integration of order α(t) in the Riemann-
Liouville sense, and is given using (3.2)–(3.12) as follows:
⎡ α Pα α α α α ⎤
0 a P01 a 02 a P03 a P04 ... a P0(m−1) a P0m
⎢0 Pα Pα α α α α ⎥
⎢ a 11 a 12 a P13 a P14 ... a P1(m−1) a P1m ⎥
⎢ ⎥
⎢ 0 a P21
α Pα
a 22
α
a P23
α
a P24 ... α
a P2(m−1)
α
a P2m ⎥
⎢ ⎥
⎢0 0 0 α
a P33
α
a P34 ... α
a P3(m−1)
α
a P3m ⎥
⎢ ⎥
Paα(t ) = ⎢0 0 α α α α ⎥. (3.14)
⎢ 0 a P43 a P44 ... a P4(m−1) a P4m ⎥
⎢. . ⎥
⎢. . .. .. .. .. .. ⎥
⎢. . . . . . . ⎥
⎢ ⎥
⎣0 0 0 0 0 α
. . . a P(m−1)(m−1) a P(m−1)m ⎦
α
0 0 0 0 0 α
. . . a Pm(m−1) α
a Pmm
502 S. Nemati
The aim of this section is to give an error bound for the numerical estimation of the
left Riemann–Liouville variable-order integral operator of an arbitrary function in
the sense of L2 -norm. To this aim, we consider the following L2 -norm for a function
f ∈ L2 [a, b]:
b 12
2
f 2 = |f (t)| dt .
a
Proof By assuming 0 < α(t) ≤ 1, for all t ∈ [a, b], we consider the following
α(t )
definition of the norm for the operator a It ,
where we have used f 2 = 1. Since we have (t) > 0.8 for all t > 0, we can
write (α(t) + 1) > 45 . Therefore, we get
; ;
; (t − a)α(t ) ; 5; ;
; ; ; ;
; ; < ;(t − a)α(t ); ,
; (α(t) + 1) ; 4 2
2
which together with 0 < α(t) ≤ 1 help us to have one of the following statements:
1. If 0 < t − a < 1, then
; ;
; (t − a)α(t ) ;
; ; 5 5 1
; ; < 12 = (b − a) 2 .
; (α(t) + 1) ; 4 4
2
2. If 1 ≤ t − a ≤ b − a, then
; ;
; (t − a)α(t ) ;
; ; 5 5 3
; ; < t − a2 = √ (b − a) 2 .
; (α(t) + 1) ; 4 4 3
2
a Itα(t ) 2 < C.
α(t )
We use the boundedness property of a It to continue the proof. Using
Theorem 2.1, we have
y − ym 2 = O(h3 ) (3.15)
and
; ;
; T α(t ) ;
;Y a It " − Y T Paα(t ) " ; = O(h3 ). (3.16)
2
= O(h ),
3
In this section, in order to illustrate the accuracy and applicability of the proposed
method, we consider two functions and employ the method for computing the
left Riemann–Liouville variable-order integral of these functions. We note that the
method was carried out using Mathematica 12. To show the accuracy, the l 2 norm
of the error and the convergence order are defined, respectively, by
1
1
m 2
Em = (J (ti ) − Jm (ti ))2 , &m = log2 (Em /E2m ) ,
m
i=1
∞
(a1 )k . . . (ap )k zk
p Fq a1 , . . . , ap ; b1 , . . . , bq ; z = ,
(b1 )k . . . (bq )k k!
k=0
Fig. 2 The l 2 norm of the error for some selected values of m in logarithmic scale
In Table 1, the l 2 norm of the error and the convergence order are presented which
confirm the O(h3 ) accuracy of this method. Furthermore, in Fig. 2, the results for
the l 2 norm of the error are plotted in a logarithmic scale.
506 S. Nemati
Let y(t) ∈ C 1 [a, b] be the exact solution of (4.1), then an approximation of y (t)
based on the GMHFs can be considered as follows:
where
C α(t ) 1−α(t )
a Dt y(t) = a It y (t) 2 C T Pa1−α(t ) "(t). (4.4)
A Numerical Approach for Approximating Variable-Order Integral 507
n
n
f (t, y(t)) 2 f (ti , y(ti ))ψi (t) = f (ti , yi )ψi (t) = F (, Y )"(t), (4.5)
i=0 i=0
Here, we employ the proposed method for solving three examples to demonstrate
the applicability and accuracy of our new method.
Example 4.3 As the first example, consider the problem (4.1) with 0 ≤ t ≤ 1 and
[4]
(4) (3)
f (t, y) = t 3 + t 3−α(t ) + t 2 + t 2−α(t ) − y.
(4 − α(t)) (3 − α(t))
Fig. 3 Plots of the error with m = 4, 8, 16, 32 for Example 4.3 with α(t) = sin t
S. Nemati
A Numerical Approach for Approximating Variable-Order Integral 509
Table 2 Numerical results for the L2 -error and convergence order for Example 4.3 with α = 0.75
Present method Method of [4]
m L2 -error Convergence order CPU time Convergence order
10 2.76e−4 3.00 0.000 1.79
20 3.45e−5 3.00 0.016 2.07
40 4.31e−6 3.00 0.016 2.17
80 5.39e−7 2.99 0.062 2.21
160 6.74e−8 3.00 0.234 2.23
320 8.42e−9 3.00 0.906 2.24
640 1.05e−9 – 2.828 –
RAM using Mathematica 12. The Mathematica function FindRoot was used for
solving the resulting systems.
The exact solution of this problem is t 7/2 . By setting α(t) = 1 − 0.5 exp(−t), the
numerical results of employing the proposed method for solving this example are
given in Fig. 4 and Table 3.
Example 4.5 As the last example, consider the following Riccati fractional differ-
ential equation [3, 12]
C α(t )
0 Dt y(t) = 2y(t) − y 2 (t) + 1, 0 < α ≤ 1, 0 ≤ t ≤ 1,
Fig. 4 Plots of the error with m = 4, 8, 16, 32 for Example 4.4 with α(t) = 1 − 0.5 exp(−t)
S. Nemati
A Numerical Approach for Approximating Variable-Order Integral 511
Fig. 5 Numerical solutions with m = 10 and different values of α together with the exact solution
with α = 1 for Example 4.5
512 S. Nemati
5 Concluding Remarks
A new numerical technique has been introduced for computing the left Riemann–
Liouville variable-order integral of a given function. A generalized class of the
modified hat functions (GMHFs) has been considered and used to suggest our new
scheme. The given function is easily expanded using the GMHFs. An operational
matrix of variable-order integral of the basis vector is computed and used to
approximate the integral of the function under consideration. The convergence
order of our numerical method is proved and confirmed by the results of two
illustrative examples. Finally, this new technique is employed to solve variable-
order differential equations and the numerical results demonstrate the efficiency of
the method.
Acknowledgement The author is grateful to two anonymous referees for several positive and
constructive comments, which helped her to improve the manuscript.
References
13. T. Odzijewicz, A.B. Malinowska, D.F.M. Torres, Fractional variational calculus of variable
order, in Advances in Harmonic Analysis and Operator Theory. Operator Theory: Advances
and Applications, vol. 229 (Birkhäuser, Basel, 2013)
14. P.W. Ostalczyk, P. Duch, D.W. Brzeziński, D. Sankowski, Order functions selection in the
variable, fractional-order PID controller, in Advances in Modelling and Control of Non-integer-
Order Systems. Lecture Notes in Electrical Engineering, vol. 320 (2015), pp. 159–170.
15. M.R. Rapaić, A. Pisano, Variable-order fractional operators for adaptive order and parameter
estimation. IEEE Trans. Autom. Control 59(3), 798–803 (2014)
16. S.G. Samko, B. Ross, Integration and differentiation to a variable fractional order. Integral
Transform. Spec. Funct. 1, 277–300 (1993)
17. M.P. Tripathi, V.K. Baranwal, R.K. Pandey, O.P. Singh, A new numerical algorithm to solve
fractional differential equations based on operational matrix of generalized hat functions.
Commun. Nonlinear Sci. Numer. Simul. 18, 1327–1340 (2013)
Langlands Reciprocity for C ∗-Algebras
Igor V. Nikolaev
1 Introduction
The Langlands conjectures say that all zeta functions are automorphic [9]. In
this note we study (one of) the conjectures in terms of the C ∗ -algebras [5].
Namely, denote by G(AK ) a reductive group G over the ring of adeles AK of a
number field K and by G(K) its discrete subgroup over K. The Banach algebra
L1 (G(K)\G(AK )) consists of the integrable complex-valued functions endowed
with the operator norm. The addition of functions f1 , f2 ∈ L1 (G(K)\G(AK )) is
defined pointwise and the multiplication is given by the convolution product:
(f1 ∗ f2 )(g) = f1 (gh−1 )f2 (h)dh. (1.1)
G(K)\G(AK )
I. V. Nikolaev ()
Department of Mathematics, St. John’s University, New York, NY, USA
where Mod is the category of graded left modules over the graded ring B(V , L, σ ),
T ors the full subcategory of Mod of the torsion modules and Coh the category of
quasi-coherent sheaves on the variety V [18, p. 180]. The norm-closure of a self-
adjoint representation of the ring B(V , L, σ ) by the linear operators on a Hilbert
space is called the Serre C ∗ -algebra of V [10]. In what follows, we shall focus on
the case when V is defined over a number field K, i.e. V is an arithmetic variety.
The corresponding Serre C ∗ -algebra is denoted by AV . The Hasse-Weil L-function
of V was calculated in [10] in terms of the K-theory of algebra AV .
It is known that the Langlands philosophy does not distinguish between the
arithmetic and automorphic objects [9]. Therefore one can expect a regular map
between the C ∗ -algebras AV and AG , provided V is a G-coherent variety, see
Definition 1.1. We prove that such a map is an embedding AV +→ AG . To give
an exact statement, we shall need the following notions. The i-th trace cohomology
{Htir (V ) | 0 ≤ i ≤ 2 dimC V } of an arithmetic variety V is an additive abelian
subgroup of R obtained from a canonical trace on the Serre C ∗ -algebra of V [10].
Likewise, the group K0 (AG ) of the stationary AF-algebra AG is an additive abelian
subgroup of R [6, Chapter 6].
Definition 1.1 The arithmetic variety V is called G-coherent, if
2 Preliminaries
This section is a brief account of preliminary facts involved in our paper; we refer
the reader to [2, 5, 9, 18].
2.1 AF-Algebras
Recall that B(H) is endowed with a ∗-involution; the involution comes from the
scalar product on the Hilbert space H. We shall call representation (2.1) ∗-coherent,
if (i) ρ(t) and ρ(t −1 ) are unitary operators, such that ρ ∗ (t) = ρ(t −1 ) and (ii) for
all b ∈ R it holds (ρ ∗ (b))σ (ρ) = ρ ∗ (bσ ), where σ (ρ) is an automorphism of ρ(R)
induced by σ . Whenever B = R[t, t −1 ; σ ] admits a ∗-coherent representation, ρ(B)
Langlands Reciprocity for C ∗ -Algebras 519
i∗
H even (V ) ⊗ Q K0 (AV ⊗ K) ⊗ Q H odd (V ) ⊗ Q
τ∗
where |V (Fpr )| is the number of points of variety V (Fpr ) defined over the field with
pr elements. It is known that:
E
where Li (s, V ) = p Pi (p−s ); the L(s, V ) is called the Hasse-Weil (or motivic)
L-function of V .
On the other hand, if K is a number fieldE then the adele ring AK of K is a
locally compact subring of the direct product Kv taken over all places v of K;
the AK is endowed with a canonical topology. Consider a reductive group G(AK )
over AK ; the latter is a topological group with a canonical discrete subgroup G(K).
Denote by L2 (G(K)\G(AK )) the Hilbert space of all square-integrable complex-
valued functions on the homogeneous space G(K) \ G(AK ) and consider the right
regular representation R of the locally compact group G(AK ) by linear operators
on the space L2 (G(K)\G(AK )) given by formula (1.1). It is well known, that each
irreducible component π of the unitary representation R can be written in the form
π = ⊗πv , where v are all unramified places of K. Using the spherical functions,
one gets an injection πv → [Av ], where [Av ] is a conjugacy class of matrices in the
group GLn (C). The automorphic L-function is given by the formula:
/ 4 5−1
L(s, π) = det In − [Av ](Nv)−s , s ∈ C, (2.5)
v
where Nv is the norm of place v; we refer the reader to [9, p. 170] and [8, p. 201]
for details of this construction.
The following conjecture relates the Hasse-Weil and automorphic L-functions.
Conjecture 2.2 ([9]) For each 0 ≤ i ≤ 2n there exists an irreducible representa-
tion πi of the group G(AK ), such that Li (s, V ) ≡ L(s, πi ).
Langlands Reciprocity for C ∗ -Algebras 521
3 Proofs
K × \A×
K /CK −→ Gal (K |K),
ab
(3.1)
lm
/
K × \A× ∼ Z/piki Z ,
K /CK = lim
← −
(3.2)
i=1
k
where m → ∞. Notice that the cyclic group Z/pi i Z can be embedded into the
k k
finite field Fqi , where qi = pi i . Thus the group GLn (Z/pi i Z) is correctly defined
and from (3.2) one gets an isomorphism
/
lm
GLn (K × \A× ∼
K /CK ) = lim
← −
GLn (Fqi ), (3.3)
i=1
E j
where GLn (Fqi ) is a finite group of order n−1 j =0 (qi − qi ) and such a group is no
n
longer abelian. In particular, it follows from (3.3) that the GLn (K × \A× K /CK ) is a
profinite group.
(i) Let us show that the group GLn (K × \A× K /CK ) being profinite implies that the
AG is an AF-algebra. Indeed, if G is a finite group then the group algebra C[G]
has the form
C[G] ∼
= Mn1 (C) ⊕ · · · ⊕ Mnh (C),
522 I. V. Nikolaev
GLn (K × \A× ∼
K /CK ) = lim
←
G,
− i
(3.4)
C[Gi ] ∼
= Mn(i)
1
(C) ⊕ · · · ⊕ Mn(i)
h
(C) (3.5)
AG Z ∼
= OB ⊗ K,
to the action of continuous symmetry group GLn (AK ) on the homogeneous space
GLn (K)\GLn (AK ). This observation can be applied to prove Weil’s conjecture on
the Tamagawa numbers.
Lemma 3.3 The algebra AV embeds into the AF-algebra AG , where V is a G-
coherent variety.
Proof We shall use the Pimsner’s Theorem [13, Theorem 7] about an embedding
of the crossed product algebra AV into an AF-algebra. It will develop that the G-
coherence of V implies that the AF-algebra is Morita equivalent to the algebra AG
of Lemma 3.1. We pass to a detailed argument.
Let V be a complex projective variety. Following [13] we shall think of V as
a compact metrizable topological space X. Recall that for a homeomorphism ϕ :
X → X the point x ∈ X is called non-wandering if for each neighborhood U of x
and every N > 0 there exists n > N such that
ϕ n (U ) ∩ U = ∅.
(In other words, the point x does not “wander” too far from its initial position in the
space X.) If each point x ∈ X is a non-wandering point, then the homeomorphism
ϕ is called non-wandering.
Let σ : V → V be an automorphism of finite order of the G-coherent variety V ,
such that the representation (2.1) is ∗-coherent. Then the crossed product
AV = C(V ) σ Z
AV +→ A, (3.7)
On the other hand, the trace cohomology of a G-coherent variety V must satisfy an
inclusion
K0 (A) ∼
= K0 (AG ). (3.11)
In other words, the AF-algebras A and AG are Morita equivalent. The embedding
AV +→ AG follows from formulas (3.7) and (3.11). Lemma 3.3 is proved.
Theorem 1.3 follows from Lemma 3.3.
Corollary 1.5 follows from an observation that the Frobenius action σ (F rpi ) :
Htir (V ) → Htir (V ) extends to a Hecke operator Tp : K0 (AG ) → K0 (AG ),
whenever Htir (AV ) ⊆ K0 (AG ). Let us pass to a detailed argument.
Recall that the Frobenius map on the i-th trace cohomology of variety V is given
by an integer matrix σ (F rpi ) ∈ GLbi (Z), where bi is the i-th Betti number of V ;
moreover,
2n
|V (Fp )| = (−1)i tr σ (F rpi ), (3.12)
i=0
where V (Fp ) is the reduction of V modulo a good prime p [10]. (Notice that
(3.12) is sufficient to calculate the Hasse-Weil L-function L(s, V ) of variety V via
Eq. (2.2); hence the map σ (F rpi ) : Htir (V ) → Htir (V ) is motivic.)
Definition 3.4 Denote by Tpi an endomorphism of K0 (AG ), such that the diagram
in Fig. 2 is commutative, where ι is the embedding (1.2). By Hi we understand the
algebra over Z generated by the Tpi ∈ End (K0 (AG )), where p runs through all but
a finite set of primes.
ι ι
? Tpi ?
K0 (AG ) - K0 (AG )
Langlands Reciprocity for C ∗ -Algebras 525
Remark 3.5 The algebra Hi is commutative. Indeed, the endomorphisms Tpi cor-
respond to multiplication of the group K0 (AG ) by the real numbers; the latter
commute with each other. We shall call the {Hi | 0 ≤ i ≤ 2n} an i-th Hecke algebra.
Lemma 3.6 The algebra Hi defines an irreducible representations πi of the group
G(AK ).
Proof Let f ∈ L2 (G(K)\G(AK )) be an eigenfunction of the Hecke operators
Tpi ; in other words, the Fourier coefficients cp of the function f coincide with the
eigenvalues of the Hecke operators Tp up to a scalar multiple. Such an eigenfunction
is defined uniquely by the algebra Hi .
Let Lf ⊂ L2 (G(K)\G(AK )) be a subspace generated by the right translates
of f by the elements of the locally compact group G(AK ). It is immediate (see
e.g. [8, Example on p. 197]), that Lf is an irreducible subspace of the space
L2 (G(K)\G(AK )); therefore it gives rise to an irreducible representation πi of the
locally compact group G(AK ). Lemma 3.6 follows.
Lemma 3.7 L(s, πi ) ≡ Li (s, V ).
Proof Recall that the function Li (s, V ) can be written as
/ −1
Li (s, V ) = det In − σ (F rpi )p−s , (3.13)
p
where σ (F rpi ) ∈ GLbi (Z) is a matrix form of the action of F rpi on the trace
cohomology Htir (V ).
On the other hand, from (2.5) one gets
/ −1
L(s, πi ) = det In − [Aip ]p−s , (3.14)
p
for all but a finite set of primes p. Comparing formulas (3.13)–(3.15), we get that
L(s, πi ) ≡ Li (s, V ). Lemma 3.7 follows.
Corollary 1.5 follows from Lemma 3.7 and formula (2.4).
526 I. V. Nikolaev
4 Example
We shall illustrate Theorem 1.3 and Corollary 1.5 for the group
G∼
= GL2 (AK ),
√
where K = Q( D) is a real quadratic field.
Proposition 4.1 K0 (AG ) ∼
= Z + Zω, where
√
1+ D
, if D ≡ 1 mod 4,
ω= √ 2 (4.1)
D, if D ≡ 2, 3 mod 4.
Proof By Lemma 3.1 and Remark 3.2, the AG is a stationary AF-algebra given by
partial multiplicity matrix B ∈ SL2 (Z). In particular, K0 (AG ) ∼
= Z + Zω, where
ω ∈ Q(λB ), where λB is the Perron-Frobenius eigenvalue of matrix B. Moreover,
by the construction End (K) ∼ = End (K0 (AG )), where End is the endomorphism
ring of the corresponding Z-module. But End (K) ∼ = OK , where OK is the ring
of integers of K. Thus, λB ∈ K and ω is given by formula (4.1). Proposition 4.1
follows.
Proposition 4.2 Let ECM ∼ = C/Ok be an elliptic curve with complex
√ multiplication
by the ring of integers of the imaginary quadratic field k = Q( −D). Then ECM is
a G-coherent variety of the group G ∼ = GL2 (AK ).
Proof The noncommutative torus Aθ is a C ∗ -algebra generated by the unitary
operators u and v satisfying the commutation relation vu = e2πiθ uv for a constant
θ ∈ R [15]. The Serre C ∗ -algebra of an elliptic curve Eτ ∼ = C/(Z + Zτ ) is
isomorphic to Aθ for any {τ | I m τ > 0}, see [12, Theorem 1.3.1]. In particular
[11], if τ ∈ Ok then
Ht0r (ECM ) = Ht2r (ECM ) ∼
= Z,
(4.2)
Ht1r (ECM ) ∼
= Z + Zω.
E −1
p det (I2 − σ (F rp1 )p−s
L(s, ECM ) = , s ∈ C, (4.3)
ζ(s)ζ(s − 1)
where ζ (s) is the Riemann zeta function and the product is taken over the set of
good primes; we refer the reader to formula (3.13). It is immediate that
L(s, π0 ) = ζ(s),
L(s, π2 ) = ζ(s − 1),
But formula (3.15) says that [A1p ] = σ (F rp1 ) and therefore the numerator of (4.3)
coincides with the L(s, π1 ). Proposition 4.4 is proved.
Remark 4.5 Proposition 4.4 can be proved in terms of the Grössencharacters [17,
Chapter II, §10].
Acknowledgments I thank the referees for their interest and helpful comments on the draft of this
paper.
References
9. R.P. Langlands, L-functions and automorphic representations, in Proceedings of the ICM 1978,
Helsinki (1978), pp. 165–175
10. I.V. Nikolaev, On traces of Frobenius endomorphisms. Finite Fields Appl. 25, 270–279 (2014)
11. I.V. Nikolaev, On a symmetry of complex and real multiplication. Hokkaido Math. J. 45, 43–51
(2016)
12. I.V. Nikolaev, Noncommutative Geometry. De Gruyter Studies in Mathematics, vol. 66 (De
Gruyter, Berlin, 2017)
13. M.V. Pimsner, Embedding some transformation group C ∗ -algebras into AF-algebras. Ergodic
Theory Dyn. Syst. 3, 613–626 (1983)
14. M.V. Pimsner, D.V. Voiculescu, Imbedding the irrational rotation C ∗ -algebra into an AF-
algebra. J. Oper. Theory 4, 201–210 (1980)
15. M.A. Rieffel, Non-commutative tori – a case study of non-commutative differentiable mani-
folds. Contemp. Math. 105, 191–211 (1990)
16. J.-P. Serre, Représentations Linéaires des Groupes Finis (Hermann, Paris, 1967)
17. J.H. Silverman, Advanced Topics in the Arithmetic of Elliptic Curves. GTM, vol. 151 (Springer,
Berlin, 1994)
18. J.T. Stafford, M. van den Bergh, Noncommutative curves and noncommutative surfaces. Bull.
Amer. Math. Soc. 38, 171–216 (2001)
Compact Sequences in Quasifractal
Algebras
Steffen Roch
1 Introduction
Let H be a Hilbert space and (Pn )n∈N a sequence of orthogonal projections of finite
rank which converges strongly to the identity operator on H . Let F denote the set
of all bounded sequences (An )n≥1 of operators An ∈ L(im Pn ) and G the set of all
sequences (An ) ∈ F with An → 0. Provided with the operations
and the norm (An ) := sup An , F becomes a unital C ∗ -algebra and G a closed
ideal of F . Let δ(n) denote the rank of Pn . Then one can identify L(im Pn ) with
the C ∗ -algebra Cn := Mδ(n) (C) of the complex δ(n) × δ(n) matrices, and F and G
can be identified with the direct product and the direct sum of the sequence (Cn ),
S. Roch ()
Department of Mathematics, Technical University of Darmstadt, Darmstadt, Germany
e-mail: [email protected]
respectively. In what follows we will think of F and G in this way (although some
results hold for products and sums of sequences of general C ∗ -algebras as well).
The quotient algebra F /G plays a significant role in numerical analysis, which
stems from the observation that asymptotic properties of the sequence (An ) ∈ F
can often be rephrased as a property of the coset (An ) + G. To mention only two
examples, it is
for every sequence (An ) ∈ F , and the coset (An )+G is invertible in F /G if and only
if the An are invertible for all sufficiently large n and if the norms of their inverses
are uniformly bounded, which is equivalent to saying that (An ) is a stable sequence.
Fractal and quasifractal algebras The correspondence between asymptotic prop-
erties of the sequence (An ) ∈ F and properties of the coset (An ) + G is particularly
close when the sequence (An ) belongs to a fractal subalgebra of F . For example, if
(An ) is a sequence in a fractal subalgebra of F , then the limit lim An exists, and
the sequence (An ) is stable if one of its (infinite) subsequences is stable.
The perhaps simplest way to define fractal algebras is the following (which is
equivalent to the original definition in [15]):
A C ∗ -subalgebra A of F is fractal if every partial zero sequence in A is a zero sequence,
i.e., if every sequence (An ) ∈ A with lim inf An = 0 satisfies lim An = 0.
Not every subalgebra of F is fractal, but it turns out that every separable
subalgebra of F has a fractal restriction. Here is the precise statement of the fractal
restriction theorem from [9] (see [12] for a shorter proof):
If A is a separable C ∗ -subalgebra of F , then there is an infinite subset M of N such that the
algebra A|M of all restricted sequences (An )n∈M is a fractal subalgebra of F |M .
Compact sequences We will need several notions of the compactness and of the
rank of an element in a general C ∗ -algebra and, in particular, of a sequence in F .
These notions can be subsumed under the following simple scheme.
Let A be a unital C ∗ -algebra and J a non-empty self-adjoint subset of A such
that AJ ⊆ J and J A ⊆ J . We call the subsets with these properties the semi-
ideals of A. Clearly, 0 ∈ J . Every semi-ideal J induces a rank function on A, i.e.,
a function r : A → N ∪ {∞} which satisfies
(a) r(a) = 0 if and only if a = 0,
(b) r(a + b) ≤ r(a) + r(b),
(c) r(ab) ≤ min{r(a), r(b)},
(d) r(a) = r(a ∗ )
for all a, b ∈ A, as follows: Set r(0) := 0. If a nonzero a ∈ A is a finite sum of
elements of J , then r(a) is the smallest positive integer such that a can be written
as a sum of r(a) elements from J . Finally, r(a) := ∞ if a is not a finite sum of
elements of J . The closure of the set of the elements with finite rank is a closed
ideal of A, called the ideal of the compact elements (relative to J ). We will reify
this scheme in several settings:
1. Let A be a unital C ∗ -algebra and J the set of all elements k ∈ A with the
property that for every a ∈ A there is a number α ∈ C such that kak = αk.
Then J is a semi-ideal; the associated rank function is called the algebraic rank
and denoted by alg rank a, and the associated ideal of the compact elements is
denoted by C(A).
2. For A = F , consider the set J of all sequences (Kn ) ∈ F such that rank Kn ≤
1 for all n ∈ N. Then J is a semi-ideal of F ; the associated rank function
is called the sequential rank and denoted by seq rank (An ), and the associated
ideal of the compact elements is denoted by K. The elements of K are called the
compact sequences. It is not hard to show that G ⊆ K and that seq rank (An ) =
supn rank An for every sequence (An ) ∈ F .
3. If A is a C ∗ -subalgebra of F and J is the set of all sequences (Kn ) in A such
that rank Kn ≤ 1 for all n ∈ N, then we denote the corresponding ideal of the
compact sequences by K(A). Clearly, K = K(F ).
4. Let J be the set of all cosets (Kn ) + G of sequences (Kn ) ∈ F with
seq rank (Kn ) ≤ 1. Then J is a semi-ideal of F /G and the corresponding ideal
of the compact elements is nothing but K/G. If r denotes the rank function
associated with J then we call ess rank (An ) := r((An )+G) the essential rank of
(An ) ∈ F . In particular, the sequences of essential rank 0 are just the sequences in
G. The essential rank of a sequence (An ) ∈ F can be characterized as the smallest
integer r ≥ 0 such that (An ) can be written as (Gn ) + (Kn ) with (Gn ) ∈ G and
supn rank Kn = r. The advantage of the essential rank of a sequence (An ) over
its sequential rank is that it depends on the coset of (An ) modulo G only.
See Sections 4.1–4.4 in [11] for some basic facts on compact sequences.
532 S. Roch
for every sequence (An ) ∈ A of finite essential rank. One can show that a fractal
algebra A has local weight 1 if and only if
A ∩ K = K(A). (1.2)
Proof We shall employ a criterion from [11] which characterizes the compactness
of a sequence (Kn ) in terms of the singular values of the matrices Kn . Thus, let
1 (A) ≥ . . . ≥ n (A) ≥ 0 denote the singular values of a matrix A ∈ Mn (C).
If K = (Kn ) is a compact sequence, then every (fractal or not) restriction of K is
compact. If K fails to be compact, then the negation of property (a) in Theorem 4.5
in [11] yields a positive constant C and strictly increasing sequences (nr ) and (kr )
with nr ≥ kr such that
The same theorem then implies that no restriction of the subsequence (Knr ) of (Kn )
is compact. This proves assertion (a).
Let now K be a sequence of finite essential rank r. The estimate ≥ in (2.2) comes
from (2.1). For the reverse estimate, we have to show that there is a fractal restriction
K|M of K of essential rank r. Let C := lim sup r (Kn ). By Corollary 4.6 in [11],
C is positive, and the set
M := {n ∈ N : r (Kn ) ≥ C/2}
for all M ∈ fr A.
534 S. Roch
There are several conditions which ensure the equality (2.3). Our goal is to show
(2.3) for sequences K which are stably regularizable. Recall that an element a of a
C ∗ -algebra A is Moore-Penrose invertible if there is a b ∈ A such that aba = a
and bab = b and such that ab and ba are self-adjoint. The element b is uniquely
determined by these conditions; it is called the Moore-Penrose inverse of a and
denoted by a † . All we need on Moore-Penrose invertibility is in Section 2.2.1 in
[5]. A sequence K ∈ F is then called stably regularizable if it is the sum of a
Moore-Penrose invertible sequence and a sequence in G. Stable regularizability of
a sequence K is equivalent to the Moore-Penrose invertibility of its coset K + G in
F /G.
So let K = (Kn ) ∈ A be a sequence of finite essential rank r which is stably
regularizable. Then, by definition, there are sequences (Ln ) ∈ F and (Gn ) ∈ G
such that (Kn ) = (Ln ) + (Gn ) and (Ln ) is Moore-Penrose invertible in F (and, if
G ⊆ A, even in A by inverse closedness). In particular, every Ln is Moore-Penrose
invertible and sup L†n < ∞, such that (Ln )† = (L†n ). Clearly, (Ln ) is a compact
sequence and
Proof From k = kk † k and the properties of the rank function we conclude that
which implies that r(k) = r(k † k) and r(k) ≤ r(k † ). Since (k † )† = k, we obtain
r(k) = r(k † ). Further we infer from Theorem 2.15 in [5] that k † k = (k ∗ k+q)−1 k ∗ k,
where q is the Moore-Penrose projection of k. Hence,
Proof Let k ∈ pAp be of algebraic rank 1 in pAp and a ∈ A. Then k = pkp, and
from kak = pkp pap pkp we conclude that k has algebraic rank 1 in A.
Compact Sequences in Quasifractal Algebras 535
Proof Let ess rank P = r. Since P ∈ K ∩A = K(A), there are sequences (Kni ) ∈ A
of spatial rank one and (Gn ) ∈ G such that
We may even assume that (Kni ) ∈ PAP (otherwise multiply (2.5) from both sides
by P to get a decomposition of P into r sequences P(Kni )P ∈ PAP of essential rank
one and a sequence in PGP). Let M ∈ fr A and A ∈ PAP. Then
A|M = (PA)|M = (Kn1 ) + . . . + (Knr ) + (Gn ) A |M
= (Kn1 )A |M + . . . + (Knr )A |M + (Gn )A |M
with sequences (Kn1 )A |M of spatial rank one in (PAP)|M , which implies that
(PAP)|M ⊆ K (PAP)|M .
Since P is compact, we also have (PAP)|M ⊆ K|M , which results in the inclusion
⊇ in (2.4). The reverse inclusion is evident.
Thus, if the algebra A satisfies the local weight one condition, then so does the
algebra (PAP)|M . Since M ∈ fr A, the latter algebra is also fractal, and we conclude
from (1.1) that
ess rank K|M = alg rank(PAP)|M /G |M K|M + G|M (2.6)
for all sequences K ∈ PAP. To get rid of the restriction to the algebra PAP on the
right-hand side of (2.6), we employ Lemma 2.3 which states in the present context
536 S. Roch
that
alg rank(PAP)|M /G |M K|M + G|M = alg rankA|M /G |M K|M + G|M .
C3 is the only one with continuous trace. For C1 , the identity matrix diag (1, 1)
is a projection of algebraic rank one at 0, but there is no rank one projection
close to it in a neighborhood of 0, whereas for C2 , the Hausdorff property is
violated.
(b) Let (sn ) be a dense subsequence of [0, 1]. For i ∈ {1, 2, 3}, let S(Ci ) stand
for the smallest closed subalgebra of F which contains the ideal G and all
sequences (f (sn )) with f ∈ Ci where Ci is as in (a). (Here we
assume
that
δ ≥ 2 and identify a 2 × 2-matrix A with the δ(n) × δ(n)-matrix A0 00 .) Each of
the algebras S(Ci ) is quasifractal, S(Ci ) ⊆ K, and the mapping
Here we recall from [13] the definition of a topology on (fr A)∼ which makes
(fr A)∼ to a compact Hausdorff space. For A as C ∗ -subalgebra of F , let Lmin (A)
denote the smallest closed complex subalgebra1 of l ∞ := l ∞ (N) which contains all
sequences (An ) where (An ) ∈ A. Clearly, Lmin (A) is a commutative C ∗ -algebra,
which is unital if A is unital.
For a C ∗ -subalgebra L of l ∞ , we let cr L stand for the set of all infinite subsets
M of N such that all sequences in the restriction L|M converge. The algebra L is
called quasiconvergent if every infinite subset of N has an infinite subset in cr L.
Then, for every C ∗ -subalgebra A of F , fr A = cr Lmin (A), and A is quasifractal if
and only if Lmin (A) is quasiconvergent.
Let L be a unital C ∗ -subalgebra of l ∞ . Then, for every M ∈ cr L, the mapping
Proof We first show that Lmin (L) = L for every C ∗ -subalgebra L of l ∞ . If (αn ) ∈
L, then (|αn |) = ((αn )(αn )∗ )1/2 ∈ L; hence, Lmin (L) ⊆ L. Conversely, every
sequence in L can be written as a linear combination of four non-negative sequences.
Since (βn ) = (|βn |) ∈ Lmin (L) for each of these sequences, the reverse inclusion
L ⊆ Lmin (L) follows.
Now to the proof of (4.2). The inclusion Lmin (A) ⊆ Lmax (A) holds trivially, as
already mentioned. To get the reverse inclusion, note that
where the equality comes from the remark at the beginning of the proof and the
inclusion holds because Lmax (A) ⊆ A and the functor Lmin is increasing. This
settles the first equality in (4.2).
540 S. Roch
Let H be a Hilbert space and K ∈ L(H ) an operator with rank K ≤ 1. Then, for
every A ∈ L(H ), there is an α ∈ C such that KAK = αK. The number α is
uniquely determined and satisfies |α| ≤ A K whenever K = 0. If K = 0, then
KAK = αK holds for every α ∈ C, whereas the condition |α| ≤ A K only
holds when α = 0. Thus, both conditions together determine α uniquely.
Let now A be a C ∗ -subalgebra of F and let (Kn ) ∈ A be a sequence with
maxn rank Kn ≤ 1. Then, as we have just seen, for every sequence (An ) ∈ A, there
is a unique sequence (αn ) ∈ l ∞ such that
If M ∈ supp (An ), then the restriction A|M is fractal and (An )|M is not a partial zero
sequence. Thus,
Note also that if one of two equivalent sets in fr A belongs to supp (An ), then so does
the other. In this sense, the support of a sequence is compatible with the equivalence
relation ∼.
Let now (An ), (Kn ) ∈ A with maxn rank Kn ≤ 1, and let (αn ) be the sequence
uniquely determined by (5.1). Further let M ∈ fr A. We distinguish two cases.
Case A M ∈ supp (An ). Then (An )|M ∈ G|M by (5.2). The second condition in
(5.1) then implies that (αn )|M is a zero sequence; hence, convergent.
Case B M ∈ supp (An ). Suppose (αn )|M is not convergent. Then, since (αn ) is a
bounded sequence, there are complex numbers α = β and disjoint infinite subsets
Mα , Mβ of M such that αn → α on Mα and αn → β on Mβ .
Compact Sequences in Quasifractal Algebras 541
and the fact that the sequence (Kn )|M has a positive limit, k, by (5.2), which
implies that the right hand side of (5.3) converges to k|α − β| > 0.
Hence, (αn Kn −αKn )|M is a partial zero sequence but not a zero sequence, which
contradicts the fractality of A|M . This contradiction shows that the sequence (αn )|M
converges for every M ∈ fr A, whence (αn ) ∈ Lmax (A).
Let A be a unital C ∗ -algebra and C a unital C ∗ -subalgebra in the center of
A, i.e., the elements of C commute which each element of A. Then C is a unital
commutative C ∗ -subalgebra of A. Let J stand for the set of all elements k ∈ A
with the following property: For every a ∈ A there is a c ∈ C such that kak = ck.
Then J forms a semi-ideal of A. The associated rank function is called the C-rank,
and the associated ideal of the C-compact elements is denoted by JC (A). In case C
is the complete center of A, the C-rank is also called the central rank and denoted
by cen rank a, and the associated ideal of the centrally compact elements is denoted
by Jcen (A). Note also that JC (A) = C(A), with the notation introduced in the
introduction.
Theorem 5.2 Let A be a unital C ∗ -subalgebra of F with G ⊆ A, and assume A is
an Lmax -algebra. Then
The theory of compact sequences in fractal algebras A is most satisfying when the
algebra A is of local weight 1, which happens if and only if A ∩ K = K(A). It is
only natural to impose an analogous condition when studying compact sequences in
542 S. Roch
A ∩ K = K(A).
Then, by Theorem 5.2, A ∩ K = K(A) = Jcen (A) where Cen A = Lmax (A) =
A ∩ l ∞ (N) by Theorem 4.4.
We are going to localize the algebra B := A/G over its central subalgebra C :=
(Lmax (A) + G)/G ∼ = Lmax (A)/c0 via the local principle by Allan/Douglas (see
Theorems 2.2.2 and 2.2.11 in [14]). By Theorem 4.2,
Let Ix stand for the smallest closed ideal of B which contains the maximal ideal x
of C and let x denote the quotient map B → B/Ix . Then the local principle states
(among other things) that an element b ∈ B is invertible in B if and only if x (b) is
invertible in B/Ix for every x ∈ Max B and that the function
is upper semi-continuous for every b ∈ B. In the present context on can say more:
If x = M∼ ∈ (fr A)∼ then, by Theorem 4.9 in [13] (note that Lmax (A) = Lmin (A)
by Theorem 4.4), the local algebra A/Ix is ∗ -isomorphic to the fiber A(M∼ ), and
the function (6.1) is continuous for every b ∈ B. (Roughly speaking, the continuity
results from including Lmax (A) = Lmin (A) into A.)
If (Kn ) ∈ A is a sequence of central rank one, then its coset k := (An ) + G
has the property that, for every b ∈ B, there is a c ∈ C such that kbk = ck.
Thus, k ∈ B has C-rank 1. The ideal Jcen (A)/G of B we are interested in will be
denoted by J . As we have just seen, J is generated by certain elements of C-rank
1. Hence, J ⊆ JC (B). Note also that C is a subalgebra of the center of B, hence
JC (B) ⊆ Jcen (B). We cannot claim that equality holds in either of these inclusions.
In this section, we are going to show that Fell’s condition is satisfied for the algebras
J and JC (B). We prepare the next steps with a few lemmas. Since now C stands for
a central subalgebra, we adopt the notation introduced in the previous section and
write JC (A) instead of C(A) for the closed ideal of a C ∗ -algebra A generated by
its elements of C-rank = algebraic rank 1 to avoid confusion.
Compact Sequences in Quasifractal Algebras 543
hence |qn (x) − g(x)| ≤ pn − f ∞ . This estimate holds for x ∈ [a, 0) and x = 0
as well. Thus,
The next results state that projections of algebraic rank one in x (J ) can be
lifted both to elements of C-rank one in J and to local projections.
Proposition 7.3 Let B be a unital C ∗ -algebra, C a central C ∗ -subalgebra of B
which contains the unit element, and J ⊆ JC (B) a closed ideal of B which is
generated by elements of C-rank one. Let a ∈ J , x ∈ Max C, and suppose that
x (a) is a projection of algebraic rank one in B/Ix . Then there is an element k ∈ J
of C-rank one such that x (k) = x (a).
Proof Set p := x (a). Being of algebraic rank one, p = 0. Since a ∈ J , a is a
limit of sums of elements rin ∈ J with C-rank one:
n
a = lim rin .
n→∞
i=1
Then
n
p = x (a) = lim x (rin )
n→∞
i=1
n
p = lim px (rin )p. (7.1)
n→∞
i=1
Since p is of algebraic rank one, there are numbers λin ∈ C such that px (rin )p =
λin p. Not all of these numbers can be zero (otherwise (7.1) would imply that p = 0).
Let λ := λi0 n0 = 0. Then, with r := ri0 n0 , px (r)p = λp, whence
1 1
p= px (r)p = x (a)x (r)x (a) = x (λ−1 ara).
λ λ
is continuous for every j ∈ J , then U can be chosen such that y (a) = 0 for all
y ∈ U.
Compact Sequences in Quasifractal Algebras 545
Proof Let b ∈ J be such that x (b) = p. Without loss we may assume that b is
positive (otherwise first replace b by (b + b∗ )/2 and then by max(0, b), using the
continuous functional calculus and Lemma 7.2 (a)). From
and the upper semi-continuity of the function (7.2) we conclude that, given ε > 0,
there is an open neighborhood Uε of x such that
One may consider Theorem 7.5 as stating Fell’s condition for J with respect to
the base space Max C. To get from here Fell’s condition for J with respect to the
primitive ideal space Prim J , we still need a ‘transfer lemma’ which relates the two
settings. For, we need some facts on primitive ideal spaces. First, the mapping
ϕ : Prim A → Max C, L → L ∩ C
is onto, and it is continuous with respect to the hull-kernel topology on Prim A (and
it is open if and only if the function Max C 4 y → y (j ) is continuous for every
a ∈ A) (Section C1 in [16]) and, second, the mapping
Ix0 ∩ J ⊆ λ−1 −1
J (L0 ) ∩ J = λJ (λJ (L0 )) = L0 .
Summarizing, we obtain
kn
b = lim jkn .
n→∞
k=1
Compact Sequences in Quasifractal Algebras 547
kn
b + L0 = lim (b + L0 ) (jkn + L0 ) (b + L0 ). (7.4)
n→∞
k=1
(b + L0 ) (jkn + L0 ) (b + L0 ) = αkn (b + L0 ).
By (7.4) and since p = 0, at least one of the αkn is not zero; say αk n . Then
j := jk n /αk n is in J , is of C-rank 1, and satisfies
(b + L0 ) (j + L0 ) (b + L0 ) = b + L0 .
whence, by (7.3),
2
b/x0 (b) − b/x0 (b) ∈ L0 .
The following theorem settles Fell’s condition (= condition (c) in Definition 3.1)
for ideals J which are generated by elements of C-rank 1.
Theorem 7.7 Let B be a unital C ∗ -algebra, C a central C ∗ -subalgebra of B which
contains the unit element, and J ⊆ JC (B) a closed ideal of B which is generated
by elements of C-rank 1. Assume that the function (7.2) is continuous for every
j ∈ J . Then J satisfies Fell’s condition, i.e, for every L0 ∈ Prim J , there are a
neighborhood U of L0 in Prim J and an a ∈ J such that a + L is a projection of
algebraic rank 1 for all L ∈ U .
Proof Let L0 ∈ Prim J and x0 := ϕJ (L0 ). Let p ∈ J /L0 be a projection of
algebraic rank 1. Then, by part (a) of the Transfer Lemma 7.6, there is a a ∈ J of
C-rank 1 such that a + L0 = p and a + Ix0 is a projection of algebraic rank one
in x0 (J ). By Theorem 7.5, we can choose the element a positive and of C-rank
1 and such that x (a) is a projection of algebraic rank one for all x in an open
neighborhood V ⊆ Max C of x0 . Part (b) of the transfer lemma then implies that,
for every x ∈ V , there is a unique Lx ∈ Prim J such that ϕJ (Lx ) = x and that
a + Lx is a projection of algebraic rank 1 in J /Lx .
−1
Since ϕJ is continuous, U1 := ϕJ (V ) is open in Prim J . Further, by Lemma
A.30 in [8], U2 := {L ∈ Prim J : a + L > 1/2} is open (note that Spec J and
Prim J are naturally homeomorphic). Thus, U := U1 ∩ U2 is open in Prim J . The
assertion will follow once we have shown that U = {Lx : x ∈ V }. The inclusion ⊇
follows by the definition of U .
The reverse inclusion ⊆ is a consequence of the transfer lemma again. Indeed, let
L ∈ U . Since L ∈ U1 , ϕJ (L) = x0 ∈ V . By Lemma 7.6, a + L is a projection of
algebraic rank ≤ 1, implying that the norm of a + L is either 0 or 1. Since L ∈ U2 ,
we conclude that a + L = 1. Hence, a + L is a projection of rank 1, whence
L = Lx0 by the uniqueness assertion in the transfer lemma.
is continuous on U .
Compact Sequences in Quasifractal Algebras 549
Proof Let x0 ∈ U and s := alg rank (a + (Ix0 ∩ J )). Then there are finitely many
elementary components E1 , . . . , Er of the dual algebra J /(Ix0 ∩ J ) and s mutually
orthogonal projections πji ∈ Ei , j = 1, . . . , ni , of algebraic rank 1 such that
r
ni
a + (Ix0 ∩ J ) = πji .
i=1 j =1
Now we continue the locally defined elements πji , αji and βi to elements of J .
By Theorem 7.5, there are positive elements bi ∈ J of central rank 1 such that
bi + (Ix0 ∩ J ) = βi and bi + (Ix ∩ J ) is a projection of algebraic rank 1 for all x
in a neighborhood Wi of x0 .
Next, it follows as in Lemmas 3.1 and 3.2 of [4], there are elements pji , aji , bi ∈
J and a neighborhood V ⊆ ∩i Wi of x0 such that
and that, for all x ∈ V , bi + (Ix ∩ J ) is a projection and the pji + (Ix ∩ J ) are
mutually orthogonal projections with
(aji )∗ aji +(Ix ∩J ) = pji +(Ix ∩J ), aji (aji )∗ +(Ix ∩J ) = bi +(Ix ∩J ). (8.1)
References
5. R. Hagen, S. Roch, B. Silbermann, C ∗ -Algebras and Numerical Analysis (Dekker, New York,
2001)
6. V. Nistor, N. Prudhon, Exhaustive families of representations and spectra of pseudodifferential
operators. J. Oper. Theory 78, 247–279 (2017)
7. G.K. Pedersen, C ∗ -Algebras and Their Automorphism Groups (Academic, London, 1979)
8. I. Raeburn, D.P. Williams, Morita Equivalence and Continuous Trace C ∗ -Algebras (American
Mathematical Society, Providence, 1998)
9. S. Roch, Algebras of approximation sequences: Fractality, in in Problems and Methods in
Mathematical Physics. Operator Theory: Advances and Applications, vol. 121 (Birkhäuser,
Basel, 2001), pp. 471–497
10. S. Roch, Algebras of approximation sequences: fredholm theory in fractal algebras. Studia
Math. 150, 53–77 (2002)
11. S. Roch, Finite Sections of Band-Dominated Operators, vol. 191. (Memoirs of the American
Mathematical Societ, Providence, 2008), p. 895
12. S. Roch, Extension-restriction theorems for algebras of approximation sequences, in Operator
Theory, Operator Algebras, ans Matrix Theory. Operator Theory: Advances and Applications,
vol. 267 (Birkhäuser, Basel 2018), pp. 261–284
13. S. Roch, Beyond fractality: Piecewise fractal and quasifractal algebras, in The Diversity and
Beauty of Applied Operator Theory. Operator Theory: Advances and Applications, vol. 268
(Birkhäuser, Basel 2018), pp. 413–428
14. S. Roch, P.A. Santos, B. Silbermann, Non-commutative Gelfand Theories. A Tool-kit for
Operator Theorists and Numerical Analysts (Universitext, Springer, London, 2011)
15. S. Roch, B. Silbermann, C ∗ -algebra techniques in numerical analysis. J. Oper. Theory 35,
241–280 (1996)
16. D. Williams, Crossed Products of C ∗ -Algebras (American Mathematical Society, Providence,
2007)
Dilation Theory: A Guided Tour
O. M. Shalit ()
Faculty of Mathematics, Technion - Israel Institute of Technology, Haifa, Israel
e-mail: [email protected]
There are other ways to present dilation theory, and by the end of the first section
the reader will find references to several alternative sources. Either because of my
ignorance, or because I had to make choices, some things were left out. I have not
been able to cover all topics that could fall under the title, nor did I do full justice to
the topics covered. After all this is just a survey, and that is the inevitable nature of
the genre.
The purpose of this introductory section is to present the notion of dilation, and to
give a first indication that this notion is interesting and can be useful.
Let H be a Hilbert space, and T ∈ B(H) be a contraction, that is, T is an
operator
√ such that T ≤ 1. Then I − T ∗ T ≥ 0, and so we can define DT =
I − T ∗ T . Halmos [72] observed that the simple construction
T DT ∗
U=
DT −T ∗
T = PH U H , (1.1)
by the spectral theorem, where σ (U ) denotes the spectrum of U . Since for every
unitary U , the spectrum σ (U ) is contained in the unit circle T = {z ∈ C : |z| = 1},
the proof is complete.
Remark 1.2 The above proof is a minor simplification of the proof of von Neu-
mann’s inequality due to Sz.-Nagy [163], which uses the existence of a unitary
power dilation (see the next section). The simplification becomes significant when
dim H < ∞, because then U is a unitary which acts on a finite dimensional space,
and the spectral theorem is then a truly elementary matter. Note that even the case
dim H = 1 is not entirely trivial: in this case von Neumann’s inequality is basically
the maximum modulus principle in the unit disc; as was observed in [145], this
fundamental results can be proved using linear algebra!
Remark 1.3 A matrix valued polynomial is a function z → p(z) ∈ Mn where
p(z) = N k=0 Ak z , and A1 , . . . , Ak ∈ Mn = Mn (C) (the n × n matrices over C).
k
we may evaluate a matrix valued polynomial p at T by setting p(T ) = N k=0 Ak ⊗
T k ∈ Mn ⊗ B(H), or, equivalently, p(T ) is the n × n matrix over B(H) with ij th
entry equal to pij (T ) (the operator p(T ) acts on the direct sum of H with itself n
times). It is not hard to see that if p is a matrix valued polynomial with values in Mn
and T is a contraction, then the inequality (1.5) still holds but with |p(z)| replaced
by p(z)Mn : first one notes that it holds for unitary operators, and then one obtains
it for a general contraction by the dilation argument that we gave.
The construction (1.2) together with Theorem 1.1 illustrate what dilation theory
is about and how it can be used: every object in a general class of objects (here,
contractions) is shown to be “a part of” an object in a smaller, better behaved class
(here, unitaries); the objects in the better behaved class are well understood (here,
by the spectral theorem), and thus the objects in the general class inherit some
good properties. The example we have just seen is an excellent one, since proving
von Neumann’s inequality for non-normal contractions is not trivial. The simple
construction (1.2) and its multivariable generalizations have other applications, for
example they lead to concrete cubature formulas and operator valued cubature
formulas [95, Section 4.3]. In Sect. 4 we will examine in detail a deeper application
of dilation theory—an operator theoretic proof of Pick’s interpolation theorem.
Additional applications are scattered throughout this survey.
Before continuing I wish to emphasize that “dilation theory” and even the word
“dilation” itself mean different things to different people. As we shall see further
down the survey, the definition changes, as do the goals and the applications. Besides
the expository essay [95], the subject is presented nicely in the surveys [8] and [13],
and certain aspects are covered in books, e.g., [2, 118], and [121] (the forthcoming
book [26] will surely be valuable when it appears). Finally, the monograph [23] is
an indispensable reference for anyone who is seriously interested in dilation theory
of contractions.
It is quite natural to ask whether one can modify the construction (1.2) so that (1.3)
holds for all k ∈ N, and not just up to some power N. We will soon see that the
answer is affirmative. Let us say that an operator U ∈ B(K) is a power dilation
of T ∈ B(H), if H is a subspace of K and if T k = PH U k H for all k ∈ N =
{0, 1, 2, . . .}. The reader should be warned that this is not the standard terminology
used, in the older literature one usually finds the word dilation used to describe what
we just called a power dilation, whereas the concept of N-dilation does not appear
much (while in the older-older literature one can again find power dilation).
In this and the next sections I will present what I and many others refer to as
classical dilation theory. By and large, this means the theory that has been pushed
556 O. M. Shalit
and organized by Sz.-Nagy and Foias (though there are many other contributors),
and appears in the first chapter of [23]. The book [23] is the chief reference for
classical dilation theory. The proofs of most of the results in this and the next section,
as well as references, further comments and historical remarks can be found there.
Theorem 2.1 (Sz.-Nagy’s Unitary Dilation Theorem [163]) Let T be a contrac-
tion on a Hilbert space H. Then there exists a Hilbert space K containing H and a
unitary U on K, such that
Moreover, K can be chosen to be minimal in the sense that the smallest reducing
subspace for U that contains H is K. If Ui ∈ B(Ki ) (i = 1, 2) are two minimal
unitary dilations, then there exists a unitary W : K1 → K2 acting as the identity on
H such that U2 W = W U1 .
One can give a direct proof of existence of the unitary dilation by just writing
down an infinite operator matrix U acting on 2 (Z, H) = ⊕n∈Z H, similarly to
(1.2).
6 A nminimal dilation is then obtained by restricting U to the reducing nsubspace
n∈Z U H (the notation means the closure of the span of the subspaces U H). The
uniqueness of the minimal unitary dilation is then a routine matter. We will follow
a different path, that requires us to introduce another very important notion: the
minimal isometric dilation.
Before presenting the isometric dilation theorem, it is natural to ask whether it
is expected to be of any use. A unitary dilation can be useful because unitaries
are “completely understood” thanks to the spectral theorem. Are isometries well
understood? The following theorem shows that, in a sense, isometries are indeed
very well understood.
For a Hilbert space G, we write 2 (N, G) for the direct sum ⊕n∈N G. The
unilateral shift of multiplicity dim G (or simply the shift) is the operator S :
2 (N, G) → 2 (N, G) given by
The space G is called the multiplicity space. Clearly, the shift is an isometry.
Similarly, the bilateral shift on 2 (Z, G) is defined to be the operator
where we indicate with a box the element at the 0th summand of 2 (Z, G).
Theorem 2.2 (Wold Decomposition) Let V be an isometry on a Hilbert space H.
Then there exists a (unique) direct sum decomposition H = Hs ⊕ Hu such that
Hs and Hu are reducing for V , and such that V Hs is unitarily equivalent to a
unilateral shift and V Hu is unitary.
Dilation Theory: A Guided Tour 557
For the proof, the reader has no choice but to define Hu = ∩n≥0 V n H. Once one
shows that Hu is reducing, it remains to show that V H⊥ is a unilateral shift. Hint:
u
the multiplicity space is H V H, (this suggestive notation is commonly used in
the theory; it means “the orthogonal complement of V H inside H”, which is in this
case just (V H)⊥ ).
Theorem 2.3 (Sz.-Nagy’s Isometric Dilation Theorem) Let T be a contraction
on a Hilbert space H. Then there exists a Hilbert space K containing H and an
isometry V on K, such that
T∗ = V∗ H (2.2)
which6is readily seen to satisfy V ∗ H = T ∗ . That V is an isometry, and the fact that
K = n∈N V n H, can be proved directly and without any pain.
The uniqueness is routine, but let’s walk through it for once. If Vi ∈ B(Ki ) are
two minimal isometric dilations, then we can define a map W on span{V1n h : n ∈
N, h ∈ H} ⊆ K1 by first prescribing
W V1n h = V2n h ∈ K2 .
This map preserves inner products: assuming that m ≤ n, and using the fact that Vi
is an isometric dilation of T , we see
W V1m g, W V1n h = V2m g, V2n h = g, V2n−m h = g, T n−m h = V1m g, V1n h.
558 O. M. Shalit
The map W therefore well defines an isometry from the dense subspace span{V1n h :
n ∈ N, h ∈ H} ⊆ K1 onto the dense subspace span{V2n h : n ∈ N, h ∈ H} ⊆ K2 ,
and therefore extends to a unitary which, by definition, intertwines V1 and V2 .
∗
Note that the minimal isometric dilation is actually a coextension: T ∗ = V H .
A coextension is always a power dilation, so T n = PH V n H for all n ∈ N,
but, of course, the converse is not true (see Theorem 3.10 and (3.6) below for the
general form of a power dilation). Note also that the minimality requirement is more
stringent than the minimality required from the minimal unitary dilation. The above
proof of existence together with the uniqueness assertion actually show that every
isometry which is a power dilation of T and is minimal in the sense of the theorem,
is in fact a coextension.
Because the minimal isometric dilation V is actually a coextension of T , the
adjoint V ∗ is a coisometric extension of T ∗ . Some people prefer to speak about
coisometric extensions instead of isometric coextensions.
Once the existence of an isometric dilation is known, the existence of a
unitary dilation follows immediately, by the Wold decomposition. Indeed, given a
contraction T , we can dilate it to an isometry V . Since V ∼ = S ⊕ Vu , where S is a
unilateral shift and Vu is a unitary, we can define a unitary dilation of T by U ⊕ Vu ,
where U is the bilateral shift of the same multiplicity as S. This proves the existence
part of Theorem 2.1; the uniqueness of the minimal unitary dilation is proved as for
the minimal isometric one.
The minimal unitary dilation of a single contraction can serve as the basis of the
development of operator theory for non-normal operators. This idea was developed
to a high degree by Sz.-Nagy and Foias and others; see the monograph [23]
([23] also contains references to alternative approaches to non-normal operators,
in particular the theories of de Branges–Rovnyak, Lax–Phillips, and Livsič and his
school). The minimal unitary dilation can be used to define a refined functional
calculus on contractions, it can be employed to analyze one-parameter semigroups
of operators, it provides a “functional model” by which to analyze contractions and
by which they can be classified, and it has led to considerable progress in the study
of invariant subspaces.
To sketch just one of the above applications, let us briefly consider the functional
calculus (the following discussion might be a bit difficult for readers with little
background in function theory and measure theory; they may skip to the beginning
of Sect. 3 without much loss). We let H ∞ = H ∞ (D) denote the algebra of bounded
analytic functions on the open unit disc D. Given an operator T ∈ B(H), we wish
to define a functional calculus f → f (T ) for all f ∈ H ∞ . If the spectrum of T
is contained in D, then we can apply the holomorphic functional calculus to T to
Dilation Theory: A Guided Tour 559
for i = 1, 2. These are clearly isometric coextensions, but they do not commute:
Here and below we use the shorthand notation p(T ) = p(T1 , . . . , Td ) whenever
p is a polynomial in d variables and T = (T1 , . . . , Td ) is a d-tuple of operators. The
proof of the above theorem (which is implicit in the lines preceding it) gives rise to
an interesting principle: whenever we have a unitary or a normal dilation then we
have a von Neumann type inequality. This principle can be used in reverse, to show
that for three or more commuting contractions there might be no unitary dilation, in
general.
Example 3.5 There exist three contractions T1 , T2 , T3 on a Hilbert space H and a
complex polynomial p such that
Ti e = fi
Ti fi = −gi
Ti fj = gk , k = i, j
Ti gj = δij h
Ti h = 0
are three commuting contractions that have no isometric dilation. However, these
operators can be shown to satisfy von Neumann’s inequality.
What is it exactly that lies behind this dramatic difference between d = 2 and
d = 3? Some people consider this to be an intriguing mystery, and there has been
effort made into trying to understand which d-tuples are the ones that admit a unitary
dilation (see, e.g., [160] and the references therein), or at least finding sufficient
conditions for the existence of a nice dilation.
A particularly nice notion of dilation is that of regular dilation. For a d-tuple T =
(T1 , . . . , Td ) ∈ B(H)d and n = (n1 , . . . , nd ) ∈ Nd , we write T n = T1n1 · · · Tdnd . If
n = (n1 , . . . , nd ) ∈ Zd , then we define
T (n) = (T n− )∗ T n+
The conditions (3.5) are sometimes called Brehmer’s conditions. For the proof,
one shows that the function n → T (n) is a positive definite function on the group
Zd (see Section I.9 in [23]), and uses the fact that every positive definite function on
a group has a unitary dilation [23, Section I.7].
Corollary 3.8 The following are sufficient conditions for a d-tuple T =
(T1 , . . . , Td ) of commuting contractions on a Hilbert space to have a regular
unitary dilation:
1. di=1 Ti 2 ≤ 1.
2. T1 , . . . , Td are all isometries.
3. T1 , . . . , Td doubly commute, in the sense that Ti Tj∗ = Tj∗ Ti for all i = j (in
addition to Ti Tj = Tj Ti for all i, j ).
Proof It is not hard to show that the conditions listed in the corollary are sufficient
for Brehmer’s conditions (3.5) to hold.
dilation theorem, Theorem 3.2). The restriction of the isometry U to the subspace
6
n∈N U H is clearly
n
1. an isometry,
2. a dilation of A,
3. a minimal dilation,
and6therefore (by uniqueness of the minimal isometric dilation), the restriction of U
to n∈N U n H is 6 unitarily equivalent to the minimal isometric dilation V on H, so
we identify K = n∈N U n H and V = U H . It follows (either from our knowledge
on the minimal dilation, or simply from the fact that U is a coextension) that V is a
coextension of A. With respect to the decomposition L = K ⊕ K⊥ ,
V X RQ
U= , W =
0 Z P N
Ts = PH Vs H , for all s ∈ S.
Note that Sz.-Nagy’s unitary dilation theorem can be rephrased by saying that
every semigroup of contractions over S = N has a unitary dilation, in the above
sense. Similarly, there are notions of extension and coextension of a semigroup
of operators. Some positive results have been obtained for various semigroups. Sz.-
566 O. M. Shalit
Nagy proved that every semigroup T = {Tt }t ∈R+ of contractions that is point-strong
continuous (in the sense that t → Tt h is continuous for all h ∈ H) has isometric and
unitary dilations, which are also point-strong continuous (see [23, Section I.10]).
This result was extended to the two parameter case by Słociński [151] and Ptak
[130]; the latter also obtained the existence of regular dilations for certain types
of multi-parameter semigroups. Douglas proved that every commutative semigroup
of isometries has a unitary extension [52]. Letting the commutative semigroup be
S = Nd , we recover Theorem 3.1. Douglas’s result was generalized by Laca to
semigroups of isometries parametrized by an Ore semigroup [92], and in fact to
“twisted” representations.
A result that somewhat sheds light on the question, which tuples of operators
have a unitary dilation and which don’t, is due to Opela. If T = {Ti }i∈I ⊂ B(H)
is a family of operators, we say that T commutes according to the graph G with
vertex set I , if Ti Tj = Tj Ti whenever {i, j } is an edge in the (undirected) graph G.
We can consider T as a semigroup parameterized by a certain quotient of the free
semigroup over I . Opela proved the following compelling result: given a graph G,
every family T = {Ti }i∈I of contractions commuting according to G has a unitary
dilation that commutes according to G, if and only if G contains no cycles [108].
It is interesting that in the general setting of semigroups of operators, one can say
something about the structure of dilations.
Theorem 3.10 (Sarason’s Lemma [136]) Let V = {Vs }s∈S ⊂ B(K) be a
semigroup of operators over a semigroup with unit S, and let H be a subspace
of K. Then the family T = {Ts := PH Vs H }s∈S is a semigroup over S if and only
if there exist two subspaces M ⊆ N ⊆ K, invariant under Vs for all s, such that
H = N M := N ∩ M⊥ .
Proof The sufficiency of the condition is easy to see, if one writes the elements of
the semigroup V as 3 × 3 block operator matrices with respect to the decomposition
K = M ⊕ H ⊕ N ⊥.
For the converse, one has no choice but to define N = ∨s∈S Vs H (clearly an
invariant space containing H), and then it remains to prove that M := N H is
invariant for V , or—what is the same—that PH Vt M = 0 for all t ∈ S. Fixing t, we
know that for all s,
PH Vt PH Vs PH = Tt Ts = Tt s = PH Vt s PH = PH Vt Vs PH .
Sarason’s lemma is interesting and useful also in the case of dilations of a single
contraction.
Remark 3.11 Up to this point in the survey, rather than attempting to present a
general framework that encapsulates as much of the theory as possible, I chose
to sew the different parts together with a thin thread. There are, of course, also
“high level” approaches. In Sect. 7 we will see how the theory fits in the framework
of operator algebras, which is one unifying viewpoint (see also [41, 118, 121]).
There are other viewpoints. A notable one is due to Sz.-Nagy—very soon after
he proved his unitary dilation theorem for a single contraction, he found a far-
reaching generalization in terms of dilations of positive functions on ∗-semigroups;
see [164], which contains a theorem from which a multitude of dilation theorems
can be deduced (see also [162] for a more recent discussion with some perspective).
Another brief but high level look on dilation theory can be found in Arveson’s survey
[13].
The purpose of this section is to illustrate how classical dilation theory can be
applied in a nontrivial way to prove theorems in complex function theory. The
example we choose is classical—Pick’s interpolation theorem—and originates in
the work of Sarason [137]. Sarason’s idea to use commutant lifting to solve the
Pick interpolation problem works for a variety of other interpolation problems as
well, including Carathéodory interpolation, matrix valued interpolation, and mixed
problems. It can also be applied in different function spaces and multivariable
settings. Here we will focus on the simplest case. Good references for operator
theoretic methods and interpolation are [2] and [63], and the reader is referred to
these sources for details and references.
Recall that H ∞ denotes the algebra of bounded analytic functions on the unit disc
D = {z ∈ C : |z| < 1}. For f ∈ H ∞ we define
f ∞ = sup |f (z)|.
z∈D
568 O. M. Shalit
and
f ∞ ≤ 1. (4.2)
It is common knowledge that one can always find a polynomial (unique, if we take
it to be of degree less than or equal to n − 1) that interpolates the data, in the sense
that (4.1) holds. The whole point is that we require (4.2) to hold as well. Clearly,
this problem is closely related to the problem of finding the H ∞ function of minimal
norm that interpolates the points.
Recall that an n × n matrix A = (aij )ni,j =1 is said to be positive semidefinite if
for every v = (v1 , . . . , vn )T ∈ Cn
n
Av, v = aij vj v i ≥ 0.
i,j =1
The n × n matrix on the left hand side of (4.3) is called the Pick matrix. What
is remarkable about this theorem is that it gives an explicit and practical necessary
and sufficient condition for the solvability of the interpolation problem: that the Pick
matrix be positive semidefinite.
At this point it is not entirely clear how this problem is related to operator theory
on Hilbert spaces, since there is currently no Hilbert space in sight. To relate this
problem to operator theory we will represent H ∞ as an operator algebra. The space
on which H ∞ acts is an interesting object in itself, and to this space we devote the
next subsection. Some important properties of H ∞ functions as operators will be
studied in Sect. 4.3, and then, in Sect. 4.4, we will prove Theorem 4.1.
Dilation Theory: A Guided Tour 569
on the unit disc D that satisfy |an | < ∞. It is not hard to see that H is a linear
2 2
We learn that the linear functional h → h(w) is a bounded functional, and that the
element of H 2 that implements this functional is kw . The functions kw are called
kernel functions, and the function k : D × D → C given by k(z, w) = kw (z)
is called the reproducing kernel of H 2 . The fact that point evaluation in H 2 is a
bounded linear functional lies at the root of a deep connection between function
theory on the one hand, and operator theory, on the other.
The property of H 2 observed in the last paragraph is so useful and important that
it is worth a general definition. A Hilbert space H ⊆ CX consisting of functions
on a set X, in which point evaluation h → h(x) is bounded for all x ∈ X, is said
to be a Hilbert function space or a reproducing kernel Hilbert space. See [119]
for a general introduction to this subject, and [2] for an introduction geared towards
570 O. M. Shalit
Pick interpolation (for readers that are in a hurry, Chapter 6 in [147] contains an
elementary introduction to H 2 as a Hilbert function space). If H is a Hilbert function
space on X, then by the Riesz representation theorem, for every x ∈ X there is an
element kx ∈ H such that h(x) = h, kx for all h ∈ H, and one may define the
reproducing kernel of H by k(x, y) = ky (x).
The multiplier algebra of a Hilbert function space H on a set X is defined to be
for all h ∈ H 2 . This formula for the norm in H 2 implies that H ∞ ⊆ Mult(H 2 ), and
that Mf ≤ f ∞ .
Dilation Theory: A Guided Tour 571
N
N
Tp = T an Mzn 1 = an Mzn T 1 = Mp f = fp.
n=0 n=0
while f (w)pn (w) → f (w)h(w), on the other hand. This means that T h = f h for
all h and therefore f ∈ Mult(H 2 ) = H ∞ , as required.
Let z1 , . . . , zn ∈ D. It is not hard to see that kz1 , . . . , kzn are linearly independent.
Let G = span{kz1 , . . . , kzn }, and let A = PG Mz G . By Proposition 4.2, G is
coinvariant for Mz , i.e., Mz∗ G ⊆ G, and A∗ = Mz∗ G is the diagonal operator given
by
We can now prove Theorem 4.1. We first show that (4.3) is a necessary condition.
Suppose that f ∈ H ∞ satisfies f (zi ) = wi for all i = 1, . . . , n and that f ∞ ≤ 1.
Define B = PG Mf G , where G = span{kz1 , . . . , kzn } as in the previous subsection.
Then, by (4.5) B ∗ is the diagonal operator given by B ∗ kzi = wi kzi . Since Mf ≤
1, also B ∗ ≤ 1, thus for all α1 , . . . , αn ∈ C,
; n ;2 ; ;2 ; n ;2 ; n ;2
; ; ; n ; ; ; ; ;
; ; ; ∗ ; ; ; ; ;
0≤; αi kzi ; − ;B αi kzi ; = ; αi kzi ; − ; wi αi kzi ;
; ; ; ; ; ; ; ;
i=1 i=1 i=1 i=1
n
= αj 1 − wj wi αi kzj , kzi
i,j =1
n
1 − wi wj
= αj αi .
1 − zi zj
i,j =1
That is, the Pick matrix is positive semidefinite, and (4.3) holds.
Conversely, suppose that (4.3) holds. Define a diagonal operator D : G → G
by Dkzi = wi kzi for i = 1, . . . , n, and let B = D ∗ . Then the above computation
can be rearranged to show that B = B ∗ ≤ 1. Now, the diagonal operator B ∗
clearly commutes with the diagonal operator A∗ = Mz∗ G, so B commutes with
A. Since Mz is the minimal isometric dilation of A, the commutant lifting theorem
(Theorem 3.9) implies that B has a coextension to an operator T that commutes
with Mz and has T ≤ 1. By Proposition 4.5, T = Mf for some f ∈ H ∞ , and
by Proposition 4.2, f (zi ) = wi for all i = 1, . . . , n. Since f ∞ = T ≤ 1, the
proof is complete.
Dilation Theory: A Guided Tour 573
Classical dilation theory does not end with dilating commuting contractions to
commuting unitaries. Let us say that a d-tuple N = (N1 , . . . , Nd ) is a normal
tuple if N1 , . . . , Nd are all normal operators and, in addition, they all commute with
one another. Recall that the joint spectrum σ (N) of a normal tuple is the set
where M(C ∗ (N)) is the space of all nonzero complex homomorphisms from the
unital C*-algebra C ∗ (N) generated by N to C. If N acts on a finite dimensional
space, then the joint spectrum is the set of joint eigenvalues, belonging to an
orthogonal set of joint eigenvectors that simultaneously diagonalize N1 , . . . , Nd .
A commuting tuple of unitaries U = (U1 , . . . , Ud ) is the same thing as a normal
tuple with joint spectrum contained in the torus Td . Since normal tuples are in a
sense “completely understood”, it is natural to ask which operator tuples T have a
normal dilation N (where the definition of dilation is as in (3.1)) with the spectrum
σ (N) prescribed to be contained in some set X ⊂ Cd .
Suppose that T = (T1 , . . . , Td ) is a commuting tuple of operators and that N =
(N1 , . . . , Nd ) is a normal dilation with σ (N) = X ⊂ Cd . Then we immediately
find that
for every polynomial p in d variables. In fact, it is not too hard to see that the above
inequality persists when p is taken to be a rational function that is regular on X. This
motivates the following definition: a subset X ⊆ Cd is said to be a K-spectral set
for T if X contains the joint spectrum σ (T ) of T , and if for every rational function
f that is regular on X,
2. One may also discuss polynomial spectral sets, in which (5.1) is required to
hold only for polynomials [38]. If X is polynomially convex (and in particular, if
X is convex), then considering polynomials instead of rational functions leads to
the same notion.
Thus, with the terminology introduced above, we can rephrase Theorem 3.4 by
2
saying that the bidisc D is a spectral set for every pair T = (T1 , T2 ) of commuting
contractions, and Example 3.5 shows that there exists three commuting contractions
3
for which the tridisc D is not a spectral set.
The notion of a spectral set for a single operator is due to von Neumann [168].
A nice presentation of von Neumann’s theory can be found in Sections 153–155
of [133]. The reader is referred to [19] for a rather recent survey with a certain
emphasis on the single variable case. To give just a specimen of the kind of result
that one can encounter, which is quite of a different nature than what I am covering
in this survey, let me mention the result of Crouzeix [40], which says that for every
T ∈ B(H), the numerical range W (T ) := {T h, h : h = 1} of T is a K-spectral
set for some K ≥ 2 (it is easy to see that one cannot have a constant smaller than 2;
Crouzeix conjectured that K = 2, and this conjecture is still open at the time of me
writing this survey).
It is plain to see that if T has a commuting normal dilation N with spectrum
σ (N) ⊆ X, then X is a polynomial spectral set for T , and it is true that in fact
X is a spectral set. It is natural to ask whether the converse implication holds, that
is, whether the assumption that a set X is a spectral set for a tuple T implies that
there exists a normal dilation with spectrum constrained to X (or even to the Shilov
boundary ∂X). There are cases when this is true (see [19]), but in general the answer
is no. For example, we already mentioned that Parrott’s example [110] of three
commuting contractions that have no unitary dilation (hence also no normal dilation
3
with spectrum contained in D ) does not involve a violation of von Neumann’s
3
inequality, in other words the tuple T from (3.3) has D as a spectral set but has no
unitary dilation.
The situation was clarified by Arveson’s work [11], where the notion of complete
spectral set was introduced. To explain this notion, we need matrix valued polyno-
mials and rational functions. Matrix valued polynomials in several commuting (or
noncommuting) variables, and the prescriptions for evaluating them at d-tuples of
commuting (or noncommuting) operators, are defined in a similar manner to their
definition in the one variable case in Remark 1.3. Once one knows how to evaluate
a rational function in several variables at a commuting tuple, the passage to matrix
valued rational functions is done similarly.
Given a tuple T ∈ B(H)d of commuting contractions, we say that a set X ⊂ Cd
is a complete K-spectral set for T , if σ (T ) ⊆ X and if for every matrix valued
rational function f that is regular on X, (5.1) holds, where now for an n × n matrix
valued rational function f X,∞ = supz∈X f (z)Mn . If X is a complete K-spectral
set for T with K = 1, then it is simply said to be a complete spectral set for T .
Dilation Theory: A Guided Tour 575
f (T ) = PH f (N) H .
Putting Arveson’s dilation theorem together with some comments made above,
3
we see that D is a spectral set for the triple T from (3.3), but it is not a
complete spectral set. On the other hand, we know that for every pair of commuting
2
contractions T = (T1 , T2 ), the bidsic D is a complete spectral set. Agler and
McCarthy proved a sharper result: if T = (T1 , T2 ) acts on a finite dimensional
space, and T1 , , T2 < 1, then there exists a one dimensional complex algebraic
subvariety V ⊆ D2 (in fact, a so-called distinguished variety, which means that
V ∩ ∂(D2 ) = V ∩ T2 ), such that V is a complete spectral set for T [3].
If X ⊂ C is a spectral set for an operator T , one may ask whether or not it is
a complete spectral set. We close this section by mentioning some notable results
in this direction. It is known that if X ⊂ C is a compact spectral set for T such
that rat (X) + rat (X) is dense in C(∂X), then X is a complete spectral set, and T
has a normal dilation with spectrum in ∂X. The condition is satisfied, for example,
when X is the closure of a bounded and simply connected open set (this result is due
to Berger, Foias and Lebow (independently); see [118, Theorem 4.4]). The same is
true if X is an annulus (Agler [1]), but false if X is triply connected (Agler et al. [5]
and Dritschel and McCullough [54]).
If a pair of commuting operators T = (T1 , T2 ) has the symmetrized bidisc :=
{(z1 + z2 , z1 z2 ) : z1 , z2 ∈ D} as a spectral set, then in fact is a complete spectral
set for T (Agler and Young [4]). Pairs of operators having as a spectral set have
a well developed model theory (see, e.g., Sarkar [139] and the references therein).
Building on earlier work of Bhattacharyya et al. [29], and inspired by Agler and
McCarthy’s distinguished varieties result mentioned above, Pal and Shalit showed
that if is a spectral set for a pair T = (T1 , T2 ) of commuting operators acting on a
finite dimensional space, then there exists a distinguished one dimensional algebraic
variety V ⊆ which is a complete spectral set for T [109].
576 O. M. Shalit
p(T ) = PH p(U ) H
6.2 Models
Another direction in which dilation theory for commuting d-tuples has been
developed is that of operator models. Roughly, the idea is that certain classes of
d-tuples of operators can be exhibited as the compressions of a particular “model”
d-tuple of operators. We will demonstrate this with a representative example; for a
broader point of view see [105], Chapter 14 in [2], or the surveys [8] and [138].
Our example is the d-shift on the Drury-Arveson space Hd2 [12, 55] (see also the
survey [146]).
For a fixed d, we let Hd2 denote the space of all analytic functions
f (z) = α cα zα on the unit ball Bd such that (with standard multi-index notation)
α!
f 2H 2 := |cα |2 < ∞.
d
α
|α|!
This norm turns the space Hd2 into a Hilbert space of analytic functions on Bd , such
that point evaluation is bounded. In fact, Hd2 is the reproducing kernel Hilbert space
determined by the kernel k(z, w) = 1−z,w1
. Some readers might jump to their feet
and object that this space is nothing but the good old symmetric Fock space, but it
is fruitful and enlightening to consider it as a space of analytic functions (so please,
sit down).
For the record, let the reader know that the possibility d = ∞ is allowed, but we
do not dwell upon this point.
578 O. M. Shalit
Si f (z) = zi f (z) , i = 1, . . . , d,
and this inequality replaces von Neumann’s inequality in this setting (and this was
Drury’s motivation [55]). It can be shown [55] (see also [12, 44]) that there exists
no constant C such that P (S) ≤ C supz∈Bd |p(z)|, and in particular, commuting
row contractions in general do not have normal dilations with spectrum contained
in Bd .
is a tuple V = (V1 , . . . , Vd ) such that Vi∗ Vj = δij I , for all i, j . Thus, the operators
V1 , . . . , Vd are all isometries which have mutually orthogonal ranges, and this is
equivalent to the condition that the row operator [V1 V2 · · · Vd ] is an isometry. The
Sz.-Nagy isometric dilation theorem extends to the setting of (noncommuting) row
contractions. The following theorem is due to Frazho [64] (the case d = 2), Bunce
[33] (the case d ∈ N∪{∞}) and Popescu [123] (who proved the existence of dilation
in the case d ∈ N ∪ {∞}, and later developed a far reaching generalization of Sz-
Nagy’s and Foias’s theory for noncommuting tuples and more).
Theorem 6.2 (Row Isometric Dilation of Row Contractions) Let T ∈ B(H)d
be a row contraction. Then there exists a Hilbert space K containing H and a row
isometry V = (V1 , . . . , Vd ) ∈ B(K)d such that Vi∗ H = Ti∗ for all i.
There is also a very closely related dilation result, that shows that the shift
L = (L1 , . . . , Ld ) on the full Fock space is a universal model for pure row
contractions, which reads similarly to Theorem 6.1, with the free shift L replacing
the commutative shift S. Correspondingly, there is a von Neumann type inequality
p(T ) ≤ p(L) which holds for every row contraction T and every polynomial
p in noncommuting variables [123, 124].
Popescu has a large body of work in which this dilation/model theory is
developed, applied, and generalized. In particular, the theory can be modified to
accommodate tuples satisfying certain polynomial relations [127] (see also [150,
Section 8]) or tuples in certain noncommutative polydomains [128].
The isometric dilation of a row contractions lies at the heart of the free functional
calculus for row contractions (see, e.g., [126]), and is important for understanding
the algebraic structure of noncommutative Hardy algebras (also called analytic
Toeplitz algebras, see [45]), as well as for the study and classification of algebras of
bounded nc analytic functions on the nc unit ball and its subvarieties [134, 135].
Until now, we have only considered operators on Hilbert spaces. But there are
other kinds of interesting spaces, and the concept of dilations has appeared and
been used in various settings. In the setting of Banach spaces, one may hope to
dilate a contraction to an invertible isometry (that is, a surjective isometry); more
generally one may wish to dilate a semigroup of operators to a group representation.
Results along these lines, including a direct analogoue of Sz.-Nagy’s unitary dilation
theorem, were obtained by Stroescu; see [161].
However, Banach spaces form a huge class of spaces, and the dilation theory in
the context of general Banach spaces contains the additional aspect that one might
like to ensure that the dilating space shares some properties with the original space.
For example, if T is a contraction on an Lp -space, one might wish to dilate to an
invertible isometry acting on an Lp -space. Moreover, if T is positive, in the sense
that Tf ≥ 0 (almost everywhere) whenever f ≥ 0 (almost everywhere), then one
580 O. M. Shalit
J T n = QU n J , for all n ∈ N.
Note that even in the case p = 2, this is not exactly Sz.-Nagy’s dilation theorem:
the assumptions are stronger, but so is the conclusion. For a modern approach to
dilations in Banach spaces, generalizations, and also an overview of the history
of the theory and its applications, see [62]. Operator algebras are another class of
spaces in which dilation theory was developed and applied; we will discuss this
setting in Sects. 7 and 8 below.
The operator algebraic outlook on dilation theory began with Arveson’s visionary
papers [10, 11]. Arveson sought to develop a systematic study of nonselfadjoint
operator algebras, which is based on studying the relations between an operator
algebra and the C*-algebras that it generates. From the outset, the approach was
general and powerful enough to cover also certain operator spaces. On the one hand,
this approach opened the door by which operator algebraic techniques entered into
Dilation Theory: A Guided Tour 583
operator theory: these techniques have shed light on classical dilation theory, and
they also created a powerful framework by which new dilation results could be
obtained. On the other hand, the general philosophy of dilation theory found its way
into operator algebras, and has led to remarkable developments.
In this section I will present Stinespring’s dilation theorem, and how Arveson’s
extension theorem and his notion of C*-dilation have made Stinespring’s theorem
into a “dilation machine” that produces and explains dilation results in operator
theory. Then I will briefly discuss how dilation theory is related to the notions of
boundary representations and the C*-envelope, which lie at the heart of the above
mentioned analysis of the relationship between on operator algebra/space and the
C*-algebras that it generates.
I will not attempt to cover all the manifold ways in which dilation theory appears
in the theory of operator algebras, and I’ll just mention a few (of my favorite) recent
examples: [51, 82, 84, 85]. The reader is referred to the survey [48] or the paper [41]
in order to get an idea of the role it plays, in particular in operator algebras related
to dynamical systems and semicrossed products.
acting elementwise as
The map φ is said be completely positive (or CP for short) if φ (n) is positive for
all n. Likewise, φ is said to be completely contractive (or CC for short) if φ (n) is
contractive for all n. A map is UCP if it is a unital CP map, and UCC if it is a unital
CC map.
Completely positive maps were introduced by Stinespring [159], but it was
Arveson who observed how important they are and opened the door to their
becoming an indispensable tool in operator theory and operator algebras [10]. There
are several excellent sources to learn about operator spaces/systems and completely
positive (and bounded) maps; see for example [118] and [122].
Completely positive maps arise also in mathematical physics in a natural way
[89]. The evolution of an open quantum system is described by a semigroup of
completely positive maps [49], and noisy channels in quantum information theory
are modelled as trace preserving completely positive maps [107]. In quantum
probability [111], semigroups of unit preserving completely positive maps play the
role of Markov semigroups.
The simplest examples of completely positive maps are ∗-homomorphisms
between C*-algebras. Next, a map of the form B(K) 4 T → V ∗ T V ∈ B(H), where
V is some fixed operator in B(H, K), is readily seen to be completely positive.
Since compositions of CP maps are evidently CP, we see that whenever A is a
C*-algebra, π : A → B(K) is a ∗-homomorphism, and V ∈ B(H, K), then the
map a → V ∗ π(a)V is a CP map. The following fundamental theorem shows that
essentially all CP maps on C*-algebras are of this form.
Theorem 7.1 (Stinespring’s Theorem [159]) Let A be a unital C*-algebra and
let φ : A → B(H) be a CP map. Then there exists a Hilbert space K, an operator
V ∈ B(H, K), and a ∗-representation π : A → B(K) such that
The tuple (π, K, V ) can be chosen such that K = [π(A)H]—the smallest closed
subspace containing π(a)h for all a ∈ A and h ∈ H—and in this case the triple
(π, K, V ) is unique up to unitary equivalence.
Proof On the algebraic tensor product A ⊗ H, we define a semi-inner product by
setting a ⊗ g, b ⊗ h = g, φ(a ∗ b)hH and extending sesquilinearly (the complete
positivity guarantees that this is a positive semidefinite form). Quotienting out the
kernel and then completing gives rise to the Hilbert space K. The image of all the
elementary tensors b ⊗ h ∈ A ⊗ H in K form a total set, and we continue to
denote these images as b ⊗ h. One needs to check that for every a ∈ A, the map
π(a) : b ⊗ h → ab ⊗ h extends to a well defined, bounded linear operator on K.
Once this is done, it is easy to verify that the map a → π(a) is a ∗-homomorphism.
To recover φ, we define V : H → K by V (h) = 1 ⊗ h, and then all that remains
to do is to compute V ∗ (a ⊗h), g = a ⊗h, V (g) = a ⊗h, 1⊗g = h, φ(a ∗ )g,
Dilation Theory: A Guided Tour 585
V ∗ π(a)V h = V ∗ (a ⊗ h) = φ(a)h,
φ(a) = PH π(a) H .
The utility of completely positive maps comes from the following extension theorem
of Arveson. For a proof, see Arveson’s paper or Paulsen’s book [118, Chapter 7].
Theorem 7.3 (Arveson’s Extension Theorem [10]) Let M be an operator system
in a C*-algebra A, and let φ : M → B(H) be a CP map. Then there exists a CP
map φ̂ : A → B(H) such that φ̂ = φ and which extends φ, i.e., φ̂(a) = φ(a)
for all a ∈ M.
We will now see how the combination of Stinespring’s dilation theorem and
Arveson’s extension theorem serve as kind of all purpose “dilation machine”, that
produces dilation theorems in varied settings.
Let 1 ∈ M ⊆ A be a unital operator space. A linear map φ : M → B(H) is
said to have a C*-dilation to A if there exists a ∗-representation π : A → B(K),
K ⊇ H, such that
Arveson showed that every UCP map is UCC, and that, conversely, every UCC
map as above extends to a UCP map φ : ML := M + M∗ → B(H) given by
∗ ∗
φ (a + b ) = φ(a) + φ(b) . Combining this basic fact with Theorems 7.1 and 7.3
we obtain the following versatile dilation theorem.
Theorem 7.4 Every UCC or UCP map has a C*-dilation.
586 O. M. Shalit
Arveson’s dilation theorem1 (Theorem 5.1) follows from the above theorem,
once one carefully works through the delicate issues of joint spectrum, Shilov
boundary and functional calculus (see [11]). We shall illustrate the use of the dilation
machine by proving Arveson’s dilation theorem for the simple but representative
d
example of the polydisc D .
Theorem 7.5 (Arveson’s Dilation Theorem for the Polydisc) A d-tuple of com-
muting contractions T = (T1 , . . . , Td ) has a unitary dilation if and only if the
d
polydisc D is a complete spectral set for T .
Proof To relate the statement of the theorem to the language of Sect. 5, we note that
d
the Shilov boundary of D is just the torus (∂D)d = Td , and therefore a unitary
dilation is nothing but a normal dilation with joint spectrum contained in Td . Recall
d
that D being a complete spectral set is equivalent to that
d
for every matrix valued polynomial p (since D is convex it suffices to consider
matrix valued polynomials, and there is no need to worry about matrix valued
rational functions).
If U = (U1 , . . . , Ud ) is a tuple of commuting unitaries and p is a matrix valued
polynomial, then, using the spectral theorem, it is not hard to see that p(U ) =
supz∈σ (U ) p(z) ≤ p∞ . Now, if U is a dilation of T then p(T ) ≤ p(U ),
and so the inequality (7.1) holds. That was the easy direction.
d
Conversely, suppose that D is a complete spectral set for a commuting tuple
T ∈ B(H)d , that is, suppose that (7.1) holds for every matrix valued polynomial p.
Let M = C[z1 , . . . , zd ] be the space of polynomials in d variables, considered as a
unital subspace of the C*-algebra C(Td ), equipped with the usual supremum norm.
It is useful to note that
d
by applying the maximum modulus principle in several variables. The fact that D
is a complete spectral set for T implies that the unital map φ : M → B(H) given
by φ(p) = p(T ), is completely contractive. By Theorem 7.4, φ has a C*-dilation
π : C(Td ) → B(K), such that
1 The reader should be aware that Theorem 7.4 is sometimes referred to as Arveson’s dilation
theorem, whereas I used this name already for the more specific Theorem 5.1.
Dilation Theory: A Guided Tour 587
p(T ) = PH p(U ) H
φr (a) = Kr (T )∗ (a ⊗ I )Kr (T ).
588 O. M. Shalit
We compute that
φr (S) = Kr (T )∗ (S ⊗ I )Kr (T )h
= Kr (T )∗ en+1 ⊗ r n DrT T n∗ h
= r 2n+1 T n+1 DrT
2
T n∗ h = rT h.
The ideas in this section are best motivated by the following classical example.
Example 7.7 Consider the disc algebra A(D), which is equal to the closure of the
polynomials with respect to the norm p∞ = supz∈D |p(z)|. The disc algebra
is an operator algebra, being a subalgebra of the C*-algebra C(D) of continuous
functions on the disc D. Moreover, C ∗ (A(D)) = C(D), that is, the C*-subalgebra
generated by A(D) ⊆ C(D) is equal to C(D). However, C(D) is not determined
uniquely by being “the C*-algebra generated by the disc algebra”. In fact, by the
maximum modulus principle, A(D) is also isometrically isomorphic to the closed
subalgebra of C(T) generated by all polynomials, and the C*-subalgebra of C(T)
generated by the polynomials is equal to C(T).
Now, C(T) is the quotient of C(D) by the ideal of all continuous functions
vanishing on the circle T. If π : C(D) → C(T) denotes the quotient map, then
π(f ) = f T , and we note, using the maximum principle again, that π is isometric
on A(D). It turns out that T is the minimal subset E ⊆ D such that the map
f → f E is isometric on A(D).
The above phenomenon arises in all uniform algebras, that is, in all unital
subalgebras A of C(X) that separate the points of X, where X is some compact
Hausdorff space. For every such algebra there exists a set ∂A ⊆ X—called the
Shilov boundary of A—which is the unique minimal closed subset E ⊆ X such
that f → f E is isometric (see [68] for the theory of uniform algebras).
Dilation Theory: A Guided Tour 589
for all f ∈ A. For example, the Lebesgue measure on the circle is a representing
measure for the point 0, because
2π
1
f (0) = f (eit )dt
2π 0
for every f ∈ A(D), as an easy calculation shows. Every point can be represented
by the delta measure δz ; the points on the circle are singled out by being those with
590 O. M. Shalit
a unique representing measure, that is, they can be represented only by the delta
measure. In the general case of a uniform algebra A ⊆ C(X), the points in X that
have a unique representing measure are referred to as the Choquet boundary of
A. It is not hard to show that the Choquet boundary of A(D) is T. In general, the
Choquet boundary of a uniform algebra is dense in the Shilov boundary.
Let return to the noncommutative case, so let A ⊆ B = C ∗ (A) be again a
unital operator algebra generating a C*-algebra. Point evaluations correspond to the
irreducible representations of a commutative C*-algebra, and probability measures
correspond to states, that is, positive maps into C. With this in mind, the reader will
hopefully agree that the following generalization is potentially useful: an irreducible
representation π : B → B(H) is said to be a boundary representation if the only
UCP map : B → B(H) that extends π A is π itself.
Arveson proved in [10] that if an operator algebra A ⊆ B = C ∗ (A) has
sufficiently many boundary representations, in the sense that
for all A ∈ Mn (A), then the Shilov ideal exists, and is equal to the intersection
of all boundary ideals. For some important operator algebras, the existence of
sufficiently many boundary representations was obtained (see also [11]), but
the problem of existence of boundary representations in general remained open
almost 45 years.2 Following a sequence of important developments [15, 53, 101],
Davidson and Kennedy proved that every operator system has sufficiently many
boundary representations [42]. Their proof implies that every unital operator space,
and in particular every unital operator algebra, has sufficiently many boundary
representations as well.
2 The existence of the C*-envelope was obtained much earlier, without making use of boundary
boundary representation if and only if the restriction π S has the unique extension
property.
Now, the unique extension property nicely captures the idea that S is in some
sense rigid in B, but it is hard to verify it in practice. The following notion is more
wieldy. A UCP map ψ : S → B(K) is said to be a dilation of a UCP map φ : S →
B(H) if φ(a) = PH ψ(a) H for all a ∈ S. The dilation ψ is said to be a trivial
dilation if H is reducing for φ(S), that is, ψ = φ ⊕ ρ for some UCP map ρ. A UCP
map φ is said to be maximal if it has only trivial dilations.
Penetrating observations of Muhly and Solel [101], and consequently Dritschel
and McCullough [53], can be reformulated as the following theorem. The beauty
is that the notion of maximality is intrinsic to the operator system S, and does not
take the containing C*-algebra B into account (similar reformulations exist for the
categories of unital operator spaces and operator algebras).
Theorem 7.9 A UCP map φ : S → B(H) has the unique extension property if and
only if it is maximal.
Following Dritschel and McCullough’s proof of the existence of the C*-envelope
[53] and Arveson’s consequent work [15], Davidson and Kennedy proved the
following theorem (as above, similar reformulations exist for the categories of unital
operator spaces and operator algebras).
Theorem 7.10 Every UCP map can be dilated to a maximal UCP map, and every
pure UCP map can be dilated to a pure maximal UCP map.
Davidson and Kennedy proved that pureness guarantees that the ∗-representation,
which is the unique UCP extension of the maximal dilation, is in fact irreducible.
Moreover, they showed that pure UCP maps completely norm an operator space.
Thus, by dilating sufficiently many pure UCP maps, and making use of the above
theorems, they concluded that there exist sufficiently many boundary representa-
tions [42].
Example 7.11 Let us see what are the maximal dilations in the case of the disc
algebra A(D) ⊆ C(D) (we switch back from the category of operator systems to
the category of unital operator algebras). A representation π of C(D) is determined
uniquely by a normal operator N with spectrum in D by the relation N = π(z). A
UCC representation φ : A(D) → B(H) is determined uniquely by the image of the
coordinate function z, which is a contraction T = φ(z) ∈ B(H). Conversely, by the
A(D) functional calculus (see Sect. 2.2), every contraction T ∈ B(H) gives rise to a
UCC homomorphism of A(D) into B(H). In this context, a dilation of a UCC map
φ into B(H) is simply a representation ρ : A(D) → B(K) such that
f (T ) = φ(f ) = PH ρ(f ) H = PH f (V ) H ,
With the above notation, it is not hard to see the following two equivalent
statements: (i) a dilation ρ : A(D) → B(K) is maximal if and only if V is a
unitary, and (ii) a representation π : C(D) → B(K) is such that ρ = π A(D) has
the unique extension property if and only if V is a unitary. The fact that every UCC
representation of A(D) has a maximal dilation, is equivalent to the fact that every
contraction has a unitary dilation.
What are the boundary representations of the disc algebra? The irreducible
representations of C(D) are just point evaluations δz for z ∈ D. The boundary
representations are those point evaluations δz whose restriction to A(D) have a
unique extension to a UCP map C(D) → C. But UCP maps into the scalars are
just states, and states of C(D) are given by probability measures. Hence, boundary
representations are point evaluations δz such that z has a unique representing
measure, so that z ∈ T.
The Shilov ideal can be obtained as the intersection of the kernels of the boundary
representations, and so it is the ideal of functions vanishing on T. The C*-envelope
is the quotient of C(D) by this ideal, thus it is C(T), as we noted before.
Note that to find the boundary representations of the disc algebra we did not
need to invoke the machinery of maximal dilations. In the commutative case, the
existence of sufficiently many boundary representations is no mystery: all of them
are obtained as extensions of evaluation at a boundary point. The machinery of max-
imal dilations allows us to find the boundary representations in the noncommutative
case, where there are no function theoretic tools at our disposal.
holds for all a ∈ A and s ∈ S. It is a fact, not hard to show, that if is a Markov
semigroup then every dilation is strong. Examples show that this is not true for
general CP-semigroups. Sometimes, to lighten the terminology a bit, we just say
that α is a (strong) dilation.
Remark 8.1 It is worth pausing to emphasize that the dilation defined above is
entirely different from Stinespring’s dilation: the Stinespring dilations of s and s
cannot be composed. The reader should also be aware that there are other notions
of dilations, for example in which the “small” algebra B is embedded as a unital
subalgebra of the “large” algebra A (see, e.g., [67] or [167] and the references
therein), or where additional restrictions are imposed (see, e.g. [91], and the papers
that cite it).
594 O. M. Shalit
There are two general approaches by which strong dilations of CP-semigroups can
be constructed.
The Muhly–Solel Approach One approach, due to Muhly and Solel [102], seeks
to represent a CP-semigroup in a form similar to (8.1), and then to import ideas
from classical dilation theory. To give little more detail, if = {s }s∈S is a
CP-semigroup on a von Neumann algebra B ⊆ B(H), then one tries to find a
product system E E = {Es }s∈S of B -correspondences over S (see Sect. 6.5) and
a completely contractive covariant representation T = {Ts }s∈S such that
s (idEs E a)V
αs (a) = V s∗
[150] they were finally formally introduced (at the same time, subproduct systems
of Hilbert spaces were introduced in Bhat and Mukherjee’s paper [27]).
The approach to dilations introduced in [150] consists of two parts: first, embed
the Arveson–Stinespring subproduct system associated with a CP-semigroup into
a product system, and then dilate the representation to an isometric dilation. This
approach was used to find necessary and sufficient conditions for the existence of
dilations. In particular, it was used to prove that a Markov semigroup has a (certain
kind of) minimal dilation if and only if the Arveson–Stinespring subproduct system
can be embedded into a product system. Moreover, the framework was used to show
that there exist CP-semigroups over N3 that have no minimal strong dilations, as
was suggested from experience with classical dilation theory. Vernik later used these
methods to prove an analogue of Opela’s theorem (see 3.3) for completely positive
maps commuting according to a graph [166].
The reader is referred to [150] for more details. The main drawback of that
approach is that it works only for CP-semigroups of normal maps on von Neumann
algebras.
The Bhat–Skeide Approach The second main approach to dilations of CP-
semigroups is due to Bhat and Skeide [28]. It has several advantages, one of
which is that it works for semigroups on unital C*-algebras (rather than von
Neumann algebras). The Bhat–Skeide approach is based on a fundamental and
useful representation theorem for CP maps called Paschke’s GNS representation
[112], which I will now describe.
For every CP map φ : A → B between two unital C*-algebras, there exists a
C*-correspondence E from A to B (see Sect. 6.5) and a vector ξ ∈ E such that
φ(a) = ξ, aξ for all a ∈ A. The existence of such a representation follows from
a construction: one defines E to be the completion of the algebraic tensor product
A ⊗ B with respect to the B-valued inner product
a ⊗ b, a ⊗ b = b∗ φ(a ∗ a )b ,
equipped with the natural left and right actions. Letting ξ = 1A ⊗1B , it is immediate
that φ(a) = ξ, aξ for all a ∈ A. Moreover, ξ is cyclic, in the sense that it
generates E as a C*-correspondence. The pair (E, ξ ) is referred to as the GNS
representation of φ. The GNS representation is unique in the sense that whenever
F is a C*-correspondence from A to B and η ∈ F is a cyclic vector such that
φ(a) = η, aη for all a ∈ A, then there is an isomorphism of C*-correspondences
from E onto F that maps ξ to η.
In the Bhat–Skeide approach to dilations, the idea is to find a product system
F E = {Fs }s∈S of B-correspondences and a unit, i.e., a family ξ E = {ξs ∈ Fs }s∈S
satisfying ξs+t = ξs E ξt , such that is recovered as
for all s ∈ S and all b ∈ B. If is a Markov semigroup, the dilation is obtained via
a direct limit construction. For non Markov semigroups, a dilation can be obtained
via a unitalization procedure. In [28], dilations were constructed this way in the
continuous and discrete one-parameter cases. This strategy bypasses product system
representations, but, interestingly, it can also be used to prove the existence of an
isometric dilation for any completely contractive covariant representation of a one-
parameter product system [155].
Again, it turns out that constructing such a product system is not always possible.
However, if one lets (Fs , ξs ) be the GNS representation of the CP map s , then it
is not hard to see that F = {Fs }s∈S is a subproduct system (called the GNS
subproduct system) and ξ E = {ξs } is a unit.
The above observation was used by Shalit and Skeide to study the existence
of dilations of CP-semigroups in a very general setting [149]. If one can embed
the GNS subproduct system into a product system, then one has (8.3), and can
invoke the Bhat–Skeide approach to obtain a dilation. The paper [149] develops
this framework to give a unified treatment of dilation theory of CP-semigroups over
a large class of monoids S, including noncommutative ones. One of the main results
in [149], is the following, which generalizes a result obtained earlier in [150].
Theorem 8.4 A Markov semigroup over an Ore monoid admits a full strict dilation
if and only if its GNS subproduct system embeds into a product system.
This theorem essentially enables to recover almost all the other known dilation
theorems and counter examples. It is used in [149] to show that every Markov
semigroup over S = N2 on a von Neumann algebra has a unital dilation, and also
that for certain multi-parameter semigroups (the so called quantized convolution
semigroups) there is always a dilation. The theorem is also used in the converse
direction, to construct a large class of examples that have no dilation whatsoever.
In the setting of normal semigroups on von Neumann algebras, the Bhat–Skeide
and Muhly–Solel approaches to dilations are connected to each other by a functor
called commutant; see [149, Appendix A(iv)] for details.
New Phenomena We noted above some similarities between the theory of isomet-
ric dilations of contractions, and the dilation theory of CP-semigroups. In particular,
in both theories, there always exists a dilation when the semigroup is parameterized
by S = N, N2 , R+ , and the results support the possibility that this is true for S = R2+
as well. Moreover, in both settings, there exist semigroups over S = N3 for which
there is no dilation.
But there are also some surprises. By Corollary 3.8, if a tuple of commuting
contractions T = (T1 , . . . , Td ) satisfies di=1 Ti 2 ≤ 1, then T has a regular
unitary dilation. Therefore, one might think that if a commuting d-tuple of CP maps
1 , . . . , d are such that di=1 i is sufficiently small, then this tuple has a
dilation. This is false, at least in a certain sense (that is, if one requires a strong and
minimal dilation); see [150, Section 5.3].
Moreover, by Corollary 3.8, a tuple of commuting isometries always has a unitary
dilation, and it follows that every tuple of commuting coisometries has an isometric
598 O. M. Shalit
Besides the interesting interpretation of dilations given at the end of Sect. 8.1,
dilations have some deep applications in noncommutative dynamics. In this section,
we will use the term CP-semigroup to mean a one-parameter semigroup =
{s }s∈S (where S = N or S = R+ ) of normal CP maps acting on a von Neumann
algebra B. In case of continuous time (i.e., S = R+ ), we will also assume that
is point weak-∗ continuous, in the sense that t → ρ(t (b)) is continuous for every
ρ ∈ B∗ . We will use the same convention for E- or E0 -semigroups.
The Noncommutative Poisson Boundary Let be a normal UCP map on a von
Neumann algebra B. Then one can show that the fixed point set {b ∈ B : (b) = b}
is an operator system, and moreover that it is the image of a completely positive
projection E : B → B. Hence, the Choi-Effros product x ◦ y = E(xy) turns the
fixed point set into a von Neumann algebra H ∞ (B, ), called the noncommutative
Poisson boundary of . The projection E and the concrete structure on H ∞ (B, )
are hard to get a grip with.
Arveson observed that if (A, α, p) is the minimal dilation of , and if Aα is the
fixed point algebra of α, then the compression a → pap is a unital, completely
positive order isomorphism between Aα and H ∞ (B, ). Hence Aα is a concrete
realization of the noncommutative Poisson boundary. See the survey [81] for details
(it has been observed that this result holds true also for dilations of abelian CP-
semigroups [129]).
Continuity of CP-Semigroups Recall that one-parameter CP-semigroups on a von
Neumann algebra B ⊆ B(H) are assumed to be point weak-∗ continuous. Since
CP-semigroups are bounded, this condition is equivalent to point weak-operator
continuity, i.e., that t → t (b)g, h is continuous for all b ∈ B and all g, h ∈ H.
Another natural kind of continuity to consider is point strong-operator continuity,
which means that t → t (b)h is continuous in norm for all b ∈ B and h ∈ H.
For brevity, below we shall say a semigroup is weakly continuous if it is point
weak-operator continuous, and strongly continuous if it is point strong-operator
continuous.
Dilation Theory: A Guided Tour 599
Strong continuity is in some ways easier to work with and hence it is desirable,
but it is natural to use the weak-∗ topology, because it is independent of the
representation of the von Neumann algebra. Happily, it turns out that weak (and
hence point weak-∗) continuity implies strong continuity.
One possible approach to prove the above statement is via dilation theory. First,
one notices that the implication is easy for E-semigroups. Indeed, if α is an E-
semigroup on A ⊆ B(K), then
αt (a)k − αs (a)k2 = αt (a ∗ a)k, k + αs (a ∗ a)k, k − 2 Reαt (a)k, αs (a)k.
Assuming that s tends to t, the expression on the right hand side tends to zero if
α is weakly continuous. Now, if is a weakly continuous CP-semigroup, then its
dilation α given by Theorem 8.2 is also weakly continuous. By the above argument,
α is strongly continuous, and this continuity is obviously inherited by t (·) =
pαt (·)p. Hence, by dilation theory, weak continuity implies strong continuity.
The above argument is a half cheat, because, for a long time, the known proofs
that started from assuming weak continuity and ended with a weakly continuous
dilation, actually assumed implicitly, somewhere along the way, that CP-semigroups
are strongly continuous [14, 28, 102]. This gap was pointed out and fixed by
Markiewicz and Shalit [98], who proved directly that a weakly continuous CP-
semigroup is strongly continuous. Later, Skeide proved that the minimal dilation of a
weakly continuous semigroup of CP maps is strongly continuous, independently of
[98], thereby recovering the result “weakly continuous ⇒ strongly continuous” with
a proof that truly goes through the construction of a dilation; see [156, Appendix
A.2].
Existence of E0 -Semigroups As we have seen above, dilations can be used to
study CP-semigroups. We will now see an example, where dilations are used in
the theory of E0 -semigroups.
The fundamental classification theory E0 -semigroups on B = B(H) was
developed Arveson, Powers, and others about two decades ago; see the monograph
[14] for the theory, and in particular for the results stated below (the classification
theory of E0 -semigroups on arbitrary C* or von Neumann algebras is due to Skeide;
see [156]). For such E0 -semigroups there exists a crude grouping into type I, type
II, and type III semigroups. However, it is not at all obvious that there exist any E0 -
semigroups of every type. Given a semigroup of isometries on a Hilbert space H ,
one may use second quantization to construct E0 -semigroups on the symmetric and
anti-symmetric Fock spaces over H , called the CCR and CAR flows, respectively.
CAR and CCR flows are classified in term of their index. These E0 -semigroups are
of type I, and, conversely, every type I E0 -semigroup is cocycle conjugate to a CCR
flow, which is, in turn, conjugate to a CAR flow.
It is much more difficult to construct an E0 -semigroup that is not type I. How does
one construct a non-trivial E0 -semigroup? Theorem 8.2 provides a possible way:
construct a Markov semigroup, and then take its minimal dilation. This procedure
600 O. M. Shalit
In recent years, dilation theory has found a new role in operator theory, through
the framework of matrix convexity. In this section I will quickly introduce matrix
convex sets in general, special examples, minimal and maximal matrix convex over
a given convex set, and the connection to dilation theory. Then I will survey the
connection to the UCP interpolation problem, some dilation results, and finally an
application to spectrahedral inclusion problems.
A := i Ai A∗i ; this induces a metric on B(H)d for every d, and in particular
on Mnd for every n. We will say that a nc set S is closed if Sn is closed in Mnd for all
n. We will say that S is bounded if there exists some C > 0 such that X ≤ C for
all X ∈ S.
For a tuple X = (X1 , . . . , Xn ) ∈ Mnd and a linear map φ : Mn → Mk , we write
φ(X) = (φ(X1 ), . . . , φ(Xd )) ∈ Mkd . In particular, if A and B are n × k matrices,
then we write A∗ XB = (A∗ X1 B, . . . , A∗ Xd B). Another operation that we can
preform on tuples is the direct sum, that is, if X ∈ Mm d and Y ∈ M d , then we let
n
X ⊕ Y = (X1 ⊕ Y1 , . . . , Xd ⊕ Yd ) ∈ Mm+n .d
Dilation Theory: A Guided Tour 601
X ∈ Sm , Y ∈ Sn 7⇒ X ⊕ Y ∈ Sm+n
and
The matrix range is a closed and bounded matrix convex set. Conversely, every
closed and bounded matrix convex set is the matrix range of some operator tuple.
If d = 1, then the first level W1 (A) of the matrix range of an operator A coincides
with the closure of the numerical range W (A)
We note, however, that for d ≥ 2, the first level W1 (A) does not, in general, coincide
with the closure of what is sometimes referred to as the joint numerical range of a
tuple [96].
Matrix ranges of single operators were introduced by Arveson [11], and have
been picked up again rather recently. The matrix range of an operator tuple A
is a complete invariant of the operator system generated by A, and—as we shall
602 O. M. Shalit
see below—it is useful when considering interpolation problems for UCP maps.
Moreover, in the case of a fully compressed tuple A of compact operators or normal
operators, the matrix range determines A up to unitary equivalence [115]. The
importance of matrix ranges has led to the investigation of random matrix ranges,
see [70].
Example 9.3 Let A ∈ B(H)d . The free spectrahedron determined by A is the nc
set DA = ∞n=1 DA (n) given by
⎧ ⎫
⎨
d ⎬
DA (n) = X ∈ Mnd : Re Xj ⊗ Aj ≤ I .
⎩ ⎭
j =1
A free spectrahedron is always a closed matrix convex set, that contains the origin
in its interior. Conversely, every closed matrix convex set with 0 in its interior is a
free spectrahedron. In some contexts it is more natural to work with just selfadjoint
matrices. For A ∈ B(H)dsa one defines
⎧ ⎫
⎨
d ⎬
DA
sa
= X ∈ Mdsa : Xj ⊗ Aj ≤ I .
⎩ ⎭
j =1
The first level DA (1) is called a spectrahedron. Most authors use the word
spectrahedron to describe only sets of the form DA (1) where A is a tuple of
matrices; and likewise for the term free spectrahedron. This distinction is important
for applications of the theory, since spectrahedra determined by tuples of matrices
form a class of reasonably tractable convex sets that arise in applications, and not
every convex set with 0 in its interior can be represented as DA (1) for a tuple A
acting on a finite dimensional space.
For a matrix convex set S ⊆ Md we define its polar dual to be
S ◦ = X ∈ Md : Re Xj ⊗ Aj ≤ I for all A ∈ S .
Example 9.4 Another natural and important way in which matrix convex sets arise,
is as positivity cones in operator systems. In [65] it was observed that a finite
dimensional abstract operator system M (see [118, Chapter 13]) generated by d
Dilation Theory: A Guided Tour 603
if and only if
This result was first obtained by Li and Poon [97], in the special case where A
and B each consist of commuting selfadjoint matrices. It was later recovered in [47],
in the above generality, as a consequence of Theorem 9.6 together with the fact that
for a normal tuple N, the matrix range W(N) is the minimal matrix convex set that
contains the joint spectrum σ (N) in its first level (see [47, Corollary 4.4]). The next
section is dedicated to explaining what are the minimal and maximal matrix convex
sets over a convex set, and how these notions are related to dilation theory.
by the linear inequalities that determine K. It is clear that W max (K) is matrix
convex, and a moment’s thought reveals that it contains every matrix convex set
that has K as its first level.
Dilation Theory: A Guided Tour 605
That settles the question, of whether or not there exists a matrix convex set with
first level equal to K. It follows, that there has to exist a minimal matrix convex set
that has K as its first level—simply intersect over all such matrix convex sets. There
is a useful description of this minimal matrix convex set. We define
W min (K) = X ∈ Mdsa : ∃ normal T with σ (T ) ⊆ K s.t. X ≺ T . (9.1)
because by Theorem 2.1, every contraction has a unitary dilation. Since the set of
real linear inequalities determining the disc is
D = {z ∈ C : Re eiθ z ≤ 1 for all θ ∈ R},
it follows that
W max (D) = {X : Re eiθ X ≤ I for all θ ∈ R},
which equals the set of all matrices with numerical range contained in the disc.
606 O. M. Shalit
Given a convex set K ⊆ Cd , Passer, Shalit and Solel introduced a constant θ (K)
that quantifies the difference between the minimal and maximal matrix convex sets
over K [116, Section 3]. For two convex sets K, L, we define
and θ (K) = θ (K, K). Note that CW min (L) = W min (CL).
Remark 9.8 In the theory of operator spaces, there are the notions of minimal and
maximal operator spaces over a normed space V , and there is a constant α(V )
that quantifies the difference between the minimal and maximal operator space
structures [117] (see also [118, Chapter 14] and [122, Chapter 3]). These notions
are analogous to the above notions of minimal and maximal matrix convex sets, but
one should not confuse them.
By the characterization of the minimal and maximal matrix convex sets, the
inclusion W max (K) ⊆ W min (L) is a very general kind of dilation result: it means
that every d-tuple X satisfying the linear inequalities defining K, has a normal
dilation X ≺ N such that σ (N) ⊆ L. Let us now review a few results obtained
regarding this dilation problem.
Theorem 9.9 (Theorem 6.9, [116]) For p ∈ [1, ∞], let Bp,d denote the unit ball
in Rd with respect to the p norm, and let Bp,d (C) denote the unit ball in Cd with
respect to the p norm. Then
θ (Bp,d ) = d 1−|1/2−1/p|
and
The above result was originally proved by Fritz, Netzer and Thom [65] for cones
with a symmetric base; to pass between the language of convex bodies and that of
cones, one may use the gadget developed in [116, Section 7]. In [116, Theorem 4.5]
Dilation Theory: A Guided Tour 607
it was observed that Theorem 9.10 is also a consequence of the methods of [47,
Section 7] together with some classical results in convex geometry.
Already in [57, Lemma 3.1] it was observed that there is only one matrix convex
S with S1 = [a, b] ⊂ R, namely the matrix interval given by Sn = {X ∈ (Mn )sa :
aIn ≤ X ≤ bIn }. Said differently, W max ([a, b]) = W min ([a, b]). It is natural to ask
whether there exists any other convex body (i.e., a compact convex set) K with the
property that W min (K) = W max (K).
Theorem 9.11 Let K ⊆ Rd be a convex body. Then W max (K) = W min (K) if and
only if K is a simplex, that is, if K is the convex hull of a set of affinely independent
points. In fact, W2max (K) = W2min (K) already implies that K is a simplex.
The result that the equality W max (K) = W min (K) is equivalent to K being
a simplex was first obtained by Fritz, Netzer and Thom [65, Corollary 5.3] for
polyhedral cones. In [116, Theorem 4.1] it was proved for general convex bodies,
and it was also shown that one does not need to check equality Wnmax (K) =
Wnmin (K) for all n in order to deduce that K is a simplex—it suffices to check
this for some n ≥ 2d−1 . For simplex pointed convex bodies, it was shown that
W2max (K) = W2min (K) already implies that K is a simplex [116, Theorem 8.8].
Huber and Netzer later obtained this for all polyhedral cones [80], and finally
Aubrun, Lami, Palazuelos and Plavala proved the result for all cones [16, Corollary
2].
Remark 9.12 The minimal matrix convex set over a “commutative” convex set
K ⊆ Rd can be considered as the matrix convex hull of K. There are some variations
on this theme. Helton, Klep and McCullough studied the matrix convex hull of free
semialgebraic sets [75]. Instead of W min (K) and W max (K), which are the minimal
and maximal matrix convex sets with prescribed first level, one can also discuss the
minimal and maximal matrix convex sets with a prescribed kth level (see [90], or
[169, 170] for the version of this notion in the framework of operator systems).
In the recent paper [114], Passer and Paulsen define, given a matrix convex set
S, the minimal and maximal matrix convex sets W min-k (S) and W max-k (S) such
that Wkmin-k (S) = Wkmax-k (S) = Sk , and they utilize quantitative measures of
discrepancy between W min-k (S), W max-k (S) and S to glean information on the
operator system corresponding to S; unfortunately, these results are beyond the
scope of this survey. The paper [114] also ties together some of the earlier work
in this direction, so it is a good place to start if one is interested in this problem.
There are many other interesting dilation results in [46, 50, 65, 76, 113, 116]. In this
section I will review a few more.
608 O. M. Shalit
Problem 9.13 Fix d ∈ N. What is the smallest constant Cd such that for every
d-tuple of contractions A, there exists a d-tuple of commuting normal operators B,
such that A ≺ B holds with Bi ≤ Cd for all i?
√
First, we note that the sharp dilation constant θ (B∞,d ) = d obtained in
Theorem 9.10 implies the following result, which is a solution to Problem 9.13
in the selfadjoint setting.
Theorem 9.14 (Theorem 6.7, [116]) For every d-tuple A = (A1 , . . . , Ad ) of
selfadjoint contractions, there √ exists a d-tuple of commuting selfadjoints N =
(N
√ 1 , . . . , Nd ) with Ni ≤ d for i = 1, . . . , d, such that A ≺ N. Moreover,
d is the optimal constant for selfadjoints.
It is interesting to note that one of the proofs of the above theorem goes through
a concrete construction of the dilation. The nonselfadjoint version of Problem 9.13
is more difficult, and it does not correspond to an inclusion problem of some W max
in some W min . The best general result in the nonselfadjoint case is the following
theorem obtained by Passer.
Theorem 9.15 (Theorem 4.4, [113]) For every d-tuple A = (A1 , . . . , Ad ) of con-
√ exists a d-tuple of commuting normal operators N = (N1 , . . . , Nd )
tractions, there
with Ni ≤ 2d for i = 1, . . . , d, such that A ≺ N.
Thus
√ √
d ≤ Cd ≤ 2d.
In the next section we will improve the lower bound in the case d = 2.
Helton, Klep, McCullough and Schweighofer obtained a remarkable result,
which is analogous to Theorem 9.14, but in which the dilation constant is indepen-
dent of the number of operators d [76]. Following Ben-Tal and Nemirovski [21],
Helton et al. defined a constant ϑ(n) as follows:
1
n
n
= min ai xi2 dμ(x) : |ai | = 1
ϑ(n) ∂Bn i=1 i=1
1
A = V ∗ NV .
ϑ(n)
Dilation Theory: A Guided Tour 609
Moreover, ϑ(n) is the smallest constant such that the above holds for all finite sets
of contractive selfadjoints F ⊆ B(H)sa .
Note the difference from Theorem 9.14: the dimension of matrices is fixed at
n × n, but the number of matrices being simultaneously dilated is not fixed. In other
words, the constant ϑ(n) depends only on the size of the matrices being dilated (in
fact, it is shown that n can be replaced with the maximal rank of the matrices being
dilated). It is also shown that
√
πn
ϑ(n) ∼ .
2
In the next subsection I will explain the motivation for obtaining this result.
Any dilation result, such as Theorem 9.14 or 9.16, leads to a von Neumann type
inequality. For example, if A is a d-tuple of selfadjoint contractions, then by
Theorem 9.14, for every matrix valued polynomial p of degree at most one, we
have the following inequality:
√ √
p(A) ≤ sup p(z) : z ∈ [− d, d]d .
This result is by no means trivial, but it is the kind of application of dilation theory
that we have already seen above several times.
We will now see a deep application of Helton, Klep, McCullough and
Schweighofer’s theorem (Theorem 9.16) that is of a different nature from the
applications that we have seen hitherto, and is the main motivation for the
extraordinary paper [76]. The application builds on earlier work of Ben-Tal and
Nemirovski [21] in control theory and optimization, related to what is sometimes
called the matrix cube problem. I will give a brief account; the reader who seeks a
deeper understanding should start with the introductions of [21] and [76].
In the analysis of a linear controlled dynamical system (as in [21]), one is led to
the problem of deciding whether the cube [−1, 1]d is contained in the spectrahedron
DAsa (1), for a given a d-tuple of selfadjoint n × n matrices A , . . . , A ; this is
1 d
called the matrix cube problem. More generally, given another d-tuple of selfadjoint
matrices B1 , . . . , Bd , it is of practical interest to solve the spectrahedral inclusion
problem, that is, to be able to decide whether
DBsa (1) ⊆ DA
sa
(1).
610 O. M. Shalit
Note that the matrix cube problem is a special case of the spectrahedral inclusion
problem, since [−1, 1]d = DC sa (1) for the d-tuple of 2d × 2d diagonal matri-
to be NP hard (note that the naive solution of checking whether all the vertices of the
cube are in DA sa (1) requires one to test the positive semidefiniteness of 2d matrices).
[74], it was observed that the free relaxation, that is, the problem DBsa ⊆ DA sa
equivalent to the UCP interpolation problem, that is, to the existence of a UCP map
sending Bi to Ai for all i = 1, . . . , d [74, Theorem 3.5]. Now, the UCP interpolation
problem can be shown to be equivalent to the solution of a certain semidefinite
program [74, Section 4]. In practice, there are numerical software packages that
can solve such problems efficiently.
So we see that instead of solving the matrix cube problem [−1, 1]d ⊆ DA sa (1),
one can solve the free relaxation DC ⊆ DA . Now, the whole point of the sharp
sa sa
results in [76] is that they give a tight estimate of how well the tractable free
relaxation approximates the hard matrix cube problem. To explain this, we need
the following lemma.
Lemma 9.17 Suppose that A is a d-tuple of selfadjoint n × n matrices. Then,
[−1, 1]d ⊆ DA
sa
(1) ⇒ DC
sa
⊆ ϑ(n)DA
sa
.
Proof Suppose that [−1, 1]d ⊆ DA sa (1). If X ∈ D sa (n), then by Theorem 9.16,
C
X ≺ ϑ(n)N, where N is a normal tuples and σ (N) ⊆ [−1, 1]d ⊆ DA sa (1). So
Xj ⊗ Aj ≺ ϑ(n) Nj ⊗ Aj ≤ ϑ(n)I,
where the last inequality follows easily by the spectral theorem and the assumption
[−1, 1]d ⊆ DA sa (1).
Finally, we can now understand how to give an approximate solution to the
matrix cube problem. Simply, one tests whether DC sa ⊆ ϑ(n)D sa , which is a
A
tractable problem. If the inclusion holds, then it holds at every level and in particular
[−1, 1]d ⊆ ϑ(n)DA sa (1). If not, then, using the lemma, we conclude that [−1, 1]d
This section is dedicated to presenting the results Gerhold and Shalit from [69], on
dilations of q-commuting unitaries.
Let θ ∈ R and write q = eiθ . If u and v are two unitaries that satisfy
vu = quv, then we say that u and v are q-commuting. We denote by Aθ the
universal C*-algebra generated by a pair of q-commuting unitaries, and we call Aθ
θ
a rational/irrational rotation C*-algebra if 2π is rational/irrational respectively.
We shall write uθ , vθ for the generators of Aθ . The rotation C*-algebras have been
of widespread interest ever since they were introduced by Rieffel [132]. A good
reference for this subject is Boca’s book [31].
In an attempt to make some progress in our understanding of the general constant
Cd from Problem 9.13, Malte Gerhold and I studied a certain refinement of that
problem which is of independent interest. Instead of dilating arbitrary tuples of
contractions, we considered the task of dilating pairs of unitaries u, v that satisfy
the q-commutation relation vu = quv, and studied the dependence of the dilation
constant on the parameter q. In the context of Problem 9.13, it is worth noting that,
by a result of Buske and Peters [34] (see also [86]), every pair of q-commuting
contractions has a q-commuting unitary power dilation; therefore, this work has
implications to all pairs of q-commuting operators. Surprisingly, our dilation results
also have implications for the continuity of the norm and the spectrum of the almost
Mathieu operator from mathematical physics (this application will be discussed in
the final section).
For every θ ∈ R we define the optimal dilation constant
We note that the infimum is actually a minimum, and that it is equal to the infimum
of the constants c that satisfy: for every q-commuting pair of unitaries U, V there
exists a commuting normal dilation M, N such that M, N ≤ c (see [69,
Proposition 2.3]). Thus, cθ is a lower bound for the constant C2 from Problem 9.13.
Theorem 10.1 (Theorem 3.2, [69]) Let θ, θ ∈ R, set q = eiθ , q = eiθ , and put
c = e 4 |θ−θ | . Then for any pair of q-commuting unitaries U, V there exists a pair
1
Proof The proof makes use of the Weyl operators on symmetric Fock space (see
[111, Section 20]). For a Hilbert space H let H ⊗s k be the k-fold symmetric tensor
product of H , and let
∞
K
(H ) := H ⊗s k
k=0
∞
1
e(x) := √ x ⊗k , x ∈ H,
k=0
k!
form a linearly independent and total subset of (H ). For z ∈ H we define the Weyl
unitary W (z) ∈ B((H )) which is determined by
z2
W (z)e(x) = e(z + x) exp − − x, z
2
so
; ; ; ; ⊥ 2
; ; ; ; − p 2y |θ −θ |
;P W (z) (H ) ;
= ;P W (y) (H ) ; = e = e− 4 .
Now if we put
|θ −θ | |θ −θ |
U =e 4 P W (z) (H )
, V =e 4 P W (y) (H )
,
and
U = W (z) , V = W (y)
then we get the statement for this particular q-commuting pair U, V . Since the Weyl
unitaries give rise to a universal representation of Aθ , the general result follows (see
[69, Proposition 2.3]).
From the above result we obtained continuity of the dilation scale.
Corollary 10.2 (Corollary 3.4, [69]) The optimal dilation scale cθ depends Lips-
chitz continuously on θ . More precisely, for all θ, θ ∈ R we have
|cθ − cθ | ≤ 0.39 θ − θ .
4
cθ,θ = .
uγ + u∗γ + vγ + vγ∗
4
cθ = .
uθ + u∗θ + vθ + vθ∗
Proof Since it is a nice construction that we have not yet seen, let us show just that
4
the value of cθ,θ is no bigger than uγ +u∗ +v ∗ ; for the optimality of the dilation
γ +vγ
γ
constant we refer the reader to [69] (the formula for cθ = cθ,0 follows, since it is
not hard to see that cθ = c−θ ).
614 O. M. Shalit
π(uγ ) π(vγ )
U =U⊗ , V =V ⊗ .
ϕ(uγ ) ϕ(vγ )
Wh = h ⊗ x , h ∈ H.
Then
1
W ∗U W = π(uγ )x, xU = U
ϕ(uγ )
and
1
W ∗V W = π(vγ )x, xV = V ,
ϕ(vγ )
The almost Mathieu operator hθ = uθ + u∗θ + vθ + vθ∗ , which appears in the formula
cθ = h4θ , arises as the Hamiltonian in a certain mathematical model describing
an electron in a lattice under the influence of a magnetic field; see Hofstadter
[77]. This operator has been keeping mathematicians and physicists busy for more
than a generation. Hofstadter’s paper included a picture that depicts the spectrum
(computed numerically) of hθ for various values of θ , famously known as the
Hofstadter butterfly (please go ahead and google it). From observing the Hofstadter
butterfly, one is led to making several conjectures.
First and foremost, it appears that the spectrum of hθ varies continuously with θ ;
since θ is a physical parameter of the system studied, and the spectrum is supposed
to describe possible energy levels, any other possibility is unreasonable. There are
other natural conjectures to make, suggested just by looking at the picture. The most
famous one is perhaps what Barry Simon dubbed as the Ten Martini Problem, which
asks whether the spectrum is a Cantor set for irrational angles. This problem was
settled (in greater generality) by Avila and Jitomirskaya (see [17] for the conclusive
work as well as for references to earlier work).
The continuity of the spectrum σ (hθ ) is a delicate problem that attracted a lot of
attention. For example, in [37] Choi, Elliott, and Yui showed that the spectrum σ (hθ )
of hθ depends Hölder continuously (in the Hausdorff metric) on θ , with Hölder
exponent 1/3. This was soon improved by Avron, Mouche, and Simon to Hölder
continuity with exponent 1/2 [18]. The 1/2-Hölder continuity of the spectrum also
follows from a result of Haagerup and Rørdam, who showed that there exist 1/2-
Hölder norm continuous paths θ → uθ ∈ B(H), θ → vθ ∈ B(H) [71, Corollary
5.5].
As an application of our dilation techniques, we are able to recover the best
possible continuity result regarding the spectrum of the operator hθ . This result
is not new, but our proof is new and simple, and I believe that it is a beautiful and
exciting application of dilation theory with which to close this survey. The following
theorem also implies that the rotation C*-algebras form a continuous field of C*-
algebras, a result due to Elliott [59]. Our dilation methods can also be used to recover
the result of Bellisard [20], that the norm of hθ is a Lipschitz continuous function
of θ .
Theorem 10.4 Let p be a selfadjoint ∗-polynomial in two noncommuting variables.
Then the spectrum σ (p(uθ , vθ )) of p(uθ , vθ ) is 12 -Hölder continuous in θ with
respect to the Hausdorff distance for compact subsets of R.
Proof Let us present the idea of the proof for the most important case
without going into the details of Hölder continuity. The idea is that, due to
Theorem 10.1, when θ ≈ θ we have the dilation (uθ , vθ ) ≺ (cuθ , cvθ ) with
1
c = e 4 |θ−θ | ≈ 1. Thus,
uθ x
cuθ =
y z
A similar estimate holds for the off diagonal block of cvθ which dilates vθ . By a
basic lemma in operator theory, for any selfadjoint operators a and b, the Hausdorff
distance between their spectra is bounded as follows:
because the off diagonal blocks have small norm, and therefore
hθ 0
σ (hθ ) ⊆ σ ≈ σ (chθ ) ≈ σ (hθ ).
0 ∗
In the same way one shows that σ (hθ ) is approximately contained in σ (hθ ), and
therefore the Hausdorff distance between the spectra is small.
It is interesting to note that the above proof generalizes very easily to higher
dimensional noncommutative tori. Determining the precise dilation scales for higher
dimensional noncommutative tori remains an open problem.
Acknowledgments This survey paper grew out of the talk that I gave at the International
Workshop on Operator Theory and its Applications (IWOTA) that took place in the Instituto
Superior Técnico, Lisbon, Portugal, in July 2019. I am grateful to the organizers of IWOTA 2019
for inviting me to speak in this incredibly successful workshop, and especially to Amélia Bastos,
for inviting me to contribute to these proceedings. I used a preliminary version of this survey as
lecture notes for a mini-course that I gave in the workshop Noncommutative Geometry and its
Applications, which took place in January 2020, in NISER, Bhubaneswar, India. I am grateful to
the organizers Bata Krishna Das, Sutanu Roy and Jaydeb Sarkar, for the wonderful hospitality and
the opportunity to speak and organize my thoughts on dilation theory. I also owe thanks to Michael
Skeide and to Fanciszek Szafraniec, for helpful feedback on preliminary versions. Finally, I wish
to thank an anonymous referee for several useful comments and corrections.
This project was partially supported by ISF Grant no. 195/16.
Dilation Theory: A Guided Tour 617
References
27. B.V.R. Bhat, M. Mukherjee, Inclusion systems and amalgamated products of product systems.
Infin. Dimens. Anal. Quant. Probab. Relat. Top. 13, 1–26 (2010)
28. B.V.R. Bhat, M. Skeide, Tensor product systems of Hilbert modules and dilations of
completely positive semigroups. Infin. Dimens. Anal. Quant. Probab. Relat. Top. 3, 519–575
(2000)
29. T. Bhattacharyya, S. Pal, S. Shyam Roy, Dilations of - contractions by solving operator
equations. Adv. Math. 230, 577–606 (2012)
30. A. Bluhm, I. Nechita, Joint measurability of quantum effects and the matrix diamond. J. Math.
Phys. 59, 112202 (2018)
31. F.P. Boca, Rotation C*-Algebras and Almost Mathieu Operators (The Theta Foundation,
Bucharest, 2001)
32. F.P. Boca, A. Zaharescu, Norm estimates of almost Mathieu operators. J. Funct. Anal. 220,
76–96 (2005)
33. J.W. Bunce, Models for n-tuples of noncommuting operators. J. Funct. Anal. 57, 21–30 (1984)
34. D.R. Buske, J.R. Peters, Semicrossed products of the disk algebra: contractive representations
and maximal ideals. Pac. J. Math. 185, 97–113 (1998)
35. M.D. Choi, K.R. Davidson, A 3 × 3 dilation counterexample. Bull. Lond. Math. Soc. 45,
511–519 (2013)
36. M.D. Choi, C.K. Li, Constrained unitary dilations and numerical ranges. J. Operator Theory
46, 435–447 (2001)
37. M.-D. Choi, G.A. Elliott, N. Yui, Gauss polynomials and the rotation algebra. Invent. Math.
99, 225–246 (1990)
38. D. Cohen, Dilations of matrices. Thesis (M.Sc.), Ben-Gurion University (2015).
arXiv:1503.07334
39. M.J. Crabb, A.M. Davie, Von Neumann’s inequality for hilbert space operators. Bull. Lond.
Math. Soc. 7, 49–50 (1975)
40. M. Crouzeix, Numerical range and functional calculus in Hilbert space. J. Funct. Anal. 244,
668–990 (2007)
41. K.R. Davidson, E.G. Katsoulis, Dilation theory, commutant lifting and semicrossed products.
Doc. Math. 16, 781–868 (2011)
42. K.R. Davidson, M. Kennedy, The Choquet boundary of an operator system. Duke Math. J.
164, 2989–3004 (2015)
43. K.R. Davidson, M. Kennedy, Noncommutative Choquet theory. arXiv:1905.08436
44. K.R. Davidson, D.R. Pitts, Nevanlinna-Pick interpolation for non-commutative analytic
Toeplitz algebras. Integr. Equ. Operator Theory 31, 321–337 (1998)
45. K.R. Davidson, D.R. Pitts, The algebraic structure of non-commutative analytic Toeplitz
algebras. Math. Ann. 311, 275–303 (1998)
46. K.R. Davidson, A. Dor-On, O.M. Shalit, B. Solel, Dilations, inclusions of matrix convex sets,
and completely positive maps. Int. Math. Res. Not. 2017, 4069–4130 (2017)
47. K.R. Davidson, A. Dor-On, O.M. Shalit, B. Solel, Dilations, inclusions of matrix convex sets,
and completely positive maps (2018). arXiv:1601.07993v3 [math.OA]
48. K.R. Davidson, A.H. Fuller, E.T.A. Kakariadis, Semicrossed products of operator algebras: a
survey. New York J. Math. 24a, 56–86 (2018)
49. E.B. Davies, Quantum Theory of Open Systems (Academic, Cambridge, 1976)
50. A. Dor-On, Techniques in operator algebras: classification, dilation and non-commutative
boundary theory. Thesis (Ph.D.) University of Waterloo (2017)
51. A. Dor-On, G. Salomon, Full Cuntz-Krieger dilations via non-commutative boundaries. J.
Lond. Math. Soc. 98, 416–438 (2018)
52. R. Douglas, On extending commutative ssemigroupsemigroups of isometries. Bull. Lond.
Math. Soc. 1, 157–159 (1969)
53. M. Dritschel, S. McCullough, Boundary representations for families of representations of
operator algebras and spaces. J. Operator Theory 53, 159–167 (2005)
54. M.A. Dritschel, S. McCullough, The failure of rational dilation on a triply connected domain.
J. Am. Math. Soc. 18, 873–918 (2005)
Dilation Theory: A Guided Tour 619
55. S.W. Drury, A generalization of von Neumann’s inequality to the complex ball. Proc. Am.
Math. Soc. 68, 300–304 (1978)
56. S.W. Drury, Remarks on von Neumann’s Inequality. Lecture Notes in Mathematics, vol. 995
(Springer, Berlin, 1983)
57. E.G. Effros, S. Winkler, Matrix convexity: operator analogues of the bipolar and Hahn-
Banach theorems. J. Funct. Anal. 144, 117–152 (1997)
58. E. Egerváry, On the contractive linear transformations of n-dimensional vector space. Acta
Sci. Math. Szeged 15, 178–182 (1954)
59. G.A. Elliott, Gaps in the spectrum of an almost periodic Schrödinger operator. C.R. Math.
Rep. Acad. Sci. Can. 4, 255–299 (1982)
60. D.E. Evans, J.T. Lewis, Dilations of dynamical semi-groups. Commun. Math. Phys. 50, 219–
227 (1976)
61. E. Evert, J.W. Helton, I. Klep, S. McCullough, Extreme points of matrix convex sets, free
spectrahedra, and dilation theory. J. Geom. Anal. 28, 1373–1408 (2018)
62. S. Fackler, J. Glück, A toolkit for constructing dilations on Banach spaces. Proc. Lond. Math.
Soc. 118, 416–440 (2019)
63. C. Foias, A.E. Frazho, The Commutant Lifting Approach to Interpolation Problems
(Birkhäuser, Basel, 1990)
64. A.E. Frazho, Models for noncommuting operators. J. Funct. Anal. 48, 1–11 (1982)
65. T. Fritz, T. Netzer, A. Thom, Spectrahedral containment and operator systems with finite-
dimensional realization. SIAM J. Appl. Algebra Geom. 1, 556–574 (2017)
66. A.H. Fuller, Finitely correlated representations of product systems of C*-correspondences
over Nk . J. Funct. Anal. 260, 574–611 (2011)
67. D.J. Gaebler, Continuous unital dilations of completely positive semigroups. J. Funct. Anal.
269, 998–1027 (2015)
68. T.W. Gamelin, Uniform Algebras, vol. 311 (American Mathematical Society, Providence,
2005)
69. M. Gerhold, O.M. Shalit, Dilations of q-commuting unitaries (2019). arXiv:1902.10362
70. M. Gerhold, O.M. Shalit, On the matrix range of random matrices (2020). arXiv:1911.12102
71. U. Haagerup, M. Rørdam, Perturbations of the rotation C*-algebras and of the Heisenberg
commutation relations. Duke Math. J. 77, 227–256 (1995)
72. P.R. Halmos, Normal dilations and extensions of operators. Summa Brasil. Math. 2, 125–134
(1950)
73. M. Hartz, M. Lupini, Dilation theory in finite dimensions and matrix convexity (2019).
arXiv:1910.03549
74. J.W. Helton, I. Klep, S. McCullough, The matricial relaxation of a linear matrix inequality.
Math. Program. 138, 401–445 (2013)
75. J.W. Helton, I. Klep, S. McCullough, Matrix convex hulls of free semialgebraic sets. Trans.
Am. Math. Soc. 368, 3105–3139 (2016)
76. J.W. Helton, I. Klep, S. McCullough, M. Schweighofer, Dilations, Linear Matrix Inequalities,
the Matrix Cube Problem and Beta Distributions. Memoirs of the American Mathematical
Society (American Mathematical Society, Providence, 2019)
77. D.R. Hofstadter, Energy levels and wave functions of Bloch electrons in rational and irrational
magnetic fields. Phys. Rev. B. 14, 2239–2249 (1976)
78. J.A. Holbrook, Schur norms and the multivariate von Neumann inequality, in Operator
Theory: Advances and Applications, vol. 127 (Birkhäuser, Basel, 2001)
79. Z. Hu, R. Xia, S. Kais, A quantum algorithm for evolving open quantum dynamics on
quantum computing devices (2019). arXiv:1904.00910
80. B. Huber, T. Netzer, A note on non-commutative polytopes and polyhedra (2019).
arXiv:1809.00476
81. M. Izumi, E0 -semigroups: around and beyond Arveson’s work. J. Oper. Theory 68, 335–363
(2012)
82. E.T.A. Kakariadis, O.M. Shalit, Operator algebras of monomial ideals in noncommuting
variables. J. Math. Anal. Appl. 472, 738–813 (2019)
620 O. M. Shalit
83. I. Kaplansky, Modules over operator algebras. Am. J. Math. 75, 839–858 (1953)
84. E. Katsoulis, C. Ramsey, Crossed Products of Operator Algebras. Memoirs of the American
Mathematical Society (American Mathematical Society, Providence, 2019)
85. M. Kennedy, O.M. Shalit, Essential normality, essential norms and hyperrigidity. J. Funct.
Anal. 270, 2812–2815 (2016)
86. D. Keshari, N. Mallick, q-commuting dilation. Proc. Am. Math. Soc. 147, 655–669 (2019)
87. G. Knese, The von Neumann inequality for 3 × 3 matrices. Bull. Lond. Math. Soc. 48, 53–57
(2016)
88. Ł. Kosiński, Three-point Nevanlinna-Pick problem in the polydisc. Proc. Lond. Math. Soc.
111, 887–910 (2015)
89. K. Kraus, States, Effects, and Operations: Fundamental Notions of Quantum Theory. Lecture
Notes in Physics, vol. 190 (Springer, Berlin, 1983)
90. T.-L. Kriel, An introduction to matrix convex sets and free spectrahedra (2016).
arXiv:1611.03103v6
91. B. Kümmerer, Markov dilations and W*-algberas. J. Funct. Anal. 63, 139–177 (1985)
92. M. Laca, From endomorphisms to automorphisms and back: dilations and full corners. J.
Lond. Math. Soc. 61, 893–904 (2000)
93. E.C. Lance, Hilbert C*-modules: A Toolkit for Operator Algebraists, vol. 210 (Cambridge
University Press, Cambridge, 1995)
94. J. Levick, R.T.W. Martin, Matrix N-dilations of quantum channels. Oper. Matrices 12, 977–
995 (2018)
95. E. Levy, O.M. Shalit, Dilation theory in finite dimensions: the possible, the impossible and
the unknown. Rocky Mountain J. Math. 44, 203–221 (2014)
96. C.K. Li, Y.T. Poon, Convexity of the joint numerical range. SIAM J. Matrix Anal. Appl. 21,
668–678 (2000)
97. C.K. Li, Y.T. Poon, Interpolation by completely positive maps. Linear Multilinear Algebra
59, 1159–1170 (2011)
98. D. Markiewicz, O.M. Shalit, Continuity of CP-semigroups in the point-strong topology. J.
Operator Theory 64, 149–154 (2010)
99. J.E. McCarthy, O.M. Shalit, Unitary N-dilations for tuples of commuting matrices. Proc. Am.
Math. Soc. 14, 563–571 (2013)
100. P.S. Muhly, B. Solel, Tensor algebras over C ∗ -correspondences: representations, dilations,
and C ∗ -envelopes. J. Funct. Anal. 158, 389–457 (1998)
101. P.S. Muhly, B. Solel, An algebraic characterization of boundary representations, in Operator
Theory: Advances and Applications, vol. 104 (Birkhäuser, Basel, 1998), pp. 189–196
102. P.S. Muhly, B. Solel, Quantum Markov processes (correspondences and dilations). Int. J.
Math. 13, 863–906 (2002)
103. P.S. Muhly, B. Solel, Hardy algebras, W*-correspondences and interpolation theory. Math.
Ann. 330, 353–415 (2004)
104. P. Muhly, B. Solel, Quantum Markov semigroups (product systems and subordination). Int. J.
Math. 18, 633–669 (2007)
105. V. Müller, F-H. Vasilescu, Standard models for some commuting multioperators. Proc. Am.
Math. Soc. 117, 979–989 (1993)
106. B. Nagy, On contractions in Hilbert space. Acta Sci. Math 15, 87–92 (2013)
107. I.L. Nielsen, M.A. Chuang, Quantum Computation and Quantum Information (Cambridge
University Press, Cambridge, 2000)
108. D. Opela, A generalization of Andô’s theorem and Parrott’s example. Proc. Am Math. Soc.
134, 2703–2710 (2006)
109. S. Pal, O.M. Shalit, Spectral sets and distinguished varieties in the symmetrized bidisc. J.
Funct. Anal. 266, 5779–5800 (2014)
110. S. Parrott, Unitary dilations for commuting contractions. Pac. J. Math. 34, 481–490 (1970)
111. K.R. Parthasarathy, An Introduction to Quantum Stochastic Calculus. Monographs in Mathe-
matics, vol. 85 (Birkhäuser, Basel, 2012)
112. W.L. Paschke, Inner product modules over B ∗ -algebras. Trans. Am. Math. Soc. 182, 443–468
(1973)
Dilation Theory: A Guided Tour 621
113. B. Passer, Shape, scale, and minimality of matrix ranges. Trans. Am. Math. Soc. 372, 1451–
1484 (2019)
114. B. Passer, V.I. Paulsen, Matrix range characterization of operator system properties (preprint).
arXiv:1912.06279
115. B. Passer, O.M. Shalit, Compressions of compact tuples. Linear Algebra Appl. 564, 264–283
(2019)
116. B. Passer, O.M. Shalit, B. Solel, Minimal and maximal matrix convex sets. J. Funct. Anal.
274, 3197–3253 (2018)
117. V.I. Paulsen, Representations of function algebras, abstract operator spaces, and Banach space
geometry. J. Funct. Anal. 109(1), 113–129 (1992)
118. V.I. Paulsen, Completely Bounded Maps and Operator Algebras (Cambridge University Press,
Cambridge, 2002)
119. V.I. Paulsen, M. Raghupathi, An Introduction to the Theory of Reproducing Kernel Hilbert
Spaces. Cambridge Studies in Advanced Mathematics, vol. 152 (Cambridge University Press,
Cambridge, 2016)
120. V.I. Paulsen, I.G. Todorov, M. Tomforde, Operator system structures on ordered spaces. Proc.
Lond. Math. Soc. 102, 25–49 (2011)
121. G. Pisier, Similarity Problems and Completely Bounded Maps. Lecture Notes of Mathematics,
vol. 1618 (Springer, Berlin, 1996)
122. G. Pisier, Introduction to Operator Space Theory, vol. 294 (Cambridge University Press,
Cambridge, 2003)
123. G. Popescu, Isometric dilations for infinite sequences of noncommuting operators. Trans. Am.
Math. Soc. 316, 523–536 (1989)
124. G. Popescu, Von Neumann inequality for (B(H)n )1 . Math. Scand. 68, 292–304 (1991)
125. G. Popescu, Poisson transforms on some C*-algebras generated by isometries. J. Funct. Anal.
161, 27–61 (1999)
126. G. Popescu, Free holomorphic functions on the unit ball of B(H)n . J. Funct. Anal. 241, 268–
333 (2006)
127. G. Popescu, Operator theory on noncommutative varieties. Ind. Univ. Math. J. 56, 389–442
(2006)
128. G. Popescu, Berezin transforms on noncommutative polydomains. Trans. Am. Math. Soc.
368, 4357–4416 (2016)
129. B. Prunaru, Lifting fixed points of completely positive semigroups. Integr. Equ. Oper. Theory
72, 219–222 (2012)
130. M. Ptak, Unitary dilations of multiparameter semigroups of operators. Ann. Polon. Math. 45,
237–243 (1985)
131. M. Rieffel, Induced representations of C*-algebras. Adv. Math. 13, 176–257 (1974)
132. M. Rieffel, C*-algebras associated with irrational rotations. Pac. J. Math. 93, 415–429 (1981)
133. F. Riesz, B. Sz.-Nagy, Functional Analysis (Dover, New York, 1990) (first published in 1955)
134. G. Salomon, O.M. Shalit, E. Shamovich, Algebras of bounded noncommutative analytic
functions on subvarieties of the noncommutative unit ball. Trans. Am. Math. Soc. 370, 8639–
8690 (2018)
135. G. Salomon, O.M. Shalit, E. Shamovich, Algebras of noncommutative functions on sub-
varieties of the noncommutative ball: the bounded and completely bounded isomorphism
problem. J. Funct. Anal. (2019). arXiv:1806.00410
136. D. Sarason, On spectral sets having connected complement. Acta Sci. Math. 26, 289–299
(1965)
137. D. Sarason, Generalized interpolation in H ∞ . Trans. Am. Math. Soc. 127, 179–203 (1967)
138. J. Sarkar, Applications of Hilbert module approach to multivariable operator theory, in
Operator Theory, ed. by D. Alpay (Springer, Berlin, 2014)
139. J. Sarkar, Operator theory on symmetrized bidisc. Ind. Univ. Math. J. 64, 847–873 (2015)
140. D. SeLegue, Minimal Dilations of CP maps and C*-extension of the Szegö limit theorem.
Ph.D Dissertation, University of California, Berkeley (1997)
622 O. M. Shalit
141. O.M. Shalit, E0 -dilation of strongly commuting CP0 -semigroups. J. Funct. Anal. 255, 46–89
(2008)
142. O.M. Shalit, What type of dynamics arise in E0 -dilations of commuting Quantum Markov
Semigroups?. Infin. Dimens. Anal. Quant. Probab. Relat. Top. 11, 393–403 (2008)
143. O.M. Shalit, Representing a product system representation as a contractive semigroup and
applications to regular isometric dilations. Can. Math. Bull. 53, 550–563 (2010)
144. O.M. Shalit, E-dilations of strongly commuting CP-semigroups (the nonunital case). Houston
J. Math. 35(1), 203–232 (2011)
145. O.M. Shalit, A sneaky proof of the maximum modulus principle. Am. Math. Month. 120(4),
359–362 (2013)
146. O.M. Shalit, Operator theory and function theory in Drury-Arveson space and its quotients,
in Operator Theory, ed. by D. Alpay (Springer, Berlin, 2014)
147. O.M. Shalit, A First Course in Functional Analysis (Chapman and Hall/CRC, Boca Raton,
2017)
148. O.M. Shalit, M. Skeide, Three commuting, unital, completely positive maps that have no
minimal dilation. Integr. Equ. Oper. Theory 71, 55–63 (2011)
149. O.M. Shalit, M. Skeide, CP-semigroups and dilations, subproduct systems and superproduct
systems: the multi-parameter case and beyond** (in preparation)
150. O.M. Shalit, B. Solel, Subproduct systems. Doc. Math. 14, 801–868 (2009)
151. M. Słociński, Unitary dilation of two-parameter semi-groups of contractions. Bull. Acad.
Polon. Sci. Ser. Sci. Math. Astronom. Phys. 22, 1011–1014 (1974)
152. A. Skalski, On isometric dilations of product systems of C*-correspondences and applications
to families of contractions associated to higher-rank graphs. Ind. Univ. Math. J. 58, 2227–
2252 (2009)
153. A. Skalski, J. Zacharias, Wold decomposition for representations of product systems of C*-
correspondences. Int. J. Math. 19, 455–479 (2008)
154. M. Skeide, Hilbert modules and applications in quantum probability. Habilitationsschrift,
Cottbus (2001). http://web.unimol.it/skeide/
155. M. Skeide, Isometric dilations of representations of product systems via commutants. Int. J.
Math. 19, 521–539 (2008)
156. M. Skeide, Classification of E0 -semigroups by product systems. Mem. Am. Math. Soc. 240,
1137 (2016)
157. B. Solel, Representations of product systems over semigroups an dilations of commuting CP
maps. J. Funct. Anal. 235, 593–618 (2006)
158. B. Solel, Regular dilations of representations of product systems. Math. Proc. R. Ir. Acad.
108, 89–110 (2008)
159. W.F. Stinespring, Positive functions on C*-algebras. Proc. Am. Math. Soc. 6, 211–216 (1955)
160. J. Stochel, F.H. Szafraniec, Unitary dilation of several contractions, in Operator Theory:
Advances and Applications, vol. 127 (Birkhäuser, Basel, 2001), pp. 585–598
161. E. Stroescu, Isometric dilations of contractions on Banach spaces. Pac. J. Math. 47, 257–262
(1973)
162. F.H. Szafraniec, Murphy’s Positive definite kernels and Hilbert C*-modules sreorganized.
Banach Center Publ. 89, 275–295 (2010)
163. B. Sz.-Nagy, Sur les contractions de l’espace de Hilbert. Acta Sci. Math. 15, 87–92 (1953)
164. B. Sz.-Nagy, Extensions of linear transformations in Hilbert space which extend beyond this
space. Appendix to F. Riesz and B. Sz.-Nagy, Functional Analysis, Ungar, New York, 1960.
Translation of “Prolongements des transformations de l’espace de Hilbert qui sortent de cet
espace”, Budapest (1955)
165. N.T. Varopoulos, On an inequality of von Neumann and an application of the metric theory
of tensor products to operators theory. J. Funct. Anal. 16, 83–100 (1974)
166. A. Vernik, Dilations of CP-maps commuting according to a graph. Houston J. Math. 42,
1291–1329 (2016)
167. F. vom Ende, G. Dirr, Unitary dilations of discrete-time quantum-dynamical semigroups. J.
Math. Phys. 60, 1–17 (2019)
Dilation Theory: A Guided Tour 623
168. J. von Neumann, Eine Spektraltheorie für allgemeine Operatoren eines unitären Raumes.
Math. Nachr. 4, 258–281 (1951)
169. B. Xhabli, Universal operator system structures on ordered spaces and their applications.
Thesis (Ph.D.) University of Houston (2009)
170. B. Xhabli, The super operator system structures and their applications in quantum entangle-
ment theory. J. Funct. Anal. 262, 1466–1497 (2012)
171. A. Zalar, Operator positivestellensatze for noncommutative polynomials positive on matrix
convex sets. J. Math. Anal. Appl. 445, 32–80 (2017)
Riesz-Fischer Maps, Semi-frames
and Frames in Rigged Hilbert Spaces
Francesco Tschinke
Abstract In this note we present a review, some considerations and new results
about maps with values in a distribution space and domain in a σ -finite measure
space X. Namely, this is a survey about Bessel maps, frames and bases (in particular
Riesz and Gel’fand bases) in a distribution space. In this setting, the Riesz-Fischer
maps and semi-frames are defined and new results about them are obtained. Some
examples in tempered distributions space are examined.
1 Introduction
Given a Hilbert space H with inner product ·|· and norm ·, a frame is a sequence
of vectors {fn } in H if there exist A, B > 0 such that:
∞
Af 2 ≤ |f |fn |2 ≤ Bf 2 , ∀f ∈ H.
k=1
As known, this notion generalizes orthonormal bases, and has reached an increasing
level of popularity in many fields of interests, such as signal theory, image
processing, etc., but it is also an important tool in pure mathematics: in fact it plays
key roles in wavelet theory, time-frequency analysis, the theory of shift-invariant
spaces, sampling theory and many other areas (see [10, 11, 19, 20, 24]).
F. Tschinke ()
Università di Palermo, Palermo, Italy
e-mail: [email protected]
Today, the notion of continuous frames in Hilbert spaces and their link with the
theory of coherent states is well-known in the literature.
With the collaboration of C. Trapani and T. Triolo [29], the author introduced
bases and frames in distributional spaces. To illustrate the motivations for this study,
we have to consider the rigged Hilbert space (or Gel’fand triple) [16, 17]: that is, if
H is a Hilbert space, the triple:
D[t] ⊂ H ⊂ D× [t × ],
where the set S × (R) is known as space of tempered distributions. Then ωλ (x) =
√1 e −iλx -that does not belong to L2 (R)—is a generalized eigenvector of A with λ
2π
as eigenvalue.
Riesz-Fischer Maps, Semi-frames and Frames in Rigged Hilbert Spaces 627
With a limiting procedure, the Fourier transform can be extended to L2 (R). Since
a function f ∈ L2 (R) defines a regular tempered distribution, we have, for all
φ ∈ S(R):
1
φ|f : = f (x)φ(x)dx = √ fˆ(λ)eiλx dλ φ(x)dx
R R 2π R
= fˆ(λ)φ̌(λ)dλ = fˆ(λ)φ|ωλ dλ.
R R
That is:
f = fˆ(λ)ωλ dλ. (1.2)
R
in weak sense. The family {ωλ ; λ ∈ R} of the previous example can be considered
as the range of a weakly measurable function ω : R → S × (R) which allows a
representation as in (1.2) of any f ∈ L2 (R) in terms of generalized eigenvectors of
A. This is an example of a distribution basis. More precisely, since the Fourier-
Plancherel theorem corresponds to the Parseval identity, this is an example of
Gel’fand distribution basis [29, Subsec. 3.4], that plays, in S × (R), the role of an
orthonormal basis in a Hilbert space.
The example above is a particular case of the Gel’fand-Maurin theorem (see
[16, 18] for details), which states that, if D is a domain in a Hilbert space H which
is a nuclear space under a certain topology τ , and A is an essentially self-adjoint
operator on D which maps D[t] into D[t] continuously, then A admits a complete
set of generalized eigenvectors.
If σ (A) is the spectrum of the closure of the operator A, the completeness of the
set {ωλ ; λ ∈ σ (A)} is understood in the sense that the Parseval identity holds, that
is:
1/2
f = | f |ωλ |2 dλ , ∀f ∈ D. (1.3)
σ (A)
628 F. Tschinke
Revisiting some results of [29] about Bessel maps, frames and (Gel’fand and
Riesz) bases in distribution set-up, in this paper the notions of Riesz-Fischer map
and of semi-frames in a space of distributions are proposed.
After some preliminaries and notations (Sect. 2), in Sect. 3 distribution Bessel
maps are considered and the notion of distribution Riesz-Fischer maps is proposed,
showing some new results about them (such as bounds and duality properties).
Since distribution Bessel maps are not, in general, bounded by a Hilbert norm,
we consider appropriate to define in Sect. 4 the distribution semi-frames, notion
already introduced in a Hilbert space by J.-P. Antoine and P. Balasz [3]. Finally,
distribution frames, distribution bases, Gel’fand and Riesz bases, considered in [29],
are revisited in Sect. 5 with some additional examples.
qM (F ) = sup | F |g |, F ∈ D× ,
g∈M
where M is a bounded subset of D[t]. In this way, a rigged Hilbert space is defined
in a standard fashion:
D[t] +→ H +→ D× [t × ], (2.1)
Hence, in this case, L(D, D× ) is a † -invariant vector space. We also denote by L(D)
the algebra of all continuous linear operators Y : D[t] → D[t] and by L(D× )
the algebra of all continuous linear operators Z : D× [t × ] → D× [t × ]. If D[t] is
reflexive, for every Y ∈ L(D) there exists a unique operator Y × ∈ L(D× ), the
adjoint of Y , such that
|Y g = Y × |g , ∀ ∈ D× , g ∈ D.
Definition 3.1 Let D[t] be a locally convex space. A weakly measurable map ω is
a Bessel distribution map (briefly: Bessel map) if for every f ∈ D,
| f |ωx |2 dμ < ∞.
X
(iii) for every bounded subset M of D there exists CM > 0 such that:
sup ξ(x) ωx |f dμ ≤ CM ξ 2 , ∀ξ ∈ L2 (X, μ). (3.1)
f ∈M X
is continuous.
Let ω be a Bessel map: the previous lemma allows to define on D × D the
sesquilinear form :
(f, g) = f |ωx ωx |g dμ.
X
632 F. Tschinke
This means that is jointly continuous on D[t]. Hence there exists an operator
Sω ∈ L(D, D× ), with Sω = Sω† , Sω ≥ 0, such that:
(f, g) = Sω f |g = f |ωx ωx |g dμ, ∀f, g ∈ D (3.2)
X
that is,
Sω f = f |ωx ωx dμ, ∀f ∈ D.
X
Since ω is a Bessel distribution map and ξ ∈ L2 (X, μ), we put for all g ∈ D:
$ξω (g) := ξ(x) ωx |g dμ. (3.3)
X
$ξω := ξ(x)ωx dμ
X
Tω : ξ → $ξω .
By (3.1), it follows that Tω is continuous from L2 (X, μ), endowed with its natural
norm, into D× [t × ]. Hence, it possesses a continuous adjoint Tω× : D[t] →
L2 (X, μ), which is called the analysis operator, acting as follows:
Definition 3.4 Let D[t] be a locally convex space. A weakly measurable map ω :
x ∈ X → ωx ∈ D× is called a Riesz-Fischer distribution map (briefly: Riesz-
Fischer map) if, for every h ∈ L2 (X, μ), there exists f ∈ D such that:
f1 − f2 ∈ ω⊥ := {g ∈ D : g|ωx = 0, μ − a.e.}.
f |ωx = h(x)
admits a solution f ∈ D if, and only if, there exists a bounded subset M of D such
that
in other words:
|μ̃(F )| ≤ sup | f |F |,
f ∈M
634 F. Tschinke
for every F ∈ D× . Since D is reflexive, there exists f¯ ∈ D such that μ̃(F ) = f¯|F .
The statement follows from the fact that μ(ωx ) = h(x).
If M is a subspace of D and the topology of D is generated by the family of
seminorms {pα }α∈I , then the topology on the quotient space D/M is defined, as
usual, by the seminorms {p̃α }α∈I , where
Proof Since ω⊥ is closed, it follows that the quotient D/ω⊥ := Dω⊥ is a Fréchet
space. If f ∈ D, we put f˜ := f + ω⊥ . Let h ∈ L2 (X, μ) and f a solution of
(3.4) corresponding to h; then, we can define an operator S : L2 (X, μ) → Dω⊥ by
h → f˜. Let us consider a sequence hn ∈ L2 (X, μ) such that hn → 0 and, for each
n ∈ N, let fn be a corresponding solution of (3.4). One has that
|hn (x)|2dμ → 0, i.e. | fn |ωx |2 dμ → 0.
X X
This implies that fn |ωx → 0 in measure, so there exists a subsequence such
that fnk |ωx → 0 a.e. (see [13]). On the other hand, if Shn = f˜n is a sequence
convergent to f˜ in Dω⊥ w.r. to the quotient topology defined by the seminorms
p̃(·), it follows that the sequence is convergent in the weak topology of Dω⊥ , i.e.:
f˜n |F̃ → f˜|F̃ ∀F̃ ∈ Dω×⊥ .
Thus, if f˜n → f˜ in the topology of Dω⊥ , then fn |F → f |F , for all F ∈ ω⊥⊥ ,
and, in particular, since ω ⊂ ω⊥⊥ , one has fn |ωx → f |ωx . Since fn |ωx has
a subsequence convergent to 0, one has f ∈ ω⊥ . From the closed graph theorem, it
follows that the map S is continuous, i.e. for all continuous seminorms p̃ on Dω⊥
Riesz-Fischer Maps, Semi-frames and Frames in Rigged Hilbert Spaces 635
there exists C > 0 such that: p̃(Sh) ≤ Ch2 , for all h ∈ L2 (X, μ). The statement
follows from the definition of Riesz-Fischer map.
Corollary 3.7 Assume that D[t] is a Fréchet space. If the map ω : x ∈ X → ωx ∈
D× is a total Riesz-Fischer map, then for every continuous seminorm p on D, there
exists a constant C > 0 such that, for the solution f of (3.4),
p(f ) ≤ C f |ωx 2 .
Remark 3.8 For an arbitrary weakly measurable map ω, we define the subset of
D[t]: D(Vω ) := {f ∈ D : f |ωx ∈ L2 (X, μ)} and the analysis operator Vω :
f ∈ D(Vω ) → f |ωx ∈ L2 (X, μ). Clearly, ω is a Riesz-Fischer map if and only
if Vω : D(Vω ) → L2 (X, μ) is surjective. If ω is total, it is injective too, so Vω is
invertible. A consequence of Corollary 3.7 is that Vω−1 : L2 (X, μ) → D(Vω ) is
continuous.
3.3 Duality
and
f |g = f |θx ωx |g dμ, ∀f, g ∈ D.
X
for all h ∈ L2 (X, μ). It follows that θx |g ∈ L2 (X, μ) (see [28, Chapter 6,
Exercise 4]).
636 F. Tschinke
Let us consider
g= g|ωx θx dμ
X
Since D is reflexive, there exists f˜ ∈ D×× = D such that μ̃(G) = f˜|G . In
particular
f˜|g = h(x) ωx |g dμ.
X
This implies that the synthesis operator Tω takes values in H, it is bounded and
Tω ≤ B 1/2 ; its hilbertian adjoint Cω := Tω∗ extends the analysis operator Tω× .
The action of Cω can be easily described: if g ∈ H and {gn } is a sequence
of elements of D, norm converging to g, then the sequence {ηn }, where ηn (x) =
gn |ωx , is convergent in L2 (X, μ). Put η = limn→∞ ηn . Then,
Tω ξ |g = lim ξ(x) ωx |gn dμ = ξ(x)η(x)dμ.
n→∞ X X
Hence Tω∗ g = η.
The function η ∈ L2 (X, μ) depends linearly on g, for each x ∈ X. Thus we can
define a linear functional ω̌x by
g|ω̌x = lim gn |ωx , g ∈ H; gn → g. (4.1)
n→∞
Of course, for each x ∈ X, ω̌x extends ωx ; however ω̌x need not be continuous, as
a functional on H. We conclude that:
Tω∗ : g → g|ω̌x ∈ L2 (X, μ).
Moreover, in this case, the sesquilinear form in (3.2), which is well defined on
D × D, is bounded with respect to · and possesses a bounded extension ˆ to H.
Hence there exists a bounded operator Ŝω in H, such that:
ˆ
(f, g) = Ŝω f |g , ∀f, g ∈ H. (4.2)
638 F. Tschinke
Since
Ŝω f |g = f |ωx ωx |g dμ, ∀f, g ∈ D,
X
Ŝω extends the frame operator Sω and Sω : D → H. It is easily seen that Ŝω = Ŝω∗
and Ŝω = Tω Tω∗ . By definition, we have:
Then Ŝω is bounded, self-adjoint and injective too. This means that Ran Sω is dense
in H, and Ŝω−1 is densely defined. If ω is not a frame, Ŝω−1 is an unbounded, self-
adjoint operator (see [3]).
Remark 4.2 If {ωx }x∈X is an upper semi-frame, then there exists a continuous
seminorm p on D such that f |ωx 2 ≤ p(f ) for all f ∈ D. In fact, the injection
D +→ H is continuous, i.e. f ≤ p(f ) for all f ∈ D. The converse is not true:
let us consider the rigged Hilbert space S(R) +→ L2 (R) +→ S × (R); the system of
derivative of Dirac’s deltas {δx }x∈R is total. Since S(R) is a Fréchet space, (ii) of
Proposition 3.2 it holds. However {δx }x∈R is not a distribution upper semi-frame; in
fact:
| φ|δx |2 dx = φ 22 ∀φ ∈ S(R),
R
d
but the derivative operator dx : S(R) → L2 (R) is unbounded (clearly with respect
to the topology of the Hilbert norm).
Remark 4.3 In [29] it is defined the notion of bounded Bessel map, that is a Bessel
map in rigged Hilbert space such that:
| f |ωx |2 dμ ≤ Bf 2 , ∀f ∈ D.
X
It is a more general notion than upper bounded semi-frame. In fact, we can consider,
as example, the distribution ωx := ηK (x)δx where ηK (x) is a C ∞ -function with
compact support K and M := maxx∈K |ηK (x)|:
| φ|ωx | dx =
2
| φ|ηK (x)δx |2 dx
R R
= |ηK (x)φ(x)|2 dx ≤ M 2 |φ(x)|2 dx ≤ M 2 φ22 .
R K
Therefore ω is a bounded Bessel map, but it is not total, then it is not an upper
semi-frame.
Riesz-Fischer Maps, Semi-frames and Frames in Rigged Hilbert Spaces 639
This section is devoted to distribution frames, with main results already shown in
[29].
Definition 4.6 ([29, Definition 3.6]) Let D[t] ⊂ H ⊂ D× [t × ] be a rigged
Hilbert space, with D[t] a reflexive space and ω a Bessel map. We say that ω is
a distribution frame if there exist A, B > 0 such that:
Af 2 ≤ | f |ωx |2 dμ ≤ Bf 2 , ∀f ∈ D.
X
This inequality, together with the fact that Ŝω is symmetric, implies that Ŝω has a
bounded inverse Ŝω−1 everywhere defined in H.
Remark 4.7 It is worth noticing that the fact that ω and Sω extend to H does not
mean that ω a frame in the Hilbert space H, because we do not know if the extension
of Sω has the form of (3.2) with f, g ∈ H.
To conclude this section, we recall a list of properties of frames proved in [29].
640 F. Tschinke
Lemma 4.8 ([29, Lemma 3.8]) Let ω be a distribution frame. Then, there exists
Rω ∈ L(D) such that Sω Rω f = Rω× Sω f = f , for every f ∈ D.
As a consequence, the reconstruction formulas for distribution frames hold for
all f ∈ D:
f = Rω× Sω f = f |ωx Rω× ωx dμ;
X
f = Sω Rω f = Rω f |ωx ωx dμ.
X
Where θx := Rω× ωx . The frame operator Sθ for θ is well defined and we have:
Sθ = ID,D× Rω .
The distribution function θ , constructed in Proposition 4.10, is also a distribution
frame, called the canonical dual frame of ω. Indeed, it results that [29]:
B −1 f 2 ≤ Sθ f |f ≤ A−1 f 2 , ∀f ∈ D.
It is clear that a Parseval distribution frame is a frame in the sense of Definition 4.6
with Sω = ID , the identity operator of D.
Lemma 4.12 ([29, Lemma 3.14]) Let D ⊂ H ⊂ D× be a rigged Hilbert space
and ω : x ∈ X → ωx ∈ D× a weakly measurable map. The following statements
are equivalent.
(i) ω is a Parseval
= distribution frame;
(ii) f |g== X f |ωx ωx |g dμ, ∀f, g ∈ D;
(iii) f = X f |ωx ωx dμ, the integral on the r.h.s. is understood as a continuous
conjugate linear functional on D, that is an element of D× .
The representation in (iii) of Lemma 4.12 is not necessarily unique.
5 Distribution Basis
Definition 5.1 ([29, Definition 2.3]) Let D[t] be a locally convex space, D× its
conjugate dual and ω : x ∈ X → ωx ∈ D× a weakly measurable map. Then ω
is a distribution basis for D if, for every f ∈ D, there exists a unique measurable
function ξf such that:
f |g = ξf (x) ωx |g dμ, ∀f, g ∈ D
X
in weak sense.
Remark 5.2 Clearly, if ω is a distribution basis, then it is μ-independent. Further-
more, since f ∈ D → ξf (x) ∈ C continuously, there exists a unique weakly
μ-measurable map θ : X → D× such that: ξf (x) = f |θx for every f ∈ D. We
call θ dual map of ω. If θ is μ-independent, then it is a distribution basis too.
The Gel’fand distribution basis, introduced in [29], is a good substitute for the notion
of an orthonormal basis which is meaningless in the present framework.
642 F. Tschinke
For all x ∈ R the set of functions ζ := {ζx (y)}x∈R is a Gel’fand distribution basis,
because the synthesis operator Tζ : L2 (R) → L2 (R) defined by:
1
(Tζ ξ )(x) = √ ξ(y)e−ixy dy = ξ̂ (x), ∀ξ ∈ L2 (R)
2π R
is an isometry onto L2 (R) by Plancherel theorem. The analysis operator is: Tζ∗ f =
fˇ, for all f ∈ L2 (R).
Riesz-Fischer Maps, Semi-frames and Frames in Rigged Hilbert Spaces 643
Example ([29, Example 3.18]) Let us consider again S(R) +→ L2 (R) +→ S × (R).
For x ∈ R, let us consider the Dirac delta δx : S(R) → C, φ → φ|δx := φ(x).
The set of Dirac deltas δ := {δx }x∈R is a Gel’fand distribution basis. In fact, the
Parseval identity holds:
| δx |φ |2 dx = |φ(x)|2 dx = φ22 , ∀φ ∈ S(R).
R R
Proposition 5.4 and (5.1) suggest a more general class of bases that will play the
same role as Riesz bases in the ordinary Hilbert space framework.
Definition 5.5 Let D ⊂ H ⊂ D× be a rigged Hilbert space. A weakly measurable
map ω : x ∈ X → ωx ∈ D× is a Riesz distribution basis if ω is a μ-independent
distribution frame.
One has the following:
Proposition 5.6 ([29, Proposition 3.19]) Let D ⊂ H ⊂ D× be a rigged Hilbert
space and let ω : x ∈ X → ωx ∈ D× be a Bessel distribution map. Then the
following statements are equivalent:
(a) ω is a Riesz distribution basis;
(b) If ζ is a Gel’fand distribution basis, then the operator W defined, for f ∈ H,
by:
f = ξf (x)ζx dμ → Wf = ξf (x)ωx dμ
X X
Example Let us consider f ∈ C ∞ (R): 0 < m < |f (x)| < M. Let us define
ωx := f (x)δx : then {ωx }x∈R is a distribution frame, in fact:
| ωx |φ |2 dx = |f (x)φ(x)|2 dx ≤ M 2 φ22 , ∀φ ∈ S(R),
R R
and
2
m φ22 ≤ |f (x)φ(x)|2 dx ≤ M 2 φ22 , ∀φ ∈ S(R).
R
one has:
ξ(x) ωx |g dx = ξ(x)f (x) δx |gdx = 0, ∀g ∈ S(R).
R R
Since {δx }x∈R is μ-independent, it follows that ξ(x)f (x) = 0 a.e., then ξ(x) = 0
a.e.. By definition, {ωx }x∈R is a Riesz distribution basis.
6 Concluding Remarks
Acknowledgement The author would like to thank R. Corso for some valuable comments and
suggestions.
References
1. S.T. Ali, J.P. Antoine, J.P. Gazeau, Continuous frames in Hilbert spaces. Ann. Phys. 222, 1–37
(1993)
2. S.T. Ali, J.P. Antoine, J.P. Gazeau, Coherent States, Wavelets and Their Generalizations, 2nd
edn. (Springer, Berlin, 2014)
Riesz-Fischer Maps, Semi-frames and Frames in Rigged Hilbert Spaces 645
3. J.-P. Antoine, P. Balazs, Frames and semi-frames. J. Phys. A Math. Theor. 44, 205201 (2011)
4. J.-P. Antoine, C. Trapani, Reproducing pairs of measurable functions and partial inner product
spaces. Adv. Oper. Theory 2, 126–146 (2017)
5. J.-P. Antoine, A. Inoue, C. Trapani, Partial *-Algebras and Their Operator Realizations
(Kluwer, Dordrecht, 2002)
6. P. Balazs, D.T. Stoeva, J.-P. Antoine, Classification of general sequences by frame-related
operators. Sampl. Theory Signal Image Process. 10, 151–170 (2011)
7. P. Balasz, M. Speckbacher, Frames, their relatives and reproducing kernel Hilbert spaces. J.
Phys. A Math. Theor. 53, 015204 (2020)
8. G. Bellomonte, C. Trapani, Riesz-like bases in rigged Hilbert spaces. Zeitschr. Anal. Anwen.
35, 243–265 (2016)
9. P. Casazza, O. Christensen, S. Li, A. Lindner, Riesz-Fischer sequences and lower frame
bounds. Zeitschr. Anal. Anwen. 21, 305–314 (2002)
10. O. Christensen, Frames and Bases: An Introductory Course (Birkhäuser, Boston, 2008)
11. O. Christensen, An Introduction to Frames and Riesz Bases (Birkhaäuser, Boston, 2016)
12. E. Cordero, H. Feichtinger, F. Luef, Banach Gelfand triples for Gabor analysis, in Pseudod-
ifferential Operators. Lecture Notes in Mathematics, vol. 1949 (Springer, Berlin, 2008), pp.
1–33
13. G. de Barra, Measure Theory and Integration (New Age International (P) limited Publishers,
Darya Ganj, 1981)
14. H.G. Feichtinger, K. Gröchenig, Gabor frames and time-frequency analysis of Distributions. J.
Funct. Anal. 146, 464–495 (1997)
15. H.G. Feichtinger, G. Zimmermann, A Banach space of test functions for Gabor analysis, in
Gabor Analysis and Algorithms: Theory and Applications (Birkhäuser, Boston, 1998)
16. I.M. Gel’fand, N.Ya. Vilenkin, Generalized Functions, vol. IV (Academic, New York, 1964)
17. I.M. Gel’fand, G.E. Shilov, E. Saletan, Generalized Functions, vol. III (Academic, New York,
1967)
18. G.G. Gould, The spectral representation of normal operators on a rigged Hilbert space. J.
London Math. Soc. 43, 745–754 (1968)
19. K. Gröchenig, Foundations of Time-Frequency Analysis (Birkhäuser, Basel, 2001)
20. C. Heil, A Basis Theory Primer. Expanded Edition (Birkhäuser/Springer, New York, 2011)
21. J. Horvath, Topological Vector Spaces and Distributions (Addison-Wesley, Boston, 1966)
22. H. Hosseini Giv, M. Radjabalipour, On the structure and properties of lower bounded analytic
frames. Iran. J. Sci. Technol. 37, 227–230 (2013)
23. M.S. Jakobsen, J. Lemvig, Density and duality theorems for regular Gabor frames. J. Funct.
Anal. 270, 229–263 (2016)
24. G. Kaiser, A Friendly Guide to Wavelets (Birkhäuser, Boston, 1994)
25. G. Kyriazis, P. Petrushev, On the construction of frames for spaces of distributions. J. Funct.
Anal. 257, 2159–2187 (2009)
26. S. Pilipovic, D.T. Stoeva, Fréchet frames, general definition and expansion. Anal. Appl. 12,
195–208 (2014)
27. M. Reed, B. Simon, Methods of Modern Mathematical Physics, vols. I and II (Academic, New
York, 1980)
28. W. Rudin, Real and Complex analysis (McGraw-Hill, New York, 1987)
29. C. Trapani, S. Triolo, F. Tschinke, Distribution frames and bases. J. Fourier Anal. and Appl.
25, 2109–2140 (2019)
30. R.M. Young, An Introduction to Nonharmonic Fourier Series, 2nd edn. (Academic, Cambridge,
2001)
Periodic Coherent States Decomposition
and Quantum Dynamics on the Flat
Torus
Lorenzo Zanelli
1 Introduction
functions in L2 (Tn ).
L. Zanelli ()
Department of Mathematics “Tullio Levi-Civita”, University of Padova, Padova, Italy
e-mail: [email protected]
The first aim of our paper is to prove the decomposition of any ϕ ∈ L2 (Tn ) with
respect to the family of periodic coherent states given by the periodization of
(1.1). In view of this target, we recall that the periodization operator
(φ)(y) := φ(y − 2πk)
k∈Zn
maps S(Rn ) into C ∞ (Tn ), as is shown for example in [14, Theorem 6.2]. Thus, we
can define for all 0 < h ≤ 1,
(x,ξ ) (y) := φ(x,ξ )(y − 2πk), (x, ξ ) ∈ Tn × h Zn , y ∈ Tn . (1.3)
k∈Zn
Notice that the family of coherent states in (1.3) is well posed also for ξ ∈ Rn
and the related phase space is Tn × Rn . However, our target is to show that the
decomposition of periodic functions can be done with respect to the minimal set of
coherent states in (1.3) for ξ ∈ h Zn ⊂ Rn . Furthermore, we notice that the phase
space Tn × h Zn is necessary in order to deal with a well defined setting of toroidal
Weyl operators acting on L2 (Tn ) and more in general with semiclassical toroidal
Pseudodifferential operators (see Sect. 2).
The first result of the paper is the following.
Theorem 1.1 Let ϕh ∈ C ∞ (Tn ) be such that !x ϕh L2 ≤ c h−M for some c > 0,
M ∈ N, ϕh L2 = 1 with 0 < h ≤ 1, h−1 ∈ N and let (x,ξ ) be as in (1.3). Then
ϕh = (x,ξ ) , ϕh L2 0(x,ξ ) dx + OL2 (h∞ ). (1.4)
ξ ∈h Zn Tn
on the torus are defined also in [7], with a related resolution of the identity, in the
understanding of the Quantum Hall effect. We also recall [5] where coherent states
and Bargmann Transform are studied on L2 (Sn ). The literature on coherent states
are quite rich, and thus we address the reader to [1].
We now devote our attention to the periodic coherent states decomposition for
eigenfunctions of elliptic semiclassical toroidal Pseudodifferential operators (see
Sect. 2). We will see that the formula (1.4) can be reduced in view of a phase-space
localization of eigenfunctions.
This is the content of the second main result of the paper.
Theorem 1.2 Let Oph (b) be an elliptic semiclassical "do as in (2.1) and h−1 ∈ N.
Let E ∈ R, and let ψh ∈ C ∞ (Tn ) be such that ψh L2 = 1 and !x ψh L2 ≤
c h−M , and which is eigenfunction of the eigenvalue problem on Tn given by
Oph (b)ψh = Eh ψh
where Eh ≤ E for any 0 < h ≤ 1. Then, there exists g(h, E) ∈ R+ such that
ψh = (x,ξ ), ψh L2 0(x,ξ ) dx + OL2 (h∞ ). (1.6)
ξ ∈h Zn , |ξ |≤g(h,E) Tn
We notice that for the operators −h2 !x +V (x) all the eigenfunctions with eigen-
values Eh ≤ E fulfill !x ψh L2 ≤ c h−2 . In particular, we have the asymptotics
g(h, E) → +∞ as h → 0+ . We also underline that the function g(E, h) and the
estimate on remainder OL2 (h∞ ) do not depend on the particular choice of ψh . This
implies that all these eigenfunctions take the form
(1.6) and therefore also any finite
linear combination of eigenfunctions of kind 1≤α≤N cα ψh,α where |cα | ≤ 1.
We remind that Weyl Law on the number N (h) of eigenvalues Eh,α ≤ E (with
their multiplicity) for semiclassical elliptic operators (see for example [9]) reads
N (h) 2 (2πh)−n (vol(U (E)) + O(1)).
The proof of the above result is mainly based on a uniform estimate for our
toroidal version of the Fourier-Bros-Iagolnitzer (FBI) transform
on the unbounded region given by all x ∈ Tn and ξ ∈ hZn such that |ξ | > g(h, E).
The FBI transform on any compact manifold has already been defined and studied
in the literature, see for example [16].
We remind that, in the euclidean setting of R2n , the function
is the usual version of the FBI transform, which is well posed for any ψh ∈ S (Rn ).
This is used to study the phase space localization by the Microsupport of ψh (see for
example [10]), namely MS(ψh ) the complement of the set of points (x0 , ξ0 ) such
650 L. Zanelli
i.e. MS(ψh ) ⊆ U (E). The well posedness of W F (ψh ) and MS(ψh ) in the periodic
setting can be seen starting from the euclidean setting and thanks to distributional
inclusion L2 (Tn ) ⊂ S (Rn ), (see for example section 3.1 of [3]). The semiclassical
study in the phase space for eigenfunctions in the periodic setting has also been
studied in [18] with respect to weak KAM theory.
In our Theorem 1.2 we are interested to show another kind of semiclassical
localization, namely to localize the bounded region
which will be bigger than MS(ψh ), h—dependent and such that the coherent state
decomposition of ψh can be done up to a remainder OL2 (h∞ ).
We now focus our attention to the decompositon (1.6) under the time evolution.
Theorem 1.3 Let ϕh ∈ C ∞ (Tn ), L2 —normalized such that
ϕh = cj ψh,j (1.7)
1≤j ≤J (h)
where ψh,j are given in Theorem 1.2 and J (h) ≤ J0 h−Q for some J0 , Q > 0. Let
Oph (b) be an elliptic semiclassical "do as in (2.1) and
The equality (1.8) shows that time evolution under the L2 —unitary map Uh (t)
does not change such a decomposition, since (h) does not depend on time.
The function (h) is not necessarily the same as the function f (h) contained in
Theorem 1.1 but we have that (h) ≥ f (h). In other words, this quantum dynamics
preserves the coherent state decomposition (1.5). The same result holds for any
eigenfunctions in Theorem 1.2 since in this case Uh (t)ψh = exp{(−iEh t)/ h}ψh .
Notice that here we can assume that Q > n, namely the linear combination
Periodic Coherent States Decomposition and Quantum Dynamics on the Flat Torus 651
(1.7) can be done with more eigenfunctions than the ones that have eigenvalues
Eh ≤ E with fixed energy E > 0. Notice also that we have (x,ξ ) , Uh (t)ϕh L2 =
Uh (−t)(x,ξ ), ϕh L2 for any t ∈ R and that the time evolution of the periodization
of coherent states has been used in [17] in the context of optimal transport theory.
Let us define the flat torus Tn := (R/2πZ)n and introduce the class of symbols b ∈
m
Sρ,δ (Tn × Rn ), m ∈ R, 0 ≤ δ, ρ ≤ 1, given by functions in C ∞ (Tn × Rn ; R) which
are 2π-periodic in each variable xj , 1 ≤ j ≤ n and for which for all α, β ∈ Zn+
there exists Cαβ > 0 such that for all (x, ξ ) ∈ Tn × Rn ,
This is the semiclassical version (see [12, 13]) of the quantization by Pseudod-
ifferential Operators on the torus developed in [14] and [15]. See also [11] for the
notion of vector valued Pseudodifferential Operators on the torus.
We now notice that we have a map Oph (b) : C ∞ (Tn ) −→ D (Tn ). Indeed,
remind that u ∈ D (Tn ) are the linear maps u : C ∞ (Tn ) −→ C such that there
exist C > 0 and k ∈ N, for which |u(φ)| ≤ C |α|≤k ∂xα φ∞ for all φ ∈ C ∞ (Tn ).
Given a symbol b ∈ S m (Tn × Rn ), the toroidal Weyl quantization reads (see
[12, 13])
h
−n
Oph (b)ψ(x) := (2π)
w
eix−y,κ b y, κ ψ(2y − x) dy.
n T
n 2
κ∈Z
In particular, it holds
|b(x, ξ )| ≥ C ξ m
for any x ∈ Tn and |ξ | ≥ c. This property guarantees bounded sublevels sets for b
and discrete spectrum for the operator Oph (b) for any fixed 0 < h ≤ 1. As we see
in Theorem 1.2, this assumption permits also to prove the semiclassical localization
of all the eigenfunctions within these sublevels sets, and this localization can be
studied by our semiclassical coherent states (1.3).
= (x,ξ )(y).
Periodic Coherent States Decomposition and Quantum Dynamics on the Flat Torus 653
We mainly adapt, in our toroidal setting, the proof of [10, Proposition 3.1.6] written
for the euclidean setting. Thus, we define the operator T 0 on functions " ∈ L2 (Tn ×
hZn ) as
(T ")(y) :=
0
"(x, ξ )0(x,ξ )(y) dx.
ξ ∈h Zn Tn
It can be easily seen that T 0 equals the adjoint of the operator (T ψ)(x, ξ ) :=
(x,ξ ), ψL2 (Tn ) , i.e.
which implies
where FT−1
n stands for the inverse toroidal Fourier Transform, and FRn is the usual
euclidean version. In view of (3.3) it follows
(T ψ)(x, ξ ) := (x,ξ ), ψL2 (Tn ) = FRn φx,ξ |Zn , FTn ψL2 (Zn )
= (k),
φx,ξ (k)0 ψ
k∈Zn
where O(h∞ ) does not depend on the functions ψ1 , ψ2 . We now use the assumption
!x ψL2 ≤ c h−M for some fixed c, M > 0 so that Fourier components fulfill
k | ≤ |k|−2 (2π)n/2 c h−M , and ψL2 = 1 gives |ψ
|ψ 0 | ≤ (2π)n/2 . Consequently,
1 (k)| ≤ (2π)n/2 + (2π)n/2 c
|ψ |k|−2 h−M , (3.5)
k∈Zn k∈Zn \{0}
and
1 (k)0 ψ
ψ 2 (μ) ≤ 1 (k)|
|ψ 2 (μ)|.
|ψ (3.6)
k,μ∈Zn k∈Zn μ∈Zn
To conclude, since
δ(k − μ) = eiα(k−μ),
α∈Zn
we get
T ψ1 , T ψ2 L2 (Tn ×hZn ) = 1 (k)0 ψ
ψ 2 (k) + O(h∞ )
k∈Zn
The estimates (3.5)–(3.6) together with (3.4) ensure that the remainder in (3.7) has
order O(h∞ ).
Periodic Coherent States Decomposition and Quantum Dynamics on the Flat Torus 655
is given by an L2 -convergent series. Thus, for any fixed ϕh we can say that there
exists f (h) > 0 such that
ϕh = (x,ξ ) , ϕh L2 0(x,ξ ) dx + O(h∞ ).
ξ ∈h Zn ,|ξ |<f (h) T
n
We apply the statement of Theorem 1.1, for a set of linearly independent eigen-
functions ψh,i generating all the eigenspaces linked to eigenvalues Eh ≤ E and
fi (h) > 0 given by Theorem 1.1:
ψh,i = (x,ξ ) , ψh,i L2 0(x,ξ ) dx + Rh,i
ξ ∈h Zn ,|ξ |<f Tn
i (h)
We define:
Since
any eigenfunction ψh linked to Eh ≤ E will be written as ψh =
i ψh,i , ψh ψh,i then the linearity of decomposition (1.4) ensures also the decom-
position (1.6) for such ψh . Namely,
ψh = (x,ξ ) , ψh L2 0(x,ξ ) dx + Rh
ξ ∈h Zn , |ξ |≤g(h,E) Tn
656 L. Zanelli
where Rh = 1≤i≤N (E,h) Ri,h . To conclude:
Rh L2 ≤ Ri,h L2 ≤ N (E, h) max Ri,h L2
1≤i≤N (E,h)
1≤i≤N (E,h)
where the L2 -normalized eigenfunctions ψh,j of Oph (b) are given in Theorem 1.2
and we assume J (h) ≤ J0 h−Q for some J0 , Q > 0 that are independent on 0 <
h ≤ 1.
Define
where fi (h) are associated to the functions ψh,i and given by Theorem 1.1.
We now observe that if Uh (t) := exp{(−iOph (b)t)/ h} then
cj e− h Ej,h ψh,j
i
U (t)ϕh =
1≤j ≤J (h)
for any t ∈ R.
We can now apply the decomposition formula (1.4) with the condition on the
frequencies |ξ | ≤ (h) and for the wave function U (t)ϕh and get the expected
result, namely
Uh (t)ϕh = (x,ξ ), Uh (t)ϕh L2 0(x,ξ ) dx + Rj,h
ξ ∈h Zn , |ξ |≤(h) Tn 1≤j ≤J
for any t ∈ R. The remainder Rh := 1≤j ≤J Rj,h can be estimated as in the
previous Theorem, namely
Rh L2 ≤ Rj,h L2 ≤ J0 h−Q max Rj,h L2
1≤j ≤J
1≤i≤J
References
1. J.P. Antoine, F. Bagarello, J.P. Gazeau (eds.), Coherent States and Their Applications. A
Contemporary Panorama. Springer Proceedings in Physics, vol. 205 (Springer, Berlin, 2018)
2. A. Bouzouina, S. De Bievre, Equipartition of the eigenfunctions of quantized ergodic maps on
the torus. Commun. Math. Phys. 178, 83–105 (1996)
3. F. Cardin, L. Zanelli, The geometry of the semiclassical wave front set for Schrödinger
eigenfunctions on the torus. Math. Phys. Anal. Geom. 20(2), Art. 10, 20 (2017)
4. M. Combescure, D. Robert, Coherent States and Applications in Mathematical Physics
(Springer, Berlin, 2012)
5. E. Diaz-Ortiz, C. Villegas-Blas, On a Bargmann transform and coherent states for the n-sphere.
J. Math. Phys. 53(6), 062103, 25 (2012)
6. F. Faure, S. Nonnenmacher, S. De Bievre, Scarred eigenstates for quantum cat maps of minimal
periods. Commun. Math. Phys. 239, 449–492 (2003)
7. M. Fremling, Coherent state wave functions on a torus with a constant magnetic field. J. Phys.
A 46(27), 275302, 23 (2013)
8. R. Fresneda, J.P. Gazeau, D. Noguera, Quantum localisation on the circle. J. Math. Phys. 59(5),
052105, 19 (2018)
9. V. Guillemin, S. Sternberg, Semi-Classical Analysis (International Press, Boston, 2013)
10. A. Martinez, Introduction to Semiclassical and Microlocal Analysis (Springer, New York,
2002)
11. B.B. Martinez, R. Denk, J.H. Monzón, T. Nau, Generation of semigroups for vector-valued
pseudodifferential operators on the torus. J. Fourier Anal. Appl. 22, 823–853 (2016)
12. A. Parmeggiani, L. Zanelli, Wigner measures supported on weak KAM tori. J. Anal. Math.
123, 107–137 (2014)
13. T. Paul, L. Zanelli, On the dynamics of WKB wave functions whose phase are weak KAM
solutions of H-J equation. J. Fourier Anal. Appl. 20, 1291–1327 (2014)
14. M. Ruzhansky, V. Turunen, Quantization of pseudo-differential operators on the torus. J.
Fourier Anal. Appl. 16, 943–982 (2010)
15. M. Ruzhansky, V. Turunen, Pseudo-Differential Operators and Symmetries: Background
Analysis and Advanced Topics (Birkhäuser, Basel, 2010)
16. J. Wunsch, M. Zworski, The FBI transform on compact C ∞ -manifolds. Trans. Amer. Math.
Soc. 353, 1151–1167 (2001)
17. L. Zanelli, On the optimal transport of semiclassical measures. Appl. Math. Optim. 74, 325–
342 (2016)
18. L. Zanelli, Schrödinger spectra and the effective Hamiltonian of the weak KAM theory on the
flat torus. J. Math. Phys. 57(8), 081507, 12 (2016)