An_Average_Set_Size_Theorem
An_Average_Set_Size_Theorem
DAVID REIMER
Department of Mathematics,
The College of New Jersey, Ewing, NJ 086280718, USA
(e-mail: [email protected])
In this paper we prove that, if a collection of sets A is union-closed, then the average set
size of A is at least 12 log2 (|A|).
1. Introduction
In what follows, all logs will be base two. For n ∈ N define [n] ≡ {1, 2, . . . , n}. Let P([n]) be
the power set of [n], and for A ⊆ B ⊆ [n] define [A, B] as [A, B] ≡ {C ⊆ [n] : A ⊆ C ⊆ B}.
For A ⊆ P([n]), we say A is union-closed if A, B ∈ A implies A ∪ B ∈ A.
The Union-Closed Sets Conjecture, attributed to P. Frankl (see [4, p. 525]), states that,
if A ⊆ P([n]) is union-closed, then some element of [n] belongs to at least half of the
sets in A. This paper will prove a theorem that puts a lower bound on the average set
size of a collection of union-closed sets. The theorem, conjectured by Jeff Kahn [3], was
motivated by, and can be shown to follow from, Frankl’s conjecture.
When A = P([n]), both sides of the equation become 2n and hence the theorem is
sharp. Later we will show that the only sharp cases are the families A = P(M), where
M ⊆ [n].
We will prove Theorem 1.1 using two lemmas. The first lemma gives a strong result in
the special case that A is a filter.
The following definition treats P([n]) as a digraph whose vertices are the sets, and
whose edges are pairs of sets that differ by exactly one element, directed from the smaller
set to the larger set.
Definition. Let A, B ∈ P([n]). Then A ⊂· B if and only if A ∪ {i} = B, for some i ∈ [n],
V (P([n])) ≡ P([n]),
E(P([n])) ≡ {(A, B) ∈ P([n])2 : A ⊂· B}.
It is not hard to demonstrate that this lemma implies the well-known fact that Theo-
rem 1.1 holds for filters. Simply divide both sides by 2|F| and notice that n > log(|F|).
The second lemma establishes a special bijection between a union-closed family and an
associated filter.
Lemma 1.3. Let A ⊆ P([n]) be closed under unions. Then there exist a filter F ⊆ P([n])
and a bijection from A to F, A → FA such that:
(i) A ⊆ FA ;
(ii) A, B ∈ A, A 6= B ⇒ [A, FA ] ∩ [B, FB ] = φ;
P
(iii) A∈A |FA − A| 6 |A|n − |A| log(|A|)
In Section 2 we will prove that Lemmas 1.2 and 1.3 imply Theorem 1.1, and in Section 3
we will prove these two lemmas.
Proof of Theorem 1.1. Let A be a subset of P([n]) that is closed under unions and apply
Lemma 1.3 to obtain filter F. For each A ∈ A, define edge set EA ≡ {(B, FA ) ∈ E(P([n])) :
B = FA −{i}, i ∈ FA −A}. Clearly |EA | = |FA −A|. Also note that EA ⊆ E([A, FA ])∩EB(F)
P P
and the sets EA are disjoint. Hence A∈A |EA | 6 |EB(F)|, or equivalently A∈A |FA −A| 6
|EB(F)|. Apply Lemma 1.2 to F. By part (i) of Lemma 1.3, we know A ⊆ FA , so
X X
2 |A| + 2 |FA − A| = |A|n + |EB(F)|,
A∈A A∈A
X X
2 |A| + |FA − A| > |A|n.
A∈A A∈A
An Average Set Size Theorem 91
= n|F| + |EB(F)|
We can now prove Lemma 1.3, following some preliminaries. We first define the i-rising
function RS,i (S). This is in fact a well-known operator, appearing, for example, in [1] and
[2]. Intuitively, RS,i (S) adds i to all sets of S that do not contain i, unless such a set
already exists in S. To simplify our notation, we will often refer to RS,i (S) as Ri (S).
Note that Ri (S) is one to one. We will now raise A in each of the n dimensions. For
our family A, define
A : if i = 0,
Ai ≡
Ri (Ai−1 ) : if i > 0,
Ai ≡ RAi−1 ,i ◦ RAi−2 ,i−1 · · · ◦ RA0 ,1 (A).
If Ai 6= Ai−1 we say Ai−1 rises, and note that Ai−1 ∈ Ai−1 implies that Ai ∪ {i} ∈ Ai .
We will need the following lemma.
Lemma 3.1. If Ai−1 is closed under unions and Ci−1 , Di−1 ∈ Ai−1 are such that Di−1 =
Ci−1 ∪ X for some X ⊆ [n], then Ci ∪ X ∈ Ai . Furthermore Ci−1 6= Ci−1 ∪ X implies
Ci 6= Ci ∪ X.
92 D. Reimer
Lemma 3.3. Let i, j ∈ [n] where i < j. Let Ci , Di ∈ Ai such that Di = Ci ∪ X for some
X ⊆ [n]. Then Cj ∪ X ∈ Aj . Furthermore, Ci 6= Ci ∪ X implies Cj 6= Cj ∪ X.
Proof. We need to show, for all A ∈ A, that An ∪ {i} ∈ An . Pick i ∈ [n]. We know
Ai ∈ Ai implies Ai ∪ {i} ∈ Ai . Let Ci = Ai , X = {i} and j = n and apply Lemma 3.3.
Then An ∪ {i} ∈ An .
inequality, which essentially says that the maximum sum is obtained when all the subcubes
[A, FA ] are the same size. By Jensen’s inequality we have
" #
X X
exp2 |A|−1 |FA − A| 6 |A|−1 2|FA −A| 6 2n |A|−1
A∈A A∈A
or
X
|FA − A| 6 n|A| − |A| log |A|.
A∈A
The inequalities in the above proof are only sharp if the subcubes [A, FA ] fill the cube
and are all the same size. Without loss of generality, we may assume the maximal set
M ∈ A is [n], and therefore does not rise. Hence |[M, FM ]| = 1 and, if the inequality is
sharp, |[A, FA ]| = 1, for all A ∈ A, which implies that A = P([n]). Therefore the only
sharp cases are those with A = P(M), for some M ⊆ [n], as claimed.
References
[1] Alon, N. (1983) On the density of sets of vectors. Discrete Math. 46 199–202.
[2] Frankl, P. (1983) On the trace of finite sets. J. Combin. Theory Ser. A 34 41–45.
[3] Kahn, J. Personal communication.
[4] Rival, I. (1985) Graphs and Order, Reidel, Dordrecht/Boston.