0% found this document useful (0 votes)

253 views

Arrays Considered Somewhat Harmful

Eric Lippert discusses the issues with using arrays in programming and argues they should not be used as return values from public methods. Arrays are problematic because they are mutable and returning them means creating a new copy each time, hurting performance. Additionally, arrays work against goals of writing parallelizable and declarative code. While arrays are important to understand, there are usually better collection types to model problems than arrays.

Uploaded by

Mithdraug

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

253 views

Arrays Considered Somewhat Harmful

Uploaded by

Mithdraug

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

1

Arrays considered somewhat harmful Eric Lippert

Arrays considered somewhat harmful

Fabulous Adventures In Coding

Eric Lippert's Erstwhile Blog

I got a moral question from an author of programming language textbooks the other day requesting
my opinions on whether or not beginner programmers should be taught how to use arrays.

Rather than actually answer that question, I gave him a long list of my opinions about arrays, how I
use arrays, how we expect arrays to be used in the future, and so on. This gets a bit long, but like
Pascal, I didn't have time to make it shorter.

Let me start by saying when you definitely should not use arrays, and then wax more philosophical
about the future of modern programming and the role of the array in the coming world.

You probably should not return an array as the value of a public method or property,
particularly when the information content of the array is logically immutable. Let me give you an
example of where we got that horridly wrong in a very visible way in the framework. If you take a
look at the documentation for System.Type, you'll find that just looking at the method descriptions
gives one a sense of existential dread. One sees a whole lot of sentences like "Returns an array of
Type objects that represent the constraints on the current generic type parameter." Almost every
method on System.Type returns an array it seems.

Now think about how that must be implemented. When you call, say, GetConstructors() on
typeof(string), the implementation cannot possibly do this, as sensible as it seems.

public class Type {

private ConstructorInfo[] ctorInfos;
public ConstructorInfo[] GetConstructors()
{
if (ctorInfos == null) ctorInfos = GoGetConstructorInfosFromMetadata();
return ctorInfos;
}

Why? Because now the caller can take that array and replace the contents of it with whatever they
please. Returning an array means that you have to make a fresh copy of the array every time you
return it. You get called a hundred times, you’d better make a hundred array instances, no matter
how large they are. It’s a performance nightmare – particularly if, like me, you are considering using
reflection to build a compiler. Do you have any idea how many times a second I try to get type
information out of reflection? Not nearly as many times as I could; every time I do it’s another
freakin’ array allocation!
2
Arrays considered somewhat harmful Eric Lippert
The frameworks designers were not foolish people; unfortunately, we did not have generic types in
.NET 1.0. clearly the sensible thing now for GetConstructors() to return is IList<ConstructorInfo>. You
can build yourself a nice read-only collection object once, and then just pass out references to it as
much as you want.

What is the root cause of this malaise? It is simple to state: The caller is requesting values. The callee
fulfills the request by handing back variables.

An array is a collection of variables. The caller doesn’t want variables, but it’ll take them if that’s
the only way to get the values. But in this case, as in most cases, neither the callee nor the caller
wants those variables to ever vary. Why on earth is the callee passing back variables then? Variables
vary. Therefore, a fresh, different variable must be passed back every time, so that if it does vary,
nothing bad happens to anyone else who has requested the same values.

If you are writing such an API, wrap the array in a ReadOnlyCollection<T> and return an
IEnumerable<T> or an IList<T> or something, but not an array. (And of course, do not simply cast
the array to IEnumerable<T> and think you’re done! That is still passing out variables; the caller can
simply cast back to array! Only pass out an array if it is wrapped up by a read-only object.)

That’s the situation at present. What are the implications of array characteristics for the future of
programming and programming languages?

Parallelism Problems

The physics aspects of Moore’s so-called “Law” are failing, as they eventually must. Clock speeds
have stopped increasing, transistor density has stopped increasing. The laws of thermodynamics and
the Uncertainty Principle are seeing to that. But manufacturing costs per chip are still falling, which
means that our only hope of Moore’s "Law" continuing to hold over the coming decades is to cram
more and more processors into each box.

We’re going to need programming languages that allow mere mortals to write code that is
parallelizable to multiple cores.

Side-effecting change is the enemy of parallelization. Parallelizing in a world with observable side
effects means locks, and locks means choosing between implementing lock ordering and dealing
with random crashes or deadlocks. Lock ordering requires global knowledge of the program.
Programs are becoming increasingly complex, to the point where one person cannot reasonably and
confidently have global knowledge. Indeed, we prefer programming languages to have the property
that programs in them can be understood by understanding one part at a time, not having to
swallow the whole thing in one gulp.

Therefore we tools providers need to create ways for people to program effectively without causing
observable side effects.

Of all the sort of “basic” types, arrays most strongly work against this goal. An array’s whole
purpose is to be a mass of mutable state. Mutable state is hard for both humans and compilers to
reason about. It will be hard for us to write compilers in the future that generate performant multi-
core programs if developers use a lot of arrays.

Now, one might reasonably point out that List<T> is a mass of mutable state too. But at least one
could create a threadsafe list class, or an immutable list class, or a list class that has transactional
3
Arrays considered somewhat harmful Eric Lippert
integrity, or uses some form of isolation or whatever. We have an extensibility model for lists
because lists are classes. We have no ability to make an “immutable array”. Arrays are what they are
and they’re never going to change.

Conceptual Problems

We want C# to be a language in which one can draw a line between code that implements a
mechanism and code that implements a policy.

The “C” programming language is all about mechanisms. It lays bare almost exactly what the
processor is actually doing, providing only the thinnest abstraction over the memory model. And
though we want you to be able to write programs like that in C#, most of the time people should be
writing code in the “policy” realm. That is, code that emphasizes what the code is supposed to
do, not how it does it.

Coding which is more declarative than imperative, coding which avoids side effects, coding which
emphasizes algorithms and purposes over mechanisms, that kind of coding is the future in a world of
parallelism. (And you’ll note that LINQ is designed to be declarative, strongly abstract away from
mechanisms, and be free of side effects.)

Arrays work against all of these factors. Arrays demand imperative code, arrays are all about side
effects, arrays make you write code which emphasizes how the code works, not what the code is
doing or why it is doing it. Arrays make optimizing for things like “swapping two values” easy, but
destroy the larger ability to optimize for parallelism.

Practical Problems

And finally, given that arrays are mutable by design, the way an array restricts that mutability is
deeply weird. All the contents of the collection are mutable, but the size is fixed. What is up with
that? Does that solve a problem anyone actually has?

For this reason alone I do almost no programming with arrays anymore. Arrays simply do not model
any problem that I have at all well – I rarely need a collection which has the rather contradictory
properties of being completely mutable, and at the same time, fixed in size. If I want to mutate a
collection it is almost always to add something to it or remove something from it, not to change
what value an index maps to.

We have a class or interface for everything I need. If I need a sequence I’ll use IEnumerable<T>, if I
need a mapping from contiguous numbers to data I’ll use a List<T>, if I need a mapping across
arbitrary data I’ll use a Dictionary<K,V>, if I need a set I’ll use a HashSet<T>. I simply don’t need
arrays for anything, so I almost never use them. They don’t solve a problem I have better than the
other tools at my disposal.

Pedagogic Problems

It is important that beginning programmers understand arrays; it is an important and widely used
concept. But it is also important to me that they understand the weaknesses and shortcomings of
arrays. In almost every case, there is a better tool to use than an array.

The difficulty is, pedagogically, that it is hard to discuss the merits of those tools without already
having down concepts like classes, interfaces, generics, asymptotic performance, query expressions,
4
Arrays considered somewhat harmful Eric Lippert
and so on. It’s a hard problem for the writer and for the teacher. Fortunately, for me, it's not a
problem that I personally have to solve.

Tags Arrays C# Code Quality Immutability Performance Rants Software development

methodology Threading

C# Interview Questions You'll Most Likely Be Asked
From Everand
C# Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
C# Arraays
No ratings yet
C# Arraays
134 pages
Object Oriented Programming
100% (1)
Object Oriented Programming
72 pages
06cs761 Ppts Chapter IV
No ratings yet
06cs761 Ppts Chapter IV
80 pages
Sheet 1
No ratings yet
Sheet 1
2 pages
Coding Interview Questions and Answers
From Everand
Coding Interview Questions and Answers
Chinmoy Mukherjee
No ratings yet
Net Stack Basics
No ratings yet
Net Stack Basics
63 pages
C Sharp Quick Ref
No ratings yet
C Sharp Quick Ref
7 pages
C# UNIT 3
No ratings yet
C# UNIT 3
18 pages
C Sharp (C#) : Benadir University
No ratings yet
C Sharp (C#) : Benadir University
33 pages
OOP2 Lecture Week 03
No ratings yet
OOP2 Lecture Week 03
43 pages
08 Arrays PDF
No ratings yet
08 Arrays PDF
24 pages
UNIT II C# LM
No ratings yet
UNIT II C# LM
32 pages
C# Data Structures and Algorithms: Harness the power of C# to build a diverse range of efficient applications
From Everand
C# Data Structures and Algorithms: Harness the power of C# to build a diverse range of efficient applications
Marcin Jamro
No ratings yet
Arrays in Csharp
No ratings yet
Arrays in Csharp
41 pages
Self Studies 10.2
No ratings yet
Self Studies 10.2
13 pages
C#12Expansion
No ratings yet
C#12Expansion
8 pages
Making Use of C# Collections: Glen Mccluskey
No ratings yet
Making Use of C# Collections: Glen Mccluskey
5 pages
Vb.net Arrays
No ratings yet
Vb.net Arrays
7 pages
Arrays
No ratings yet
Arrays
69 pages
Chapter 6c
No ratings yet
Chapter 6c
24 pages
Properties, Array, Encapsulation: Dept. of Computer Science Faculty of Science and Technology
No ratings yet
Properties, Array, Encapsulation: Dept. of Computer Science Faculty of Science and Technology
43 pages
Arrays of Arrays - Csharp
No ratings yet
Arrays of Arrays - Csharp
15 pages
In This Session, You Will Learn To:: Objectives
No ratings yet
In This Session, You Will Learn To:: Objectives
28 pages
Florin Olariu: "Alexandru Ioan Cuza", University of Iași Department of Computer Science
No ratings yet
Florin Olariu: "Alexandru Ioan Cuza", University of Iași Department of Computer Science
48 pages
OOP L12 Data Structures Lite
No ratings yet
OOP L12 Data Structures Lite
18 pages
Module 5 - More About Variables
No ratings yet
Module 5 - More About Variables
27 pages
C#DOTNET UNIT-2 LM
No ratings yet
C#DOTNET UNIT-2 LM
29 pages
C# Teorie
No ratings yet
C# Teorie
12 pages
ET Week06 Ch08
No ratings yet
ET Week06 Ch08
23 pages
SE OOP Lecture14
No ratings yet
SE OOP Lecture14
44 pages
5 - Dynamic-Arrays
No ratings yet
5 - Dynamic-Arrays
21 pages
AWP 18
No ratings yet
AWP 18
48 pages
CSharp
No ratings yet
CSharp
66 pages
Chap 07 Gaddis CSharp 6e F24
No ratings yet
Chap 07 Gaddis CSharp 6e F24
52 pages
Beyond the Basics of JavaScript
From Everand
Beyond the Basics of JavaScript
Tom Henricksen
No ratings yet
Essential Algorithms: A Practical Approach to Computer Algorithms
From Everand
Essential Algorithms: A Practical Approach to Computer Algorithms
Rod Stephens
4.5/5 (2)
Introducing Arrays: Module 04 (Complex Data Structures)
No ratings yet
Introducing Arrays: Module 04 (Complex Data Structures)
13 pages
Visual Programming Notes
No ratings yet
Visual Programming Notes
15 pages
Arrays As Objects (C# Programming Guide) : Example
No ratings yet
Arrays As Objects (C# Programming Guide) : Example
9 pages
C# Types: Tom Roeder CS 215 2006fa
No ratings yet
C# Types: Tom Roeder CS 215 2006fa
27 pages
Chapter 7
No ratings yet
Chapter 7
34 pages
Hexagonal Architecture Explained
From Everand
Hexagonal Architecture Explained
Alistair Cockburn
No ratings yet
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
CSC 220 Data Structures and Algorithms: Lecture # 3
No ratings yet
CSC 220 Data Structures and Algorithms: Lecture # 3
31 pages
Session 4
No ratings yet
Session 4
32 pages
Unit1 Part2
No ratings yet
Unit1 Part2
51 pages
System - Collections Namespace
No ratings yet
System - Collections Namespace
34 pages
Chapter 7c
No ratings yet
Chapter 7c
17 pages
AWP ATKT Solution Set April 2019 Ketaki Ghawali
No ratings yet
AWP ATKT Solution Set April 2019 Ketaki Ghawali
27 pages
Indexer S
No ratings yet
Indexer S
18 pages
7.collection Classes (Final)
No ratings yet
7.collection Classes (Final)
96 pages
Visual C# How to Program 6th Edition Deitel Test Bank - Fast Download To Start Reading Immediately
100% (11)
Visual C# How to Program 6th Edition Deitel Test Bank - Fast Download To Start Reading Immediately
53 pages
Learn Programming by Coding Like a Professional: Create Games, Apps, & Programs
From Everand
Learn Programming by Coding Like a Professional: Create Games, Apps, & Programs
Tim Codin
No ratings yet
PLD Topic 7 Arrays
No ratings yet
PLD Topic 7 Arrays
37 pages
CODING INTERVIEW: 50+ Tips and Tricks to Better Performance in Your Coding Interview
From Everand
CODING INTERVIEW: 50+ Tips and Tricks to Better Performance in Your Coding Interview
Eric Schmidt
No ratings yet
4th semester Finals exams preparation material
No ratings yet
4th semester Finals exams preparation material
12 pages
C# Class Presentation
No ratings yet
C# Class Presentation
39 pages
Declaration and Allocation of Memory For Arrays: Arrays: What's An Array?
No ratings yet
Declaration and Allocation of Memory For Arrays: Arrays: What's An Array?
6 pages
Just the basics of JavaScript
From Everand
Just the basics of JavaScript
Tom Henricksen
No ratings yet
LP07-B1-Simple Present-Lesson Plan
No ratings yet
LP07-B1-Simple Present-Lesson Plan
4 pages
1648 - Year - B.E. Mechanical Engineering Sem-V Subject - ME503 - Industrial Economics & Entrepreneurship Development
No ratings yet
1648 - Year - B.E. Mechanical Engineering Sem-V Subject - ME503 - Industrial Economics & Entrepreneurship Development
2 pages
Mysql Inventory Database
No ratings yet
Mysql Inventory Database
15 pages
Listening Practice Test 22
No ratings yet
Listening Practice Test 22
2 pages
Factors Influencing Business Decision PDF
0% (1)
Factors Influencing Business Decision PDF
2 pages
Digit1 Lab02
No ratings yet
Digit1 Lab02
10 pages
Teaching and Learning Tool Box
No ratings yet
Teaching and Learning Tool Box
10 pages
Wasim 2
No ratings yet
Wasim 2
5 pages
Notation (For Chapter 10) Foreword The Authors: VII Ix X
0% (1)
Notation (For Chapter 10) Foreword The Authors: VII Ix X
4 pages
Short Practice Test 03 - Hints & Solutions - Arjuna NEET 2025
No ratings yet
Short Practice Test 03 - Hints & Solutions - Arjuna NEET 2025
2 pages
SAIL Presentation
No ratings yet
SAIL Presentation
13 pages
Printable Reading Order
No ratings yet
Printable Reading Order
10 pages
Maths - GCSE - Unit 1 Higher Student Booklet
No ratings yet
Maths - GCSE - Unit 1 Higher Student Booklet
14 pages
Methods of Assessing Adult Attachment PDF
No ratings yet
Methods of Assessing Adult Attachment PDF
21 pages
Final FILIPINO TTL1 Grp9 Edgar Dales Cone of Experience
100% (1)
Final FILIPINO TTL1 Grp9 Edgar Dales Cone of Experience
25 pages
Agriculture Matter
No ratings yet
Agriculture Matter
27 pages
Long Quiz
No ratings yet
Long Quiz
3 pages
Format of Application
No ratings yet
Format of Application
3 pages
Corea Del Sur
No ratings yet
Corea Del Sur
4 pages
Malaysia Sewerage Industries Guidelines Volume 3
100% (1)
Malaysia Sewerage Industries Guidelines Volume 3
173 pages
Ktse Connect O C T - 2 0 1 1: Nanji Monji Dedhia Charitable Trust
No ratings yet
Ktse Connect O C T - 2 0 1 1: Nanji Monji Dedhia Charitable Trust
8 pages
Alternative Fastplay Manual For Dune (D10) RPG: (Version: 1.0)
No ratings yet
Alternative Fastplay Manual For Dune (D10) RPG: (Version: 1.0)
10 pages
MSI
No ratings yet
MSI
57 pages
Arrival 1st Edition Michael Teitelbaum 2024 scribd download
100% (7)
Arrival 1st Edition Michael Teitelbaum 2024 scribd download
85 pages
The Engineers Transit and Theodolite
No ratings yet
The Engineers Transit and Theodolite
8 pages
Sangat Sindh Report On Seminar On Safe Drinking Water Policy Report
No ratings yet
Sangat Sindh Report On Seminar On Safe Drinking Water Policy Report
6 pages
Mapping Tool (Annex 1)
No ratings yet
Mapping Tool (Annex 1)
1 page
GameRanger Launch Log
No ratings yet
GameRanger Launch Log
86 pages
Download Full God and Nature Historical Essays on the Encounter between Christianity and Science David C. Lindberg (Editor) PDF All Chapters
100% (2)
Download Full God and Nature Historical Essays on the Encounter between Christianity and Science David C. Lindberg (Editor) PDF All Chapters
40 pages
APL Brochure V 1.3
No ratings yet
APL Brochure V 1.3
17 pages

Arrays Considered Somewhat Harmful

Uploaded by

Arrays Considered Somewhat Harmful

Uploaded by

1

Arrays considered somewhat harmful Eric Lippert

Arrays considered somewhat harmful

Eric Lippert's Erstwhile Blog

public class Type {

Tags Arrays C# Code Quality Immutability Performance Rants Software development

You might also like