0% found this document useful (0 votes)

10 views33 pages

05-thread

The document discusses the concepts of processes and threads in operating systems, emphasizing the importance of concurrency and parallelism. It explains how threads share a process's address space and OS resources, making them more efficient than separate processes. The document also contrasts kernel threads and user-level threads, highlighting the advantages of user-level threads in terms of speed and efficiency, while addressing potential issues such as I/O blocking and preemption.

Uploaded by

nyamora208

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views33 pages

05-thread

Uploaded by

nyamora208

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

Operating Systems

Fall 2014

Threads

Myungjin Lee
[email protected]

1
What’s “in” a process?

• A process consists of (at least):

– An address space, containing
• the code (instructions) for the running program
• the data for the running program
– Thread state, consisting of
• The program counter (PC), indicating the next instruction
• The stack pointer register (implying the stack it points to)
• Other general purpose register values
– A set of OS resources
• open files, network connections, sound channels, …
• That’s a lot of concepts bundled together!
• Today: decompose …
– address space
– thread of control (stack, stack pointer, program counter, registers)
– OS resources

2
The Big Picture

• Threads are about concurrency and parallelism

– Parallelism: physically simultaneous operations for performance
– Concurrency: logically (and possibly physically) simultaneous
operations for convenience/simplicity
• One way to get concurrency and parallelism is to use
multiple processes
– The programs (code) of distinct processes are isolated from each
other
• Threads are another way to get concurrency and
parallelism
– Threads “share a process” – same address space, same OS
resources
– Threads have private stack, CPU state – are schedulable

3
Concurrency/Parallelism

• Imagine a web server, which might like to handle multiple requests

concurrently
– While waiting for the credit card server to approve a purchase for one client,
it could be retrieving the data requested by another client from disk, and
assembling the response for a third client from cached information
• Imagine a web client (browser), which might like to initiate multiple
requests concurrently
– The CSE home page has dozens of “src= …” html commands, each of
which is going to involve a lot of sitting around! Wouldn’t it be nice to be
able to launch these requests concurrently?
• Imagine a parallel program running on a multiprocessor, which might
like to employ “physical concurrency”
– For example, multiplying two large matrices – split the output matrix into k
regions and compute the entries in each region concurrently, using k
processors

4
What’s needed?

• In each of these examples of concurrency (web server, web

client, parallel program):
– Everybody wants to run the same code
– Everybody wants to access the same data
– Everybody has the same privileges
– Everybody uses the same resources (open files, network
connections, etc.)
• But you’d like to have multiple hardware execution states:
– an execution stack and stack pointer (SP)
• traces state of procedure calls made
– the program counter (PC), indicating the next instruction
– a set of general-purpose processor registers and their values

5
How could we achieve this?

• Given the process abstraction as we know it:

– fork several processes
– cause each to map to the same physical memory to share data
• see the shmget() system call for one way to do this (kind of)
• This is like making a pig fly – it’s really inefficient
– space: PCB, page tables, etc.
– time: creating OS structures, fork/copy address space, etc.
• Some equally bad alternatives for some of the examples:
– Entirely separate web servers
– Manually programmed asynchronous programming (non-blocking I/
O) in the web client (browser)

6
Can we do better?

• Key idea:
– separate the concept of a process (address space, OS resources)
– … from that of a minimal “thread of control” (execution state: stack,
stack pointer, program counter, registers)
• This execution state is usually called a thread, or
sometimes, a lightweight process

thread

7
Threads and processes

• Most modern OS’s (Mach (Mac OS), Chorus, Windows,

UNIX) therefore support two entities:
– the process, which defines the address space and general process
attributes (such as open files, etc.)
– the thread, which defines a sequential execution stream within a
process
• A thread is bound to a single process / address space
– address spaces, however, can have multiple threads executing
within them
– sharing data between threads is cheap: all see the same address
space
– creating threads is cheap too!
• Threads become the unit of scheduling
– processes / address spaces are just containers in which threads
execute

8
• Threads are concurrent executions sharing an address
space (and some OS resources)
• Address spaces provide isolation
– If you can’t name it, you can’t read or write it
• Hence, communicating between processes is expensive
– Must go through the OS to move data from one address space to
another
• Because threads are in the same address space,
communication is simple/cheap
– Just update a shared variable!

9
The design space

Key
older
MS/DOS UNIXes

address one thread per process one thread per process

space one process many processes

thread
Java Mach, NT,
Chorus,
Linux, …
many threads per process many threads per process
one process many processes

10
(old) Process address space

0xFFFFFFFF
stack
(dynamic allocated mem)
SP

address space heap

(dynamic allocated mem)

static data
(data segment)

code PC
(text segment)
0x00000000

11
(new) Address space with threads
thread 1 stack
SP (T1)
0xFFFFFFFF
thread 2 stack
SP (T2)

thread 3 stack
SP (T3)
address space

heap
(dynamic allocated mem)

static data
(data segment)
0x00000000 PC (T2)
code
PC (T1)
(text segment)
PC (T3)

© 2012 Gribble, Lazowska, Levy, Zahorjan 12 12

Process/thread separation

• Concurrency (multithreading) is useful for:

– handling concurrent events (e.g., web servers and clients)
– building parallel programs (e.g., matrix multiply, ray tracing)
– improving program structure (the Java argument)
• Multithreading is useful even on a uniprocessor
– even though only one thread can run at a time
• Supporting multithreading – that is, separating the concept
of a process (address space, files, etc.) from that of a
minimal thread of control (execution state), is a big win
– creating concurrency does not require creating new processes
– “faster / better / cheaper”

13
Terminology

• Just a note that there’s the potential for some confusion …

– Old world: “process” == “address space + OS resources + single
thread”
– New world: “process” typically refers to an address space + system
resources + all of its threads …
• When we mean the “address space” we need to be explicit
“thread” refers to a single thread of control within a process /
address space

• A bit like “kernel” and “operating system” …

– Old world: “kernel” == “operating system” and runs in “kernel
mode”
– New world: “kernel” typically refers to the microkernel; lots of the
operating system runs in user mode

14
“Where do threads come from, Mommy?”

• Natural answer: the OS is responsible for creating/

managing threads
– For example, the kernel call to create a new thread would
• allocate an execution stack within the process address space
• create and initialize a Thread Control Block
– stack pointer, program counter, register values
• stick it on the ready queue
– We call these kernel threads
– There is a “thread name space”
• Thread id’s (TID’s)
• TID’s are integers (surprise!)

15
Kernel threads

Mach, NT,
Chorus,
address
Linux, …
space

os kernel
thread CPU
(thread create, destroy,
signal, wait, etc.)

16
Kernel threads

• OS now manages threads and processes / address spaces

– all thread operations are implemented in the kernel
– OS schedules all of the threads in a system
• if one thread in a process blocks (e.g., on I/O), the OS knows about
it, and can run other threads from that process
• possible to overlap I/O and computation inside a process
• Kernel threads are cheaper than processes
– less state to allocate and initialize
• But, they’re still pretty expensive for fine-grained use
– orders of magnitude more expensive than a procedure call
– thread operations are all system calls
• context switch
• argument checks
– must maintain kernel state for each thread

17
“Where do threads come from, Mommy?” (2)

• There is an alternative to kernel threads

• Threads can also be managed at the user level (that is,
entirely from within the process)
– a library linked into the program manages the threads
• because threads share the same address space, the thread
manager doesn’t need to manipulate address spaces (which
only the kernel can do)
• threads differ (roughly) only in hardware contexts (PC, SP,
registers), which can be manipulated by user-level code
• the thread package multiplexes user-level threads on top of
kernel thread(s)
• each kernel thread is treated as a “virtual processor”
– we call these user-level threads

18
User-level threads
user-level
thread library

(thread create, destroy,

signal, wait, etc.)

address
space

os kernel
thread CPU

19
User-level threads: what the kernel sees

address
space

os kernel
thread CPU

20
User-level threads: the full story
user-level
thread library

(thread create, destroy,

signal, wait, etc.)
Mach, NT,
Chorus,
address
Linux, …
space

kernel threads
os kernel
thread CPU
(kernel thread create, destroy,
signal, wait, etc.)

21
User-level threads

• User-level threads are small and fast

– managed entirely by user-level library
• E.g., pthreads (libpthreads.a)
– each thread is represented simply by a PC, registers, a stack, and a
small thread control block (TCB)
– creating a thread, switching between threads, and synchronizing
threads are done via procedure calls
• no kernel involvement is necessary!
– user-level thread operations can be 10-100x faster than kernel
threads as a result

22
Performance example

• On a 700MHz Pentium running Linux 2.2.16 (only the

relative numbers matter; ignore the ancient CPU!):

– Processes
• fork/exit: 251 µs

Why?
– Kernel threads
• pthread_create()/pthread_join(): 94 µs (2.5x faster)

– User-level threads
• pthread_create()/pthread_join: 4.5 µs (another 20x
faster)
Why?

23
User-level thread implementation

• The OS schedules the kernel thread

• The kernel thread executes user code, including the thread
support library and its associated thread scheduler
• The thread scheduler determines when a user-level thread
runs
– it uses queues to keep track of what threads are doing: run, ready,
wait
• just like the OS and processes
• but, implemented at user-level as a library

24
Thread interface

• This is taken from the POSIX pthreads API:

– rcode = pthread_create(&t, attributes,
start_procedure)
• creates a new thread of control
• new thread begins executing at start_procedure
– pthread_cond_wait(condition_variable, mutex)
• the calling thread blocks, sometimes called thread_block()
– pthread_signal(condition_variable)
• starts a thread waiting on the condition variable
– pthread_exit()
• terminates the calling thread
– pthread_join(t)
• waits for the named thread to terminate

25
Thread context switch

• Very simple for user-level threads:

– save context of currently running thread
• push CPU state onto thread stack
– restore context of the next thread
• pop CPU state from next thread’s stack
– return as the new thread
• execution resumes at PC of next thread
– Note: no changes to memory mapping required!
• This is all done by assembly language
– it works at the level of the procedure calling convention
• thus, it cannot be implemented using procedure calls

26
How to keep a user-level thread from
hogging the CPU?
• Strategy 1: force everyone to cooperate
– a thread willingly gives up the CPU by calling yield()
– yield() calls into the scheduler, which context switches to another
ready thread
– what happens if a thread never calls yield()?

• Strategy 2: use preemption

– scheduler requests that a timer interrupt be delivered by the OS
periodically
• usually delivered as a UNIX signal (man signal)
• signals are just like software interrupts, but delivered to user-
level by the OS instead of delivered to OS by hardware
– at each timer interrupt, scheduler gains control and context switches
as appropriate

27
What if a thread tries to do I/O?

• The kernel thread “powering” it is lost for the duration of the

(synchronous) I/O operation!
– The kernel thread blocks in the OS, as always
– It maroons with it the state of the user-level thread
• Could have one kernel thread “powering” each user-level
thread
– “common case” operations (e.g., synchronization) would be quick
• Could have a limited-size “pool” of kernel threads
“powering” all the user-level threads in the address space
– the kernel will be scheduling these threads, obliviously to what’s
going on at user-level

28
Multiple kernel threads “powering”
each address space
user-level
thread library

(thread create, destroy,

signal, wait, etc.)

address
space

kernel threads
os kernel
thread CPU
(kernel thread create, destroy,
signal, wait, etc.)

29
What if the kernel preempts a thread
holding a lock?
• Other threads will be unable to enter the critical section and
will block (stall)

30
Addressing these problems

• Effective coordination of kernel decisions and user-level

threads requires OS-to-user-level communication
– OS notifies user-level that it is about to suspend a kernel thread
• This is called “scheduler activations”
• a research paper from UW with huge effect on practice
• each process can request one or more kernel threads
– process is given responsibility for mapping user-level
threads onto kernel threads
– kernel promises to notify user-level before it suspends or
destroys a kernel thread
• ACM TOCS 10,1

31
Summary

• You really want multiple threads per address space

• Kernel threads are much more efficient than processes, but
they’re still not cheap
– all operations require a kernel call and parameter validation
• User-level threads are:
– really fast/cheap
– great for common-case operations
• creation, synchronization, destruction
– can suffer in uncommon cases due to kernel obliviousness
• I/O
• preemption of a lock-holder
• Scheduler activations are an answer
– pretty subtle though

32
The design space

older
MS/DOS UNIXes

address one thread/process one thread/process

space one process many processes

thread
Java Mach, NT,
Chorus,
Linux, …
many threads/process many threads/process
one process many processes

Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
Operating Systems: Threads
No ratings yet
Operating Systems: Threads
32 pages
Operating System 4
No ratings yet
Operating System 4
33 pages
Distributed Systems
No ratings yet
Distributed Systems
238 pages
Questions Answered in This Lecture:: - Why Are Threads Useful? - How Does One Use POSIX Pthreads?
No ratings yet
Questions Answered in This Lecture:: - Why Are Threads Useful? - How Does One Use POSIX Pthreads?
6 pages
Chapter 2 Process Management Part 2 Threads and Multithreading
No ratings yet
Chapter 2 Process Management Part 2 Threads and Multithreading
42 pages
COS 318: Operating Systems Processes and Threads
No ratings yet
COS 318: Operating Systems Processes and Threads
24 pages
4 Threads
No ratings yet
4 Threads
61 pages
Lecture 6: Threads: Operating Systems (A) (Honor Track)
No ratings yet
Lecture 6: Threads: Operating Systems (A) (Honor Track)
44 pages
Threads
No ratings yet
Threads
11 pages
Chapter 4 - Thread Concept
No ratings yet
Chapter 4 - Thread Concept
46 pages
Processes and Threads: - An Operating System Executes A Variety of Programs
No ratings yet
Processes and Threads: - An Operating System Executes A Variety of Programs
24 pages
0378b Thread
No ratings yet
0378b Thread
41 pages
Processes and Threads Ch2
No ratings yet
Processes and Threads Ch2
7 pages
Ee249 13 Rtos
No ratings yet
Ee249 13 Rtos
211 pages
CSCI 350 Ch. 4 - Threads and Concurrency: Mark Redekopp Michael Shindler & Ramesh Govindan
No ratings yet
CSCI 350 Ch. 4 - Threads and Concurrency: Mark Redekopp Michael Shindler & Ramesh Govindan
41 pages
IPC Linux
No ratings yet
IPC Linux
58 pages
2.2 DD2356 Threads
No ratings yet
2.2 DD2356 Threads
22 pages
Lecture 16
No ratings yet
Lecture 16
30 pages
Threads: Tevfik Koşar
100% (1)
Threads: Tevfik Koşar
40 pages
Threads
No ratings yet
Threads
18 pages
lec02-concurrency
No ratings yet
lec02-concurrency
38 pages
Ch. 2 Lecture 1 PDF
No ratings yet
Ch. 2 Lecture 1 PDF
59 pages
Threads in Operating System
No ratings yet
Threads in Operating System
103 pages
Chapter 2 (Two)
No ratings yet
Chapter 2 (Two)
52 pages
lecture_03
No ratings yet
lecture_03
49 pages
4389_3_71_Module_2
No ratings yet
4389_3_71_Module_2
126 pages
EE-379 Embedded Systems and Applications: Real Time Operating Systems (RTOS) Part 1: Processes or Tasks and Threads
No ratings yet
EE-379 Embedded Systems and Applications: Real Time Operating Systems (RTOS) Part 1: Processes or Tasks and Threads
22 pages
Operating System Support: Distributed Systems Course
No ratings yet
Operating System Support: Distributed Systems Course
7 pages
Chapter 2
No ratings yet
Chapter 2
21 pages
Pocess Manament Tread PDF
No ratings yet
Pocess Manament Tread PDF
62 pages
2.process and Threds
No ratings yet
2.process and Threds
48 pages
Lecture 3 - Threads
No ratings yet
Lecture 3 - Threads
28 pages
Threads-1
No ratings yet
Threads-1
8 pages
Lect 03
No ratings yet
Lect 03
203 pages
Threads
No ratings yet
Threads
38 pages
pbl4-Osama3011
No ratings yet
pbl4-Osama3011
16 pages
Module02-2
No ratings yet
Module02-2
36 pages
Process
No ratings yet
Process
33 pages
2 Processes Threads
No ratings yet
2 Processes Threads
15 pages
Chapter - 4 Threads (Full)
No ratings yet
Chapter - 4 Threads (Full)
72 pages
Mode, Space, and Context: The Basics: Jeff Chase Duke University
No ratings yet
Mode, Space, and Context: The Basics: Jeff Chase Duke University
39 pages
4.OS Threads Dr. Punit
No ratings yet
4.OS Threads Dr. Punit
48 pages
Chapter 4: Threads: in This Chapter Our Focus Is On: Multithreading Models Thread Libraries Threading Issues
No ratings yet
Chapter 4: Threads: in This Chapter Our Focus Is On: Multithreading Models Thread Libraries Threading Issues
23 pages
Lec17 Threads Introduction
No ratings yet
Lec17 Threads Introduction
20 pages
CSI3131 Module 3
No ratings yet
CSI3131 Module 3
56 pages
Lecture Thread
No ratings yet
Lecture Thread
45 pages
System Programming - II Threads
No ratings yet
System Programming - II Threads
46 pages
OS-Chap2-2021 01 22
No ratings yet
OS-Chap2-2021 01 22
103 pages
Process and Threads, Presentation4
No ratings yet
Process and Threads, Presentation4
20 pages
DSL 04 Processes
No ratings yet
DSL 04 Processes
33 pages
Lecture 04
No ratings yet
Lecture 04
11 pages
CS307 Lecture 1
No ratings yet
CS307 Lecture 1
33 pages
OS Module-2 (Highlighted)
No ratings yet
OS Module-2 (Highlighted)
43 pages
Chapter - 3 Process
No ratings yet
Chapter - 3 Process
47 pages
Chapter 4: Threads
No ratings yet
Chapter 4: Threads
33 pages
Chap 4
No ratings yet
Chap 4
44 pages
5CS4 03 Os - NK
No ratings yet
5CS4 03 Os - NK
94 pages
chapter 4
No ratings yet
chapter 4
18 pages
All My IT Tech Posts
From Everand
All My IT Tech Posts
Stephen Edwards
No ratings yet
CSC204 - Chapter 1.2
No ratings yet
CSC204 - Chapter 1.2
27 pages
Visvesvaraya Technological University Belagavi: Scheme of Teaching and Examinations and Syllabus
No ratings yet
Visvesvaraya Technological University Belagavi: Scheme of Teaching and Examinations and Syllabus
29 pages
Commonly Used Approaches To Real-Time Scheduling
No ratings yet
Commonly Used Approaches To Real-Time Scheduling
25 pages
CF Unit 3
No ratings yet
CF Unit 3
13 pages
Bangladesh University: Answer Any Five (5) From The Following Questions. Each Set Must Be Answered Together
No ratings yet
Bangladesh University: Answer Any Five (5) From The Following Questions. Each Set Must Be Answered Together
2 pages
XDC2018 Android-X86 Tech Talk
No ratings yet
XDC2018 Android-X86 Tech Talk
53 pages
Android Emulation Setup
No ratings yet
Android Emulation Setup
6 pages
Lecture 29 GPU Architecture Example
No ratings yet
Lecture 29 GPU Architecture Example
15 pages
20+ Cool Command Prompt Tricks That You Should Know (2023) - Beebom
No ratings yet
20+ Cool Command Prompt Tricks That You Should Know (2023) - Beebom
30 pages
Linux Programming Lecture Notes
79% (19)
Linux Programming Lecture Notes
190 pages
Module 2 Reviewer
No ratings yet
Module 2 Reviewer
26 pages
Log Time
No ratings yet
Log Time
9 pages
Install
No ratings yet
Install
78 pages
The PC Boot Process - Windows XP.: Fixed Disk
No ratings yet
The PC Boot Process - Windows XP.: Fixed Disk
5 pages
VMware Interview Questions V2.0
100% (1)
VMware Interview Questions V2.0
7 pages
Charlotte Pipe Revit Families: Install Csvs To Revit Mep "Lookuptables" Folder, See Install Paths Below
No ratings yet
Charlotte Pipe Revit Families: Install Csvs To Revit Mep "Lookuptables" Folder, See Install Paths Below
3 pages
Linux For Pentester
No ratings yet
Linux For Pentester
48 pages
ASEEx Slides
No ratings yet
ASEEx Slides
87 pages
Db2 Interview Questions
No ratings yet
Db2 Interview Questions
4 pages
03-0237-02 Note To User ZEUS 3.2 198401
No ratings yet
03-0237-02 Note To User ZEUS 3.2 198401
48 pages
CLI Cheat Sheet: Directory Operations IO Redirection
No ratings yet
CLI Cheat Sheet: Directory Operations IO Redirection
1 page
217 Lec2
No ratings yet
217 Lec2
24 pages
crash-2025-05-15_17.03.08-server
No ratings yet
crash-2025-05-15_17.03.08-server
33 pages
Percona Server Installation: Running PMM Server Via Docker
No ratings yet
Percona Server Installation: Running PMM Server Via Docker
3 pages
Maria DB Concepts
No ratings yet
Maria DB Concepts
6 pages
26-Synchronization in Java
No ratings yet
26-Synchronization in Java
12 pages
MCUXpresso IDE Installation Guide - Cleaned
No ratings yet
MCUXpresso IDE Installation Guide - Cleaned
14 pages
Parallel and Distributed Computing Lecture 02
No ratings yet
Parallel and Distributed Computing Lecture 02
17 pages
OS Module 1 Complete Solutions
No ratings yet
OS Module 1 Complete Solutions
32 pages
Inter and Intra Query Parallelism
No ratings yet
Inter and Intra Query Parallelism
1 page

05-thread

Uploaded by

05-thread

Uploaded by

Operating Systems

• A process consists of (at least):

• Threads are about concurrency and parallelism

• Imagine a web server, which might like to handle multiple requests

• In each of these examples of concurrency (web server, web

• Given the process abstraction as we know it:

• Most modern OS’s (Mach (Mac OS), Chorus, Windows,

address one thread per process one thread per process

address space heap

© 2012 Gribble, Lazowska, Levy, Zahorjan 12 12

• Concurrency (multithreading) is useful for:

• Just a note that there’s the potential for some confusion …

• A bit like “kernel” and “operating system” …

• Natural answer: the OS is responsible for creating/

• OS now manages threads and processes / address spaces

• There is an alternative to kernel threads

(thread create, destroy,

(thread create, destroy,

• User-level threads are small and fast

• On a 700MHz Pentium running Linux 2.2.16 (only the

• The OS schedules the kernel thread

• This is taken from the POSIX pthreads API:

• Very simple for user-level threads:

• Strategy 2: use preemption

• The kernel thread “powering” it is lost for the duration of the

(thread create, destroy,

• Effective coordination of kernel decisions and user-level

• You really want multiple threads per address space

address one thread/process one thread/process

You might also like