0% found this document useful (0 votes)

46 views60 pages

Parallel-Port-Example-Computer-Science-2004-7-7-The-Point-Jacobi-Iteration - PRG Örnekleri

This document provides an overview of using MPI to parallelize the solution of the Laplace partial differential equation across multiple processors. It describes distributing the work, data, and communication required. The serial Jacobi iteration method is presented, and its parallel implementation is demonstrated through domain decomposition, data distribution across processors, and use of MPI functions like Send, Recv, and Reduce to exchange boundary data between processors at each iteration. Sample C and Fortran code templates are provided to get started in implementing the parallel solution.

Uploaded by

Mike Thomson

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views60 pages

Parallel-Port-Example-Computer-Science-2004-7-7-The-Point-Jacobi-Iteration - PRG Örnekleri

Uploaded by

Mike Thomson

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 60

Parallel Port Example

April 24, 2002

Introduction
The objective of this lecture is to go over a simple problem that illustrates the use of the MPI
library to parallelize a partial differential equation (PDE).
The Laplace problem is a simple PDE and is found at the core of many applications. More
elaborate problems often have the same communication structure that we will discuss in
this class. Thus, we will use this example to provide the fundamentals on how
communication patterns appear on more complex PDE problems.
This lecture will demonstrate message passing techniques, among them, how to:
• Distribute Work
• Distribute Data
• Communication:
Since each processor has its own memory, the data is not shared, and communication
becomes important.
• Synchronization

April 24, 2002

Laplace Equation
The Laplace equation is:

We want to know t(x,y) subject to the following initial boundary conditions:

April 24, 2002

Laplace Equation
To find an approximate solution to the equation, define a square mesh or grid
consisting of points

April 24, 2002

The Point Jacobi Iteration
The method known as “point Jacobi iteration” calculates the value if T9i,j) as
an average of the old values of T at the neighboring points:

April 24, 2002

The Point Jacobi Iteration
The iteration is repeated until the solution is reached.

If we want to solve T for [1000, 1000] points, the grid itself needs to be of
dimension 1002 x 1002; since the algorithm to calculate T9i,j) requires
values of T at I-1, I+1, j-1, and j+1.

April 24, 2002

Serial Code Implementation
In the following NR=numbers of rows, NC= number of columns. (excluding the boundary columns
and rows)
The serial implementation of the Jacobi iteration is:

April 24, 2002

Serial Version – C

April 24, 2002

Serial Version – C

April 24, 2002

Serial Version – C

April 24, 2002

Serial Version - Fortran

April 24, 2002

Serial Version - Fortran

April 24, 2002

Serial Version - Fortran

April 24, 2002

Serial Version - Fortran

April 24, 2002

Serial Version - Fortran

April 24, 2002

Parallel Version: Example Using
4 Processors
Recall that in the serial case the grid boundaries were:

April 24, 2002

Simplest Decomposition for
Fortran Code

April 24, 2002

Simplest Decomposition for
Fortran Code
A better distribution from the point of view of communication
optimization is the following:

The program has a “local” view of data.

The programmer has to have a “global” view of data.
April 24, 2002
Simplest Decomposition for C
Code

April 24, 2002

Simplest Decomposition for C
Code
In the parallel case, we will break this up into 4 processors:
There is only one set of boundary values. But when we distribute the data, each
processor needs to have an extra row for data distribution:

The program has a “local” view of data.

The programmer has to have a “global”
view of data.

April 24, 2002

Include Files
Fortran:
* (always declare all variables)
implicit none
INCLUDE 'mpif.h‘

* Initialization and clean up (always check error codes):

call MPI_Init(ierr)
call MPI_Finalize(ierr)

C:
#include "mpi.h"
/* Initialization and clean up (always check error codes): */

stat = MPI_Init(&argc, &argv);

stat = MPI_Finalize();

Note: Check for MPI_SUCCESS

if (ierr. ne. MPI_SUCCESS) then

do error processing
endif

April 24, 2002

Initialization
Serial version:

Parallel version:
Just for simplicity, we will distribute rows in C and columns in Fortran; this is easier because data
is stored in rows C and in columns Fortran.

April 24, 2002

Parallel Version: Boundary
Conditions
Fortran Version

We need to know MYPE number and how many PEs we are using.
Each processor will work on different data depending on MYPE.
Here are the boundary conditions in the serial code, where
NRL-local number of rows, NRL=NPROC

April 24, 2002

Parallel C Version: Boundary
Conditions

We need to know MYPE number and how many PEs we are using. Each processor will work on
different data depending on MYPE.
Here are the boundary conditions in the serial code, where
NRL=local number of rows, NRL=NR/NPROC

April 24, 2002

Processor Information
Fortran:
Number of processors:
call MPI_Comm_size (MPI_COMM_WORLD, npes ierr)
Processor Number:
call MPI_Comm_rank(MPI_COMM_WORLD, mype, ierr)
C:
Number of processors:
stat = MPI_Comm_size(MPI_COMM_WORLD, &npes);
Processor Number:
stat = MPI_Comm_rank(MPI_COMM_WORLD, &mype);

April 24, 2002

Maximum Number of Iterations
Only 1 PE has to do I/O (usually PE0).
Then PE0 (or root PE) will broadcast niter to all others. Use the
collective operation MPI_Bcast.
Fortran:

Here number of elements is how many values we are passing, in this case
only one: niter.
C:

April 24, 2002

Main Loop
for (iter=1; iter <= NITER; iter++) {
Do averaging (each PE averages from 1 to 250)
Copy T into Told
Send Values down

Send values up
This is where we use MPI communication calls: need to exchange data between processors
Receive values from above

Receive values from below

(find the max change)

Synchronize

April 24, 2002

Parallel Template: Send data up
Once the new T values have been calculated:
SEND
• All processors except processor 0 send their “first” row (in C) to their neighbor above
(mype – 1).

April 24, 2002

Parallel Template: Send data
down
SEND
• All processors except the last one, send their “last” row to their neighbor below (mype + 1).

April 24, 2002

Parallel Template: Receive from
above
Receive
• All processors except PE0, receive from their neighbor above and unpack in row 0.

April 24, 2002

Parallel Template: Receive from
below
Receive
• All processors except processor (NPES-1), receive from the neighbor below and unpack in
the last row.

Example: PE1 receives 2 messages – there is no guarantee of the order in which they will be
received.
April 24, 2002
Parallel Template (C)

April 24, 2002

Parallel Template (C)

April 24, 2002

Parallel Template (C)

April 24, 2002

Parallel Template (C)

April 24, 2002

Parallel Template (C)

April 24, 2002

Parallel Template (Fortran)

April 24, 2002

Parallel Template (Fortran)

April 24, 2002

Parallel Template (Fortran)

April 24, 2002

Parallel Template (Fortran)

April 24, 2002

Parallel Template (Fortran)

April 24, 2002

Variations

if ( mype != 0 ){
up = mype - 1
MPI_Send( t, NC, MPI_FLOAT, up, UP_TAG, comm, ierr
); }

Alternatively
up = mype - 1
if ( mype == 0 ) up = MPI_PROC_NULL;
MPI_Send( t, NC, MPI_FLOAT, up, UP_TAG, comm,ierr );

April 24, 2002

Variations

if( mype.ne.0 ) then

left = mype - 1
call MPI_Send( t, NC, MPI_REAL, left, L_TAG, comm, ierr)
endif
Alternatively
left = mype - 1
if( mype.eq.0 ) left = MPI_PROC_NULL
call MPI_Send( t, NC, MPI_REAL, left, L_TAG, comm, ierr)
endif

Note: You may also MPI_Recv from MPI_PROC_NULL

April 24, 2002
Variations
Send and receive at the same time:
MPI_Sendrecv( … )

April 24, 2002

Finding Maximum Change

Each PE can find it’s own maximum change dt

To find the global change dtg in C::

MPI_Reduce(&dt, & dtg, 1, MPI_FLOAT,
MPI_MAX, PE0, comm);

To find the global change dtg in Fortran:

call
MPI_Reduce(dt,dtg,1,MPI_REAL,MPI_MAX, PE0,
comm, ierr)

April 24, 2002

Domain Decomposition

April 24, 2002

Data Distribution I
Domain Decomposition I

• All processors have entire T array.

• Each processor works on TW part of T.
• After every iteration, all processors broadcast their TW to all other
processors.
• Increased memory.
• Increased operations.
April 24, 2002
Data Distribution I
Domain Decomposition II

• Each processor has sub-grid.

• Communicate boundary values only.
• Reduce memory.
• Reduce communications.
• Have to keep track of neighbors in two directions.
April 24, 2002
Exercise
1. Copy the following parallel templates into your /tmp directory in jaromir:
/tmp/training/laplace/laplace.t3e.c
/tmp/training/laplace/laplace.t3e.f

2. These are template files; your job is to go into the sections marked "<<<<<<" in the source code
and add the necessary statements so that the code will run on 4 PEs.

Useful Web reference for this exercise:

To view a list of all MPI calls, with syntax and descriptions, access the Message Passing
Interface Standard at:
http://www-unix.mcs.anl.gov/mpi/www/
3. To compile the program, after you have modified it, rename the new programs laplace_mpi_c.c
and laplace_mpi_f.f and execute:
cc –lmpi laplace_mpi_c
f90 –lmpi laplace_mpi_f

April 24, 2002

Exercise
4. To run:
echo 200 | mpprun -n4 ./laplace_mpi_c
echo 200 | mpprun -n 4 ./laplace_mpi_f

5. You can check your program against the solutions

laplace_mpi_c.c and

laplace_mpi_f.f

April 24, 2002

Source Codes
The following are the C and Fortran templates that you need to parallelize for the Exercise.
laplace.t3e.c

April 24, 2002

Source Codes

April 24, 2002

Source Codes

April 24, 2002

Source Codes

April 24, 2002

Source Codes

April 24, 2002

Source Codes
laplace.t3e.f

April 24, 2002

Source Codes

April 24, 2002

Source Codes

April 24, 2002

Source Codes

April 24, 2002

Source Codes

April 24, 2002

MCQ For Medical Student
100% (11)
MCQ For Medical Student
6 pages
Parallel and Distributed Computing
33% (3)
Parallel and Distributed Computing
10 pages
Bloomsbury Kids PDF
No ratings yet
Bloomsbury Kids PDF
4 pages
MPIDomain Decomp
No ratings yet
MPIDomain Decomp
12 pages
MC Openmp
No ratings yet
MC Openmp
10 pages
Hon Pro
No ratings yet
Hon Pro
8 pages
Parallel Programming Using Basic MPI Presented by Timothy H. Kaiser, Ph.D. San Diego Supercomputer Center
No ratings yet
Parallel Programming Using Basic MPI Presented by Timothy H. Kaiser, Ph.D. San Diego Supercomputer Center
19 pages
Lecture 11 Distributed Memory Programming
No ratings yet
Lecture 11 Distributed Memory Programming
28 pages
Mpi Course
No ratings yet
Mpi Course
202 pages
60004210188_RajSingh_HPC_Exp1-7
No ratings yet
60004210188_RajSingh_HPC_Exp1-7
23 pages
VL2020210104311 Fat PDF
No ratings yet
VL2020210104311 Fat PDF
6 pages
Mpi
No ratings yet
Mpi
67 pages
High Performance Computing For Computational Mechanics: ISCM-10
No ratings yet
High Performance Computing For Computational Mechanics: ISCM-10
63 pages
ECE 1747H: Parallel Programming: Message Passing (MPI)
No ratings yet
ECE 1747H: Parallel Programming: Message Passing (MPI)
67 pages
Parallel Algorithms Underlying MPI Implementations
No ratings yet
Parallel Algorithms Underlying MPI Implementations
55 pages
Lecture15 PDF
No ratings yet
Lecture15 PDF
32 pages
Message Passing Basics
No ratings yet
Message Passing Basics
40 pages
ATPESC 2019 Track-2 1-7-30 830am Guo-Raffenetti-Thakur-MPI For Scalable Computing
No ratings yet
ATPESC 2019 Track-2 1-7-30 830am Guo-Raffenetti-Thakur-MPI For Scalable Computing
199 pages
Pseudo Code of Mpi Programs
No ratings yet
Pseudo Code of Mpi Programs
22 pages
Class03 - MPI, Part 1, Intermediate PDF
No ratings yet
Class03 - MPI, Part 1, Intermediate PDF
83 pages
Unit IV
No ratings yet
Unit IV
12 pages
[Scientific and Engineering Computation] William Gropp, Ewing L. Lusk, Anthony Skjellum, Rajeev Thakur - Using MPI and Using MPI-2 (1999, The MIT Press)
No ratings yet
[Scientific and Engineering Computation] William Gropp, Ewing L. Lusk, Anthony Skjellum, Rajeev Thakur - Using MPI and Using MPI-2 (1999, The MIT Press)
385 pages
MPI Plamen Krastev
No ratings yet
MPI Plamen Krastev
49 pages
Floyd's Algorithm: Input N: Number of Vertices A (0..n-1) (0..n-1) - Adjacency Matrix
No ratings yet
Floyd's Algorithm: Input N: Number of Vertices A (0..n-1) (0..n-1) - Adjacency Matrix
7 pages
Parallel Programming 3
No ratings yet
Parallel Programming 3
22 pages
An Introduction To MPI: Parallel Programming With The Message Passing Interface
No ratings yet
An Introduction To MPI: Parallel Programming With The Message Passing Interface
48 pages
Report - Viber String
No ratings yet
Report - Viber String
26 pages
Intro To MPI
No ratings yet
Intro To MPI
44 pages
Message Passing Fundamentals: Reference: Http://foxtrot - Ncsa.uiuc - edu:8900/public/MPI
No ratings yet
Message Passing Fundamentals: Reference: Http://foxtrot - Ncsa.uiuc - edu:8900/public/MPI
22 pages
MPI Lab 3
No ratings yet
MPI Lab 3
18 pages
410A-week-5
No ratings yet
410A-week-5
23 pages
Mpi Programming 2
No ratings yet
Mpi Programming 2
57 pages
MPI Tutorial: MPI (Message Passing Interface)
No ratings yet
MPI Tutorial: MPI (Message Passing Interface)
29 pages
Fortran Mpi Tutorial
No ratings yet
Fortran Mpi Tutorial
29 pages
MPI Tutorial: MPI (Message Passing Interface)
No ratings yet
MPI Tutorial: MPI (Message Passing Interface)
29 pages
Parallel Computing in CFD: Milovan Perić
No ratings yet
Parallel Computing in CFD: Milovan Perić
25 pages
02 Mpi 0
No ratings yet
02 Mpi 0
19 pages
Sunil Kumar L 24
No ratings yet
Sunil Kumar L 24
21 pages
MPI_tutorial_Fall_Break_2022
No ratings yet
MPI_tutorial_Fall_Break_2022
60 pages
Writing Message Passing Parallel Programs With MPI: Course Notes
No ratings yet
Writing Message Passing Parallel Programs With MPI: Course Notes
80 pages
Parallel Algorithms Underlying MPI Implementations
No ratings yet
Parallel Algorithms Underlying MPI Implementations
55 pages
CSE4001 Parallel and Distributed Computing: Lab Assignment 6
No ratings yet
CSE4001 Parallel and Distributed Computing: Lab Assignment 6
8 pages
Distributed Memory Programming With: Peter Pacheco
No ratings yet
Distributed Memory Programming With: Peter Pacheco
125 pages
CS621 Final Term Current Papers
No ratings yet
CS621 Final Term Current Papers
9 pages
Distributed Memory Programming With MPI: Peter Pacheco
No ratings yet
Distributed Memory Programming With MPI: Peter Pacheco
121 pages
Van Loan
No ratings yet
Van Loan
273 pages
Asg 03_MPI
No ratings yet
Asg 03_MPI
8 pages
Using MPI Portable Programming With The Message Pa PDF
No ratings yet
Using MPI Portable Programming With The Message Pa PDF
8 pages
08_1_MPI_Comm_Data_Distributions
No ratings yet
08_1_MPI_Comm_Data_Distributions
60 pages
MPI Pacheco Ch3
No ratings yet
MPI Pacheco Ch3
124 pages
Mpi 1
No ratings yet
Mpi 1
38 pages
Untitled document
No ratings yet
Untitled document
23 pages
VSS-MPI-2
No ratings yet
VSS-MPI-2
23 pages
Project Topics
No ratings yet
Project Topics
2 pages
Mpi Openmp Handouts
No ratings yet
Mpi Openmp Handouts
67 pages
Ee8218 Lab2
No ratings yet
Ee8218 Lab2
7 pages
2-MPI
No ratings yet
2-MPI
13 pages
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
Franco Mario
No ratings yet
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
From Everand
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
Bruce Dang
No ratings yet
C Programming
From Everand
C Programming
Netra
No ratings yet
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
From Everand
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
Tenko
No ratings yet
C++ Program To Read A File
No ratings yet
C++ Program To Read A File
3 pages
CPP - Files - and - Streams - Dosya Açma-Kapama
No ratings yet
CPP - Files - and - Streams - Dosya Açma-Kapama
7 pages
Codeblocks - Manual V 2.0
No ratings yet
Codeblocks - Manual V 2.0
1 page
Data Acquisition System Using Parallel Port of Computer
No ratings yet
Data Acquisition System Using Parallel Port of Computer
5 pages
Borland Graphics Interface (BGI) For Windows
No ratings yet
Borland Graphics Interface (BGI) For Windows
3 pages
C Program For Long To String Conversion - Geeksforgeeks
No ratings yet
C Program For Long To String Conversion - Geeksforgeeks
10 pages
Custom Windows NT 4.0 Parallel Port Device Driver: A Component of A Network Performance Measurement Tool
No ratings yet
Custom Windows NT 4.0 Parallel Port Device Driver: A Component of A Network Performance Measurement Tool
37 pages
Driving Stepper Motors Using NXP I C-Bus GPIO Expanders: Rev. 2 - 11 October 2011 Application Note
No ratings yet
Driving Stepper Motors Using NXP I C-Bus GPIO Expanders: Rev. 2 - 11 October 2011 Application Note
31 pages
Parallel Port MIDI Interface
No ratings yet
Parallel Port MIDI Interface
6 pages
CGS D/A Converter: View Source View History
No ratings yet
CGS D/A Converter: View Source View History
6 pages
PCI DIGITAL IO TIMER CARD FOR PCs (Şema + PRG)
No ratings yet
PCI DIGITAL IO TIMER CARD FOR PCs (Şema + PRG)
57 pages
4979-Article Text-9259-1-10-20210503
No ratings yet
4979-Article Text-9259-1-10-20210503
8 pages
ProductLine 2
No ratings yet
ProductLine 2
10 pages
Address Multiplexing - Hardware Structure of 8086
No ratings yet
Address Multiplexing - Hardware Structure of 8086
57 pages
Oice Otp: aP89xx-DBAMP Demo Board
No ratings yet
Oice Otp: aP89xx-DBAMP Demo Board
7 pages
2 - MTD - Volume 4 - 2017 Totolici
100% (1)
2 - MTD - Volume 4 - 2017 Totolici
8 pages
Arduino: Help Me For Interfacing Arduino With AMTP32M VOICE IC
No ratings yet
Arduino: Help Me For Interfacing Arduino With AMTP32M VOICE IC
8 pages
AMTPxxM-Datasheet V1.1 20141216
No ratings yet
AMTPxxM-Datasheet V1.1 20141216
30 pages
5d9d PDF
No ratings yet
5d9d PDF
29 pages
10 SM Science English 2019 20 PDF
No ratings yet
10 SM Science English 2019 20 PDF
288 pages
Mathematics and Statistics Undergraduate Handbook
No ratings yet
Mathematics and Statistics Undergraduate Handbook
12 pages
Re SA B42 AUD Final PB Exam Questions Answers Solutions Re SA B42 AUD Final PB Exam Questions Answers Solutions
No ratings yet
Re SA B42 AUD Final PB Exam Questions Answers Solutions Re SA B42 AUD Final PB Exam Questions Answers Solutions
26 pages
carrillo2004
No ratings yet
carrillo2004
6 pages
Dbms Assignment: Create Table Commands
No ratings yet
Dbms Assignment: Create Table Commands
9 pages
AR-109 - 191030 - Rev.9 - 2ND-3RD FLOOR Reflected Ceiling Plan
No ratings yet
AR-109 - 191030 - Rev.9 - 2ND-3RD FLOOR Reflected Ceiling Plan
1 page
Disadvantages of GLOBALIZATION
No ratings yet
Disadvantages of GLOBALIZATION
2 pages
Maths
No ratings yet
Maths
62 pages
Physics Project-Marble Roller Coaster-Converting Potential Energy To Kinetic Energy
No ratings yet
Physics Project-Marble Roller Coaster-Converting Potential Energy To Kinetic Energy
6 pages
TUM CV2 Summary
No ratings yet
TUM CV2 Summary
24 pages
INS21 Chapter 7 PDF
100% (1)
INS21 Chapter 7 PDF
30 pages
Test Bank For Pathology For Massage Therapists 2nd Edition by Salvo
100% (49)
Test Bank For Pathology For Massage Therapists 2nd Edition by Salvo
8 pages
MTNL
No ratings yet
MTNL
5 pages
Exam Model Hansa
100% (1)
Exam Model Hansa
2 pages
Evaluate A Creative Multimedia Form (Living Museum, Electronic Portfolio, Others)
No ratings yet
Evaluate A Creative Multimedia Form (Living Museum, Electronic Portfolio, Others)
8 pages
PDF PDF
100% (1)
PDF PDF
80 pages
IT Video Exercises & Tests
No ratings yet
IT Video Exercises & Tests
19 pages
What Is Climate Change? - United Nations
No ratings yet
What Is Climate Change? - United Nations
6 pages
Eco-Infrastracture Waterfront Property Development: Far Eastern University
No ratings yet
Eco-Infrastracture Waterfront Property Development: Far Eastern University
8 pages
CheatsFS251
No ratings yet
CheatsFS251
19 pages
The Thorny Way of Truth Part7 Marinov
No ratings yet
The Thorny Way of Truth Part7 Marinov
340 pages
Opposite Spells
No ratings yet
Opposite Spells
12 pages
1.yourself - IT Jobs
No ratings yet
1.yourself - IT Jobs
8 pages
Refund Compensation Claim Form
No ratings yet
Refund Compensation Claim Form
1 page
HSTET2018 - 2 Feb 19 - Day 2 - Shift 1 - 9.30am - HINDI (Paper1) PDF
No ratings yet
HSTET2018 - 2 Feb 19 - Day 2 - Shift 1 - 9.30am - HINDI (Paper1) PDF
38 pages
Welcome: Obstetrics and Gynaecological Nursing Department
No ratings yet
Welcome: Obstetrics and Gynaecological Nursing Department
57 pages
Vikram Employee Retension
No ratings yet
Vikram Employee Retension
61 pages

Parallel-Port-Example-Computer-Science-2004-7-7-The-Point-Jacobi-Iteration - PRG Örnekleri

Uploaded by

Parallel-Port-Example-Computer-Science-2004-7-7-The-Point-Jacobi-Iteration - PRG Örnekleri

Uploaded by

Parallel Port Example

April 24, 2002

April 24, 2002

We want to know t(x,y) subject to the following initial boundary conditions:

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

The program has a “local” view of data.

April 24, 2002

The program has a “local” view of data.

April 24, 2002

* Initialization and clean up (always check error codes):

stat = MPI_Init(&argc, &argv);

Note: Check for MPI_SUCCESS

if (ierr. ne. MPI_SUCCESS) then

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

Receive values from below

(find the max change)

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

if( mype.ne.0 ) then

Note: You may also MPI_Recv from MPI_PROC_NULL

April 24, 2002

Each PE can find it’s own maximum change dt

To find the global change dtg in C::

To find the global change dtg in Fortran:

April 24, 2002

April 24, 2002

• All processors have entire T array.

• Each processor has sub-grid.

Useful Web reference for this exercise:

April 24, 2002

5. You can check your program against the solutions

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

April 24, 2002

You might also like