Week 1-PART 2-Understanding Data Step Processing

This document outlines the processing of SAS DATA steps in epidemiological research, detailing the two main phases: compilation and execution. During the compilation phase, the input buffer and program data vector are created, and syntax errors are checked, while the execution phase involves reading data values and initializing variables. Key concepts such as automatic variables _N_ and _ERROR_ are also introduced, which help track the execution process and errors.

Uploaded by

KinSparkin'

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Week 1-PART 2-Understanding Data Step Processing

Uploaded by

KinSparkin'

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 19

Week 1-Part 2

Understanding DATA
STEP processing
PHEB 631: SAS PROGRAMMING IN
EPIDEMIOLOGICAL RESEARCH
Xiaohui Xu, Ph.D.
Department of Epidemiology and Biostatistics
Part 2 Understanding SAS
programs Processing
Lecture Outlines
• Understanding the steps involved in processing SAS
programs
• Identify the two phases that occur when a DATA step is processed
• Interpret automatic variables
• Identify the processing phase in which an error occurs
2.1 Understanding SAS DATA step
Processing
• A SAS DATA step is processed in two phases:
2.1.1 Compilation Phase

• 2.1.1.1 Input Buffer

• At the beginning of the compilation phase, the input buffer (an area of memory) is
created to hold a record from the external file.
2.1.1 Compilation Phase

• 2.1.1.2 Program Data Vector (PDV)

• After the input buffer is created, the program data vector is created. The program data vector
is the area of memory where SAS builds a data set, one observation at a time.
• The program data vector contains two automatic variables that can be used for processing but
which are not written to the data set as part of an observation.
 _N_ counts the number of times that the DATA step begins to execute.
 _ERROR_ signals the occurrence of an error that is caused by the data during execution.
The default value is 0, which means there is no error. When one or more errors occur, the
value is set to 1.
2.1.1 Compilation Phase

• 2.1.1.3 Data Set Variables

• As the INPUT statement is compiled, a slot is added to the program data vector for
each variable in the new data set.
2.1.1 Compilation Phase

• 2.1.1.3 Data Set Variables

• As the INPUT statement is compiled, a slot is added to the program data vector for
each variable in the new data set.
2.1.1 Compilation Phase

• 2.1.1.3 Data Set Variables

• As the INPUT statement is compiled, a slot is added to the program data vector for
each variable in the new data set.
2.1.1 Compilation Phase

• 2.1.1.3 Data Set Variables

• As the INPUT statement is compiled, a slot is added to the program data vector for
each variable in the new data set.
2.1.1 Compilation
Phase

• 2.1.1.4 Descriptor Portion of the SAS

Data Set
• At the bottom of the DATA step (in this
example, when the RUN statement is
encountered), the compilation phase is
complete, and the descriptor portion of
the new SAS data set is created. The
descriptor portion of the data set
includes
• The name of the data set
• The number of observations and
variables
• The names and attributes of the
variables.
2.1.1 Compilation Phase

• 2.1.1.5 Syntax Checking

• During the compilation phase, SAS also scans each statement in the DATA
step, looking for syntax errors. Syntax errors include
 Missing or misspelled keywords
 Invalid variable names
 Missing or invalid punctuation
 Invalid options.
2.1.2. Execution
Phase
• After the DATA step is compiled, it
is ready for execution. During the
execution phase, the data portion
of the data set is created. The data
portion contains the data values.
2.1.2. Execution Phase
• 2.1.2.1 Initializing Variables
• At the beginning of the execution phase, the value of _N_ is 1. Because there are no
data errors, the value of _ERROR_ is 0.
*Numeric values are of 2 types- Std and non-std e.g., date, currency ($);
2.1.2. Execution Phase
• 2.1.2.2 Input Data
• When an INPUT statement begins to read data values from a record that is held in the
input buffer, it uses an input pointer to keep track of its position.
• The input pointer starts at column 1 of the first record, unless otherwise directed. As
the INPUT statement executes, the raw data is read by the order and is assigned to
variables in the program data vector.
Iterations of the data step until end of the data

SAS Programming 2 Data Manipulation Techniques - Quizzes PDF
100% (1)
SAS Programming 2 Data Manipulation Techniques - Quizzes PDF
92 pages
Data Driven System Engineering: Automotive ECU Development
From Everand
Data Driven System Engineering: Automotive ECU Development
James Wen
No ratings yet
SAS Programming 2: Data Manipulation Techniques - Syntax: Course Notes
100% (1)
SAS Programming 2: Data Manipulation Techniques - Syntax: Course Notes
20 pages
Sas Handbook: By: Luis Montes
No ratings yet
Sas Handbook: By: Luis Montes
20 pages
Introduction to the simulation of power plants for EBSILON®Professional Version 15
From Everand
Introduction to the simulation of power plants for EBSILON®Professional Version 15
Steffen Swat
No ratings yet
Oracle Database Administration Interview Questions You'll Most Likely Be Asked: Job Interview Questions Series
From Everand
Oracle Database Administration Interview Questions You'll Most Likely Be Asked: Job Interview Questions Series
Vibrant Publishers
5/5 (1)
SAS DATA Step - Compile, Execution, and The Program Data Vector
No ratings yet
SAS DATA Step - Compile, Execution, and The Program Data Vector
10 pages
Arthur Xuejun Li, City of Hope National Medical Center, Duarte, CA
No ratings yet
Arthur Xuejun Li, City of Hope National Medical Center, Duarte, CA
12 pages
Introduction To DATA Step Processing - How The DATA Step Works - A Basic Introduction - Step-By-Step Programming With Base SAS (R) Software
No ratings yet
Introduction To DATA Step Processing - How The DATA Step Works - A Basic Introduction - Step-By-Step Programming With Base SAS (R) Software
7 pages
1.overview of SAS
No ratings yet
1.overview of SAS
20 pages
interview3
No ratings yet
interview3
5 pages
Datastep SAS
No ratings yet
Datastep SAS
11 pages
Programming With The KEEP, RENAME, and DROP Data Set Options
No ratings yet
Programming With The KEEP, RENAME, and DROP Data Set Options
13 pages
SAS Interview Questions
No ratings yet
SAS Interview Questions
4 pages
Ocean Technologies, Hyderabad.: SAS Interview Questions:Base SAS
No ratings yet
Ocean Technologies, Hyderabad.: SAS Interview Questions:Base SAS
5 pages
PharmaSUG-2024-AP-144
No ratings yet
PharmaSUG-2024-AP-144
6 pages
Sas fAQ'S
No ratings yet
Sas fAQ'S
112 pages
Sas
No ratings yet
Sas
84 pages
How MERGE Really Works: Bob Virgile Robert Virgile Associates, Inc
No ratings yet
How MERGE Really Works: Bob Virgile Robert Virgile Associates, Inc
7 pages
Training Schedule: Basic Data Manipulation
No ratings yet
Training Schedule: Basic Data Manipulation
46 pages
DataStepPDV in Sas
No ratings yet
DataStepPDV in Sas
22 pages
Mainframe Sas Online Training 01
No ratings yet
Mainframe Sas Online Training 01
27 pages
LEARN SAS Within 7 Weeks: Part2 (Introduction To SAS - The Data Step)
100% (3)
LEARN SAS Within 7 Weeks: Part2 (Introduction To SAS - The Data Step)
63 pages
Notes On The SAS Data Step and An Introduction To Simulation
No ratings yet
Notes On The SAS Data Step and An Introduction To Simulation
37 pages
Tranforming SAS Data Sets
No ratings yet
Tranforming SAS Data Sets
41 pages
Sas Certification Course Contenst
No ratings yet
Sas Certification Course Contenst
2 pages
Sas Interview Questions
No ratings yet
Sas Interview Questions
15 pages
1 Base SAS Training 12th May 2015
100% (2)
1 Base SAS Training 12th May 2015
494 pages
Day 1
No ratings yet
Day 1
13 pages
SAS Basics
100% (1)
SAS Basics
29 pages
TUTORIAL I: SAS Basics and Data Management I. SAS Basics: SAS (Statistical Analysis Software)
No ratings yet
TUTORIAL I: SAS Basics and Data Management I. SAS Basics: SAS (Statistical Analysis Software)
13 pages
A Step by Step Guide To Learning SAS
No ratings yet
A Step by Step Guide To Learning SAS
49 pages
Complete Sas
100% (1)
Complete Sas
284 pages
SAS A00-215 Certification Exam Syllabus
No ratings yet
SAS A00-215 Certification Exam Syllabus
5 pages
A Step by Step Guide To Learning SAS
No ratings yet
A Step by Step Guide To Learning SAS
49 pages
SAS Tips
No ratings yet
SAS Tips
34 pages
What Is SAS?: Statistical Analysis Software Main Uses of SAS
No ratings yet
What Is SAS?: Statistical Analysis Software Main Uses of SAS
284 pages
Quiz Chapter 6
No ratings yet
Quiz Chapter 6
11 pages
Basics of Sas
No ratings yet
Basics of Sas
14 pages
Sas - Data Step Processing
No ratings yet
Sas - Data Step Processing
9 pages
SAS+Programming Resource+Guide
No ratings yet
SAS+Programming Resource+Guide
118 pages
SAP interface programming with RFC and VBA: Edit SAP data with MS Access
From Everand
SAP interface programming with RFC and VBA: Edit SAP data with MS Access
Karl Josef Hensel
No ratings yet
Visual Basic 2010 Coding Briefs Data Access
From Everand
Visual Basic 2010 Coding Briefs Data Access
Kevin Hough
5/5 (1)
Administering Microsoft Azure SQL Solutions DP 300
From Everand
Administering Microsoft Azure SQL Solutions DP 300
Manish Soni
No ratings yet
Crystal Reports Introduction: Versions 2008-2016
From Everand
Crystal Reports Introduction: Versions 2008-2016
Seth Bonder
No ratings yet
Knight's Microsoft SQL Server 2012 Integration Services 24-Hour Trainer
From Everand
Knight's Microsoft SQL Server 2012 Integration Services 24-Hour Trainer
Brian Knight
No ratings yet
Pivot Tables In Depth For Microsoft Excel 2016
From Everand
Pivot Tables In Depth For Microsoft Excel 2016
Suljan Qeska
3.5/5 (3)
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
GETTING STARTED WITH OPENOFFICE CALC
From Everand
GETTING STARTED WITH OPENOFFICE CALC
Remy Lentzner
No ratings yet
Introduction to Oracle Database Administration
From Everand
Introduction to Oracle Database Administration
Ying Wang
5/5 (1)
The Data Detective's Toolkit: Cutting-Edge Techniques and SAS Macros to Clean, Prepare, and Manage Data
From Everand
The Data Detective's Toolkit: Cutting-Edge Techniques and SAS Macros to Clean, Prepare, and Manage Data
Kim Chantala
No ratings yet
Tableau 8.2 Training Manual: From Clutter to Clarity
From Everand
Tableau 8.2 Training Manual: From Clutter to Clarity
Larry Keller
No ratings yet
Defect Prediction in Software Development & Maintainence
From Everand
Defect Prediction in Software Development & Maintainence
Rudra Kumar
No ratings yet
Oracle SQL Developer 2.1
From Everand
Oracle SQL Developer 2.1
Sue Harper
No ratings yet
Straight Road to Excel 2013/2016 Pivot Tables: Get Your Hands Dirty
From Everand
Straight Road to Excel 2013/2016 Pivot Tables: Get Your Hands Dirty
Sam Akrasi
No ratings yet
SAS Interview Questions You'll Most Likely Be Asked
From Everand
SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Windows Batch File Programming
From Everand
Windows Batch File Programming
Michael Elliott
2/5 (2)
How to Track Schedules, Costs and Earned Value with Microsoft Project
From Everand
How to Track Schedules, Costs and Earned Value with Microsoft Project
Akram Najjar
No ratings yet
Hallo Microsoft Excel: Mastering Data Analytics
From Everand
Hallo Microsoft Excel: Mastering Data Analytics
Agus Kurniawan
No ratings yet
Using Vocals Determine Human Emotion
From Everand
Using Vocals Determine Human Emotion
Faiz ul haque Zeya
No ratings yet
Learn Date and Time in Android - CodeProject
No ratings yet
Learn Date and Time in Android - CodeProject
3 pages
Each Question Carries 2 Marks
No ratings yet
Each Question Carries 2 Marks
35 pages
VHDL Ams
No ratings yet
VHDL Ams
28 pages
Jbasic Users Guide
No ratings yet
Jbasic Users Guide
247 pages
Lesson 2 - PHP Variables
No ratings yet
Lesson 2 - PHP Variables
10 pages
Types of C Constants: C Constants Can Be Divided Into Two Major Categories: Primary Constants Secondary Constants
No ratings yet
Types of C Constants: C Constants Can Be Divided Into Two Major Categories: Primary Constants Secondary Constants
21 pages
The Amstrad Programmer's Guide - Bryan Skinner (1985)
No ratings yet
The Amstrad Programmer's Guide - Bryan Skinner (1985)
222 pages
AMOS-18 Programming Reference
No ratings yet
AMOS-18 Programming Reference
778 pages
Atv31 Mapa Modbus
No ratings yet
Atv31 Mapa Modbus
56 pages
Verilog Language Reference: Verilog Modeling Style Guide (CFE), Product Version 3.1
No ratings yet
Verilog Language Reference: Verilog Modeling Style Guide (CFE), Product Version 3.1
33 pages
Basic-Plus-2 Reference Manual: Order Number: AA-JP30B-TK
No ratings yet
Basic-Plus-2 Reference Manual: Order Number: AA-JP30B-TK
539 pages
13 Data Representation
No ratings yet
13 Data Representation
33 pages
A Look at Procedure C++ (Object Oriented Program)
No ratings yet
A Look at Procedure C++ (Object Oriented Program)
23 pages
C-Programing by Pankaj Sir
100% (1)
C-Programing by Pankaj Sir
43 pages
Top 20 UiPath Interview Questions and Answers-1
No ratings yet
Top 20 UiPath Interview Questions and Answers-1
39 pages
PPT06-Function and Recursion
No ratings yet
PPT06-Function and Recursion
31 pages
AtomMotion Manual V1.0
100% (1)
AtomMotion Manual V1.0
211 pages
Power Builder Manual PDF
100% (3)
Power Builder Manual PDF
1,280 pages
CS101 - Final - Term - Solved - MCQS PDF
0% (1)
CS101 - Final - Term - Solved - MCQS PDF
48 pages
Learn Python 3 - Hello World Cheatsheet - Codecademy
No ratings yet
Learn Python 3 - Hello World Cheatsheet - Codecademy
5 pages
AQA Memory Game
No ratings yet
AQA Memory Game
33 pages
2 Computer Programming Module 2
No ratings yet
2 Computer Programming Module 2
5 pages
Quiz Questions For Chapter 1
No ratings yet
Quiz Questions For Chapter 1
19 pages
Postman Readme Guide
No ratings yet
Postman Readme Guide
36 pages
Chapter 7
No ratings yet
Chapter 7
16 pages
DCIT22-Computer Programming 1 - Learning Module 3
No ratings yet
DCIT22-Computer Programming 1 - Learning Module 3
8 pages
Ass Introduction To Object Oriented Programming & C++
100% (1)
Ass Introduction To Object Oriented Programming & C++
23 pages
Ado - Lisp Library (Adolisp)
No ratings yet
Ado - Lisp Library (Adolisp)
25 pages
CPNM 1 - Introduction: Mridul Sankar Barik
No ratings yet
CPNM 1 - Introduction: Mridul Sankar Barik
21 pages
GA4 Ecommerce Tracking - Part 2 - Ecommerce Events PDF
No ratings yet
GA4 Ecommerce Tracking - Part 2 - Ecommerce Events PDF
18 pages

Week 1-PART 2-Understanding Data Step Processing

Uploaded by

Week 1-PART 2-Understanding Data Step Processing

Uploaded by

Week 1-Part 2

• 2.1.1.1 Input Buffer

• 2.1.1.2 Program Data Vector (PDV)

• 2.1.1.3 Data Set Variables

• 2.1.1.3 Data Set Variables

• 2.1.1.3 Data Set Variables

• 2.1.1.3 Data Set Variables

• 2.1.1.4 Descriptor Portion of the SAS

• 2.1.1.5 Syntax Checking

You might also like