0% found this document useful (0 votes)

38 views

CS109a Lecture1

The document discusses exploratory data analysis and introduces key concepts: 1. It defines data and describes where data can come from, such as internal sources, existing external sources, and external sources requiring collection. 2. It discusses common data types like numeric, boolean, strings, and compound types like dates, lists, and dictionaries. It also describes common data formats like tabular, structured, and semi-structured data. 3. The document introduces the concept of exploratory data analysis and describes examining descriptive statistics, data visualization, and using examples to explore data.

Uploaded by

Sang Nguyễn

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views

CS109a Lecture1

Uploaded by

Sang Nguyễn

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 67

lecture #1: exploratory data analysis

CS 109A, STAT 121A, AC 209A: Data Science

Weiwei Pan, Pavlos Protopapas, Kevin Rader

Fall 2016
Harvard University
announcements

■ How to optimally communicate with us (Piazza then help line)

■ Issues with Gutenberg in HW0 (for now stop using Gutenberg)
■ Extension students: labs stream on Friday (posted same day),
interactive lab via zoom, TF ofﬁce hours, extra day on HW
■ Cross-registered students already have or will get full access to
Canvas today (please check!)
■ HW0 must be submitted (through Vocareum)!
■ HW0 submission deadline extended to Wednesday (today) at
11:59pm.
■ Time and effort required to complete HW0 will vary depending on
familiarity with programming (this is the learning curve), and is a
good indicator of ﬁt of the class (the programming will not get
easier)
■ HW1 is released
1
the data science process

On the ﬁrst day of class you were introduced to the “data science”
process.

■ Ask questions
■ Data Collection
■ Data Exploration
■ Data Modeling
■ Data Analysis
■ Visualization and Presentation of Results

Note: This process is not linear!

2
lecture outline

What Is Data?

Exploring Data
Descriptive Statistics
Data Visualization

An Example

What Next?

3
.what is data?
what is data?

“A datum is a single measurement of something on a scale that is

understandable to both the recorder and the reader. Data is
multiple such measurements.”

Provocative claim: everything is (can be) data!

5
where does it come from?

■ Internal sources: already collected by or is part of the overall

data collection of you organization.
For example: business-centric data that is available in the
organization data base to record day to day operations; scientiﬁc
or experimental data
■ Existing External Sources: available in ready to read format from
an outside source for free or for a fee.
For example: public government databases, stock market data,
Yelp reviews
■ External Sources Requiring Collection Efforts: available from
external source but acquisition requires special processing.
For example: data appearing only in print form, or data on
websites
6
where does it come from?

How to get data generated, published or hosted online:

■ API (Application Programming Interface): using a prebuilt set of

functions developed by a company to access their services. Often
pay to use.
For example: Google Map API, Facebook API, Twitter API
■ RSS (Rich Site Summary): summarizes frequently updated online
content in standard format. Free to read if the site has one.
For example: news-related sites, blogs
■ Web scraping: using software, scripts or by-hand extracting data
from what is displayed on a page or what is contained in the HTML
ﬁle.

6
web scraping

■ Why do it? Older government or smaller news sites might not have
APIs for accessing data, or publish RSS feeds or have databases for
download. You don’t want to pay to use the API or the database.
■ How do you it? See HW0
■ Should you do it?
⇒ You just want to explore: Are you violating their terms of service?
Privacy concerns for website and their clients?
⇒ You want to publish your analysis or product: Do they have an API or
fee that you’re bypassing? Are they willing to share this data? Are you
violating their terms of service? Are there privacy concerns?

7
what does it look like?

What kind of values are in your data (data types)?

Simple or atomic:

■ Numeric: integers, ﬂoats

■ Boolean: binary or true false values
■ Strings: sequence of symbols

8
what does it look like?

What kind of values are in your data (data types)?

Compound, composed of a bunch of atomic types:

■ Date and time: compound value with a speciﬁc structure

■ Lists: a list is a sequence of values
■ Dictionaries: A dictionary is a collection of key-value pairs, a pair
of values x : y where x is usually a string called key representing
the “name” of the value, and y is a value of any type.
Example: Student record
∙ First: Weiwei
∙ Last: Pan
∙ Classes: [CS109A, STAT121A, AC209A]

8
what does it look like?

How is your data represented and stored (data format)?

■ Tabular Data: a dataset that is a two-dimensional table, where

each row typically represents a single data record, and each
column represents one type of measurement (csv, tsp, xlsx etc.).
■ Structured Data: each data record is presented in a form of a,
possibly complex and multi-tiered, dictionary (json, xml etc.)
■ Semistructured Data: not all records are represented by the same
set of keys or some data records are not represented using the
key-value pair structure.

8
what does it look like?

How is your data represented and stored (data format)?

■ Textual Data
■ Temporal Data
■ Geolocation Data

8
more on tabular data

In tabular data, we expect each record or observation to represent a

set of measurements of a single object or event.

Hight Radius Do I Like It?

Cylinder # 1 10 5 Yes
Cylinder # 2 3 7.5 No

Each type of measurement is called a variable or an attribute of the

data (e.g. Height, Radius and “Do I Like It?” are variables or
attributes). The number of attributes is called the dimension of the
data.

We expect each table to contain a set of records or observations of

the same kind of object or event (e.g. our table above contains
observations of cylinders).
9
more on tabular data

You’ll see later that it’s important to distinguish between classes of

variables or attributes based on the type of values they can take on.

■ Quantitative variable: is numerical and can be

⇒ discrete - a ﬁnite number of values are possible in any bounded
interval
For example: “Number of siblings” is a discrete variable
⇒ continuous - an inﬁnite number of values are possible in any bounded
interval
For example: “Height” is a continuous variable

■ Categorical variable: no inherent order among the values

For example: “What kind of pet you have” is a categorical variable

9
is the data any good?

Common issues with data:

■ Missing values: how do we ﬁll in?

■ Wrong values: how can we detect and correct?
■ Messy format
■ Not usable: the data cannot answer the question posed

10
handling messy data

The following is a table accounting for produce deliveries over a

weekend.

What are the variables in this dataset?

What object or event are we measuring?

Friday Saturday Sunday

Morning 15 158 10
Afternoon 2 90 20
Evening 55 12 45

11
handling messy data

We’re measuring individual deliveries; the variables are Time, Day,

Number of Produce.

Friday Saturday Sunday

Morning 15 158 10
Afternoon 2 90 20
Evening 55 12 45

Problem: each column header represents a single value rather than

a variable. Row headers are “hiding” the Day variable. The values of
the variable, “Number of Produce”, is not recorded in a single
column.

11
handling messy data

We need to reorganize the information to make explicit the event

we’re observing and the variables associated to this event.

Delivery Time Day No. of Produce

1 Morning Friday 15
2 Morning Saturday 158
3 Morning Sunday 10
4 Afternoon Friday 2
5 Afternoon Saturday 90
6 Afternoon Sunday 20
7 Evening Friday 55
8 Evening Saturday 12
9 Evening Sunday 45

11
handling messy data

What object or event are we measuring?

What are the variables in this dataset?

11
handling messy data

We’re measuring individual deliveries; the variables are Time, Day,

Number of Produce:

11
handling messy data

Common causes of messiness are:

■ Column headers are values, not variable names

■ Variables are stored in both rows and columns
■ Multiple variables are stored in one column
■ Multiple types of experimental units stored in same table

In general, we want each ﬁle to correspond to a dataset, each

column to represent a single variable and each row to represent a
single observation.

11
.exploring data
talk outline

What Is Data?

Exploring Data
Descriptive Statistics
Data Visualization

An Example

What Next?

13
basic terms

Population versus sample:

■ Population is the entire set of objects or events under study.

Population can be hypothetical “all students” or all students in
this class.
■ Sample is a “representative” subset of the objects or events under
study. Needed because it’s impossible or intractable to obtain or
compute with population data.

Biases in samples:

■ Selection: some subjects or records are more likely to be selected

■ Volunteer/nonresponse: subjects or records who are not easily
available are not represented
For example: I usually only hear from students for whom
something has gone terribly wrong in the course.
14
describing data

Given some large dataset, we’d like to compute a few quantities that
intuitively summarizes the data. To begin with we’d like to know

■ what are typical values for our variables or attributes?

■ how representative are these typical values?

15
Location: Mean
centrality

1. The Mean

The meanTo of
calculate the
a set of n average
number ofx samples
of a set ofofobservations,
a variable isadd their x
denoted
value and
and is deﬁned by divide by the number of observations:

xn x 1 ∑
n
x1x1++x x2 2++x.3. +. +...+
n
x = = 1 xi
x=
n
n
n
=
n
n
i=1
" xi
i=1

The mean describes what a “typical” sample value looks like, or

where is the “center” of the distribution of the data.
16
centrality

The median of a set of n number of samples, ordered by value, of a

variable is is deﬁned by


 x , if n is odd

 ⌊n/2⌋+1
Median =

 x + xn/2+1

 n/2 , if n is even
2

Example:

Ages: 17, 19, 21, 22, 23, 23, 23, 38

22+23
Median = 2 = 22.5

The median describes what a “typical” sample looks like, or where is

the “center” of the distribution of the samples.
16
centrality

The mean is sensitive to outliers.

16
centrality

The mean is sensitive to skewness (asymmetry) of distributions.

16
centrality

How hard (in terms of algorithmic complexity) is it to calculate

■ the mean
■ the median

16
centrality

How hard (in terms of algorithmic complexity) is it to calculate

■ the mean: at most O(n)

■ the median: at leat O(n log n)

Note: Practicality of implementation has to be considered!

16
centrality

For samples of categorical variables, neither mean or median make

sense.

The mode might be a better way to ﬁnd the most “representative”

value. 16
spread

The spread of samples measures how well the mean or median

describes the sample set.

One way to measuring spread of a set of samples is via the range.

Range = Maximum Value − Minimum Value

17
spread

The (sample) variance, denoted s2 , measures how much on average

the sample values “deviates” from the mean
∑n 2
|xi − x|
s2 = i=1
n−1
Note: the term |xi − x| measure the amount by which xi deviates
from the mean x. Squaring these deviation means that s2 is sensitive
to extreme values (outliers).

Note: s2 doesn’t have the same units as xi ! What does a variance of

1, 008 mean? Or 0.0001?

17
spread

The (sample) standard deviation, denoted s, is the square root of

the variance √
∑n 2
i=1 |xi − x|
s=
n−1
Note: s has the same units as xi !

17
talk outline

What Is Data?

Exploring Data
Descriptive Statistics
Data Visualization

An Example

What Next?

18
why data visualization?

The following data sets comprise the Anscombe’s Quartet; all four
sets of data have identical simple summary statistics.

Dataset I Dataset II Dataset III Dataset IV

x y x y x y x y
10 8.04 10 9.14 10 7.46 8 6.58
8 6.95 8 8.14 8 6.77 8 5.76
13 7.58 13 8.74 13 12.74 8 7.71
9 8.81 9 8.77 9 7.11 8 8.84
11 8.33 11 9.26 11 7.81 8 8.47
14 9.96 14 8.1 14 8.84 8 7.04
6 7.24 6 6.13 6 6.08 8 5.25
4 4.26 4 3.1 4 5.39 19 12.5
12 10.84 12 9.13 12 8.15 8 5.56
7 4.82 7 7.26 7 6.42 8 7.91
5 5.68 5 4.74 5 5.73 8 6.89
Sum: 99.00 82.51 99.00 82.51 99.00 82.51 99.00 82.51
Avg: 9.00 7.50 9.00 7.50 9.00 7.50 9.00 7.50
Std: 3.32 2.03 3.32 2.03 3.32 2.03 3.32 2.03

19
why data visualization?

The following data sets comprise the Anscombe’s Quartet; all four
sets of data have identical simple summary statistics.

19
why data visualization?

If I tell you that the average score for Homework 0 Part A is: 7.64/15.

What does that suggest?

19
why data visualization?

If I then show you the following graph, what does it suggest?

19
what is data visualization good for?

Analyze:

■ Identify hidden patterns and trends

■ Help formulate/test hypothese
■ Help determine the next step in analysis/modeling

20
what is data visualization good for?

Communicate:

■ Present information and ideas succinctly

■ Provide evidence and support
■ Inﬂuence and persuade

20
visualization design principles

Basic data visualization guidelines from Edward Tufte:

■ Maximize data to ink ratio: show the data

Bad Better

21
visualization design principles

Basic data visualization guidelines from Edward Tufte:

■ Maximize data to ink ratio: show the data

size of effect in graph
■ Don’t lie with scale: minimize size of effect in data (Lie Factor)
Bad Better
8.100

8.095

8.090

21
visualization design principles

Basic data visualization guidelines from Edward Tufte:

■ Maximize data to ink ratio: show the data
■ Don’t lie with scale: minimize size of effect in graph
size of effect in data (Lie Factor)
■ Minimize chart-junk: show data variation, not design variation
Bad Better

The number of visual parameters should not exceed the

dimension of the data! 21
visualization design principles

Basic data visualization guidelines from Edward Tufte:

■ Maximize data to ink ratio: show the data

size of effect in graph
■ Don’t lie with scale: minimize size of effect in data (Lie Factor)
■ Minimize chart-junk: show data variation, not design variation
■ Clear, detailed and thorough labeling (including important events)

21
types of data visualizations

What do you want your visualization to show about your data?

■ Distribution: how a variable or variables in the dataset distribute

over a range of possible values.
■ Relationship: how the values of multiple variables in the dataset
relate
■ Composition: how the dataset breaks down into subgroups
■ Comparison: how trends in multiple variable or datasets compare

22
distribution
Effect of Bin Size on Histogram
• Simulated 1000 N(0,1) and 500 N(1,1)

Frequency
Frequency
Effect of Bin SizeEffect of Bin Size on Histogram
on Histogram
A histogram is a way to visualize how 1-dimensional data is
• Simulated 1000 N(0,1) and• 500
Simulated
N(1,1)1000 N(0,1) and 500 N(1,1)
distributed across certain values.

Frequency
Frequency
Frequency
Frequency
Frequency

Note: Trends in histograms are sensitive to number of bins.

Frequency
Frequency

23
distribution

A scatter plot is a way to visualize how multi-dimensional data is

distributed across certain values.

23
relationships

A scatter plot is also a way to visualize the relationship between the

different attributes of multi-dimensional data.

24
composition

A pie chart is a way to visualize the static composition of a group.

25
composition

A stacked area graph is a way to visualize the composition of a

group as it changes over time.

25
comparisons

Plotting multiple histograms or curves on the same axes is a way to

visualize how different variables compare.

26
visualizing the impossible

Often your dataset seem too complex to visualize:

■ Data is too high dimensional (how do you plot 100 variables on

the same set of axes?)
■ Some variables are categorical (how do you plot values like “Cat”
or “No”?)

27
reducing the dimension

When the data is high dimensional, a scatter plot of all data

attributes can be impossible or unhelpful.

The above is the data from Homework set #0! 28

reducing the dimension

Relationships may be easier to spot by producing multiple plots of

lower dimensionality.

28
adding extra dimensions

For 3D data, color coding a categorical attribute can be effective.

The above visualizes a set of Iris measurements. The variables are:

petal length, sepal length, Iris type (setosa, versicolor, virginica).
29
adding extra dimensions

For 3D data, a quantitative attribute can be encoded by size in a

bubble chart.

The above visualizes a set of consumer products. The variables are:

revenue, consumer rating, product type and product cost.
29
.an example
effectiveness of drugs

Use some simple visualizations to explore the following dataset.

Bacteria Name Group No. Res. to Drug 1 Res. to Drug 2 Res. to Drug 3
Brucella abortus 1 0.1 3 49
Diplococcus pneumoniae 2 4.75 0.007 0.125
Aerobacter aerogenes 1 0.3 1 47.2
Streptococcus viridans 2 4.9 0.03 -1.45

31
effectiveness of drugs

Bar graph showing resistance of each bacteria to each drug:

Any patterns?
31
effectiveness of drugs

Bar graph showing resistance of each bacteria to each drug (grouped

by Group Number):

Any patterns?
31
effectiveness of drugs

Scatter plot of Drug #1 vs Drug #3 resistance:

Note: The process of data exploration is iterative (visualize for

trends, re-visualize to conﬁrm)!
31
.what next?
explain

We can see that birth weight is positively correlated with femur

length.

Can we describe exactly how they are correlated?

33
predict

We can see that types of iris seem to be distinguished by petal and

sepal lengths.

Can we predict the type of iris given petal and sepal lengths?
34

Hourglass Workout Program by Luisagiuliet 2
76% (21)
Hourglass Workout Program by Luisagiuliet 2
51 pages
12 Week Program: Summer Body Starts Now
87% (46)
12 Week Program: Summer Body Starts Now
70 pages
Read People Like A Book by Patrick King-Edited
58% (81)
Read People Like A Book by Patrick King-Edited
12 pages
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
77% (13)
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
260 pages
Cheat Code To The Universe
94% (79)
Cheat Code To The Universe
34 pages
Facial Gains Guide (001 081)
91% (45)
Facial Gains Guide (001 081)
81 pages
Curse of Strahd
95% (467)
Curse of Strahd
258 pages
The Psychiatric Interview - Daniel Carlat
91% (34)
The Psychiatric Interview - Daniel Carlat
473 pages
The Borax Conspiracy
91% (57)
The Borax Conspiracy
14 pages
The Secret Language of Attraction
86% (108)
The Secret Language of Attraction
278 pages
How To Develop and Write A Grant Proposal
83% (542)
How To Develop and Write A Grant Proposal
17 pages
Penis Enlargement Secret
60% (124)
Penis Enlargement Secret
12 pages
Workbook For The Body Keeps The Score
89% (53)
Workbook For The Body Keeps The Score
111 pages
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
83% (1016)
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
13 pages
KamaSutra Positions
78% (69)
KamaSutra Positions
55 pages
7 Hermetic Principles
93% (30)
7 Hermetic Principles
3 pages
27 Feedback Mechanisms Pogil Key
77% (13)
27 Feedback Mechanisms Pogil Key
6 pages
Frank Hammond - List of Demons
92% (92)
Frank Hammond - List of Demons
3 pages
Phone Codes
79% (28)
Phone Codes
5 pages
36 Questions That Lead To Love
91% (35)
36 Questions That Lead To Love
3 pages
How 2 Setup Trust
97% (307)
How 2 Setup Trust
3 pages
The 36 Questions That Lead To Love - The New York Times
94% (34)
The 36 Questions That Lead To Love - The New York Times
3 pages
100 Questions To Ask Your Partner
78% (36)
100 Questions To Ask Your Partner
2 pages
Satanic Calendar
25% (56)
Satanic Calendar
4 pages
The 36 Questions That Lead To Love - The New York Times
95% (21)
The 36 Questions That Lead To Love - The New York Times
3 pages
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
100% (8)
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
27 pages
Jeffrey Epstein39s Little Black Book Unredacted PDF
75% (12)
Jeffrey Epstein39s Little Black Book Unredacted PDF
95 pages
1001 Songs
70% (73)
1001 Songs
1,798 pages
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
23% (954)
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
38 pages
Zodiac Sign & Their Most Common Addictions
63% (30)
Zodiac Sign & Their Most Common Addictions
9 pages
PHP UNIT 01-Notes PDF
100% (1)
PHP UNIT 01-Notes PDF
62 pages
Lecture2 Data
No ratings yet
Lecture2 Data
57 pages
Lecture 01-05 Data, Central Tendency PDF
No ratings yet
Lecture 01-05 Data, Central Tendency PDF
51 pages
Unit1-Data Science Fundamentals
No ratings yet
Unit1-Data Science Fundamentals
35 pages
4.0 Introduction to Data
No ratings yet
4.0 Introduction to Data
16 pages
Course 3
No ratings yet
Course 3
22 pages
Week 2 - 3getting To Know Your Data
No ratings yet
Week 2 - 3getting To Know Your Data
67 pages
3 Data Science Intro
No ratings yet
3 Data Science Intro
76 pages
Data Science UNIT 1 Final
No ratings yet
Data Science UNIT 1 Final
107 pages
Chapter 1.1 Introduction to Data
No ratings yet
Chapter 1.1 Introduction to Data
10 pages
Exploratory Data Analysis
No ratings yet
Exploratory Data Analysis
16 pages
Coursera - Data Analytics - Course 3
No ratings yet
Coursera - Data Analytics - Course 3
14 pages
Combine PDF
No ratings yet
Combine PDF
270 pages
Lecture 01
No ratings yet
Lecture 01
40 pages
EDA - Unit 1
No ratings yet
EDA - Unit 1
82 pages
Lecture 5 1 Flavours of Data
No ratings yet
Lecture 5 1 Flavours of Data
30 pages
Types of Data and Data Quality: KIT306/606: Data Analytics Unit Coordinator: A/Prof. Quan Bai University of Tasmania
No ratings yet
Types of Data and Data Quality: KIT306/606: Data Analytics Unit Coordinator: A/Prof. Quan Bai University of Tasmania
25 pages
FDS - UNIT 1
No ratings yet
FDS - UNIT 1
233 pages
Chapter 1
No ratings yet
Chapter 1
3 pages
C20 Combined
No ratings yet
C20 Combined
291 pages
FDS Unit 1 Notes
No ratings yet
FDS Unit 1 Notes
53 pages
Unit-2-1
No ratings yet
Unit-2-1
48 pages
What Is Data - Coursera
No ratings yet
What Is Data - Coursera
6 pages
Data Preparation Notebook
No ratings yet
Data Preparation Notebook
14 pages
DVP Unit1
No ratings yet
DVP Unit1
44 pages
22UCS303 DS-Unit III-N
No ratings yet
22UCS303 DS-Unit III-N
85 pages
EDA 1
No ratings yet
EDA 1
137 pages
FDS Module 1 Notes
No ratings yet
FDS Module 1 Notes
27 pages
Unit 3 Data Exploration (P)
No ratings yet
Unit 3 Data Exploration (P)
69 pages
(STATS) Module 3
No ratings yet
(STATS) Module 3
2 pages
Data-Preprocessing
No ratings yet
Data-Preprocessing
138 pages
TYCS DS Unit1
No ratings yet
TYCS DS Unit1
28 pages
Unit 3 Data Preprocessing - Data
No ratings yet
Unit 3 Data Preprocessing - Data
90 pages
How data is col
No ratings yet
How data is col
11 pages
Data Visualization and Story Telling Notes
No ratings yet
Data Visualization and Story Telling Notes
31 pages
CE880_Lecture3_slides
No ratings yet
CE880_Lecture3_slides
44 pages
Lecture 3 (DS) - Steps in Data Science Process
No ratings yet
Lecture 3 (DS) - Steps in Data Science Process
57 pages
Notes 3 (Prepare Coursera)
No ratings yet
Notes 3 (Prepare Coursera)
67 pages
SM Session 1 IPL 2024 Post Session Slides
No ratings yet
SM Session 1 IPL 2024 Post Session Slides
44 pages
Lecture 1,2&3
No ratings yet
Lecture 1,2&3
80 pages
Unit 1 - FoDS - Sep 2023
No ratings yet
Unit 1 - FoDS - Sep 2023
147 pages
Data Analyst work
No ratings yet
Data Analyst work
22 pages
ANL201 Study Unit 3 - 2023
No ratings yet
ANL201 Study Unit 3 - 2023
48 pages
Document (9)
No ratings yet
Document (9)
8 pages
Data Interpretation Workshop Presentation
No ratings yet
Data Interpretation Workshop Presentation
34 pages
Step 1: Ask Questions
No ratings yet
Step 1: Ask Questions
30 pages
Notes of Week-1 and Week-2
No ratings yet
Notes of Week-1 and Week-2
30 pages
Basic Economic Analytics Using Excel!
No ratings yet
Basic Economic Analytics Using Excel!
72 pages
DS_w3-4
No ratings yet
DS_w3-4
69 pages
1 Introduction
No ratings yet
1 Introduction
51 pages
Module 1 - Lecture 3 - Types of Data - 16.5.2022
No ratings yet
Module 1 - Lecture 3 - Types of Data - 16.5.2022
38 pages
Unit 1 Introduction
No ratings yet
Unit 1 Introduction
86 pages
Unit I- Data Science
No ratings yet
Unit I- Data Science
161 pages
Lesson 2 Notes
No ratings yet
Lesson 2 Notes
11 pages
fds print
No ratings yet
fds print
7 pages
CH 01
No ratings yet
CH 01
39 pages
FIT1043 - Lecture 3 - 2024
No ratings yet
FIT1043 - Lecture 3 - 2024
69 pages
Data Mining
No ratings yet
Data Mining
40 pages
Data and Information
No ratings yet
Data and Information
6 pages
Illuminating Data: A hands on guide to data visualization in R
From Everand
Illuminating Data: A hands on guide to data visualization in R
Eman Ahmad
No ratings yet
(Excerpts From) Investigating Performance: Design and Outcomes With Xapi
From Everand
(Excerpts From) Investigating Performance: Design and Outcomes With Xapi
Janet Laane Effron
No ratings yet
Portal Info Stub
No ratings yet
Portal Info Stub
12 pages
Ggeb 1
No ratings yet
Ggeb 1
3 pages
Leskovsek M. Using Linux With ChatGPT. Leverage AI Technology... 2023
No ratings yet
Leskovsek M. Using Linux With ChatGPT. Leverage AI Technology... 2023
88 pages
FAQ IOP-Versionen 2019-07-01 EN
No ratings yet
FAQ IOP-Versionen 2019-07-01 EN
3 pages
Readme UK
No ratings yet
Readme UK
16 pages
Half Yearly-Viii Comp MR
No ratings yet
Half Yearly-Viii Comp MR
2 pages
Seminar
No ratings yet
Seminar
25 pages
Flyer EyeC ProofRunner Sheetfed For Manroland Sheetfed EN
No ratings yet
Flyer EyeC ProofRunner Sheetfed For Manroland Sheetfed EN
2 pages
Enterprise Architecture
83% (12)
Enterprise Architecture
365 pages
Easybcd 2.2 Manual: Read/Download
0% (1)
Easybcd 2.2 Manual: Read/Download
2 pages
Cad Laboratory Lecture Exercises
No ratings yet
Cad Laboratory Lecture Exercises
22 pages
Programming with C 20 Concepts Coroutines Ranges and more Updated 2024 2nd Edition Andreas Fertig - Instantly access the full ebook content in just a few seconds
100% (1)
Programming with C 20 Concepts Coroutines Ranges and more Updated 2024 2nd Edition Andreas Fertig - Instantly access the full ebook content in just a few seconds
48 pages
CSI106_IntroductiontoComputerScience_Nhậpmônkhoahọcmáytính_10262024
No ratings yet
CSI106_IntroductiontoComputerScience_Nhậpmônkhoahọcmáytính_10262024
2 pages
5.2 - Smart Contracts
No ratings yet
5.2 - Smart Contracts
18 pages
Vim Beginner's Guide
No ratings yet
Vim Beginner's Guide
1 page
Resume English Graduate
100% (2)
Resume English Graduate
5 pages
About Me: Mohamed Abd-El Salam Ahmed
No ratings yet
About Me: Mohamed Abd-El Salam Ahmed
2 pages
Connecting SAC With SAP ANALYTICS Cloud Kit 1.0 - SAP Blogs
No ratings yet
Connecting SAC With SAP ANALYTICS Cloud Kit 1.0 - SAP Blogs
17 pages
NAMA: Muhammad Ridho Fasya NIM: 0702193197 KELAS: SI-4/3 Matkul: Grafika Komputer (Uas Teori & Praktikum)
No ratings yet
NAMA: Muhammad Ridho Fasya NIM: 0702193197 KELAS: SI-4/3 Matkul: Grafika Komputer (Uas Teori & Praktikum)
11 pages
COMPROVANTE PUBLICAÇÕES - HET Starters and Alternators Library
No ratings yet
COMPROVANTE PUBLICAÇÕES - HET Starters and Alternators Library
5 pages
Graphic Org
No ratings yet
Graphic Org
2 pages
Functions of C++
No ratings yet
Functions of C++
12 pages
V2460 Especificações
No ratings yet
V2460 Especificações
12 pages
PPS - Unit-2 - MCQ
No ratings yet
PPS - Unit-2 - MCQ
8 pages
Malware Analysis Report Infamous Chisel (En)
No ratings yet
Malware Analysis Report Infamous Chisel (En)
4 pages
Linux Command Line For You and Me Documentation: Release 0.1
No ratings yet
Linux Command Line For You and Me Documentation: Release 0.1
94 pages
在线论文生成器
100% (2)
在线论文生成器
7 pages
CV - Dr. Biswajit Datta
No ratings yet
CV - Dr. Biswajit Datta
8 pages
MCQ Adb PDF
No ratings yet
MCQ Adb PDF
6 pages