How to Create Anscombe’s Quartet in Excel?
Last Updated :
13 Nov, 2022
Anscombe Quartet developed a situation that, despite the different datasets, with different scatter charts, data could have the same correlation values among them. Anscombe Quartet is famous to provide four data sets that tell us the importance of graphing and trend lines in the data. In this article, we will learn about Anscombe's Quartet in excel.
Anscombe's Quartet
In 1973, Francis Anscombe told us the importance of graphing data before analyzing the data yet they have similar arithmetical identities. Anscombe's Quartet is a group of four data sets that look identical with the same mean and correlation values but when creating a scatter chart they appear very differently. We are considering eleven data values in each data set.
Anscombe's Quartet in Excel
Step 1: Given the Anscombe Quartet dataset. The below 4 images, show the 4 datasets created by Anscombe.
DataSet 1:
DataSet 2:
DataSet 3:
Dataset 4:
Step 2: By physically viewing the data above. We are not able to infer what type of correlation data could have. To have better visual knowledge, we will create Scatter charts for each dataset. We will be showing the steps of creating a scatter chart, for dataset1, a similar process can be used to create charts for the rest of the datasets. Select the dataset1, A2:B13. Go to the Insert tab.
Step 3: Under the Charts section, click on the Scatter option. Select the scatter chart from the list that appeared.
Step 4: A scatter chart is created for dataset1.
Step 5: Repeat Steps 2,3, and 4, and create similar charts for dataset2, dataset3, and dataset4.
Dataset 2:
Dataset 3:
Dataset 4:
Step 6: We can observe from the charts that, each chart has some different visualization in the scatter plot. Dataset1 has an evenly distribution, dataset2 has a parabolic shape, dataset3 is nearly a y = x line, and dataset 4 is nearly a straight line parallel to Y-Axis. Our next task is to add a trend line for each of the graphs, to have further insights into Anscombe's quartet. Select the chart created for dataset1. Go to the Layout tab, and click on the Trendline option.
Step 7: A drop-down list appears. Click on the Linear Trendline option.
Step 8: A trendline line appears in the graph.
Step 9: Repeat steps 6, 7, and 8, and add similar trendlines to dataset2, dataset3, and dataset4.
Dataset 2:
Dataset 3:
Dataset 4:
Step 10: We can observe that the trend line looks similar for each of the datasets. Let us add the equation of the trend line to know more about the current datasets. Select the chart for dataset1. Go to the Layout tab, and click on Trendline.
Step 11: A drop-down list appears. Click on the More Trendline Options.
Step 12: A new dialogue box, name, and Format Trendline appear on the screen. Check the box, Display equation on chart.
Step 13: The trend line equation appears on the chart. The equation is y = 0.5x + 3 for dataset1.
Step 14: Repeat steps 10, 11, 12, and 13, and similarly add trendlines equation for dataset2, dataset3, and dataset4.
Dataset 2:
Dataset 3:
Dataset 4:
We can observe that the equation of trendline is the same for all the datasets i.e. y = 0.5x + 3, despite the fact, that the scatter charts are very different from each other. Hence, the Anscombe quartet proved that only the summary statistics are not completely reliable, we need to look up the graphs also.
Conclusion
By this, we conclude that graphical representation is important before analyzing the data whether they are identical or whether statistical identities are the same in nature. Graphical representations are different and have different relationships in all data sets.
Similar Reads
How to Create a Step Chart in Excel
A step chart is used to represent data that changes irregularly between time intervals. Now, Excel doesn't have a feature to create a Step Chart like the one shown below but we can create one by making some changes in our data. What is a Step Chart in ExcelA Step chart is the same as a Line Chart. T
4 min read
How to Create a Contingency Table in Excel
A contingency table, also known as a crosstab is used to show the relationship between two categorical variables. In Excel, we can make a contingency table using the pivot table function. They are best for summarizing the relationship between categorical variables. A contingency table is just like a
4 min read
How to Create a Covariance Matrix in Excel?
Covariance is a statistical term that signifies the direction of the linear relationship between two variables. The direction usually refers to whether the variables vary directly or inversely to each other. It should be remembered that covariance only measures how two variables change together, it
3 min read
How To Create Interactive Charts in Excel?
In MS Excel, we can draw various charts, but among them today, we will see the interactive chart. By name, we can analyze that the chart, which is made up of interactive features, is known as an interactive chart. In general, this chart makes the representation in a better and more user-friendly way
9 min read
How to Create a Stem-and-Leaf Plot in Excel?
Stem and Leaf plot is a histogram tabulation of data. Stem and leaf plot is better for data visualization and cleanliness of the data in a certified range. The plot helps determine the frequency distribution of the data. In this article, we will learn how to create a stem and leaf plot in excel. Ste
5 min read
How to Create Slicers and Timelines in Excel?
Slicers and Timelines in Excel are used to implement with the Excel tables and Pivot Tables to help in filtering out large amounts of data to get the high level of information from a collection of data. Finding Slicer and Timeline in Excel Navigate to the Insert menu ribbon, Select the Filters secti
3 min read
How to Create a Thermometer Chart in Excel?
The Thermometer chart in Excel can be used to depict specific data based on the actual value and the target value. It can be used in a wide range of scenarios such as representing the past performance of horses in horse racing or the global temperature and it's variation throughout decades etc. In t
2 min read
How to Create an X-Y Scatter Plot in Excel?
Excel is powerful data visualization and data management tool which can be used to store, analyze, and create reports on large data. It can be used to visualize and compare data using a graph plot. In excel we can plot different kinds of graphs like line graphs, bar graphs, etc. to visualize or anal
2 min read
How to Create a Heat Map in Excel?
Heatmaps are basically used to calculate the intensity of an event. It helps us in easily understanding and interpreting any complex dataset and make deductions from it. It uses colors in the bar graph of a dataset and shows different regions with different colors, which area needs more attention an
2 min read
How to Create an Ogive Graph in Excel?
Ogive graph, the name might scare you, but believe me this is very simple to create. The ogive graph is a cumulative frequency graph. The graph is plotted between the fixed intervals vs the frequency added up before. It is a curve plotted for cumulative frequency distribution on a graph. There are t
4 min read