How To Calculate Average For Every Column In A Csv File
Last Updated :
28 Apr, 2025
We are given a CSV file and our task is to find the average of each column in Python using different approaches. In this article, we will see how we can calculate the average for every column in a CSV file.
Example:
Input:
data.csv
Age,Salary
30,50000
25,60000
28,55000
Output: Average Age: 27.67, Average Salary: 55000.00
Calculate the Average For Every Column in a Python CSV file
Below are some of the ways by which we can calculate the average for every column in a Python CSV file:
- Using CSV module and Manual Calculation
- Using Pandas Library
- Using NumPy Library
data.csv
Age,Salary
30,50000
25,60000
28,55000
Using the csv and Manual Calculation
In this example, the Python program reads a CSV file ('data.csv'), calculates the sum and count of numeric values in each column, and then computes and prints the average for each column, handling non-numeric values gracefully. The results are displayed with two decimal places.
Python3
# Python program for the above approach
import csv
# Open the CSV file for reading
with open('data.csv', newline='') as csvfile:
reader = csv.reader(csvfile)
headers = next(reader) # Read the header row
# Initialize variables to store column sums and counts
sums = [0] * len(headers)
counts = [0] * len(headers)
# Iterate through each row in the CSV file
for row in reader:
for i, value in enumerate(row):
try:
num = float(value)
sums[i] += num
counts[i] += 1
except ValueError:
pass # Ignore non-numeric values
# Calculate and print the average for each column
for i, header in enumerate(headers):
average = sums[i] / counts[i] if counts[i] != 0 else 0
print(f"Average {header}: {average:.2f}")
Output:
Average Age: 27.67
Average Salary: 55000.00
Using Pandas Library
In this example, the Python script uses the Pandas library to read a CSV file ('data.csv') into a DataFrame. It then calculates the average for each column using the mean()
function and displays the results, providing a concise and efficient approach for calculating column averages in a CSV dataset.
Python3
import csv
import pandas as pd
# Read the CSV file (replace 'data.csv' with your file path)
df = pd.read_csv('data.csv')
# Calculate column averages
column_averages = df.mean()
# Display the results
print("Average for each column:")
print(column_averages)
Output:
Average for each column:
Age 27.666667
Salary 55000.000000
dtype: float64
Using NumPy Library
In this example, the Python script utilizes the NumPy library to read a CSV file ('data.csv') and convert it into a NumPy array of integers, skipping the header row. It then calculates the average for specific columns (Age and Salary) using np.mean()
and displays the results with two decimal places. This approach provides a concise method for computing column averages in a CSV dataset with numerical data.
Python3
import numpy as np
import csv
# Read the CSV file (replace 'data.csv' with your file path)
with open('data.csv', 'r') as f:
reader = csv.reader(f)
next(reader) # Skip the header row
data = np.array(list(reader), dtype=int)
# Calculate column averages
age_avg = np.mean(data[:, 0]) # Column 0 (Age)
salary_avg = np.mean(data[:, 1]) # Column 1 (Salary)
# Display the results
print(f"Average Age: {age_avg:.2f}")
print(f"Average Salary: ${salary_avg:.2f}")
Output:
Average Age: 27.67
Average Salary: $55000.00
Similar Reads
Python Tutorial - Learn Python Programming Language Python is one of the most popular programming languages. Itâs simple to use, packed with features and supported by a wide range of libraries and frameworks. Its clean syntax makes it beginner-friendly. It'sA high-level language, used in web development, data science, automation, AI and more.Known fo
10 min read
Python Interview Questions and Answers Python is the most used language in top companies such as Intel, IBM, NASA, Pixar, Netflix, Facebook, JP Morgan Chase, Spotify and many more because of its simplicity and powerful libraries. To crack their Online Assessment and Interview Rounds as a Python developer, we need to master important Pyth
15+ min read
Non-linear Components In electrical circuits, Non-linear Components are electronic devices that need an external power source to operate actively. Non-Linear Components are those that are changed with respect to the voltage and current. Elements that do not follow ohm's law are called Non-linear Components. Non-linear Co
11 min read
Python OOPs Concepts Object Oriented Programming is a fundamental concept in Python, empowering developers to build modular, maintainable, and scalable applications. By understanding the core OOP principles (classes, objects, inheritance, encapsulation, polymorphism, and abstraction), programmers can leverage the full p
11 min read
Python Projects - Beginner to Advanced Python is one of the most popular programming languages due to its simplicity, versatility, and supportive community. Whether youâre a beginner eager to learn the basics or an experienced programmer looking to challenge your skills, there are countless Python projects to help you grow.Hereâs a list
10 min read
Python Exercise with Practice Questions and Solutions Python Exercise for Beginner: Practice makes perfect in everything, and this is especially true when learning Python. If you're a beginner, regularly practicing Python exercises will build your confidence and sharpen your skills. To help you improve, try these Python exercises with solutions to test
9 min read
Python Programs Practice with Python program examples is always a good choice to scale up your logical understanding and programming skills and this article will provide you with the best sets of Python code examples.The below Python section contains a wide collection of Python programming examples. These Python co
11 min read
Spring Boot Tutorial Spring Boot is a Java framework that makes it easier to create and run Java applications. It simplifies the configuration and setup process, allowing developers to focus more on writing code for their applications. This Spring Boot Tutorial is a comprehensive guide that covers both basic and advance
10 min read
Python Introduction Python was created by Guido van Rossum in 1991 and further developed by the Python Software Foundation. It was designed with focus on code readability and its syntax allows us to express concepts in fewer lines of code.Key Features of PythonPythonâs simple and readable syntax makes it beginner-frien
3 min read
Python Data Types Python Data types are the classification or categorization of data items. It represents the kind of value that tells what operations can be performed on a particular data. Since everything is an object in Python programming, Python data types are classes and variables are instances (objects) of thes
9 min read