How to plot a CSV file in Python

Learn to plot CSV files in Python. This guide covers various methods, tips, real-world applications, and how to debug common errors.

Published on:

Mon

Apr 6, 2026

Updated on:

Wed

Apr 8, 2026

The Replit Team

ON THIS PAGE

Example H2

Python helps you plot data from CSV files, a key skill for data visualization. You can transform raw spreadsheet information into clear graphs that reveal trends and patterns within your data.

In this article, we'll explore several techniques to plot your data. We'll cover practical tips, show real-world applications, and offer advice to debug common issues you might face along the way.

Simple way to plot CSV data with `pandas`

import pandas as pd import matplotlib.pyplot as plt df = pd.read_csv('data.csv') df.plot(x='Month', y='Sales') plt.show()--OUTPUT--[A line plot showing sales figures across different months]

The pandas library is your workhorse for this task. The pd.read_csv() function reads your file and organizes the data into a DataFrame—a structure that's perfect for manipulation and analysis. This step is crucial because it sets up your data for easy plotting.

With the data loaded, you can call the df.plot() method directly on the DataFrame. This function is a convenient pandas feature that uses matplotlib to create the graph for you. You just need to specify which columns map to the x and y axes. Finally, plt.show() displays the visual plot you've just defined.

Basic CSV plotting approaches

While pandas is convenient, you can gain more control over your plots by using Python's csv module and the matplotlib library directly.

Using the `csv` module and `matplotlib`

import csv import matplotlib.pyplot as plt months = [] sales = [] with open('data.csv', 'r') as file: reader = csv.DictReader(file) for row in reader: months.append(row['Month']) sales.append(float(row['Sales'])) plt.plot(months, sales, marker='o') plt.show()--OUTPUT--[A line plot with markers showing sales across months]

This method offers more direct control by reading the file and building data lists yourself. Instead of relying on pandas, you use Python's built-in csv module to parse the data row by row.

The csv.DictReader object treats each row as a dictionary, letting you access data by its column header, like row['Month'].
As you loop through the reader, you append values to the months and sales lists. It's crucial to convert numerical data using float() so matplotlib can plot it correctly.

Creating a scatter plot with `plt.scatter()`

import pandas as pd import matplotlib.pyplot as plt df = pd.read_csv('data.csv') plt.scatter(df['Temperature'], df['Sales'], alpha=0.7) plt.xlabel('Temperature (°C)') plt.ylabel('Sales ($)') plt.show()--OUTPUT--[A scatter plot showing the relationship between temperature and sales]

A scatter plot is ideal for visualizing the relationship between two variables. The plt.scatter() function takes two data series—in this case, temperature and sales—and plots each pair as a point. This helps you spot correlations, like whether higher temperatures lead to more sales.

The alpha=0.7 argument makes the points slightly transparent, which helps reveal areas where data points overlap.
Functions like plt.xlabel() and plt.ylabel() are essential for labeling your axes, making your plot clear and easy to interpret.

Plotting multiple columns from a CSV file

import pandas as pd import matplotlib.pyplot as plt df = pd.read_csv('sales_data.csv') plt.figure(figsize=(10, 5)) plt.plot(df['Month'], df['Product_A'], label='Product A') plt.plot(df['Month'], df['Product_B'], label='Product B') plt.legend() plt.show()--OUTPUT--[A line plot comparing sales of Product A and Product B over months]

To compare different data series, you can plot them on the same axes. Simply call the plt.plot() function multiple times, once for each column you want to visualize. This overlays the lines on a single graph, making comparisons straightforward.

The label argument in plt.plot() is key for distinguishing between lines.
Calling plt.legend() tells matplotlib to display a legend using the labels you provided.
You can use plt.figure() to adjust the plot's dimensions for better readability.

Advanced CSV plotting techniques

Building on the basic plotting methods, you can now use powerful libraries to create more polished, interactive, and customized visualizations from your CSV data.

Using `seaborn` for enhanced visualizations

import pandas as pd import seaborn as sns import matplotlib.pyplot as plt df = pd.read_csv('data.csv') sns.set_theme(style="whitegrid") sns.lineplot(x='Year', y='Value', hue='Category', data=df) plt.title('Trends by Category') plt.show()--OUTPUT--[A seaborn line plot showing trends across different categories over years]

Seaborn is a library built on matplotlib that simplifies creating attractive statistical plots. The sns.set_theme() function applies a professional visual style, like a grid, with just one line. The real power comes from functions like sns.lineplot(), which can interpret your DataFrame directly.

The hue parameter is a key feature. It automatically groups your data by the specified column—in this case, 'Category'.
It then plots each group as a distinct, color-coded line and adds a legend for you.

This approach lets you visualize trends across different categories without writing complex loops or manual plotting calls.

Interactive plotting with `plotly.express`

import pandas as pd import plotly.express as px df = pd.read_csv('data.csv') fig = px.line(df, x='Date', y='Value', color='Category', title='Interactive Time Series') fig.show()--OUTPUT--[An interactive plotly line chart that can be zoomed and panned]

For plots you can interact with, plotly.express is a top-tier choice. It generates dynamic charts that let you explore the data directly, rather than just viewing a static image. The syntax is clean and expressive, making it easy to get started.

The px.line() function builds a plot from your DataFrame. Much like seaborn, it uses a color parameter to automatically group and color-code your data by category.
Calling fig.show() renders a fully interactive chart. You can zoom, pan, and hover over data points to see their specific values.

Customizing plots with `pandas` styling options

import pandas as pd import matplotlib.pyplot as plt df = pd.read_csv('data.csv') ax = df.plot(x='Year', y='Revenue', figsize=(10, 6), grid=True, style='.-', colormap='viridis') ax.set_ylabel('Revenue ($ millions)') plt.show()--OUTPUT--[A colorful styled line plot showing revenue over years with grid lines]

You can customize your visuals directly within the df.plot() method. This function accepts several arguments that control the plot's appearance, giving you a quick way to create a more polished chart without extra lines of code.

Parameters like figsize and grid adjust the plot's dimensions and add a background grid for readability.
The style argument defines the line's look, such as using '.-' to show both data points and a connecting line.
By assigning the plot to a variable like ax, you get an Axes object that lets you make further tweaks, like setting a custom y-axis label with ax.set_ylabel().

Move faster with Replit

Replit is an AI-powered development platform where all Python dependencies pre-installed, so you can skip setup and start coding instantly. This lets you move from learning individual techniques to building complete working applications faster.

Instead of piecing together plotting code, describe the app you want to build and Agent 4 will take it from idea to working product. For example, you could build:

A sales dashboard that automatically reads a CSV file and generates an interactive line chart to track monthly revenue.
A market analysis tool that creates a scatter plot to visualize the relationship between ad spend and user sign-ups from your campaign data.
A performance visualizer that plots sales data for multiple products on a single graph, helping you compare their trends over time.

Simply describe your app, and Replit will write the code, test it, and fix issues automatically, all within your browser.

Common errors and challenges

Even with powerful tools, you might hit a few snags when plotting CSV data, but most common issues have straightforward fixes.

Handling missing values in CSV data plots

Missing data, often appearing as empty cells or NaN values, can cause errors or create misleading gaps in your plots. Before plotting, you need to decide how to handle them.

You can remove rows with missing data entirely using the dropna() method. This is a quick fix but might discard valuable information.
Alternatively, you can fill the gaps with a specific value—like zero or the column's average—using the fillna() method, which preserves your dataset's size.

Fixing date parsing issues for time series plots with `parse_dates`

Dates are often tricky because Python might read them as plain text strings instead of chronological data, which messes up time series plots. To fix this, you can instruct pandas to interpret date columns correctly right from the start. When you load your data, use the parse_dates argument in the pd.read_csv() function to specify which columns contain dates.

Converting string columns to numeric types for plotting

Sometimes a column that looks numeric is actually stored as a string, especially if it contains symbols like commas or currency signs. Trying to plot this data will usually result in a TypeError. You can convert these columns to a numeric format using functions like astype(float) or pd.to_numeric, which also helps handle any non-numeric values that might be hiding in your data.

Handling missing values in CSV data plots

Missing data, often appearing as empty cells or NaN values, can create misleading gaps in your plots. When a plotting library encounters these points, it often breaks the line, which can distort the visual trend. The code below demonstrates this exact problem.

import pandas as pd import matplotlib.pyplot as plt df = pd.read_csv('data_with_gaps.csv') plt.plot(df['Date'], df['Value']) plt.show()

When plt.plot() encounters missing data, it stops drawing the line and restarts at the next valid point, creating visual breaks. The code below demonstrates a common approach to fix this before plotting.

import pandas as pd import matplotlib.pyplot as plt df = pd.read_csv('data_with_gaps.csv') df = df.dropna(subset=['Value']) plt.plot(df['Date'], df['Value']) plt.show()

The fix is to clean the data before plotting. By calling dropna(subset=['Value']), you're telling pandas to remove any rows where the 'Value' column is empty. This gives the plot function a clean series of data points to work with, resulting in a continuous line without visual breaks. It’s a straightforward way to focus on the overall trend, but remember that you are removing data points from your analysis.

Fixing date parsing issues for time series plots with `parse_dates`

When dates are read as text, your time series plot won't work correctly. The x-axis will be sorted alphabetically, not chronologically, creating a tangled mess. The code below shows what happens when you plot data without proper date parsing.

import pandas as pd import matplotlib.pyplot as plt df = pd.read_csv('time_series.csv') plt.plot(df['Date'], df['Value']) plt.show()

The pd.read_csv() function reads the Date column as text by default, so the plot connects points alphabetically instead of chronologically. The following code shows how to fix this during the import process.

import pandas as pd import matplotlib.pyplot as plt df = pd.read_csv('time_series.csv', parse_dates=['Date']) plt.plot(df['Date'], df['Value']) plt.show()

The fix is simple. When loading your data with pd.read_csv(), add the parse_dates=['Date'] argument. This tells pandas to treat the 'Date' column as actual dates, not just text. As a result, your plot will be sorted chronologically, showing the correct time series trend. This is crucial for any dataset where time is a key factor, ensuring your visualizations accurately reflect how data changes over time.

Converting string columns to numeric types for plotting

Sometimes, a column that appears numeric is actually stored as a string, often due to currency symbols or commas. Attempting to plot this data will raise a TypeError because plotting functions can't perform mathematical operations on text. The code below demonstrates this common issue.

import pandas as pd import matplotlib.pyplot as plt df = pd.read_csv('numeric_data.csv') plt.scatter(df['X'], df['Y']) plt.show()

The plt.scatter() function can't map text values to coordinates on the plot, causing the operation to fail. The following code demonstrates how to prepare the data correctly before you attempt to visualize it.

import pandas as pd import matplotlib.pyplot as plt df = pd.read_csv('numeric_data.csv') df['X'] = pd.to_numeric(df['X'], errors='coerce') df['Y'] = pd.to_numeric(df['Y'], errors='coerce') plt.scatter(df['X'], df['Y']) plt.show()

The fix is to convert the text-based columns into numbers before plotting using the pd.to_numeric() function.

The key is the errors='coerce' argument. It replaces any values that can't be converted with NaN (Not a Number), preventing your code from crashing.

This ensures your plotting function receives clean, numeric data. You'll often encounter this issue when your CSV contains formatted numbers, like currency or figures with commas, which Python reads as strings.

Real-world applications

With common data issues solved, you can apply these plotting skills to real-world scenarios like visualizing weather data and analyzing sales performance.

Visualizing weather data with `plot()`

The plot() method lets you easily compare multiple weather metrics, such as temperature and humidity, on a single graph to see how they trend together over time.

import pandas as pd import matplotlib.pyplot as plt weather_data = pd.read_csv('weather_data.csv', parse_dates=['Date']) weather_data.plot(x='Date', y=['Temperature', 'Humidity']) plt.ylabel('Value') plt.show()

This approach leverages the power of pandas for multi-line plotting. First, pd.read_csv() loads the data, while parse_dates=['Date'] ensures your time axis is correctly ordered chronologically.

The key step is calling plot() directly on the DataFrame.
When you pass a list of column names to the y parameter, like ['Temperature', 'Humidity'], pandas automatically generates a separate line for each one.

This is a concise way to overlay multiple datasets on the same axes without needing separate plotting commands for each line.

Analyzing sales performance with `heatmap()`

A `heatmap()` is ideal for analyzing sales performance, as it turns your data into a color-coded grid that instantly reveals high-performing regions and quarters.

import pandas as pd import seaborn as sns import matplotlib.pyplot as plt sales_data = pd.read_csv('quarterly_sales.csv') pivot_table = sales_data.pivot_table(values='Sales', index='Region', columns='Quarter') sns.heatmap(pivot_table, annot=True, cmap='YlGnBu', fmt='.0f') plt.show()

This code first reshapes your data using pivot_table(). This function is essential, as it organizes your sales figures into a grid with regions as rows and quarters as columns—the exact format a heatmap needs.

The sns.heatmap() function then visualizes this new grid structure.
The annot=True argument writes the actual sales value directly onto each cell.
cmap='YlGnBu' sets the color scheme, while fmt='.0f' ensures the numbers are displayed as integers without decimal points.

Get started with Replit

Now, turn these techniques into a working tool. Describe what you want to build to Replit Agent, like “a dashboard that reads a sales CSV and plots monthly revenue” or “a tool that generates a heatmap from quarterly sales data”.

Replit Agent will write the code, test for errors, and deploy your application for you. Start building with Replit.

Get started free

Create and deploy websites, automations, internal tools, data pipelines and more in any programming language without setup, downloads or extra tools. All in a single cloud workspace with AI built in.

Get started free

Create and deploy websites, automations, internal tools, data pipelines and more in any programming language without setup, downloads or extra tools. All in a single cloud workspace with AI built in.

Get started for free

Follow @Replit