How to Create a Histogram in Excel: Step-by-Step

A histogram is a bar chart that shows the frequency distribution of continuous data. Unlike a standard bar chart that compares categories, a histogram groups numeric values into ranges (called bins)...

Key Insights

  • Excel offers two primary methods for creating histograms: the built-in chart type (Excel 2016+) for quick visualization, and the Data Analysis Toolpak for precise statistical control over bin ranges.
  • Proper data preparation is critical—your numeric values must be in a single column without blank cells or text values, and you should understand your data’s range before choosing bin widths.
  • The shape of your histogram tells a story: normal distributions suggest predictable processes, skewed distributions indicate bias, and bimodal patterns often reveal hidden subgroups in your data.

Introduction to Histograms

A histogram is a bar chart that shows the frequency distribution of continuous data. Unlike a standard bar chart that compares categories, a histogram groups numeric values into ranges (called bins) and displays how many observations fall into each range. This makes it indispensable for understanding the shape, spread, and central tendency of your data at a glance.

You’ll find histograms everywhere in practical data analysis. Quality control engineers use them to verify that manufacturing measurements cluster around target values. Marketing analysts examine response time distributions to identify performance bottlenecks. Educators analyze exam score distributions to calibrate grading curves. If you’re working with any substantial dataset of numeric values, a histogram should be one of your first exploratory tools.

The power of a histogram lies in its ability to reveal patterns that summary statistics hide. A mean of 75 and standard deviation of 10 could describe a perfectly normal distribution—or it could mask a bimodal distribution with two distinct peaks. Only visualization tells you which you’re dealing with.

Preparing Your Data

Before creating a histogram, you need clean, properly formatted data. Excel expects your numeric values in a single column with a header row. No blank cells. No text values mixed in. No merged cells.

Here’s a sample dataset representing customer response times in seconds for a support ticket system. Copy this into column A of a new worksheet:

Response Time (seconds)
45
67
23
89
34
56
78
42
91
33
55
67
88
44
76
29
63
51
84
37
72
48
95
31
59
66
82
40
73
54
87
36
61
49
92
28
58
71
85
43
77
52
94
35
64
46
83
39
69
57

This gives you 50 data points ranging from 23 to 95 seconds. Before proceeding, always check your data’s range using =MIN(A2:A51) and =MAX(A2:A51). Understanding your data’s boundaries helps you make informed decisions about bin widths later.

Remove any rows with errors or non-numeric values. You can quickly check for problems by using =COUNT(A2:A51) (should return 50) and =COUNTA(A2:A51) (should also return 50 if all values are numeric).

Method 1: Using the Built-in Histogram Chart (Excel 2016+)

Excel 2016 introduced histograms as a native chart type, making creation straightforward. This method works well for quick exploratory analysis when you don’t need precise control over bin boundaries.

Follow these steps:

  1. Select your data range including the header (A1:A51)
  2. Navigate to Insert → Charts → Insert Statistic Chart (the icon with bars and a curve)
  3. Select Histogram from the dropdown

Excel automatically calculates bin widths using Scott’s normal reference rule, which optimizes for normally distributed data. For our response time data, you’ll likely see 5-7 bins.

To customize the bins, right-click on the horizontal axis and select “Format Axis.” You’ll find these options:

Axis Options:
├── By Category (treats each value as separate)
├── Automatic (Excel's algorithm)
├── Bin width: [specify exact width in units]
├── Number of bins: [specify exact count]
└── Overflow bin / Underflow bin (for outlier handling)

For our response time data, setting a bin width of 15 seconds creates intuitive groupings: 20-35, 35-50, 50-65, 65-80, 80-95. This makes the chart immediately interpretable for stakeholders who think in terms of “under 30 seconds” or “over a minute.”

The overflow and underflow bin options are particularly useful when you want to group extreme values. Setting an overflow bin at 90 seconds would group all responses taking longer than 90 seconds into a single “90+” category.

Method 2: Using the Data Analysis Toolpak

The Data Analysis Toolpak provides more statistical rigor and explicit control over bin ranges. This method is essential when you need reproducible bin boundaries or want to compare multiple datasets using identical bins.

First, enable the Toolpak:

  1. Go to File → Options → Add-ins
  2. In the Manage dropdown, select “Excel Add-ins” and click Go
  3. Check “Analysis Toolpak” and click OK

Now create your bin ranges. In column C, define the upper boundary of each bin:

Bin
30
45
60
75
90

These values represent the upper limit of each bin. Values up to 30 go in the first bin, 31-45 in the second, and so on. Values above 90 will appear in a “More” category.

To generate the histogram:

  1. Go to Data → Data Analysis → Histogram → OK
  2. Set Input Range to your data (A1:A51, including header)
  3. Set Bin Range to your bins (C1:C6, including header)
  4. Check “Labels” since we included headers
  5. Choose an Output Range or New Worksheet
  6. Check “Chart Output” to generate the visual

The Toolpak produces a frequency table alongside the chart:

Bin         Frequency
30          4
45          10
60          12
75          11
90          10
More        3

This frequency table is valuable for further analysis. You can calculate cumulative frequencies, percentages, or use the data in statistical tests.

Customizing Your Histogram

Default Excel histograms rarely meet presentation standards. Here’s a VBA macro that applies professional formatting to any selected histogram:

Sub FormatHistogram()
    Dim cht As Chart
    Dim ser As Series
    
    ' Get the active chart
    If ActiveChart Is Nothing Then
        MsgBox "Please select a histogram chart first."
        Exit Sub
    End If
    
    Set cht = ActiveChart
    Set ser = cht.SeriesCollection(1)
    
    ' Remove gaps between bars (true histogram style)
    ser.Format.Fill.ForeColor.RGB = RGB(70, 130, 180)  ' Steel blue
    ser.Format.Line.ForeColor.RGB = RGB(255, 255, 255)  ' White border
    ser.Format.Line.Weight = 1
    
    ' Access chart via parent and modify gap width
    cht.ChartGroups(1).GapWidth = 5  ' Minimal gap
    
    ' Format title
    If cht.HasTitle Then
        cht.ChartTitle.Font.Size = 14
        cht.ChartTitle.Font.Bold = True
    End If
    
    ' Format axes
    cht.Axes(xlCategory).TickLabels.Font.Size = 10
    cht.Axes(xlValue).TickLabels.Font.Size = 10
    cht.Axes(xlValue).HasTitle = True
    cht.Axes(xlValue).AxisTitle.Text = "Frequency"
    
    ' Remove gridlines for cleaner look
    cht.Axes(xlValue).HasMajorGridlines = False
    
End Sub

Key formatting principles:

  • Eliminate gaps: Set gap width to 0-5% for a true histogram appearance. Bar charts have gaps; histograms shouldn’t.
  • Use neutral colors: Steel blue, slate gray, or muted greens work better than Excel’s default bright colors.
  • Label axes clearly: Always include “Frequency” or “Count” on the y-axis and descriptive units on the x-axis.
  • Add a descriptive title: “Response Time Distribution (n=50)” tells viewers exactly what they’re seeing.

Interpreting Histogram Results

The shape of your histogram reveals critical information about your data’s underlying process.

Normal distribution: A symmetric bell curve centered around the mean suggests a stable, predictable process influenced by many small random factors. Our response time data shows a roughly uniform distribution, which is unusual and might indicate artificial constraints or data collection issues.

Right-skewed distribution: A long tail extending toward higher values is common in time-based data (response times, processing duration) where there’s a natural minimum but no maximum. This suggests most values cluster low, with occasional outliers.

Left-skewed distribution: A long tail toward lower values often appears in ceiling-effect scenarios, like exam scores where many students achieve near-perfect results.

Bimodal distribution: Two distinct peaks indicate your data likely contains two separate populations. If our response times showed peaks at 40 and 80 seconds, we might investigate whether two different support teams or ticket types exist in the data.

Look for outliers as isolated bars separated from the main distribution. In quality control contexts, these often indicate measurement errors or process failures requiring investigation.

Troubleshooting Common Issues

Blank cells cause incorrect counts: Excel’s Histogram tool handles blank cells inconsistently. Use =IF(ISBLANK(A2),"",A2) to explicitly identify blanks, then filter them out before analysis.

Text values corrupt your histogram: A single text value in your data range can cause the entire histogram to fail. Use =ISNUMBER(A2) to verify each cell, or filter your data to numeric values only.

Too many or too few bins: The Freedman-Diaconis rule provides a good starting point: bin width = 2 × IQR × n^(-1/3), where IQR is the interquartile range and n is the sample size. For 50 data points with an IQR of 30, this suggests bins of approximately 13 units.

Large datasets slow Excel: For datasets exceeding 100,000 rows, consider using Power Query to pre-aggregate your data into frequency counts before charting. This dramatically improves performance while producing identical visual results.

Histograms remain one of the most powerful tools in your analytical toolkit. Master these techniques, and you’ll extract insights from numeric data that spreadsheets of summary statistics could never reveal.

Liked this? There's more.

Every week: one practical technique, explained simply, with code you can use immediately.