Section 2.5: Frequency Distributions

Fundamentals of Social Statistics by Adam J. McKee

A frequency distribution is used to organize and summarize data.  Remember that frequency means the number of times something happens, and its symbol is f.  The symbol N, the number of subjects, is also used to stand for frequency.

A frequency distribution is simply a table that tells us how many things were observed in a particular category.  The most common way to present a frequency distribution is to have the scores (or categories) listed in order from highest to lowest in the first column, and the number of cases listed in the second column.  Many researchers also include the percentage of cases associated with each score.

Frequency distributions fall under the heading of descriptive statistics.  Recall that the purpose of descriptive statistics is to organize and summarize data.  Constructing a frequency distribution does both.  Because all of the possible values of a variable are collapsed into categories with the number of scores that fell into that category, the frequency distribution has the potential to reduce a very large, incomprehensible matrix of data into something that can be seen and interpreted at a glance.

It is common to present frequency distributions as both tables and graphs.  Either way, two things are presented to the reader.  The first is a list of the categories that the variable has been divided into.  Also, the table or graph presents the number of people that fell into each of the categories.  A very simple frequency distribution may involve the variable gender.  A table would provide a column with male and female in it, and another column that has the number of males and the number of females.  A histogram would have a bar showing the number of males, and another bar showing the number of males.

Since the scores (or categories) are ordered, a frequency distribution shows how scores are distributed on a scale; that’s where the name comes from.  It is a matter of custom that the scores be listed from highest to lowest.  Most computer programs, such as Microsoft’s Excel, will sort data from least to greatest (ascending), or greatest to least (descending).

When presented with a frequency distribution table, you can determine several other descriptive statistics.  To determine the number of participants in the entire study, add up the frequency for every category (Σf = N).  To determine the sum of the scores, take each X (score category) value and multiply it by its frequency, then sum those products (note that this does not work for categories that are a range of scores).

Proportions and Percentages

You can easily obtain the proportion or percentage of study participants that fall into a particular category in a frequency distribution.  To obtain the proportion, divide the frequency of the category you are interested in by the total number of people in the study (proportion = f/N).  The proportion associated with each category is called its relative frequency. Recall that while proportions can be expressed as fractions, it is more common to see them expressed in decimal form.  To determine the percentage of participants that fall into a particular category, compute the proportion of people that fall into the category you are interested in and multiply that by 100.  These percentages are often shown in frequency distributions by adding a column headed with the percent sign (%).

Summary

A frequency distribution is a method used in statistics to organize, summarize, and present data in a comprehensible manner. Represented by the symbol “f”, it outlines how often a particular event or category occurs. Essentially, it’s a table where scores or categories are listed, typically from highest to lowest, with their corresponding frequencies or counts. Many researchers also incorporate the percentage representation for each category. This technique falls under descriptive statistics, aiming to present large datasets in an easily digestible format. Frequency distributions can be displayed as tables or visualized as graphs, such as histograms, where categories and their counts are visually represented. For instance, for the “gender” variable, the table or graph would showcase the counts of males and females. Through frequency distributions, one can deduce various descriptive statistics, like the total number of participants or the sum of scores. The section also highlights how to derive proportions and percentages from these distributions, transforming raw counts into relative frequencies or percentages for easier analysis and interpretation.


[ Back | Contents | Next ]

Last Modified:  09/25/2023

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.