One of the first things that you may wish to do when you have entered your data into a computer is to summarize them in some way so that you can get a ‘feel’ for the data. This can be done by producing diagrams, tables or summary statistics (Chapters 5 and 6). Diagrams are often powerful tools for conveying information about the data, for providing simple summary pictures, and for spotting outliers and trends before any formal analyses are performed.
One Variable
Frequency Distributions
An empirical frequency distribution of a variable relates each possible observation, class of observations (i.e. range of values) or category, as appropriate, to its observed frequency of occurrence. If we replace each frequency by a relative frequency (the percentage of the total frequency), we can compare frequency distributions in two or more groups of individuals.
Displaying Frequency Distributions
Once the frequencies (or relative frequencies) have been obtained for categorical or some discrete numerical data, these can be displayed visually.
- Bar or column chart – a separate horizontal or vertical bar is drawn for each category, its length being proportional to the frequency in that category. The bars are separated by small gaps to indicate that the data are categorical or discrete (Fig. 4.1a).
- Pie chart – a circular ‘pie’ is split into sectors, one for each category, so that the area of each sector is proportional to the frequency in that category (Fig. 4.1b).