1. Introduction
We begin this note by considering first the magnitude of the difference between the arithmetic mean
and the geometric mean
. Some new explicit formulas for the magnitude of this difference will be introduced here.
For the sake of simplicity, we shall focus our attention in this introduction on the special case in which the group (or the sample) under investigation is of size
. For this special case, we let
and
denote the scores for a nonnegative quantitative variable measured for individuals 1 and 2, respectively, in the group (or sample). The difference between the arithmetic mean
and the geometric mean
of these scores can, of course, be expressed simply as
Now it is interesting to note that the variance
of the square root of these scores (i.e., the variance of
and
) can be written as
Thus, from formula (2) we see that
Note that
in formulas (2) and (3) is not the variance of the
scores but rather the variance of the square root of these scores. From formula (3), we note that
With the introduction of formulas (2) and (3), we obtain a simple explanation for the inequality (4) in the special case considered in this paragraph.
The inequality (4) is well known, and it is called the arithmetic mean/geometric mean (AM/GM) inequality. (A more general version of the AM/GM inequality is called Jensen’s inequality; e.g., see Roberts and Varberg 1973.)
We next consider two groups (or two samples, one sample from each of two populations) in the special case in which each group is of size
. For expository purposes, we let
,
, and
denote the arithmetic mean of the
scores, the geometric mean of the
scores, and the variance of the square root of these scores, respectively, for group 1, and we let
,
, and
denote the corresponding quantities for group 2. From formula (3), we obtain
and
and thus
From formula (5) we see that the difference between
and
is determined completely by
—that is, by the difference between the variance of the square root of the scores in group 1 and the variance of the square root of the scores in group 2.
We also see from formula (5) that
whenever
and that
whenever
Thus, we find that
and
will have the same sign whenever
and
and
will have different signs (i.e.,
and
, or
and
) whenever
Here now is a simple example in which
and
. This is the sign-switching phenomenon noted in Trond Petersen’s paper and cited in the abstract of the present note: For sample 1, we let
, so
and
; and for sample 2, we let
, so
and
. Thus,
, and
. For further insight into this example of the sign-switching phenomenon, see Appendix A1 in this note.
2. Some More General Formulas
For expository purposes, we next consider the special case in which the group (or the sample) under investigation is of size
. For this case, we let
denote the scores for a nonnegative quantitative variable measured for individuals 1, 2, 3, and 4, respectively, in the group (or sample). The difference between the arithmetic mean and the geometric mean can, of course, be expressed simply as
Now, for this special case, we let
denote the variance of the square root of the scores
and
(i.e., the variance of
and
), the variance of the square root of the scores
and
(i.e., the variance of
and
), and the variance of
and
, respectively. With the variances as defined previously, we now introduce
to denote the following “variance” formula:
Thus, with
defined by (8), we see that
can also be expressed as follows:
where
is the arithmetic mean of the
scores (namely,
) and
is the geometric mean of these scores. Thus, in the special case considered here, we see that
As we did earlier with formula (3), we can now use formula (10) to provide an explicit explanation of the inequality
in the special case considered here. In addition, we can also use formula (10) in this case to obtain a formula corresponding to formula (5) with the
and
in formula (5) replaced now by the corresponding “variance” formulas—that is,
and
, respectively—which are obtained by applying formula (8), using the
scores in group 1 to obtain
and the corresponding
scores in group 2 to obtain
. Similarly, using formula (10) and the formula corresponding to formula (5) with
and
in formula (5) replaced for
and
, respectively, we can also obtain the inequality corresponding to the inequalities in (6a) and (6b) and the inequalities presented after formula (5) and before (6a) and (6b).
The results presented previously—for example, in formulas (3) and (10)—can be directly generalized to the case in which the group (or the sample) under investigation is of size
(for
). We have presented the results obtained for the case where
and
, and for expository purposes, instead of presenting now the general formula, we provide more insight into how the general formula can be obtained by considering next the result for the case where
—that is, where
. For expository purposes, we let
,
, and
denote the arithmetic mean
, the geometric mean
, and the corresponding “variance” formula
—namely, formula (8)—respectively, obtained using the scores
, and we let
,
, and
denote the corresponding quantities obtained using the corresponding scores
. From formula (10), we see that
and
We now introduce
to denote the following “variance” formula:
where
denotes the variance of
and
. Applying the formulas in (11) to formula (12), we find that
where
is the arithmetic mean of the
scores (namely,
) and
is the geometric mean of these scores. Thus, in the case considered here too, we again see that
where
is now defined by formula (12). Considering the definition of the “variance” terms in formula (12), we find that
can also be rewritten as
As noted earlier in this section, the results presented previously (e.g., formulas [3], [10], and [14] and the related formulas [2], [8], [12], and [15]) can be directly generalized to the case in which the group (or sample) under investigation is of size
(for
). In addition, it is also possible to obtain somewhat similar (but more complicated) results when the size
of the group (or sample) cannot be expressed as
(for
). These cases will be considered further in Appendixes A2 and A3.
3. A Different Perspective
We now present an approach to the examination of the magnitude of the difference between the arithmetic mean and the geometric mean that is quite different from the approach presented earlier in this note. For a group (or a sample) of any given size
, we again let
and
denote the arithmetic mean and the geometric mean, respectively, of the
scores—that is,
From (16) we see that
where log denotes the natural logarithm. The logarithmic function,
, is a concave function, defined for all values of
(e.g., see Roberts and Varberg 1973). Because of this concavity, we find that
we thus obtain
As noted in Section 1, inequality (19) is well known. To gain more insight into the magnitude of the difference between the left side and right side of these inequalities, we next consider inequality (18) in more detail.
For the sake of simplicity, we begin here by considering again the special case in which
. In this case, for specified
scores (say,
and
), inequality (18) can be expressed as
we also find, more generally, that
for
and
. Inequality (20) states that the logarithm of the midpoint between
and
is greater than or equal to the midpoint between
and
; in addition, inequality (21) states, more generally, that the logarithm of each point in the interval between
and
is greater than or equal to the height of the corresponding point on the line segment (or chord) between
and
. Note that
is a weighted average of
and
, and the term on the right side of (21) is the corresponding weighted average of
and
. For specified values of
and
, a simple examination of the graph of the logarithmic function will make clear what is the magnitude of the difference between the term on the left side of inequality (20) and the term on the right side of this inequality; a similar simple examination of the graph of the logarithmic function can also be applied with respect to the terms on each side of inequality (21).
Next we consider the special case in which
. In this example, for specified
scores (say,
,
, and
), inequality (18) can be expressed as
Inequality (22) states that the logarithm of the arithmetic average of
,
, and
is greater than or equal to the arithmetic average of the corresponding logarithms. To gain more insight into this inequality, let us first rewrite the arithmetic average
as a weighted average of two quantities as follows:
where
. By applying inequality (21) to this weighted average of
and
, we then obtain the inequality
by applying inequality (20) to
(i.e., to the midpoint between
and
), we see that
(Note that the term on the left side of [22] is equal to the term on the left side of [23] and the term on the right side of [22] is equal to the term on the right side of [24].) Inequality (23) states that the logarithm of the weighted average of
and
is greater than or equal to the corresponding weighted average of
and
, and inequality (24) states that this weighted average of
and
is greater than or equal to the corresponding weighted average of (1) the midpoint between
and
and (2)
. For specified values of
,
, and
, a simple examination of the graph of the logarithmic function will make clear what is the magnitude of the difference between the term on the left side of inequality (23) (i.e., the logarithm of the weighted average of
and
, where
is the midpoint between
and
) and the term on the right side of inequality (23) (i.e., the height of the corresponding point on the line segment [or chord] between
and
). And, similarly, a simple examination of the graph of the logarithmic function will make clear what is the magnitude of the difference between the term on the left side of inequality (24) (which was the same as the term on the right side of inequality [23]—that is, the height of the corresponding point on the line segment between
and
) and the term on the right side of inequality (24)—that is, the height of the corresponding point on the line segment between
and
. (Recall that
is the midpoint between
and
and
is the midpoint on the line segment between
and
.) Thus, from the previous comments on inequalities (23) and (24), we see that inequality (22) can be viewed as a comparison of the logarithm of the weighted average of
and
(where
is the midpoint between
and
) and the height of the corresponding point on the line segment between
and
.
We have considered inequality (18) in the special case in which
and
to gain more insight into the magnitude of the difference between the left side and right side of this inequality. The same approach can be applied in turn for
.
The approach presented in this section is intended to shed some light on the difference between
and
in inequality (18), and earlier in this note we introduced explicit formulas—for example, see formulas (3), (10), (14) and (2), (8), (12), (15), as well as formulas in the Appendix—that shed light on the magnitude of the difference between
and
in the inequalities (4) and (19). We can rewrite the inequalities (4) and (19) as
where
. Earlier in this note, we gave explicit formulas for the quantity
—for example, see formulas (2), (8), (12), and (15) as well as the formulas in the Appendix. It may also be sometimes useful to view the quantity
simply (tautologically) as the nonnegative quantity
.
4. The Difference between the Slopes When a Continuous Dependent Variable Is Expressed in Raw Form Versus Logged Form
The sign-switching phenomenon noted in Petersen’s paper is presented there in the context in which comparison is made between results obtained when a continuous dependent variable is expressed in raw form and the corresponding results obtained when the dependent variable is expressed in logged form. It is possible that the slope obtained in the former case can be positive while the corresponding slope obtained in the latter case can be negative or that the slope in the former case can be negative while the corresponding slope in the latter case can be positive. In the special case when the independent variable can take on two possible scores—say, the scores 1 and 2—the observations on the continuous dependent variable corresponding to the independent variable at score 1 can be viewed as the observations in group 1 (or sample 1), and the observations on the dependent variable corresponding to the independent variable at score 2 can be viewed as the observations in group 2 (or sample 2). A negative slope obtained when the dependent variable is expressed in raw form and a positive slope obtained when the dependent variable is expressed in logged form would be described by the inequalities
and
; similarly, a reversal of the sign of the slopes would be described by the inequalities
and
. With the formulas for
and
in (16) and (17), respectively, we recall that the arithmetic mean of the logarithm of the
scores (i.e., the arithmetic mean of the logged form of the scores on the dependent variable) is equal to the logarithm of the geometric mean of the
scores (i.e., the logarithm of the geometric mean of the raw form of the scores on the dependent variable).