analysis of variance


Also found in: Dictionary, Thesaurus, Medical, Legal, Financial, Acronyms, Wikipedia.

analysis of variance

[ə¦nal·ə·səs əv ′ver·ē·əns]
(statistics)
A method for partitioning the total variance in experimental data into components assignable to specific sources.

analysis of variance (ANOVA)

(STATISTICS) a procedure used to test whether differences between the MEANS of several groups are likely to be found in the population from which those groups were drawn. An example might be three groups of people with different educational backgrounds for whom the mean wage level has been calculated. ANOVA provides a way of testing whether the differences between the means are statistically significant by dividing the variability of the observations into two types. One type, called ‘within group’ variability, is the VARIANCE within each group in the SAMPLE. The second type is the variability between the group means (‘between groups’ variability). If this is large compared to the ‘within group’ variability it is likely that the population’s means are not equal.

The assumptions underlying the use of analysis of variance are:

  1. each group must be a RANDOM SAMPLE from a normal population (see NORMAL DISTRIBUTION);
  2. the variance of the groups in the population are equal.

However, the technique is robust and can be used even if the normality and equal variance assumptions do not hold. The random sample condition is nevertheless essential. See also SIGNIFICANCE TEST.

Analysis of Variance

 

a statistical method in mathematics for determining the effect of separate factors on the result of an experiment. Analysis of variance was first proposed by the British statistician R. A. Fisher in 1925 for analyzing results of agricultural experiments designed to reveal under what conditions a specific agricultural crop provided a maximum yield. Modern applications of analysis of variance encompass a broad range of problems in economics, biology, and technology and are generally interpreted in terms of a statistical theory for determining systematic variations among the results of measurements made under the influence of several varying factors.

If the values of the unknown constants a1, … , an could be measured by various methods or means of measurement M1, …, Mm and, in each instance, a systematic error could depend on the selected method as well as on the unknown value of ai being measured, then the results of the measurements xij are represented as sums of the form

xij = ai + bij + δiji 1, 2, ... , = n; j = 1, 2, ... , m

where bij is a systematic error arising during the measurement of a1 by method Mi, and δij is a random error. Such a model is called a two-factor layout of analysis of variance (the first factor is the quantity being measured and the second, the method of measurement). Variances in the empirical distributions, corresponding to the sets of random values

where

are expressed by the formulas

These variances satisfy the identity =

s2 = s02 + s12 + s22

which also explains the origin of the term “analysis of variance.”

If the values of systematic errors do not depend on the method of measurement (that is, there are no systematic variations among the methods of measurement), then the ratio s22/s02 is close to 1. This property is the basic criterion for the statistical determination of systematic variations. If s22 differs significantly from 1, then the hypothesis about the absence of systematic variations is rejected. The significance of the difference is determined according to the probability distribution of the random errors in the measurements. Specifically, if all measurements are of equal accuracy and the random errors are normally distributed, then the critical values for the ratio s22/s02 are determined by F -distribution tables (distribution of the variance ratio).

The above scheme allows only for detecting the existence of systematic variations and, generally speaking, is not suitable for their numerical evaluation with subsequent elimination from the observed results. Such evaluation may be achieved only through numerous measurements (with repeated applications of the described scheme).

REFERENCES

Scheffe, H. Dispersionnyi analiz. Moscow, 1963. (Translated from English.)
Smirnov, N. V., and I. V. Dunin-Barkovskii. Kurs teorii veroiatnostei i matematicheskoi statistiki dlia tekhnicheskikh prilozhenii, 2nd ed. Moscow, 1965.

L. N. BOL’SHEV

References in periodicals archive ?
The results of an analysis of variance and Duncan's mean separation test for static bending strength of calabrian pine wood are shown in Table 3.
A Friedman analysis of variance also revealed statistically significant differences in mean ranks of courses which addressed multicultural content, [chi square] (8, N=23)=28.
multiple regression with dummy variables and two-way analysis of variance.
3] Finally, guidelines are available for selecting one of the four types of analysis of variance tables that are incorporated into the SAS-76 computer program.
Thus, the differences between obtained values in the report of "Global Entrepreneurship Monitor" which were used as data in our analysis of variance were recognized to be random and it cannot be said that Danish entrepreneurs pay more attention to a particular sphere.
Analysis of variance was performed in order to identify possible response patterns of subgroups of respondent caregivers according to socio-demographic variables.
Motor ratings in the two groups were analyzed using repeated measures analysis of variance.
Topics include the development of psychology as a science in South Korea and Latvia, Islamic counseling in Indonesia, mental health problems among Iraqi children, post-tsunami perspectives of students and their parents, management of ADHD, and non-parametric three way analysis of variance with repeated measures.
Linear models; the theory and application of analysis of variance.
A multivariate analysis of variance (MANOVA) estimated effects of family help and time spent on homework on homework attitudes and homework management strategies.
Results of the analysis of variance to test the main effect of disability beneficiary status on the variable of age yielded a statistically significant difference, F (2, 110) = 7.
The latest version of this add-on module includes Complex Samples General Linear Models (CSGLM), enabling users to build linear regression, analysis of variance (ANOVA) and analysis of covariance (ANCOVA) models for evaluating and predicting numerical outcomes.

Full browser ?