factor analysis

Also found in: Dictionary, Thesaurus, Medical, Legal, Financial, Acronyms, Wikipedia.

factor analysis

[′fak·tər ə‚nal·ə·səs]
Given sets of variables which are related linearly, factor analysis studies techniques of approximating each set relative to the others; usually the variables denote numbers.

factor analysis

a MULTIVARIATE statistical technique in which the covariances (or CORRELATIONS) between a large set of observed VARIABLES are explained in terms of a small number of new variables called factors. The ideas originated in the work on correlation by Galton and Spearman, and were developed primarily in studies of intelligence. Most applications are found in psychology and sociology.

The technique is ‘variable directed’, with no distinction between INDEPENDENT and DEPENDENT VARIABLES in the data set. There are four steps to the analysis. The first is to derive a correlation matrix in which each variable in the data set is correlated with all the other variables. The next step is to extract the factors. The aim of this stage is to determine the minimum number of factors that can account adequately for the observed correlations between the original variables. If the number of factors identified is close to the number of original variables, there is little point to the factor analysis. Sometimes it is difficult to assign a meaningful name to the factors. The purpose of the third (optional) step, rotation, is to find simpler and more easily interpretable factors. If a satisfactory model has been derived, the fourth step is to compute scores for each factor for each case in the data set. The factor scores can then be used in subsequent analyses.

Factor analysis attracts a lot of criticism (Chatfield and Collins, 1980). The observed correlation matrix is generally assumed to have been constructed using product moment correlations. Hence, the usual assumptions of an interval measurement, normal distributions and homogeneity of variance are needed. Against this, it is argued the technique is fairly robust. Another problem is that the different methods of extraction and rotation tend to produce different solutions. Further, although factors may be clearly identified from the analysis, it may be difficult to give them a meaningful interpretation. Despite the need for so many judgmental decisions in its use, factor analysis remains a useful exploratory tool.

Factor Analysis


a branch of multivariate analysis embracing methods for estimating the dimensions of a set of observed variables by studying the structure of the covariance or correlation matrices.

The basic assumption underlying factor analysis is that the correlations between a large number of observable variables are determined by the existence of a smaller number of hypothetical unobservable variables, or factors. A general model for factor analysis is provided in terms of the random variables X1 . . .,Xn, which are the observation results, by the following linear model:

Here, the random variables fj are common factors, the random variables Ui are factors specific to the variables Xi and are not correlated with the fj, and the εj, are random errors. It is assumed that k < n, that the random variables ej are independent of each other and of the fj and Ui, and that E∊i = 0 and D∊i = Factor Analysis The constant coefficients aij are called loadings (weights): aij is the loading of the ith variable on the jth factor. The quantities aij, bi, and Factor Analysis are taken as unknown parameters that have to be estimated.

In the form given above, the model for factor analysis is characterized by some indeterminacy, since n variables are expressed in terms of n + k other variables. Equations (*), however, imply a hypothesis, regarding the covariance matrix, that can be tested. For example, if the factors fj are uncorrelated, Dfi = 1, Bi = 0, and cij are the elements of the matrix of covariances between the Xi, then there follows from equation (*) an expression for the cij in terms of the loadings and the variances of the errors:

The general model for factor analysis is thus equivalent to a hypothesis regarding the covariance matrix: the covariance matrix can be represented as the sum of the matrix A A’ and the diagonal matrix with elements Factor Analysis, where

A = {aij}

The estimation procedure in factor analysis consists of two steps. First, the factor structure (that is, the number of factors required to account for the correlations between the Xi) is determined, and the loadings are estimated. Second, the factors are estimated on the basis of the observation results. The fundamental obstacle to the interpretation of the set of factors is that for k > 1 neither the loadings nor the factors can be determined uniquely, since the factors fj in equations (*) can be replaced by means of any orthogonal transformation. This property of the model is made use of to transform (rotate) the factors; the transformation is chosen so that the observed variables have the maximum possible loadings on one factor and minimum possible loadings on the remaining factors.

Various practical methods are known for estimating loadings. The methods assume that X1,. . ., Xn obey a multivariate normal distribution with covariance matrix C = {cij}. The maximum likelihood method is noteworthy. It leads to a unique set of estimates of the cij, but for the estimates of the aij it yields equations that are satisfied by an infinite set of solutions with equally good statistical properties.

Factor analysis is regarded as dating from 1904. Although it was originally developed for problems in psychology, the range of its applications is much broader, and it is now used to solve various practical problems in such fields as medicine, economics, and chemistry. A rigorous theoretical grounding, however, has not yet been provided for many results and methods of factor analysis that are widely used in practice. The mathematical description of modern factor analysis in a rigorous manner is an extremely difficult task and remains uncompleted.


Lawley, D., and A. Maxwell. Faktornyi analiz kak statisticheskii metod. Moscow, 1967. (Translated from English.)
Harman, H. Sovremennyi faktornyi analiz. Moscow, 1972. (Translated from English.)


References in periodicals archive ?
Data of study 1 was analysed by exploratory factor analysis with varimax rotation as suggested by the theory (Watson el al.
A useful checklist has been provided on what to report in a factor analysis article.
It can be concluded that MFA is also a particular case of factor analysis framework; only it considers both the class information and nearest neighborhood of the samples for the partition design.
In order to fill the research gap in literature on pharmaceutical project management and to have an in-depth analysis of the reasons that delay the projects, this study took the help of statistical factor analysis including both exploratory and confirmatory factor analyses.
Use of Exploratory Factor Analysis in Published Research: Common Errors and Some Comment on Improved Practice.
For the prediction of body weight in the indigenous Mengali sheep, factor analysis scores in the multiple regression analysis was applied to the current data, as a superior alternative to remove multicollinearity problem being observed in multiple regression analysis, path analysis and canonic correlation analysis.
In order to obtain a final version of the QSMP, confirmatory factor analysis was carried out, and confirmed the factor structure of the five large factors obtained through the interjudge analysis (Indiscipline, Antisocial Behavior, Bullying, Disruptive Behavior and Academic Indifference).
544 respectively, indicating that sufficient correlation existed between different foods to proceed with factor analysis.
In this research, applying realistic data from the city of Isfahan in Iran, it is shown how with the aid of factor analysis technique we can analyze information of regional municipalities.