AMOS Tutorial
AMOS Tutorial
An Introduction
August 2012
Table of Contents
Section 1: Introduction ................................................................................................................3
1.1 About this Document/Prerequisites ....................................................................................3
1.2 Accessing AMOS ..............................................................................................................3
1.3 Documentation ..................................................................................................................4
1.4 Getting Help with AMOS ..................................................................................................4
Section 2: SEM Basics ................................................................................................................5
2.1 Overview of Structural Equation Modeling ........................................................................5
2.2 SEM Nomenclature............................................................................................................6
2.3 Why SEM? ........................................................................................................................8
Section 3: SEM Assumptions ......................................................................................................8
3.1 A Reasonable Sample Size .................................................................................................8
3.2 Continuously and Normally Distributed Endogenous Variables .........................................9
3.3 Model Identification (Identified Equations) ........................................................................9
3.4 Complete Data or Appropriate Handling of Incomplete Data ........................................... 13
3.5 Theoretical Basis for Model Specification and Causality.................................................. 14
Section 4: Building and Testing a Model using AMOS Graphics ............................................... 15
4.1 Illustration of the SEM-Multiple Regression Relationship ................................................ 15
4.2 Drawing a model using AMOS Graphics ......................................................................... 19
4.3 Reading Data into AMOS ................................................................................................ 26
4.4 Selecting AMOS Analysis Options and Running your Model .......................................... 33
Section 5: Interpreting AMOS Output ....................................................................................... 35
5.1 Evaluating Global Model Fit ............................................................................................ 36
5.2 Tests of Absolute Fit ........................................................................................................ 38
5.3 Tests of Relative Fit ......................................................................................................... 38
5.4 Modifying the Model to Obtain Superior Goodness of Fit ................................................ 39
5.5 Viewing Path Diagram Output ......................................................................................... 46
5.6 Significance Tests of Individual Parameters ..................................................................... 49
Section 6: Putting it all together - A substantive interpretation of the findings ........................... 50
References ................................................................................................................................ 51
2
The Division of Statistics + Scientific Computation, The University of Texas at Austin
Section 1: Introduction
1.1 About this Document/Prerequisites
This course is a brief introduction and overview of structural equation modeling using the
AMOS (Analysis of Moment Structures) software. Structural equation modeling (SEM)
encompasses such diverse statistical techniques as path analysis, confirmatory factor analysis,
causal modeling with latent variables, and even analysis of variance and multiple linear
regression. The course features an introduction to the logic of SEM, the assumptions and
required input for SEM analysis, and how to perform SEM analyses using AMOS.
By the end of the course you should be able to fit structural equation models using AMOS. You
will also gain an appreciation for the types of research questions well-suited to SEM and an
overview of the assumptions underlying SEM methods.
You should already know how to conduct a multiple linear regression analysis using SAS, SPSS,
or a similar general statistical software package. You should also understand how to interpret the
output from a multiple linear regression analysis. Finally, you should understand basic Microsoft
Windows navigation operations: opening files and folders, saving your work, recalling
previously saved work, etc.
3
The Division of Statistics + Scientific Computation, The University of Texas at Austin
1.3 Documentation
The AMOS manual is the AMOS 16.0 User's Guide by James Arbuckle and can be found online.
It contains over twenty examples that map to models typically fitted by many investigators.
These same examples, including sample data, are included with the student and commercial
versions of AMOS, so you can easily fit and modify the models described in the AMOS manual.
The previous AMOS manual is the AMOS 4.0 User's Guide by James Arbuckle and Werner
Wothke; this manual also contains numerous examples. A copy of the AMOS 4.0 User's Guide is
available at the PCL for check out by faculty, students, and staff at UT Austin. Barbara Byrne
has also written a book on using AMOS. The title is Structural Equation Modeling with AMOS:
Basic Concepts, Applications, and Programming. The book is published by Lawrence Erlbaum
Associates, Inc. Lawrence Erlbaum Associates, Inc. also publishes the journal Structural
Equation Modeling on a quarterly basis. The journal contains software reviews, empirical
articles, and theoretical pieces, as well as a teachers section and book reviews.
A number of textbooks about SEM are available, ranging from Ken Bollens encyclopedic
reference book to Rick Hoyles more applied edited volume. Several commonly cited titles are
shown below.
Bollen, K.A. (1989). Structural Equations with Latent Variables. New York: John Wiley and
Sons.
Loehlin, J.C. (1997). Latent Variable Models. Mahwah, NJ: Lawrence Erlbaum Associates.
Hoyle, R. (1995). Structural Equation Modeling: Concepts, Issues, and Applications. Thousand
Oaks, CA: Sage Publications.
Hatcher, L. (1996). A Step-by-Step Approach to using the SAS System for Factor Analysis and
Structural Equation Modeling. Cary, NC: SAS Institute, Inc.
4
The Division of Statistics + Scientific Computation, The University of Texas at Austin
Non-UT and UT AMOS users will find Ed Rigdon's SEM FAQ Web site to be a useful
resource; see the information on the SEMNET online discussion group for information on how to
subscribe to this forum to post questions and learn more about SEM.
5
The Division of Statistics + Scientific Computation, The University of Texas at Austin
The researcher first specifies a model based on theory, then determines how to measure
constructs, collects data, and then inputs the data into the SEM software package. The package
fits the data to the specified model and produces the results, which include overall model fit
statistics and parameter estimates.
The input to the analysis is usually a covariance matrix of measured variables such as survey
item scores, though sometimes matrices of correlations or matrices of covariances and means are
used. In practice, the data analyst usually supplies SEM programs with raw data, and the
programs convert these data into covariances and means for its own use.
The model consists of a set of relationships among the measured variables. These relationships
are then expressed as restrictions on the total set of possible relationships.
The results feature overall indexes of model fit as well as parameter estimates, standard errors,
and test statistics for each free parameter in the model.
By contrast, we do claim that F1 causes the scores observed on the measured variables I1 and I2.
Causal effects are represented by single-headed arrows in the path diagram. F1 and F2 can be
conceptualized as the variance the two indicators share (i.e., what the two indicators have in
common.) As you have probably guessed by now, F1 and F2 are latent factors; I1 through I4 are
observed variables. Perhaps they are survey items. E1 through E4 are residual or error variances
that also cause response variation in I1 through I4. This diagram tells us that scores or responses
on survey items one through four are caused by two correlated factors, along with variance that
is unique to each item. Some of that unique variance might be due to measurement error.
Some of the paths shown in the diagram are labeled with the number 1. This means that those
paths coefficients have fixed values set to 1.00. These fixed values are included by necessity:
they set the scale of measurement for the latent factors and residuals. Alternatively, you can set
7
The Division of Statistics + Scientific Computation, The University of Texas at Austin
the variances of the factors to 1.00 to obtain implicitly standardized solutions. Note: you should
not use this latter method when you perform a multiple group analysis.
Assumptions underlying the statistical analyses are clear and testable, giving the
investigator full control and potentially furthering understanding of the analyses.
Graphical interface software boosts creativity and facilitates rapid model debugging (a
feature limited to selected SEM software packages).
SEM programs provide overall tests of model fit and individual parameter estimate tests
simultaneously.
Regression coefficients, means, and variances may be compared simultaneously, even
across multiple between-subjects groups.
Measurement and confirmatory factor analysis models can be used to purge errors,
making estimated relationships among latent variables less contaminated by measurement
error.
Ability to fit non-standard models, including flexible handling of longitudinal data,
databases with autocorrelated error structures (time series analysis), and databases with
non-normally distributed variables and incomplete data.
This last feature of SEM is its most attractive quality. SEM provides a unifying
framework under which numerous linear models may be fit using flexible, powerful
software.
data are perfectly well-behaved (i.e., normally distributed, no missing data or outlying cases,
etc.). Notice that Bentler and Chou mention five cases per parameter estimate rather than per
measured variable. Measured variables typically have at least one path coefficient associated
with another variable in the analysis, plus a residual term or variance estimate, so it is important
to recognize that the Bentler and Chou and Stevens recommendations dovetail at approximately
15 cases per measured variable, minimum. More generally, Loehlin (1992) reports the results of
Monte Carlo simulation studies using confirmatory factor analysis models. After reviewing the
literature, he concludes that for this class of model with two to four factors, the investigator
should plan on collecting at least 100 cases, with 200 being better (if possible). Consequences of
using smaller samples include more convergence failures (the software cannot reach a
satisfactory solution), improper solutions (including negative error variance estimates for
measured variables), and lowered accuracy of parameter estimates and, in particular, standard
errors SEM program standard errors are computed under the assumption of large sample sizes.
When data are not normally distributed or are otherwise flawed in some way (almost always the
case), larger samples are required. It is difficult to make absolute recommendations as to what
sample sizes are required when data are skewed, kurtotic, incomplete, or otherwise less than
perfect. The general recommendation is thus to obtain more data whenever possible.
there are an infinite number of possible parameter estimate values are said to be underidentified.
Finally, models that have more than one possible solution (but one best or optimal solution) for
each parameter estimate are considered overidentified.
The following equation, drawn from Rigdon (1997) may help make this clearer:
x + 2y = 7
In the above equation, there are an infinite number of solutions for x and y (e.g., x = 5 and y =1,
or x = 3 and y = 2, or x = 1 and y = 3, etc.). These values are therefore underidentified because
there are fewer "knowns" than "unknowns." A just-identified model is one in which there are as
many knowns as unknowns.
x + 2y = 7
3x - y = 7
For this equation, there are just as many knowns as unknowns, and thus there is one best pair of
values (x = 3, y = 2).
An overidentified model occurs when every parameter is identified and at least one parameter is
overidentified (i.e., it can be solved for in more than way--instead of solving for this parameter
with one equation, more than one equation will generate this parameter estimate). Typically,
most people who use structural equation modeling prefer to work with models that are
overidentified. An overidentified model has positive degrees of freedom and may not fit as well
as a model that is just identified. Imposing restrictions on the model when you have an
overidentified model provides you with a test of your hypotheses, which can then be evaluated
using the chi-square statistic of absolute model fit and various descriptive model fit indices. The
positive degrees of freedom associated with an overidentified model allows the model to be
falsified with the chi-square test. When an overidentified model does fit well, then the researcher
typically considers the model to be an adequate fit for the data.
Identification is a structural or mathematical requirement in order for the SEM analysis to take
place. A number of rules can be used to assess the identification level of your models, but these
rules are not perfect, and they are very difficult (almost impossible, in fact) to evaluate by hand,
especially for complex models. SEM software programs such as AMOS perform identification
checks as part of the model fitting process. They usually provide reasonable warnings about
underidentification conditions.
10
The Division of Statistics + Scientific Computation, The University of Texas at Austin
The remedy for all forms of underidentification is to try to locate the source of the identification
problem and determine if the source is empirical underidentification or structural
underidentification. For structural underidentification, the only remedy is to respecify the model.
Empirical underidentification may be correctable by collecting more data or respecifying the
model.
An example from Rigdon (1997) may be informative to highlight these issues. Consider the
following model:
It contains one factor, F1, two error variances or residuals, e1 and e2, and one factor loading
value connecting F1 to v2. This model requires four parameters to be estimated: the factors
variance, the two error variances, and the one factor loading.
11
The Division of Statistics + Scientific Computation, The University of Texas at Austin
How many of the available inputs can be used in the analysis? Three. How do you know there
are three available inputs? You can use the formula
[Q(Q + 1)] / 2
where Q represents the number of measured variables in the database that are used in the model.
In this model there are two observed variables, I1 and I2, so via the formula shown above,
[2(2+1)]/2 = 3. There are two variances, one for each of the two variables, and one covariance
between I1 and I2.
How is it possible to estimate four unknown parameters from three inputs? The answer is that it
is not possible: There are three available knowns or degrees of freedom available, but there are
four unknown parameters to estimate, so overall, the model has 3 4 = -1 degrees of freedom, a
clearly impossible state of affairs. This model is clearly underidentified additional constraints
will need to be imposed on this model in order to achieve a satisfactory level of identification.
Now consider a second model:
12
The Division of Statistics + Scientific Computation, The University of Texas at Austin
This new model has [4(4+1)] / 2 = 10 available degrees of freedom because there are four
observed variables used in the model. Subtracting four error variances, two factor loadings, and
two factor variances, and one covariance between the factors from the 10 available degrees of
freedom results in one left over or available degree of freedom. This model is structurally
identified. In fact, it is overidentified because there is one positive degree of freedom present.
As it turns out, if the parameter estimate of the covariance between F1 and F2 becomes zero or
very close to zero, the model can become empirically underidentified because even though it is
structurally identified by the covariance specified between F1 and F2, it is not identified on an
empirical basis from the computer softwares perspective.
In practice, all successfully fitted models are just-identified or overidentified. Typically you want
to use overidentified models because these models allow you to test statistical hypotheses,
including global model fit (Loehlin, 1992).
five percent (or fewer) cases are not missing completely at random, inconsistent parameter
estimates can result. Otherwise, missing data experts (e.g., Little and Rubin, 1987) recommend
using a maximum likelihood estimation method for analysis, a method that makes use of all
available data points. AMOS features maximum likelihood estimation in the presence of missing
data.
14
The Division of Statistics + Scientific Computation, The University of Texas at Austin
15
The Division of Statistics + Scientific Computation, The University of Texas at Austin
The salient output from SPSS shows the correlations among each of the predictors, as well as the
predictors and the dependent variable. An R2 value of .32 is then shown, with unstandardized and
standardized regression coefficients and significance tests shown in the final table.
Now consider the equivalent model fit in AMOS:
16
The Division of Statistics + Scientific Computation, The University of Texas at Austin
The three predictors are allowed to covary; predictors covariances are shown in the diagram.
For example, the covariance between education and the socioeconomic indicator is 3.47. The
variance of each predictor appears above its rectangle. The unstandardized regression weights,
which correspond to the B coefficients displayed in the SPSS output, link the predictor variables
to the dependent variables rectangle in the AMOS diagram. For instance, the unstandardized
regression coefficient of the education variable is .11. Since the unstandardized regression
coefficient represents the amount of change in the dependent variable per single unit change in
the predictor variable, this result suggests that for every single unit of increase in education level,
powerlessness in 1971 is reduced by -.11 units in the population from which Wheaton et al. drew
their sample.
The diagram also features a residual variance associated with the measurement of powerlessness
in 1971. Since this variance is not directly measured, it is represented as a latent variable in the
AMOS diagram. The estimate of the error variance is 6.86.
dependent variable explainable by the collective set of the predictors. AMOS produces these
statistics as well, but it uses a separate diagram to display the standardized coefficients and R 2
value.
In this diagram the covariances have been replaced by correlations. For example, the correlation
between the socioeconomic indicator and the powerlessness measure in 1967 is -.32, the same
value shown in the SPSS Correlations table above. Standardized regression coefficients now link
the predictors to the dependent variable, and the R2 value for the dependent variable appears
above its rectangle on the diagram. The standardized regression weights represent the amount of
change in the dependent variable that is attributable to a single standard deviation units worth of
change in the predictor variable.
AMOS also produces tabular output similar to that of SPSS that displays the unstandardized and
standardized regression coefficients, the standard error estimates of the unstandardized
regression coefficients, and tests of statistical significance of the null hypothesis that each
unstandardized regression coefficient equals zero. The AMOS tabular output will be discussed at
more length below.
18
The Division of Statistics + Scientific Computation, The University of Texas at Austin
AMOS can fit models that are not subject to these limitations. An example of such a model
applied to the Wheaton et al. database appears in the next section.
19
The Division of Statistics + Scientific Computation, The University of Texas at Austin
How can you test this model and develop from scratch a publication quality path diagram like the
one shown above? The first step is to launch AMOS. If you are using the UT server, click on the
Start menu from the Windows taskbar, select All Programs, select Statistics, then choose AMOS
16 Graphics. You will now see a blank AMOS Graphics diagram page that looks like this:
20
The Division of Statistics + Scientific Computation, The University of Texas at Austin
You will also see a floating toolbar appear. The toolbar may partially obscure the AMOS
drawing area, but you can move it out of the way by dragging it to the side of your computer
screen. Select the toolbar by single-clicking on it with your mouse pointer. If you place your
mouse pointer over any tool icon with the toolbar selected, AMOS displays the name and
function of the tool. You can also right-click on the tool icon for more detailed AMOS help on
how to use the tool.
To add or remove individual tools from the toolbar, choose Move Tools from the Tools menu in
AMOS Graphics. You may need to resize your toolbar from time to time to view all available
tools, particularly if you have many tools displayed or you have a small monitor.
Tools are activated by single-clicking on their icons with your mouse pointer. A tool that is
active or in use will have an icon that appears to be depressed or lowered. To deactivate a
particular tool, single-click on its icon once again. Its icon should change to look like the other
tools on the toolbar.
Select the Draw Latent Variables and Indicators tool from the toolbar by first single-clicking
on the toolbar to make it active and then single-clicking on the Draw Latent Variables and
Indicators button to make it active. This tool resembles a factor analysis model with three
indicator variables:
21
The Division of Statistics + Scientific Computation, The University of Texas at Austin
Move your mouse pointer to the drawing surface and draw an oval by clicking and holding your
mouse button. Once you have a satisfactory oval drawn, click on the oval twice. You will now
have a latent variable with two observed indicators.
Perhaps your latent and observed variables are too close to the edge of the AMOS drawing
space. To move them closer to the center of the diagram, click on the Preserve Symmetries
button.
Next, click on the red moving truck tool (Move Objects); then click and drag the latent variable
and its indicators closer to the center of the AMOS drawing space. When you are finished, you
will have a diagram that looks like this:
So far, you have drawn part of the desired model. This part of the model could represent a latent
variable called Socioeconomic Status (SES) with two observed indicators, Education, and
Socioeconomic Index (SEI). Recall that rectangles represent observed or measured variables,
while ovals represent latent or unobserved variables. The two ovals leading to the two observed
variables represent the residuals for those observed variables, while the larger oval represents
their shared variance. This shared variance is captured by the latent variable, Socioeconomic
Status.
The model has two more segments that resemble this section (alienation at 1967 and alienation at
1971), so you can copy the portion of the model you have already built. To do that:
22
The Division of Statistics + Scientific Computation, The University of Texas at Austin
Click on the Select All Objects tool (represented by a raised hand with four fingers
extended). The entire diagram should change color from black to blue.
Click on the Duplicate Objects tool icon (it resembles a photocopier), click on the latent
variables oval, and drag your mouse pointer to the right. You should see a replica of the
single latent variable-two indicator variable structure appear.
Hint: If you hold down the shift key while you perform this operation, the new portion of the
diagram will be horizontally aligned with the previous diagram section.
When you finish with this step, your drawing should look like this:
Click on the Deselect Objects tool button to remove the object selection. This button resembles
a hand with four fingers retracted or bent.
Rotate the indicators of this second latent variable so that they point down on the AMOS
diagram space instead of up. Select the Rotate Indicators tool button.
23
The Division of Statistics + Scientific Computation, The University of Texas at Austin
Click once on the second latent variables oval. The two indicators and their residuals rotate 90
degrees clockwise. Click the oval once more. The indicators rotate another 90 degrees. Click on
the Rotate Indicators tool button to deactivate it.
Click on the Preserve Symmetries tool and then click on the Move Objects tool. Click on the
second latent variable and move it beneath the first latent variable and to the left on the diagram.
When you finish, your diagram should look like this:
24
The Division of Statistics + Scientific Computation, The University of Texas at Austin
Copy the latent and observed variables in the lower set by selecting the Duplicate Objects
button from the toolbar once again, selecting the latent variable of the lower latent variable,
and drag it to the right. Your diagram will now look like this:
Notice that a number of the paths are fixed to a value of 1.00. These are present to ensure proper
model identification. Perhaps you want to have those paths be consistently on the left side of
each variable set. To do that, use the Reflect Indicators tool.
25
The Division of Statistics + Scientific Computation, The University of Texas at Austin
Once you have selected the tool button, click once on each of the two lower latent variables.
Your updated diagram will now look like this:
Congratulations! You have just specified the measurement part of your model. Now you must
tell AMOS where to find the data for conducting the analysis and you must label the observed
variables.
26
The Division of Statistics + Scientific Computation, The University of Texas at Austin
Click on File Name to specify the name of the data file. The View Data button launches a data
files external application. For instance, if you specify a SPSS data file using the File Name
button, and you then click View Data, AMOS will launch SPSS so that you can view the
contents of the data file. The Grouping Variable button allows you to specify a grouping
variable within a database, so you can easily set up and test models that involve multiple groups
of subjects.
Access
dBase 3 5
Microsft Excel 3, 4, 5, and 97
FoxPro 2.0, 2.5 and 2.6
Lotus wk1, wk3, and wk4
SPSS *.sav files, versions 7.0.2 through 9.0 (both raw data and matrix formats)
Comma-delimited text files (semicolon-delimited in countries where the comma is used
as a decimal separator).
AMOS recognizes empty or blank cells in MS Excel and Access as missing data. Systemmissing data (represented by blank cells in the SPSS data editor spreadsheet) are also properly
recognized by AMOS as missing data points. Comma-delimited data files should have two
consecutive commas listed to represent a missing data point.
27
The Division of Statistics + Scientific Computation, The University of Texas at Austin
Simulated data for this example model are in an SPSS for Windows data file called Wheatongenerated.sav. If you have not already downloaded this data file, you can retrieve it at the
following Web address:
http://ssc.utexas.edu/software/software-tutorials#amos
To read these data into AMOS, select Data Files from the AMOS Graphics File menu.
File
Data Files
In the dialog box that appears, click on the File Name button. AMOS will produce a file opening
dialog box that resembles the box shown below.
In this example, the Wheaton-generated SPSS database has been located and selected in the File
Name slot of this dialog box. You should follow suit, using your copy of AMOS Graphics:
locate the Wheaton-generated.sav SPSS data file on your computer, and choose it using AMOS.
Once you have located and specified the appropriate data file, click on the Open button. You
will now see the following window:
28
The Division of Statistics + Scientific Computation, The University of Texas at Austin
Click OK to return to AMOS Graphics. You have now made the data available to AMOS.
Once AMOS receives the data, you can label observed variables. Select the Variables in
Dataset option from the View/Set menu to see a window like the one shown below.
View/Set
Variables in Dataset
Select each of these entries one at a time and drag them onto the appropriate rectangles on the
path diagram. You may notice that some of the variable names are too large to fit inside their
rectangles. One partial remedy for this problem is to enlarge the entire model by clicking on the
Resize Diagram to Fit the Page tool bar icon.
29
The Division of Statistics + Scientific Computation, The University of Texas at Austin
You can also use the Shape Change tool icon to alter the shape of the rectangles so that they are
wide enough to hold the variable names:
Select each of the rectangles using the Select Single Objects tool (it resembles a hand with the
index finger extended).
Next, click on the Shape Change tool icon to alter the shape of the rectangles. Finally, you can
reduce the size of the variable name font by double-clicking on the rectangle for a particular
variable. After you make these adjustments, your finished measurement model diagram should
look like this:
30
The Division of Statistics + Scientific Computation, The University of Texas at Austin
(i.e., the residuals). After the macro runs, each residual will have a name, beginning with e1.
To specify the paths among the latent variables, use single-headed arrows to represent causal
relationships and dual-headed arrows to represent bidirectional relationships (correlations).
To create a residual for a latent variable, use the Add Unique Variable tool icon.
Select the tools icon on the toolbar. Next, click once on a latent variable to create a unique
residual variable for that latent variable. You can click on the latent variable again to move the
position of the new residual variable 45 degrees to the right (clockwise). By clicking on the
latent variable repeatedly, you can move the new residual variable around the original latent
variable in 45 degree increments until you find a location where it looks best on the diagram.
After you make these adjustments, your finished model diagram should resemble the following
image.
32
The Division of Statistics + Scientific Computation, The University of Texas at Austin
After you have drawn your model, be sure to save it by choosing the File menu, and then
selecting Save As. When you save the model, AMOS automatically creates two back-up model
files in the same directory in which you have your original diagram. The original diagram file
will have the extension .amw; the back-up files will have the extensions .bk1 and .bk2,
respectively.
33
The Division of Statistics + Scientific Computation, The University of Texas at Austin
In this analysis, the investigator requests a number of options, including a standardized solution,
squared multiple correlations, the sample covariance matrix, and the covariance matrix of the
residuals remaining after AMOS fits the model.
Next, examine the Estimation tab. Notice that this tab provides a check box that allows you to
estimate means and intercepts.
Hint: If your database has any cases with incomplete data, AMOS will require you to estimate
means and intercepts; so you must select this check box if your database has any missing data on
observed variables included in your model.
Because this models database does not contain any missing data and we are not interested in
means at present, we leave the Estimation tab settings at their default values.
34
The Division of Statistics + Scientific Computation, The University of Texas at Austin
Before you run the model, be sure to save it by choosing Save As from the File menu and saving
a copy of the model file to an appropriate location on your computers disk drive.
To run the model, close the Analysis Properties window and click on the Calculate Estimates
tool icon. It resembles an abacus:
35
The Division of Statistics + Scientific Computation, The University of Texas at Austin
The bottom portion of this output shows a chi-square value of 76.10 with 6 degrees of freedom.
This test statistic tests the overall fit of the model to the data. The null hypothesis under test is
that the model fits the data, so you hope to find a small, non-significant chi-square value for this
test. The significance test for the chi-square model fit statistic is described in the next section.
The table that appears contains a list of output categories on the left side of the window and an
open space on the right. When you select a particular subset of output for display, it appears in
the right-hand space. Consider the Notes for Model section of the output.
You should always examine the Notes for Model section of the AMOS output after each AMOS
analysis finishes because AMOS will display most errors and warnings in this section of the
36
The Division of Statistics + Scientific Computation, The University of Texas at Austin
output. In the output shown above, AMOS reports that the minimum was achieved with no errors
or warnings. The chi-square test of absolute model fit is reported, along with its degrees of
freedom and probability value. The interpretation of these results will be discussed in further
detail below.
The absence of errors or warnings in this section of the output means that it is safe for you to
proceed to the next output section of interest, the Fit Measures output.
The Fit Measures output contains five columns. The first column shows the name of each fit
measure. The second column, labeled Default model, contains the fit statistics for the model you
specified in your AMOS Graphics diagram.
The next two columns, labeled Saturated and Independence, refer to two baseline or comparison
models automatically fitted by AMOS as part of every analysis. The Saturated model contains as
37
The Division of Statistics + Scientific Computation, The University of Texas at Austin
many parameter estimates as there are available degrees of freedom or inputs into the analysis.
The Saturated model is thus the least restricted model possible that can be fit by AMOS. By
contrast, the Independence model is one of the most restrictive models that can be fit: it contains
estimates of the variances of the observed variables only. In other words, the Independence
model assumes all relationships between the observed variables are zero.
Commonly reported fit statistics are the chi-square (labeled Discrepancy in the output shown
above), its degrees of freedom (DF), its probability value (P), the Tucker-Lewis Index (TLI), and
the Root Mean Square Error of Approximation (RMSEA) and its lower and upper confidence
interval boundaries. There is also a Standardized Root Mean Residual (Standardized RMR)
available through the Tools, Macro menu, but it is important to note that this fit index is only
available for complete datasets (it will not be printed for databases containing incomplete data).
Various rules of thumb for each of these fit statistics exist. These rules of thumb change as
statisticians publish new simulation studies that further document the behavior of various
measures of fit. The chi-square test is an absolute test of model fit: If the probability value (P) is
below .05, the model is rejected. The other measures of fit are descriptive. Hu and Bentler (1999)
recommend RMSEA values below .06 and Tucker-Lewis Index values of .95 or higher. Since the
RMSEA for this model is .11 and the Tucker-Lewis Index value is .92, the model does not fit
well according to the descriptive measures of fit.
The final column in the Fit Measures table is labeled Macro and contains the name of the
corresponding macro variable for each fit statistic reported by AMOS in the Fit Measures table.
These macro variables may be specified as part of the path diagram display if you want to
display a fit index value as part of the AMOS diagram output. For example, you could use the
CMIN macro variable to display the fitted models chi-square on the diagram. Macro variables
are used in the diagrams title, which you can specify by selecting the Title button from the
AMOS toolbar.
Formulas for the discrepancy fit functions used to generate the chi-square test of overall model
fit and the descriptive model fit statistics may be found in the AMOS manual and also in the
AMOS program help files.
39
The Division of Statistics + Scientific Computation, The University of Texas at Austin
The Threshold for Modification Indices allows you to specify what level of chi-square change
is required for a path to be included in the modification index output. The default value is 4.00
because it slightly exceeds the tabled critical value of a chi-square distribution with one degree
of freedom: 3.84. Any additional parameter estimated by AMOS should result in an expected
reduction in the model chi-square of at least 3.84. The modification index results appear below.
40
The Division of Statistics + Scientific Computation, The University of Texas at Austin
All possible variances were estimated, so there are no unmodeled variances that could be
estimated in a modified or revised model. Thus, the Variances section contains no model
modification information. There are, however, possible regression weights and covariances that
can be incorporated into a re-specified model that would result in substantial changes in the
model fit chi-square test statistic.
The largest modification index values are found in the first four pairs of residual covariances. For
example, the covariance of e4 with e6 is expected to be .901 if you were to re-specify the model
with that covariance added and then refit the model. That models chi-square test of overall fit
should be approximately 30.127 units lower than the present models value of 76.102.
Should you allow the four pairs of error covariances to be estimated in a modified model? From
looking at modification index results, the answer appears to be Yes, but it is wise to reconsider
the conceptual implications of model modification before you proceed further. It is important to
understand that when you modify a model based upon the modification index output, you are:
1. Re-specifying your model
41
The Division of Statistics + Scientific Computation, The University of Texas at Austin
As a practical consideration, it is also worth noting that AMOS provides modification index
output only when complete data are input into the program. In other words, you cannot obtain
modification index information when you use missing data with AMOS.
Can you apply these principles to the model from the current example? Yes. In examining the
model, you can see that there are two instruments that have multiple measurements: Anomia and
Powerlessness are measured in 1967 and 1971. Since these data come from the same research
participants, it seems reasonable to conclude that there may be shared variance between Anomia
in 1967 and Anomia in 1971 that is not captured by the present model. Similarly, there may also
be shared variance between Powerlessness in 1967 and Powerlessness in 1971 that is not
accounted for in the present model.
You can correlate the residuals of these two sets of variables to incorporate these sets of shared
variance into the model. To correlate the residuals, return to the AMOS Graphics window and
select the Draw Covariances tool, represented by a double-headed arrow.
42
The Division of Statistics + Scientific Computation, The University of Texas at Austin
Draw a correlation between e3, the error term for Anomia in 1967, and e5, the error term for
Anomia in 1971. Repeat the process for the Powerlessness residuals.
Hint: You can alter the amount of curvature in the correlation lines by using the Shape Change
tool.
43
The Division of Statistics + Scientific Computation, The University of Texas at Austin
Inclusion of the two correlated residuals results in a substantial drop in the model fit chi-square
from 76.1 to 7.8. Notice that degrees of freedom of the chi-square test is reduced from six
degrees of freedom to four degrees of freedom. The two degrees of freedom change occurs
because each parameter estimated by the model consumes one degree of freedom; since you
included two new parameters in the modified model (the two residual correlations), the new
model has two fewer degrees of freedom remaining.
44
The Division of Statistics + Scientific Computation, The University of Texas at Austin
The chi-square value of 7.817 with four degrees of freedom is non-significant at the .05 level: its
p-value is .099. This finding suggests that model fits the data acceptably in the population from
which Wheaton et al. drew their sample. Corroborating evidence is provided by the RMSEA fit
statistic the obtained value of .032 is well below the desired .06 cutoff. Similarly, the TuckerLewis Index result of .993 is considerably above the .95 threshold denoting satisfactory model
fit.
Once you obtain a model that fits well and is theoretically consistent, you may interpret the
parameter estimates and individual tests of significance of each parameter estimate. AMOS
provides two ways for you to examine parameter estimates. One method uses the path diagram
output to visually display the parameter estimates while the other approach uses tables similar to
those containing the overall model fit statistics.
45
The Division of Statistics + Scientific Computation, The University of Texas at Austin
Clicking on the up-arrow will cause AMOS to display the parameter estimates. Unstandardized
or standardized estimates can be chosen by clicking on the appropriate selection. Standardized
estimates are selected in the figure shown below.
Clicking on the down-arrow returns you to the AMOS Graphics drawing interface, where you
can modify your existing model and then re-run it, or you can open a new model or pre-existing
model file. When you click on the up-arrow, the following parameter estimates are displayed as
part of the output.
46
The Division of Statistics + Scientific Computation, The University of Texas at Austin
The values associated with each path are standardized regression coefficients. These values
represent the amount of change in Y given a standard deviation unit change in X. (The
corresponding unstandardized coefficients represent the amount of change in Y given a single
raw score unit change in X).
AMOS also prints the R2 values for each dependent or mediating variable above the variable. For
example, the R2 value for Anomia67 is .62. Although AMOS does a good job of laying out the
coefficients in its default display, it may be necessary from time to time to move a particular
parameter estimate value so that the drawing appears less cluttered and more easily interpretable.
To move a parameter on the output diagram, use the Move Parameter tool.
Select the tool and move your mouse pointer over the offending variable until it is highlighted in
red. Then click and pull the mouse in a direction you think would allow the parameter estimate
value to be displayed more appropriately. A good choice in the present diagram is the .39 R 2
47
The Division of Statistics + Scientific Computation, The University of Texas at Austin
value for the Alienation 1967 latent variable. Currently, it is partially hidden by the SES to
Alienation 1967 path. By dragging the parameter estimate object slightly to the right of the path
object, you can see the .39 value unobstructed.
A nice feature of AMOS is its high-quality graphical output. You can take this output and copy it
to the Windows clipboard. From there you can insert it into a word processor such as MS Word
or a presentation package like MS PowerPoint.
This model has several interesting features worth noting. First, it contains both latent
(unobserved) and manifest (observed) variables. Second, it contains both causal relationships
among latent variables, represented by single-headed arrows, and correlational or bi-directional
relationships among several of the residuals. These are represented by the dual-headed arrows
connecting e3 with e5 and e4 with e6, respectively. As discussed above, because the two anomia
and powerlessness measures are identical and measured on the same research participants across
time, it makes sense that they share variance due to causes not accounted for by the alienation
48
The Division of Statistics + Scientific Computation, The University of Texas at Austin
latent factors. The correlations between the residuals accounts for that additional shared
variance.
The table displays the unstandardized estimate, its standard error (abbreviated S.E.), and the
estimate divided by the standard error (abbreviated C.R. for Critical Ratio). The probability
value associated with the null hypothesis that the test is zero is displayed under the P column. All
of the regression coefficients in this model are significantly different from zero beyond the .01
level.
Standardized estimates allow you to evaluate the relative contributions of each predictor variable
to each outcome variable. The standardized estimates for the fitted model appear below.
49
The Division of Statistics + Scientific Computation, The University of Texas at Austin
There is not much difference between the standardized and unstandardized coefficients in this
example, probably because the units are derived from survey measurement items. By contrast,
variables with very different measurement scales entered into the same model can result in sharp
discrepancies between the standardized and unstandardized regression coefficient output.
50
The Division of Statistics + Scientific Computation, The University of Texas at Austin
Hoyle and Panter (in Hoyle, 1995) and Hatcher (1994) provide excellent discussions of how to
write the results of structural equation models for publication in journals and textbooks. Hoyle
and Panter provide guidelines for writing about structural equation models whereas Hatcher
provides sample text from a mock write-up of the results of a SEM analysis ostensibly performed
for a manuscript to be submitted for publication in a scholarly journal. Both texts full citations
appear in the References section, shown below, and are available through the UT library system.
In this course you learned
what SEM is and where to locate SEM resources in print and on line.
SEM nomenclature.
the mathematical basis of SEM and its assumptions.
how to draw a model using AMOS.
how to run the AMOS model and evaluate several key components of the AMOS
graphics and text output, including overall model fit and test statistics for individual path
coefficients.
how to modify and respecify a non-fitting model.
References
Benter, P. M. & Chou, C. P. (1987). Practical issues in structural modeling. Sociological
Methods and Research, 16(1), 78-117.
Bollen, K. A. (1989). Structural equations with latent variables. New York, NY: John Wiley &
Sons.
Hatcher, L. (1994). A step-by-step approach to using the SAS system for factor analysis and
structural equation modeling. Cary, NC: SAS Institute.
Hoyle, R. (1995). Structural equation modeling : concepts, issues and applications. Thousand
Oaks, CA: Sage Publications.
Hu, L. & Bentler, P. M. (1999). Cutoff criteria for fit indexes in covariance structure analysis:
Conventional criteria versus new alternatives. Structural Equation Modeling, 6(1), 1-55.
Little, R. J. A. & Rubin, D. A. (1987). Statistical analysis with missing data. New York NY:
John Wiley & Sons.
Loehlin, J. C. (1992). Latent variable models. Hillsdale, NJ: Lawrence Erlbaum Publishers.
Rigdon, E. (1997). Approaches to testing identification.
http://www.gsu.edu/~mkteer/identifi.html
51
The Division of Statistics + Scientific Computation, The University of Texas at Austin
Roth, P. (1994). Missing data: A conceptual review for applied psychologists. Personnel
Psychology, 47, 537-560.
Stevens, J. (1996). Applied multivariate statistics for the social sciences. Mahwah, NJ: Lawrence
Erlbaum Publishers.
Wheaton, B., Muthn, B., Alwin, D., & Summers, G. (1977). Assessing reliability and stability
in panel models. In D.R. Heise (Eds.): Sociological Methodology. San Fransisco: Jossey-Bass.
52
The Division of Statistics + Scientific Computation, The University of Texas at Austin