Categorical Data Analysis
Website for CATEGORICAL DATA ANALYSIS, 3rd edition
For the third edition of Categorical Data Analysis by Alan Agresti
(Wiley, 2013), this site contains (1) information on the use of other
software (SAS, R and S-plus, Stata, SPSS, and others), (2) data sets
for examples and many exercises (for many of which, only excerpts were
shown in the text itself), (3) short answers for some of the
exercises, (4) corrections of errors in early printings of the book. Also,
there's
1. Software Appendix
In this appendix we provide details about how to use R, SAS, Stata,
and SPSS statistical software for categorical data analysis, with
examples in many cases showing how to perform analyses discussed in
the text. This supplements the brief description found in Appendix A
of the "Categorical Data Analysis" text, 3rd edition, Wiley (2013).
For each package, the material is organized by chapter of presentation
and refers to datasets analyzed in those chapters. The full data sets
are available at
datasets.
SAS
Go to SAS for a pdf file
containing details about the use of SAS for CDA, with illustrations
for data sets in the CDA text.
R and S-Plus
Go to R for a pdf file containing
details about the use of R for CDA, and illustrations for data sets in
the CDA text. Here is a
manual that Dr. Laura Thompson prepared on
the use of R and S-Plus to conduct all the analyses in the 2nd edition
of the CDA text.
Stata
Go to Stata for discussion
of using Stata for CDA.
SPSS
Go to SPSS for discussion
of using SPSS for CDA.
Other software
Go to other software
for discussion of other software useful for CDA, such as StatXact and
LogXact.
2. Primary datasets:
Here are
datasets for many of the main examples
in the text, and for some of the exercises. See
data files for some individual
files (Crabs for Table 4.3, Teratology for Table 4.7, Credit for
Exercise 5.22, Endometrial for Table 7.2, Infection for Table 6.9,
SoreThroat for Table 6.15, Substance use for Table 9.3, MBTI for Table
9.17, Substance2 for Table 10.1, Insomnia for Table 12.3, Abortion for
Table 13.3). The horseshoe crab data are used to illustrate logistic
regression (modeling whether a female crab has at least one satellite)
and models for count data (e.g., negative binomial modeling of the
number of satellites). For the count data, better models allow
zero-inflation, as discussed in my book "Foundations of Linear and
Generalized Linear Models" (published by Wiley, 2015).
3. Selected short solutions to exercises:
Here is a pdf file of short
solutions for some of the exercises at the ends of the
chapters. These are mainly the solutions that were provided for some
of the odd-numbered exercises from the 2nd edition of the book.
Please report errors to AA@STAT.UFL.EDU, so they can be corrected in
future revisions of this site. The author regrets that he cannot
provide solutions of exercises not in this file.
4. Corrections:
Here is a pdf file
showing corrections of
typos/errors in the third edition.
Copyright © 2021, Alan Agresti.