Categorical Data Analysis

Website for CATEGORICAL DATA ANALYSIS, 3rd edition

For the third edition of Categorical Data Analysis by Alan Agresti (Wiley, 2013), this site contains (1) information on the use of other software (SAS, R and S-plus, Stata, SPSS, and others), (2) data sets for examples and many exercises (for many of which, only excerpts were shown in the text itself), (3) short answers for some of the exercises, (4) corrections of errors in early printings of the book. Also, there's

1. Software Appendix

In this appendix we provide details about how to use R, SAS, Stata, and SPSS statistical software for categorical data analysis, with examples in many cases showing how to perform analyses discussed in the text. This supplements the brief description found in Appendix A of the "Categorical Data Analysis" text, 3rd edition, Wiley (2013). For each package, the material is organized by chapter of presentation and refers to datasets analyzed in those chapters. The full data sets are available at datasets.

SAS

Go to SAS for a pdf file containing details about the use of SAS for CDA, with illustrations for data sets in the CDA text.

R and S-Plus

Go to R for a pdf file containing details about the use of R for CDA, and illustrations for data sets in the CDA text. Here is a manual that Dr. Laura Thompson prepared on the use of R and S-Plus to conduct all the analyses in the 2nd edition of the CDA text.

Stata

Go to Stata for discussion of using Stata for CDA.

SPSS

Go to SPSS for discussion of using SPSS for CDA.

Other software

Go to other software for discussion of other software useful for CDA, such as StatXact and LogXact.

2. Primary datasets:

Here are datasets for many of the main examples in the text, and for some of the exercises. See data files for some individual files (Crabs for Table 4.3, Teratology for Table 4.7, Credit for Exercise 5.22, Endometrial for Table 7.2, Infection for Table 6.9, SoreThroat for Table 6.15, Substance use for Table 9.3, MBTI for Table 9.17, Substance2 for Table 10.1, Insomnia for Table 12.3, Abortion for Table 13.3). The horseshoe crab data are used to illustrate logistic regression (modeling whether a female crab has at least one satellite) and models for count data (e.g., negative binomial modeling of the number of satellites). For the count data, better models allow zero-inflation, as discussed in my book "Foundations of Linear and Generalized Linear Models" (published by Wiley, 2015).

3. Selected short solutions to exercises:

Here is a pdf file of short solutions for some of the exercises at the ends of the chapters. These are mainly the solutions that were provided for some of the odd-numbered exercises from the 2nd edition of the book. Please report errors to AA@STAT.UFL.EDU, so they can be corrected in future revisions of this site. The author regrets that he cannot provide solutions of exercises not in this file.

4. Corrections:

Here is a pdf file showing corrections of typos/errors in the third edition.


Copyright © 2021, Alan Agresti.