Skip to contents

Simple Linear Regression & ANCOVA (Chapters 1–2)

Datasets for introducing simple linear regression, ANCOVA, and the fundamental concepts of controlling for covariates.

gcse
GCSE and London Reading Test Data
classdata_07
Class Survey Data (2007)
pisa2000
PISA 2000 International Reading Assessment Data
civic_ed
Civic Education Study: Pre-Post Survey Data

One-way ANOVA (Chapter 3)

Datasets for categorical predictors, dummy variable coding, and group comparisons via ANOVA as a special case of regression.

reading
Reading Comprehension Instruction Experiment
instruction
Reading Instruction Methods Study

Multiple Regression: Continuous Predictors (Chapter 4)

Datasets for partial regression, standardized coefficients, and the confounding/sign-reversal phenomenon.

crime
Florida County Crime Rates

Interactions (Chapter 5)

Datasets for interaction effects between dummy variables, continuous and dummy variables, and two continuous predictors.

individuals
Bureau of Labor Statistics March 2000 CPS Individual Data
faculty
Faculty Salary Data

Nonlinear Relationships & Model Building (Chapters 6–7)

Datasets for log transformations, polynomial regression, and model selection strategies.

nels_data
National Education Longitudinal Study of 1988 (NELS:88)
hsb_sub
High School and Beyond Subset

Model Diagnostics (Chapter 8)

Datasets for residual analysis, normality checks, influence diagnostics, and collinearity assessment.

hsbs1
High School and Beyond Survey (Full Sample)

Logistic Regression (Chapters 9–11)

Datasets for binary outcomes, odds ratios, Simpson’s paradox, maximum likelihood estimation, and model fit diagnostics.

gss_1
General Social Survey Data
berkeley
UC Berkeley Graduate Admissions Data (Five Departments)
berk_sub
UC Berkeley Graduate Admissions Subset (Engineering and Psychology)
penalty
Death Penalty Sentencing Data
titanic
Titanic Passenger Survival Data
disc
NELS:88 Discipline and School Experiences Study
disc2
NELS:88 Discipline Study with Achievement Scores

Latent Response & GLM (Chapter 12)

Datasets for latent variable formulations, probit vs. logit models, and the generalized linear model framework.

lambert
Lambert Longitudinal Study Data
grades
Essay Grades and Writing Features

Ordinal Response Models (Chapter 13)

Datasets for ordinal logistic/probit regression, threshold models, and the proportional odds assumption.

womenlf
Canadian Women's Labor Force Participation
satisfaction
Satisfaction Survey Data
alcohol1_pp
Adolescent Alcohol Use Person-Period Data

Utility Functions

Helper functions for exploring the package.

list_datasets()
List All Datasets in regdatasets