All three generators share classroom codes (via seed) so outputs are linkable with linkage_create_master().
Vignettes
Added “Getting Started with ALprekDB” vignette covering installation, codebooks, and a quick-start pipeline.
Added “Building Multi-Year Panel Data” vignette for budget, classroom, and student panel construction workflows.
Added “Cross-Module Linkage and Analysis” vignette demonstrating master dataset creation, transform enrichment, and DuckDB integration.
Added “Package Architecture and Data Dictionary” vignette with S3 class hierarchy, data dictionary, and all 37 validation checks.
Package stats
85 exported functions, 38 R source files, 25 test files, ~954 tests.
R CMD check: 0 errors, 0 warnings, 0 notes.
ALprekDB 0.4.0 (2026-02-06)
New features
Linkage module
Added linkage_classroom_budget() to join classroom and budget panels by school_year + classroom_code, with automatic column deduplication (authoritative source strategy).
Added linkage_aggregate_students() to compute classroom-level summaries from student data (38 aggregate variables including demographics, GOLD gains, chronic absence rates, service density, and eDECA gains).
Added linkage_create_master() to produce a two-level master dataset: $classroom_level (~208 columns) and $student_level (~445 columns), with derived per_child_budget and per_seat_budget.
Added linkage_validate() with 8 diagnostic checks (match rate, orphans, key uniqueness, NA introduction, year coverage, region consistency).
Added classroom_bind_years() with forward-fill imputation for geographic coordinates and year_first_funded within site groups; imputation log tracks every change.
Added budget_read() with auto format detection (legacy ~176-column vs new ~28-column formats).
Added budget_clean() to transform raw Excel data to long-format intermediate with payroll tax proportional allocation (legacy) and Additional Funds 1 & 2 aggregation.
Added budget_validate() with 7 checks including reconciliation diagnostics ($1.00 tolerance).