Calculate survey confidence intervals by zone group and region

Calculates confidence intervals for survey data by zone group and region, joining weights and crosswalks, and summarizing by group. Use for reporting survey fit and uncertainty.

Usage

calc_survey_ci(
  seed,
  weights,
  zone_group_crosswalk,
  puma_client_zone_xwalk,
  ci_level = 0.9,
  run_label,
  settings
)

Arguments

seed

data.table with required columns:

hh_id_dow — household-day ID
segment_name Rows: one per household-day. Keys: (hh_id_dow). Modified by reference: no (returns copy).

weights

data.table with required columns:

hh_id_dow
zone_group_balanced_weight Rows: one per household-day. Keys: (hh_id_dow). Modified by reference: no (returns copy).

zone_group_crosswalk

data.table. Zone group crosswalk.

puma_client_zone_xwalk

data.table. PUMA to client zone crosswalk.

ci_level

numeric(1). Confidence interval level (e.g., 0.9).

run_label

character(1). Run label for output files.

settings

list. Must include:

outputs_dir — output directory
seed_wtd_path — seed weighted path

Value

list with two data.tables:

zone_group: survey confidence intervals by zone group. Columns:
- variable — variable name
- day_group — day group label
- mean — mean value
- lower — lower confidence bound
- upper — upper confidence bound
- Additional columns as needed for reporting
region: survey confidence intervals by region. Columns:
- variable — variable name
- day_group — day group label
- mean — mean value
- lower — lower confidence bound
- upper — upper confidence bound
- Additional columns as needed for reporting

Details

Checks if outputs are up-to-date; re-runs if needed.
Joins weights to seed and adds zone group via crosswalks.
Summarizes survey data by zone group and region using helper functions.
Drops transit boardings for region summary.
Returns a list of data.tables for zone group and region.

Settings

outputs_dir (direct): Output directory. Default from settings.
seed_wtd_path (direct): Seed weighted path. Default from settings.

Examples

## Not run:
seed <- data.table(hh_id_dow = "H1", segment_name = "A")
weights <- data.table(hh_id_dow = "H1", zone_group_balanced_weight = 1.2)
zone_group_crosswalk <- data.table()
puma_client_zone_xwalk <- data.table()
ci_level <- 0.9
run_label <- "test"
settings <- list(outputs_dir = "out", seed_wtd_path = "seed.rds")
calc_survey_ci(seed, weights, zone_group_crosswalk, puma_client_zone_xwalk, ci_level, run_label, settings)
#> Error in get("working_dir", settings): object 'working_dir' not found
## End(Not run)