Skip to contents

Creates a standardized household size target variable for household-level weighting and expansion, using either PUMS or survey input. Use when preparing household size targets for synthetic population or survey analysis.

Usage

prep_target_h_size(h_data, p_data, target_name = "h_size", codebook, settings)

Arguments

h_data

data.table. Household-level input. Required columns:

  • For PUMS: must include SERIALNO and household size column as specified in settings.

  • For survey: must include household size column as specified in settings. Rows: one per household. Modified by reference: no (returns copy).

p_data

data.table. Person-level input (not used, included for interface consistency).

target_name

character(1). Name of the target variable to create (default: "h_size").

codebook

data.table. Codebook for variable mapping (not used in this function).

settings

list. Project settings; must include targets[[target_name]] with levels, pums_input, and survey_input.

Value

data.table. Copy of household-level input with new target variable column (target_name).

  • Columns: all original plus target_name (character)

  • Values: standardized household size bins

  • Row order preserved

Details

  • Detects input type (PUMS vs. survey) by presence of SERIALNO column in h_data.

  • Selects household size column using target_list$pums_input (PUMS) or target_list$survey_input (survey).

  • Applies cut_and_label to bin household size into target levels.

  • Renames output column to target_name (default: h_size).

  • Returns a copy of the input data.table with the new target variable.

  • Error handling: stops if levels do not match expected values.

Settings

  • targets[["h_size"]] (direct): must include levels, pums_input, and survey_input.

Examples

## Not run:
prep_target_h_size(h_data, p_data, target_name = "h_size", codebook, settings)
#> Error: object 'h_data' not found
## End(Not run)