Skip to contents

Creates a standardized household income target variable for household-level weighting and expansion, using either PUMS or survey input. Use when preparing income targets for synthetic population or survey analysis.

Usage

prep_target_income(
  h_data,
  p_data,
  target_name = "h_income",
  codebook,
  settings
)

Arguments

h_data

data.table. Household-level input. Required columns:

  • For PUMS: must include SERIALNO and income column as specified in settings.

  • For survey: must include income column as specified in settings. Rows: one per household. Modified by reference: no (returns copy).

p_data

data.table. Person-level input (not used, included for interface consistency).

target_name

character(1). Name of the target variable to create (default: "h_income").

codebook

data.table. Codebook for variable mapping; must include income value and label columns.

settings

list. Project settings; must include targets[[target_name]] with levels, pums_input, and survey_input.

Value

data.table. Copy of household-level input with new target variable column (target_name).

  • Columns: all original plus target_name (character)

  • Values: standardized income bins

  • Row order preserved

Details

  • Detects input type (PUMS vs. survey) by presence of SERIALNO column in h_data.

  • If no target levels specified, uses codebook to infer income bins.

  • Applies cut_and_label to bin income into target levels.

  • Renames output column to target_name (default: h_income).

  • Returns a copy of the input data.table with the new target variable.

  • Error handling: stops if empty income bins are detected.

Settings

  • targets[["h_income"]] (direct): must include levels, pums_input, and survey_input.

Examples

## Not run:
prep_target_income(h_data, p_data, target_name = "h_income", codebook, settings)
#> Error: object 'h_data' not found
## End(Not run)