Skip to content

pre-derive variables in gcp #121

@rebexxxxxx

Description

@rebexxxxxx

there are quite a few variables that could be derived in GCP, reducing the amount of R code and raw data downloaded. The variables im thinking about in particular are the following (contained in the clean_data.R file:

  1. d_827220437 -> apply hospital labels to this in GCP
  2. sex -> this relies on 2 variables state_d_706256705 and state_d_435027713, simple if statements in BQ would handle the definition of this
  3. biocol_type -> first, rename this to something that makes sense. second, this variable relies on if statements of the following 3 variables: d_878865966, d_167958071, d_684635302. very easily implemented in BQ.
  4. Msrv_complt -> exact same case as previous case, except with a few other variables
  5. income -> this just needs labels for the income brackets.
  6. race-> for invited participants only

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions