Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Live issue on things to address in the documentation #132

Open
7 tasks
epiben opened this issue Aug 14, 2024 · 0 comments
Open
7 tasks

Live issue on things to address in the documentation #132

epiben opened this issue Aug 14, 2024 · 0 comments
Assignees
Labels
documentation Improvements or additions to documentation

Comments

@epiben
Copy link
Collaborator

epiben commented Aug 14, 2024

This issue will be updated continuously as we realise things that needs to be explained/addressed/justified in the documentation of the data. Things won't be removed but ticked off when included in the documentation whose contents live in the docs branch.

  • MEASUREMENT.value_as_concept_id will use just 45878416 Blood, and users who desire to distinguish venous from arterial blood, will need to use the SPECIMEN table.
  • type_concept_id for data coming from LABKA and BCC is 32856
  • person from source are also filtered so that the d_status_hen_start is higher than birthday and beginning of registry. This will avoid including patients who died before the registry starts and also patients who do not have enough follow-up
  • When using quant clinical drugs, we always use the one with the smallest volume and that is shared across concentrations for the same ingredient. If no such concept exists, we resort to the smallest volume on a per-concentration basis
  • These data must be analysed with Central European locale (e.g. Europe/Copenhagen), otherwise the date parts of start_date and start_datetime won't match; the same goes for end_date and end_datetime)
  • If a medication is registered in both 3- files and 8-/9-files, we assume these are different drugs even if they might actually different as there's no way to disentangle this in the ETL pipeline. These kinds of decision are deferred to the analytic phase.
  • Conditions only have eras if they have a valid era_lookback_interval value in concept_lookup_stem. This is because we want finer control, and we don't want to rely too much on environment variables. Drugs use a default era_lookback_interval, which can be controlled by an environment variable although it's unlikely to ever be.
@epiben epiben self-assigned this Aug 14, 2024
@epiben epiben added the documentation Improvements or additions to documentation label Aug 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

No branches or pull requests

1 participant