This repo includes support files for phecodes, including:
- Mapping files which specify the ICD codes that define a phecode
- Information files which include descriptions of phecodes (e.g. phecode strings and phecode categories)
Here is a description of each file:
The phecodeX_info.csv file includes information related to each phecode, including the phecode string, category, and columns indicating sex-specificity and ICD-10 only status. The columns are as follows:
phecode The phecode label (two letters, "_", and numeric phecode)
phecode_string A descriptive label for phecode
category_num A numeric value corresponding to the phecode category
category A string indicating the phecode category
sex A string with values "Both", "Female", or "Male."
Sex specific phecodes (e.g. Prostate cancer) are labeled accordingly
icd10_only A Boolean value: 1 if the phecode is defined only by ICD-10 codes;
0 if the phecode is defined by both ICD-9 and -10 codes
phecode_num The unique numeric component of the phecode label. Can be useful for sorting.
This file includes the ICD-9 and -10 codes that define each phecodes. All codes are “unrolled” meaning that phecode ID_002.1 implies ID_002. This file is particularly useful for generating phecodes from ICD tables using join my mysql or merge in R. The columns are as follows:
phecode The phecode label
ICD The ICD code included in the phecode grouping
vocabulary_id A string indicating the type of ICD code (ICD9CM or
ICD10CM)
Use phecodeX_unrolled_ICD_CM.csv for the clinical modification of ICD used in the United States. Use phecodeX_unrolled_ICD_WHO.csv WHO ICD-10 codes.
A highly descriptive file that includes both phecode strings and ICD descriptions. This file is “flat” (i.e. not “unrolled”) such that child codes are not mapped to parents. This file is useful for examining which ICDs inform a particular phecode. The columns are as follows:
phecode The phecode label
phecode_string A descriptive label for phecode
category_num A numeric value corresponding to the phecode category
category A string indicating the phecode category
ICD The ICD-9 or ICD-10 code
vocabulary_id A string indicating the type of ICD code (ICD9CM, ICD10CM, or ICD10)
ICD_string A string description of the ICD code
ICD_chapter The chapter of the ICD code
Use phecodeX_ICD_CM_map_flat.csv for the clinical modification of ICD used in the United States. Use phecodeX_ICD_WHO_map_flat.csv WHO ICD-10 codes.
For files that are compatible with the R PheWAS package, along with example code, use this link.