-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HIV Acquisition Meta-Analysis #12
Comments
We recently added WIHS3 to the latest iteration of metas. These can be found on S3 at: |
HIV Acquisition Meta-Analysis with TOPMed Imputed Data0028 AFR+AMR+EUR (N=12,617)
|
Troubleshooting
details
detailsI redownloaded each set of TOPMed GWAS results from S3 to double check that we were using the correct results. This checks out. Also, confirm GWAS results in the HIV bucket were correctly copied from the associated Cromwell output.When each GWAS is complete, the results are copied over from the Cromwell bucket to the appropriate project bucket in the S3 organizational structure. We should verify that the GWAS results in the HIV bucket were copied over correctly. This checks out.
detailsWe needed to verify that the meta-analysis pipeline (bash script) we have been using is correct. We attempted to replicate the previous 1000g_p3 HIV acquisition GWAS meta-analysis results. The results were consistent, there for the pipeline is correct.
detailsDuring our lab meeting on Wednesday (9/23/2020) we discussed applying a stricter rsq threshold. It was suggested that since we haven't had much experience with TOPMed imputed data, the same filters we apply to 1000g_p3 data might not be directly applicable. We will therefore experiment with applying a stricter rsq threshold. In particular, we will use the GWAS results with rsq > 0.8 applied.
details
details
After GC was applied to each cohort/chromosome.
detailsNo GC
GC applied to each cohort
detailsInvestigate whether any one cohort is disproportionately contributing to the inflation.
|
TL;DR(1) For the 1000g_p3 GWAS, all the WIHS3 subjects were also included within WIHS2. (2) Including WIHS3 in the 1000g_p3 meta did inflate the results some, but it was not as pronounced as this current TOPMed meta. (3) ProbABEL was used to run the 1000g_p3 GWAS for both WIHS2 and WIHS3 (as opposed to RVTEST which we used for the TOPMed GWAS). The individual GWAS results were deflated for both when using ProbABEL. This could explain why adding WIHS3 to the 1000g_p3 meta did not inflate the results as much as it did in the TOPMed meta. The inflation is most evident in the AFR-specific results (go figure). All of WIHS3 was in WIHS2 for 1000g_p3When comparing the phenotype files for WIHS2 and WIHS3 from the original 1000g_p3 results, we see that every subject in WIHS3 was within WIHS2.
command line
Inflation observed when adding WIHS3 to 1000g_p3 metaWe do observed some inflation when adding WIHS3 to the 1000g_p3 results. The lambda values do not increase as markedly as with the TOPMed imputed results, but one can observed, from visual inspection, a clear shift above the line y=x. The most appreciable inflation is observed within the AFR-specific meta-analysis. Click below to expand the 1000g_p3 vs TOPMed plots for each meta. It should also be noted that the TOPMed imputed data has more coverage. In particular, when comparing the final SNP count of the cross-ancestry meta-analysis (AFR+AMR+EUR) we see that the TOPMed results have over a million more observations.
ProbABEL GWAS results were deflatedWIHS2 and WIHS3 1000g_p3 were both ran using the ProbABEL GWAS software. This is apparent from looking at the summary stats headers and also the file names (e.g. palogist in the name which is a ProbABEL term for logistic regression). The lambda values for WIHS2 and WIHS3 were 1.017 and 1.01, respectively. Though these values do not raise any red flags, we do observe in both the Manhattan plots and the QQ plots below (click to expand) there are some striking similarities. For example, it is evident that the points in the QQ plots for both WIHS2 and WIHS3 are systematically shifted below the line y=x. This characteristic of being deflated could explain why the 1000g_p3 metas were not as markedly inflated as we observed within the TOPMed meta-analysis results. 1000g_p3 ProbABEL GWAS plotsplot locations:
|
Performing meta-analyses HIV Acquisition.
The parent issue is GitHub Issue 97.
Analysis Description
hiv-acquisition-gwas-meta-analysis-description.xlsx
The text was updated successfully, but these errors were encountered: