-
Notifications
You must be signed in to change notification settings - Fork 355
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Default demini load fails in 1.0.7 germline variant2 GRCh37 for multisample projects because it uses a vcf file which is not annotated #2225
Comments
Sergey; |
Thanks Brad! I was using the old way (default in GRCh37), and in 1.0.5 (and before), it first did VEP and then gemini load.
So it annotates with vcfanno not VEP, but tries to load in the old way? I've switched to the new way with I'm fine with the new loader if it supports all VEP fields. Maybe it is just worth mentioning in the docs, that without Sergey |
Other projects finished just fine with the old loading method. So something was wrong with one project. Closing. |
I have this error in every multi-sample project, i.e. when you have more than one sample in a batch, the old (default) gemini loading does not work. Adding vcfanno: [gemini] helps. |
There is a difference between tables generated by vcf2db and gemini load that influences the downstream parsing, I put it here: quinlan-lab/vcf2db#36. So, if possible, it would be useful to get old gemini load working for multisample projects. |
Sergey; |
Thanks Brad! A header of the 1031-freebayes-decompose.vcf.gz
So it was indeed annotated with VEP, and the annotation then removed? Sergey |
Sergey; |
Thanks Brad for looking into this issue! I've upgraded to the latest dev and removed work/gemini dir. Unfortunately, is not working for me:
The same error,
So it is a naked vcf file as input to gemini load, and it fails to load, because it wants VEP annotation. I need the old loading, because vcfanno/vcf2db in bcbio produces different field names in gemini.db, and some fields are missing compared to VEP/gemini load (quinlan-lab/vcf2db#36). I definitely would like to switch to superfast vcf2db by default, but I need all fields. How you'd suggest to fix this? I see that I can modify gemini.conf/gemini.lua in GRCh37/config/vcfanno. For now I'm just reloading the gemini database after bcbio run with gemini load using the VEP-annotated vcf, but I am looking for a more smooth solution. SN |
Sergey; |
Thanks Brad. It works now. |
Hello!
Thanks for the excellent pipeline.
In 1.0.7 gemini load command fails (unnecessary details omitted):
It looks like because it uses 1031-freebayes-nomultiallelic.vcf.gz, which is a soft link
to 1031-freebayes-decompose.vcf.gz and it is not annotated.
Should be using 1031-freebayes-nomultiallelic-annotated-gemini.vcf.gz instead?
Could you please fix that?
Sergey
The text was updated successfully, but these errors were encountered: