New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Suggestions to per sample counts 4.1 PR #581

Merged

jkgoodrich merged 19 commits into dm/per_sample_counts_4_1 from jg/suggestions_to_per_sample_counts_4_1

Mar 20, 2024

Contributor

jkgoodrich commented Mar 15, 2024

No description provided.

jkgoodrich added 4 commits

March 15, 2024 11:22


          Add suggested edits to the per sample stats script

7f3423c


          get correct MT for genomes

bd486d7


          Merge branch 'main' of https://github.com/broadinstitute/gnomad_qc in…

fd5a7bb

…to jg/suggestions_to_per_sample_counts_4_1

# Conflicts:
#	gnomad_qc/v4/resources/release.py


          Fixes during testing

jkgoodrich added v4.1 Release Stats labels

jkgoodrich requested review from KoalaQin and matren395

March 15, 2024 18:02

jkgoodrich assigned jkgoodrich, KoalaQin and matren395


          Merge branch 'dm/per_sample_counts_4_1' of https://github.com/broadin…

1f48b90

…stitute/gnomad_qc into jg/suggestions_to_per_sample_counts_4_1

jkgoodrich mentioned this pull request

Calculate variants per sample in exomes and genomes #543

Merged

jkgoodrich added 3 commits

March 15, 2024 12:25


          check for struct expression

26451ce


          remove unneeded hl.struct

84e5302


          add multiple aggregations to avoid a class too large error

f380f0f

KoalaQin requested changes

View reviewed changes

Contributor

KoalaQin left a comment •

edited

Loading

Thanks for all the changes, it's cleaner than ours! My first round of going through without running the test yet, lmk when you fix the "class too large" issue.

gnomad_qc/v4/create_release/calculate_variant_statistics.py Outdated Show resolved Hide resolved

gnomad_qc/v4/create_release/calculate_variant_statistics.py Outdated Show resolved Hide resolved

gnomad_qc/v4/create_release/calculate_variant_statistics.py Outdated Show resolved Hide resolved

gnomad_qc/v4/create_release/calculate_variant_statistics.py Outdated Show resolved Hide resolved

jkgoodrich and others added 5 commits

March 15, 2024 14:31


          Apply suggestions from code review

3b9c4f1

Co-authored-by: Qin He <[email protected]>


          Use _localize=False in the aggregation

a05b4b1


          Merge branch 'jg/suggestions_to_per_sample_counts_4_1' of https://git…

052262d

…hub.com/broadinstitute/gnomad_qc into jg/suggestions_to_per_sample_counts_4_1


          Checkpoint before print

5c359c3


          fix from testing

6d7e8ec

KoalaQin requested changes

View reviewed changes

gnomad_qc/v4/create_release/calculate_variant_statistics.py Outdated Show resolved Hide resolved

gnomad_qc/v4/create_release/calculate_variant_statistics.py Outdated Show resolved Hide resolved

gnomad_qc/v4/create_release/calculate_variant_statistics.py Show resolved Hide resolved

gnomad_qc/v4/create_release/calculate_variant_statistics.py Outdated Show resolved Hide resolved

gnomad_qc/v4/create_release/calculate_variant_statistics.py Outdated Show resolved Hide resolved

gnomad_qc/v4/create_release/calculate_variant_statistics.py Show resolved Hide resolved

gnomad_qc/v4/create_release/calculate_variant_statistics.py Show resolved Hide resolved

jkgoodrich commented

View reviewed changes

gnomad_qc/v4/create_release/calculate_variant_statistics.py Outdated Show resolved Hide resolved

jkgoodrich commented

View reviewed changes

gnomad_qc/v4/create_release/calculate_variant_statistics.py

+              Aggregated statistics can also be computed by ancestry.
+              """
+              # TODO: Maybe move to a folder called assessment and rename to

Contributor Author

jkgoodrich Mar 18, 2024

@matren395 @KoalaQin thoughts on this TODO?

Contributor

matren395 Mar 18, 2024

does 'assessment' exist in older versions or other projects?

this would be creating stats to go with the release, so I'm very okay leaving it in create_release. However, if more assessments are coming down the road, then making a new folder could make sense? I think I'm leaning against, but not super strongly

Contributor

KoalaQin Mar 19, 2024

We don't have assesment under gnomad_qc, but we do have it under gnomad_methods and there's the summary_stats.py there. One question though, are we publicly releasing this per_sample count table or just the final aggregate stats? I'm not sure if we're allowed to, like meta isn't public.

Contributor Author

jkgoodrich Mar 19, 2024

Correct, we have not productionized release stats before, but I was using assessment because that is what we used for some release level stats that went into gnomad_methods. Nope, we can't release the HT, and the stats wont actually be fully released, we will add some of them to the stats page or to other pages of the browser. That's why I was thinking moving them to an assessment folder where we can also put the script in this PR into it.

jkgoodrich commented

View reviewed changes

gnomad_qc/v4/create_release/calculate_variant_statistics.py Outdated Show resolved Hide resolved

jkgoodrich and others added 3 commits

March 18, 2024 15:15


          Add filter to chr22 for release_ht

46d1a01


          Update gnomad_qc/v4/create_release/calculate_variant_statistics.py

d8247ed

Co-authored-by: Qin He <[email protected]>


          Add option to print stats

48862de

KoalaQin requested changes

View reviewed changes

Contributor

KoalaQin left a comment

Let's get Julia's PR in before thinking about combinations?

gnomad_qc/v4/create_release/calculate_variant_statistics.py Outdated Show resolved Hide resolved

gnomad_qc/v4/create_release/calculate_variant_statistics.py Outdated Show resolved Hide resolved

jkgoodrich and others added 3 commits

March 19, 2024 16:07


          Apply suggestions from code review

949834f

Co-authored-by: Qin He <[email protected]>


          Add option for rare variant AF cutoff

f67be9e


          Change arguments to skip so they don't all need to be included

a9a8cfd

KoalaQin approved these changes

View reviewed changes

Contributor

KoalaQin left a comment

LGTM! Running test only took ~3.5 minutes.

jkgoodrich merged commit cbecc2f into dm/per_sample_counts_4_1

2 checks passed

jkgoodrich deleted the jg/suggestions_to_per_sample_counts_4_1 branch

March 20, 2024 22:14

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Release Stats v4.1