Analyze samples to identify nuclei-free cells #932
jashapiro
started this conversation in
Propose a new analysis
Replies: 1 comment 4 replies
-
@jashapiro if you haven't started exploring here (which I gather you haven't since you're working on #945), I read through this paper earlier today and I'm definitely interested in picking this up! I see first steps as:
|
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Proposed analysis
An intriguing paper by Montserrat-Ayuso & Esteve-Codina (2024) suggests that many cells in current atlases are low quality by virtue of not containing nuclei. We should explore to what extent these concerns affect the samples in ScPCA.
Scientific goals
We are always interested in improving the quality of the data in the atlas, and identifying nucleus-free cells is another layer of quality control that we may wish to add to the dataset.
Methods or approach
As we do have both total and spliced counts for all genes, we should be able to calculate a proxy for the intronic content as the difference between those two. We could start by looking at the distribution of that statistic across cells; this would not be exactly the same as was done using the
DropletQC
package, as that requires access to the BAM files, but we should be able to do something comparable to identify cells with very low intronic fractions.We should also look at expression of MALAT1 to see if the relationship observed by Montserrat-Ayuso & Esteve-Codina between that gene an intronic fraction is observed in our data.
Existing modules
None.
Input data
I would expect to start with the processed data, but it may be that we can obtain more clear results starting with the
filtered
data, where high mitochondrial content cells have not yet been removed. I would want to explore both to start. All analysis should be possible with the SCE objects.Scientific literature
https://bmcgenomics.biomedcentral.com/articles/10.1186/s12864-024-11015-5
Other details
I expect this analysis to require minimal resources. Timeline TBD.
Beta Was this translation helpful? Give feedback.
All reactions