-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MTase-linker: Flag for methylation degree threshold #78
Comments
Are you using the |
Ah yes, I am using What is the difference between those files? The output section doesn't include that information yet. |
Also, please make sure to use the newest version of Nanomotif (v. 0.1.15) as it resolves issues present in previous versions The difference between motifs.tsv and bin_motifs.tsv lies in a series of post-processing steps applied to generate a consensus set of motifs across contigs for each bin (genome in your case). Some contigs will not have the motifs in the sequence, and other contigs might show slight variation in the motif compared to the rest of the bin due to noise or just the context in which the motifs is observed. To account for this, we apply post-processing to find consensus motifs across a whole genome and output this in bin-motifs.tsv. < If you want to find motifs in single genome bin-motifs.tsv is your go to file. motifs.tsv and score-motifs.tsv are more relevant in regard to binning. For more details on these post-processing steps, refer to supplementary note 1 of our preprint: https://www.biorxiv.org/content/10.1101/2024.04.29.591623v1 |
Got it. Regarding the version did you mean v0.4.15? That's indicated both on the PyPi page and the your conda |
Yes, sorry about the confusion. I meant v0.4.15. |
Getting a different error now on latest version with correct file input.
|
Can you provide the bin-motifs.tsv you are using? |
Here it is:
|
The error arises from a filtering step in the motif assignment process. MTase-linker only assigns motifs that are methylated in more than 50% of their occurrences across the entire genome. This is defined by the formula: n_mod_bin / (n_mod_bin + n_nomod_bin) > 0.5 From literature (Beaulaurier 2019), we know that if a methylation motif is targeted by an MTase, typically >95% of motif occurrences are methylated. This is the reason why we choose this threshold of 50%. In your case, the two motifs have a methylation level below this threshold. As a result, MTase-linker filters these motifs out and attempts to assign an empty table, leading to the error. Thus, currently MTase-linker does not support the assignment of these motifs. Would you be interested in a configurable flag that could adjust this threshold? It would be interesting to filter the modkit pileup for methylations related to the motif, and then make a similar plot to the ones in figure S8 of the Nanomotif article. I guess you would see something like the middle plot for figure S8. You might also consider adjusting the --threshold_methylation_general, which determines whether a positions is seen as methylated or not. For further details, you might find this previous discussion helpful: link to issue #60. |
Hi @JSBoejer , that would be a good flag to have. Generating something like S8 would indeed be interesting! Thanks. |
Hello,
Wanted to share the following error obtained when running MTase-linker.
The text was updated successfully, but these errors were encountered: