You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Conservation of amino acids in multiple sequence alignments is an indicator of functional importance. In lieu of experimental function assays, one way to evaluate the design capabilities of our model is to measure how likely the model is to generate a sequence with correct functional residues.
Towards this goal, we need a utility to identify and quantify the conservation of particular amino acids in a sequence given an MSA. Given a query sequence and an MSA, the goal would be to compute some measurement of conservation (eg, entropy over amino acid distribution) for each position aligned to the query.
Consideration of alignment depth at each position would be a nice-to-have feature. Perhaps indicating positions with depth below some threshold with NaN/None values.
The text was updated successfully, but these errors were encountered:
Conservation of amino acids in multiple sequence alignments is an indicator of functional importance. In lieu of experimental function assays, one way to evaluate the design capabilities of our model is to measure how likely the model is to generate a sequence with correct functional residues.
Towards this goal, we need a utility to identify and quantify the conservation of particular amino acids in a sequence given an MSA. Given a query sequence and an MSA, the goal would be to compute some measurement of conservation (eg, entropy over amino acid distribution) for each position aligned to the query.
Consideration of alignment depth at each position would be a nice-to-have feature. Perhaps indicating positions with depth below some threshold with NaN/None values.
The text was updated successfully, but these errors were encountered: