Create evaluation utility to compute residue conservation from MSA #61

jeffreyruffolo · 2023-10-25T20:24:23Z

Conservation of amino acids in multiple sequence alignments is an indicator of functional importance. In lieu of experimental function assays, one way to evaluate the design capabilities of our model is to measure how likely the model is to generate a sequence with correct functional residues.

Towards this goal, we need a utility to identify and quantify the conservation of particular amino acids in a sequence given an MSA. Given a query sequence and an MSA, the goal would be to compute some measurement of conservation (eg, entropy over amino acid distribution) for each position aligned to the query.

Consideration of alignment depth at each position would be a nice-to-have feature. Perhaps indicating positions with depth below some threshold with NaN/None values.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create evaluation utility to compute residue conservation from MSA #61

Create evaluation utility to compute residue conservation from MSA #61

jeffreyruffolo commented Oct 25, 2023

Create evaluation utility to compute residue conservation from MSA #61

Create evaluation utility to compute residue conservation from MSA #61

Comments

jeffreyruffolo commented Oct 25, 2023