Skip to content
Yan Wong edited this page Jun 5, 2024 · 1 revision

NB: mostly copied from https://tskit.dev/tsinfer/docs/latest/tutorial.html

Usage

To use tsinfer, you need to have phased (or haploid) DNA sequence data and some knowledge of the ancestral state for

Input

The recommended way to input data into tsinfer is as a series of efficient zarr arrays in the (VCF Zarr format)[https://github.com/sgkit-dev/vcf-zarr-spec]. Genetic variation data can be efficiently converted to zarr arrays using bio2zarr. For example, here's an example of converting a VCF file:


Clone this wiki locally