Skip to content

Using population-level sequencing data in SLiM for neutral evolution simulations #606

@B-Demirbas

Description

@B-Demirbas

Hi! Following a question I posted on slim-discuss (which was answered), Ben Haller suggested I raise an issue on the GitHub on the specific problem I'm facing.

My initial goal for a project is to model neutral evolution in a population of ~500 individuals over N generations using standard Wright-Fisher simulations. The project is supposed to complement experimental work, and hence I want to use real sequencing data in the form of .vcf files containing information on SNPs.

The problem is that I only have population sequence data (for a population of ~500 individuals). It does contain information on average allele frequencies, but there is no individual-level data. As far as I can tell from the SLiM manual there is no straightforward way to load in and use population-level .vcf files directly in SLiM (it seems it always assumes it is individual-level data), is this correct?

As a workaround I now use the allelic depth data to estimate the frequency of individuals with a given SNP in a population. I use that to generate artificial individuals in a new .vcf (in Python) which I then load into SLiM. This seems to work, but it would be more convenient if there was a way to do this directly in SLiM. I attached an (abridged) example of a .vcf file that I would like to use.

example VCF population single chromosome.txt

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions