Polygenic score: a toy model

library("tidyverse")

Download the data here and load them using:

data = read_rds("assets/genotoy.rds")

The data variable is a list containing the following objects

The Q1 and B1 correspond to highly polygenic phenotypes whereas Q2 and B2 correspond to mildly polygenic phenotypes.

For each phenotype, analyse the whole dataset:

For each phenotype:

Compute a polygenic score based on the polymorphims such that \(p < .003\), \(p < .01\), \(p < .03\), \(p < .1\) and \(p < .3\)
Evaluate its accuracy using either Pearson’s \(\rho\) for quantitative traits and the area under the curve for binary traits
What’s the best choice of threshold for the \(p\)-value?