munge() applies 5 filters (if possible):
Deduplication of RSID
Filter variants on INFO in INFO column is present
Filter variant on effect allele frequency
Removes strand ambigious variants
Removes variants with
N < round(stats::quantile(N, 0.9) / 1.5)
Arguments
- dset
a
dplyr::tibble()with columnsSNP,A1A2ZNand optional columnsEAFandINFO- info_filter
INFO score filter threshold at which to remove rows
- eaf_filter
effective allele filter at which to remove rows. eaf_filter=0.01 would filter to eaf > 0.01 & eaf < 0.99