munge()
applies 5 filters (if possible):
Deduplication of RSID
Filter variants on INFO in INFO column is present
Filter variant on effect allele frequency
Removes strand ambigious variants
Removes variants with
N < round(stats::quantile(N, 0.9) / 1.5)
Arguments
- dset
a
dplyr::tibble()
with columnsSNP
,A1
A2
Z
N
and optional columnsEAF
andINFO
- info_filter
INFO score filter threshold at which to remove rows
- eaf_filter
effective allele filter at which to remove rows. eaf_filter=0.01 would filter to eaf > 0.01 & eaf < 0.99