How to adjust PGS in the context of genetic ancestry¶
pgscatalog-ancestry-ajust
is a CLI application that adjusts calculated PGS using genetic ancestry similarity data.
The process requires some complex inputs. If you’d like to automatically adjust PGS in the context of genetic ancestry automatically, it’s probably simplest to use the PGS Catalog Calculator, which uses this application internally.
Installation¶
$ pip install pgscatalog-calc
Usage¶
The following inputs are needed:
- PGS calculated on a target and reference population in long (melted) format (
--agg_scores
) PGS must be calculated using the same set of variants (the intersection)
Input file structure is the same as the output from
pgscatalog-aggregate
- PGS calculated on a target and reference population in long (melted) format (
PCA derived from a reference population using fraposa-pgsc (
--ref_pcs
)PCA for target population projected onto reference PCA using
fraposa-pgsc
(--target_pcs
)A relatedness cutoff file for the reference population (
--reference_related
)A sample information file for the reference population containing genetic ancestry group labels (
--psam
,SuperPop
column by default)Text labels of the reference and target samples (
--reference
and--dataset
), which must match the aggregated scoring file contents
$ pgscatalog-ancestry-adjust --agg_scores aggregated_scores.txt.gz \
--psam reference.psam \
--ref_pcs ref.pcs \
--target_pcs target.pcs \
--reference_related ref.king.cutoff.id \
--dataset hgdp \
--reference reference \
--outdir ancestry_results
Help¶
$ pgscatalog-ancestry-adjust --help