T3/Wheat Practical Haplotype Graph (PHG) imputation

PHG Description

The Practical Haplotype Graph (PHG) database was constructed using whole-exome capture sequencing data from a diverse set of wheat accessions.

Imputed protocols and projects
Method: Practical Haplotype Graph - PHG V2 with 2.9M markers
Visualize: Explore options
Description: Imputation pipeline and accuracy
ProtocolDownload
genotype project
accessionsCluster with PHGImputation accuracy
vs 2019_HapMap
Infinium 90K v2.1 TCAP90K_HWWAMP295 PCA 92% - details
TCAP90K_SpringAM_panel245
TCAP90K_YQV14216
TCAP90K_CSRVAL14116 PCA
VRND4_UCD_201576 PCA
TCAP90K_NAMparents_panel56
TCAP90K_HWWAMP_SRPN16
TCAP90K_SWWpanel313
TCAP90K_LeafRustPanel335
TCAP90K_SNBWWW266 PCA
Infinium 9K v2.1 NSGCwheat9K_4X542 PCA 94% - details
WorldwideDiversityPanel_9K2255
GMS USU_2021288 82% - details
USU_2022471
MRASeq RPN RPN_202086PCA 84% - details
RPN_202190PCA
RPN_202280PCA
RPN_202388PCA
RPN_2020-23286PCA
RPN_2020-21157PCA
RPN_2021-22150PCA
RPN_2022-23151PCA
MRASeq RGON RGON_2021300PCA 84% - details
RGON_2022327PCA
RGON_2023412PCA
UCD GBS UCD_2022345PCAna
SDSU GBS SDSU_2022 157PCAna
SDSU_2023155PCA
MSU GBS MSU_20233115 PCAna
ThermoFisher AgriSeq UCD_2023_AgriSeq_192 PCAna
UCD_2023_AgriSeq_294 PCA
WSU_2023_AgriSeq_A96 too many ambiguous alt alleles
WSU_2023_AgriSeq_B96
USDA 3K MNA_202276593% - details
MonSU_2022417
NDS_2022124
UCD_2022176
MonSU_2023432
UWM_2023181
WSU_2023364
Allegro UCD UCD2020_Allegro_CAD179
Notes:
1. Cluster analysis for Infinium calculated by merging unimputed data with PHG genotypes (common markers).
2. Cluster analysis for other protocols (no common markers) calculated on the unimputed data and showing which accessions are in the PHG
3. Imputation accuracy calculated by comparing every 30th marker between imputed output and 2019_HapMap for matching accessions

Imputation Pipeline

The imputation was done using the ImputePipelinePlugin of the PHG (PHG Wiki). The ImputePipelinePlugin was run with minRead=0 so that all haplotypes are imputed. PHG Version 0.0.40 was used to run the ImputePipelinePlugin.

Imputation accuracy

Genotyping ProtocolPHG accessionother accessions
Genotyping by multiplexed sequencing, GMS
Infinium 9K 93% 71%
Infinium 90K 94% 71%

PHG imputation protocol

Converting Illumina Infinium before imputation - Illumina data is saved in the T3 database as Ref = A_allele, Alt = B_allele. To impute this data and merge with other genotype protocols it is necessary to convert the Illumina so that it aligns (strand and orientation) with the reference genome. Detailed description

Imputation documentation - Testing PHG accuracy

GitHub repository of scripts to prepare and test data - PHG v2