Suitability of Some Data Quality Controls Thresholds for Genetic Association Studies of Admixed Population
This trial is active, not recruiting.
|Condition||african american genetics|
|Sponsor||National Human Genome Research Institute (NHGRI)|
|Start date||May 2016|
|End date||May 2017|
|Trial size||12407 participants|
|Trial identifier||NCT02770001, 16-HG-N110, 999916110|
Background: In genetic studies, the quality of DNA samples is tested first. Samples that are low-quality are not used. Some studies involve minority ethnic groups. And example is admixed African American. These studies often have small sample sizes. It is important to make sure samples are not discarded unnecessarily. This may happen by using quality control (QC) thresholds for homogenous groups. These may not be appropriate for an admixed group. Researchers want to study samples that failed certain QC tests. They want to see if this has to do with the ancestry of the outliers or the quality of the samples.
To study samples that fail heterozygosity and sample genotype call rate QC. To see if the failing rates have to do with the ancestry composition of the outliers or the quality of the samples.
No new participants. Researchers will review data that has already been collected.
Researchers will study DNA samples in a lab.
The samples will not include data that can identify the person the sample came from.
Continental genetic ancestry fraction
time frame: Study Completion
time frame: Study Completion
Male or female participants at least 50 years old.
- All the genotype data will be used and no individual will be excluded based on any phenotype.
|Official title||Investigating the Suitability of Some Data Quality Controls Thresholds for Genetic Association Studies of Admixed Population|
|Principal investigator||Sharon K Davis|
|Description||The purpose of this IRB proposal is to gain access to genetic data generated from participants of publically-funded genomic studies and deposited into dbGaP. It is our intention to use dbGaP data to conduct secondary analysis of the influence of admixture on the outcome of data quality control (QC) in genetic association studies to inform future studies of the optimal QC metric for the genetic association analysis of admixed population. The data we intend to use is deposited in dbGaP and is from the Michigan University Health and Retirement Study (HRS). This protocol is being sent to you because of a dbGaP requirement for this specific de-identified dataset to be reviewed by an IRB as that was not in the protocol. This work will require the use of statistical analyses tools to estimate the genetic ancestry make-up of each sample, from the genotype data, and determine how that ancestry relates to QC outcomes (i.e. whether or not the sample might be excluded from an analysis due to its ancestral genetic composition rather that the sample genetic quality). Objectives and specific aims: This work aims to investigate samples failing heterozygosity and sample genotype call rate quality control (QC) to determine whether or not the samples call rate and heterozygosity rate have to do with the ancestry composition of the outliers rather than the quality of the samples and inform future studies of potential loss if general QC is applied to genetic data of admixed sample sets. Rationale and Background: In genetic association studies DNA sample quality can vary largely across study participants and such variation has an impact on genotype call rate and genotype accuracy; samples of low DNA quality tend to have lower genotype call rate and genotype accuracy. Heterozygosity rate (proportion of heterozygous loci per individual) and genotype failure rate (proportion of missing genotypes per individual) are jointly and routinely used to identify samples with low DNA quality at the data quality control (QC) stage of genetic association studies. Excessive heterozygosity rate may indicate sample contamination whilst a reduced heterozygosity rate could indicate inbreeding . Samples with 3-7% [2,3] genotype call-rate and heterozygosity > 2-3 standard deviations from the mean heterozygosity are usually excluded from genetic case-control studies. Generally, the sample size of genetic association studies involving minority ethnic groups such as admixed African American tends to be small. It is hence important to ensure samples are not discarded unnecessarily, resulting into reduced statistical power, by using QC thresholds applied to homogenous groups which might not be appropriate for an admixed sample set. The aim of this analysis is to investigate samples failing heterozygosity and sample genotype call rate QC to determine whether or not the samples call rate and heterozygosity rate have to do with the ancestry composition of the outliers rather than the quality of the samples. The motivation is to inform future studies of potential loss if general|
Call for more information