Posts Tagged ‘rvas’

Price AL, Kryukov GV, de Bakker PI, Purcell SM, Staples J, Wei LJ, Sunyaev SR. Pooled association tests for rare variants in exon-resequencing studies. American Journal of Human Genetics (2010) 86: 832-838.

Sunday, February 1st, 2015

Pooled association tests for rare variants in exon-resequencing Simulation shows advantage of mult. rarity thresholds

Price AL, Kryukov GV, de Bakker PI, Purcell SM, Staples J, Wei LJ,
Sunyaev SR. Pooled association tests for rare variants in
exon-resequencing studies. American Journal of Human Genetics (2010)
86: 832-838.


Multiple studies indicate strong association between rare variants and
resulting phenotype. This paper describes a population-genetics
simulation framework to study the influence of variant allele
frequency on the corresponding phenotype. In a prior study, causal
relationship between variants and phenotype was resolved by performing
association test on set of variants having allele frequency below a
fixed threshold. However, here it is observed that simulation
frameworks based on a variable allele frequency threshold provide
higher accuracy in association test compared to the fixed allele
frequency model. In addition, inclusion of predicted functional
effects of variants (Polyphen-2 scores) increases the accuracy of the
variable frequency threshold model. Overall, this paper describes a novel methodology, which can be
used to explore the association between rare variants and various

Searching for missing heritability: Designing rare variant association studies

Sunday, October 5th, 2014

Searching for missing heritability… rare variant association studies Pessimistic on #RVAS in #noncoding regions

Nice overview of study design. Good journal-club material.

Mapping rare and common causal alleles for complex human diseases

Saturday, February 1st, 2014

Mapping rare & common causal alleles for complex human diseases: great primer, describing yin & yang of #RVAS v #GWAS

Found this a very illuminating primer, particularly relevant to understanding rare variants.

Soumya Raychaudhuri
Cell. 2011 September 30; 147(1): 57-69.

Some particularly useful quoted snippets below.


De novo mutations occurring spontaneously in individuals are constantly and rapidly introduced into any population. …Most of these mutations are quickly filtered out or lost by genetic drift and will never achieve appreciable allele frequencies. I illustrate this concept by a simulation in which de novo neutral mutations (conferring no effect on fitness) are introduced into a population of 2,000 diploid individuals. In 31 generations 95% of these mutations disappear from the general population, and not one of these mutations achieves an allele frequency of >1% in 200 generations (see Figure S1).

Common variant associations to phenotype are often facile to find. Their high frequencies allow case-control studies to be adequately powered to detect even modest effects. Their high r2 to other proximate common variants allows for association signals to be discovered by genotyping the marker directly, or other nearby correlated markers. But mapping those associated variants to the specific variant that functionally influence disease risk can be challenging since the statistical signals invoked by inter-correlated variants are difficult to disentangle.

On the other hand, individual rare variant associations are
challenging to find. Their low frequency renders current cohorts underpowered to detect all but the strongest effects, and lack of correlation to other markers often prevents them from being picked up by a standard genotyping marker panels. But, once a rare associated variant is identified, mapping the causal rare variants is relatively facile since recent ancestry is likely to limit the number of inter-correlated markers.

For rare variant associations, the field has not yet defined accepted standards for statistical significance that account for the burden of multiple hypothesis testing. Since there are many more rare variants than common ones, and they are not typically inter-correlated with each other, a more stringent threshold may be necessary than applied for common variants. One conservative approach is to correct for the total number of bases genome-wide, ie p=0.05/3000000000 ~ 10-11 as a significance threshold.

If a genomic region is critical to disease pathogenesis rare mutations may modulate disease susceptibility. Then many affected individuals may have rare mutations more frequently in that region, though the mutations may be different from and unrelated to one another. This concept has sparked interest in the genetics community, and workers in statistical genetics have devised strategies to examine rare variants in aggregate across a target region (Bansal et al., 2010). These “burden” tests assess if rare variants within a specific region are distributed in a non-random way, suggesting that they might be playing a roll in disease pathogenesis (see Figure 3B).


BETASEQ: A Powerful Novel Method to Control Type-I Error Inflation in Partially Sequenced Data for Rare Variant Association Testing

Monday, December 23rd, 2013

HoxB13 in prostate cancer

Friday, December 20th, 2013

mis-sense change (G84E) in HOXB13 was found overall in 1.4% of prostate cancer cases and in 0.1% of unaffected controls