Lecture 48: Population Genetics I:

Variation, Hardy-Weinberg, and Linkage Disequilibrium

(version 11 October 2004)

This material is copyrighted and MAY NOT be used for commercial purposes

 You are visitor number   since 11 October 2004) 

Population vs. Quantitative Genetics

Measures of Genetic Variation

Ideally, we would be able to look at phenotypic variation and directly infer the underlying amounts of genetic variation. However, we cannot do this.

A brief history of the struggle to measure variation

Measures of variation

With DNA sequencing, essentially all loci are 100 percent polymorphic if a large enough sample is used, as two individuals only have to differ at one base pair to be different in DNA sequence, and a typical locus is 1000 - 10,000 bp.

Allele and genotype frequencies: A single locus

Obtaining allele frequencies from genotype frequencies

Predicting genotype frequencies from allele frequencies

Assuming the Hardy-Weinberg conditions
  1. Random mating (individuals mate independent of their genotype)
  2. No selection (all genotypes leave, on average, the same number of offspring)
  3. Large population size (genetic drift can be ignored)
  4. Allele frequencies the same in both sexes
  5. Autosomal loci

Then

What happens when the sexes have different allele frequencies? (Homework problem!!)

Gamete and genotype frequencies: Two loci

When considering two (or more) loci, one must also account for the presence ot Linkage disequilibrium

Under random mating, gametes combine at random. Hence, if the frequency of (say) an AB gamete is 0.4 and an ab gamete is 0.1, then

However, the frequencies of gametes in a population can change by recombination from generation to generation unless they are in linkage equilibrium (which is also called gametic-phase equilibrium).

Example

At linkage equilibrium, the gametes have frequencies expected by independence of alleles,

Dynamics of linkage disequilibrium

If linkage disequilibrium exists, how does it change over time under random mating?

Let DAB(t) denote the disequilibrium for a particular gamete type, where

( We also use the notation pAB = Freq(AB), pA = Freq(A), etc.)

How does D change over time?

Rearranging gives

Hence, the recursion for linkage disequilibrium is

Thus, for a particular gamete type, say AB (A and B represent particular alleles at two different loci), then

Example: Human blood group data

Consider the Ainu population. Does this show indications of linkage disequilibrium?

If linkage equilibrium is present, then (say) freq(MS gamete) = freq(M)*freq(S)

Is this true? freq(MS gamete) = 0.024, while freq(M)*freq(S) = 0.405*0.271 = 0.110

Hence, the initial disequilibrium for the MS gamete is

If random mating and other Hardy-Weinberg assumptions hold, if the M-S locus distance is c = 0.1, what is the expected MS equilibrium after one generation of recombination?

What about after 20 generations?

Linkage-disequilibrium mapping

Most genes show linkage-disequilibrium between very tightly-linked markers

The usefulness of this observation is that we can use tightly-linked markers as indicators of whether a particular chromosome carries a disease allele.

How does this association arise?

Even after hundreds of generations, most chromosomes carrying the mutant allele also contain the tightly linked alleles on the original chromosome.

This feature has been exploited for very fine mapping of diseases genes, an approach called linkage-disequilibrium mapping

For many mutant alleles, there is a predominant haplotype (collection of very tightly-linked markers) with which it is associated, reflecting the haplotype of the original chromosome on which the mutant arose.

Equating the probability of no recombination to the observed proportion q of disease-bearing chromosomes with this predominant haplotype gives q= (1-c)t, where t is the age of the mutation or the age of the founding population (whichever is more recent). Solving for recombination frequency gives

Example

Hastbacka et al. (1992) examined the gene for diastrophic dysplasis (DTD), an autosomal recessive disease, in Finland.

A number of marker loci were examined, with the CSF1R locus showing the most striking correlation with DTD. The investigators were able to unambiguously determine the haplotypes of 152 DTD-bearing chromosomes and 123 normal chromosomes for the sampled individuals. Four alleles of the CSF1R marker gene were detected.

Marker Allele Normal chromosome DTD chromosome
1-1 4 3.3% 144 94.7%
1-2 28 22.7% 1 0.7%
2-1 7 5.7% 0 0%
2-2 84 68.3% 7 4.6%

Here, q = 0.947, while the current Finnish population traces back to around 2000 years to a small group of founders, which underwent around t=100 generations of exponential growth.

Using these estimates of q and t, gives an estimated recombination frequency between the CSF1R gene and the DTD gene as

c = 1- (0.947)(1/100) = 0.00051.