Gene: RPL5 (Ribosomal Protein L5)
Location: chr1:92,831,986-92,841,924 (GRCh38)
Protein Length: 297 amino acids
Coding Sequence: 891 base pairs
Samples Analyzed: 3,202 diploid individuals
Total Alleles: 6,404 (2 alleles per individual)
Data Source: 1000 Genomes Project Phase 3
Analysis Tool: OneKGP
| Variant Type | Sites | Total Alleles (AC) | Samples Affected | % Samples |
|---|---|---|---|---|
| Missense | 10 | 29 | 29 | 0.91% |
| Synonymous | 8 | 286 | 268 | 8.37% |
| Stop-gained | 0 | 0 | 0 | 0.00% |
| Frameshift | 0 | 0 | 0 | 0.00% |
| Splice-disrupting | 0 | 0 | 0 | 0.00% |
| HIGH Impact (Total) | 0 | 0 | 0 | 0.00% |
Zero loss-of-function variants detected across 6,404 alleles from 3,202 individuals, indicating extreme intolerance to protein-truncating variants.
The dN/dS ratio compares the rate of non-synonymous substitutions (dN) to synonymous substitutions (dS). Under neutrality, dN/dS ≈ 1. Values < 1 indicate purifying selection; values > 1 indicate positive selection.
Assumptions:
Observed Allele Counts:
Rates:
dN = 29 / 668 = 0.04341
dS = 286 / 223 = 1.28251
dN/dS = 0.04341 / 1.28251 = 0.0338| Metric | Value | Interpretation |
|---|---|---|
| dN/dS | 0.034 | Extreme purifying selection |
| dN | 0.043 | Non-synonymous rate per site |
| dS | 1.283 | Synonymous rate per site |
dN/dS = 0.034 is dramatically below 1, indicating:
Comparison to other gene classes:
H₀ (Null): Missense and synonymous alleles occur at neutral rates
H₁ (Alternative): Missense alleles are depleted relative to neutral expectation
Given total of 315 alleles (29 missense + 286 synonymous):
Expected Missense = 315 × (668/891) = 236.3 alleles
Expected Synonymous = 315 × (223/891) = 78.7 allelesχ² = Σ [(Observed - Expected)² / Expected]
χ² = [(29 - 236.3)² / 236.3] + [(286 - 78.7)² / 78.7]
χ² = [43010.3 / 236.3] + [42990.3 / 78.7]
χ² = 181.96 + 546.20
χ² = 728.16| Statistic | Value |
|---|---|
| χ² | 728.16 |
| df | 1 |
| Critical value (α=0.05) | 3.84 |
| Critical value (α=0.001) | 10.83 |
| p-value | < 0.0001 |
Extremely significant departure from neutrality (p < 0.0001)
The chi-square value of 728.16 vastly exceeds critical values at all standard significance levels, providing overwhelming evidence that:
| Position | Ref→Alt | AC | AF (1KGP) | gnomAD AF | Category |
|---|---|---|---|---|---|
| 92833028 | G→A | 2 | 0.0003123 | 0.0000066 | Ultra-rare |
| 92834864 | T→C | 2 | 0.0003123 | 0.0000066 | Ultra-rare |
| 92836268 | A→G | 2 | 0.0003123 | 0.0000460 | Ultra-rare |
| 92836291 | C→G | 3 | 0.0004685 | 0.0000263 | Ultra-rare |
| 92836307 | G→A | 1 | 0.0001562 | 0.0000131 | Ultra-rare |
| 92837514 | C→T | 1 | 0.0001562 | 0.0000197 | Ultra-rare |
| 92837557 | A→G | 15 | 0.0023423 | 0.0055060 | Rare |
| 92837619 | G→A | 1 | 0.0001562 | 0.0000066 | Ultra-rare |
| 92837629 | A→G | 1 | 0.0001562 | 0.0000329 | Ultra-rare |
| 92841848 | C→T | 1 | 0.0001562 | 0.0000132 | Ultra-rare |
Note: AC = Allele Count (number of alternate alleles observed); AF = Allele Frequency (AC/6404)
| Position | Ref→Alt | AC | AF (1KGP) | gnomAD AF | Category |
|---|---|---|---|---|---|
| 92833026 | G→A | 4 | 0.0006246 | 0.0001642 | Ultra-rare |
| 92833612 | C→T | 1 | 0.0001562 | 0.0001643 | Ultra-rare |
| 92833636 | G→A | 185 | 0.0288882 | 0.0208400 | Common |
| 92834847 | T→C | 93 | 0.0145222 | 0.0053350 | Low frequency |
| 92834896 | C→T | 1 | 0.0001562 | 0.0000066 | Ultra-rare |
| 92836288 | C→T | 2 | 0.0003123 | 0.0004009 | Ultra-rare |
| 92836300 | T→C | 2 | 0.0003123 | 0.0000197 | Ultra-rare |
| 92841817 | A→G | 1 | 0.0001562 | 0.0000460 | Ultra-rare |
| Metric | Missense | Synonymous | Ratio |
|---|---|---|---|
| Mean AF | 0.000488 (0.049%) | 0.005685 (0.57%) | 0.086 |
| Median AF | 0.000156 | 0.000312 | 0.500 |
| Max AF | 0.002342 | 0.028888 | 0.081 |
| Variants AF < 0.001 | 9/10 (90%) | 6/8 (75%) | - |
| Variants AF > 0.01 | 0/10 (0%) | 2/8 (25%) | - |
Using standard mutation rate estimates:
Poisson Probability Calculation:
P(0 observed | λ=0.28) = e^(-0.28) = 0.756
| Metric | Value |
|---|---|
| Observed LoF | 0 |
| Expected LoF | 0.28 |
| P(0 observed) | 0.756 |
While observing zero LoF variants is not statistically unusual by itself (p=0.756), the complete absence of:
across 6,404 alleles is consistent with:
This is supported by clinical data: RPL5 haploinsufficiency causes Diamond-Blackfan Anemia (DBA6), confirming dosage sensitivity.
| Exon | Missense Sites | Syn Sites | Missense Alleles | Syn Alleles | M/S Ratio | Constraint |
|---|---|---|---|---|---|---|
| 2 | 1 | 3 | 2 | 190 | 0.011 | EXTREME |
| 3 | 1 | 2 | 2 | 94 | 0.021 | EXTREME |
| 4 | 5 | 2 | 8 | 4 | 2.000 | Moderate |
| 5 | 4 | 0 | 18 | 0 | ∞ | HIGH |
| 8 | 1 | 1 | 1 | 1 | 1.000 | Moderate-High |
Exons 2-3 (N-terminal and early 5S-binding):
Observed: 4 missense alleles, 284 synonymous alleles
Expected under neutrality: 216 missense, 72 synonymous
χ² = 418.4, p < 0.0001Interpretation: EXTREME constraint
Exon 5 (Core 5S-binding domain):
Observed: 18 missense alleles, 0 synonymous alleles
Note: Cannot perform standard chi-square (zero cell)
Interpretation: HIGH constraint despite one more common variantExon 4 (Linker/loop region):
Observed: 8 missense, 4 synonymous
M/S ratio = 2.0 (closer to neutral, but all ultra-rare)
Interpretation: Moderate constraint with high selection pressureFor the most common missense variant (chr1:92837557, AC=15, AF=0.0023423):
Observed:
Expected under HWE:
Chi-square for HWE:
χ² = [(15-15)² / 15] + [(0-0.018)² / 0.018] = 0.018
df = 1, p = 0.89Result: No deviation from HWE (as expected for rare variants)
| Gene Class | Typical dN/dS | RPL5 Observed |
|---|---|---|
| Unconstrained | 0.8 - 1.2 | 0.034 |
| Moderately constrained | 0.3 - 0.7 | 0.034 |
| Highly constrained | 0.1 - 0.3 | 0.034 |
| Essential ribosomal | 0.02 - 0.05 | 0.034 ✓ |
While we cannot calculate exact pLI scores from 1KGP alone, the pattern suggests:
With n=3,202 individuals (6,404 alleles):
Power to detect rare variants (AF=0.001):
Power to detect common variants (AF=0.01):
Conclusion: Sample size is adequate to robustly detect selection signatures
| Test | Result | Significance | Interpretation |
|---|---|---|---|
| dN/dS ratio | 0.034 | p < 0.0001 | Extreme constraint |
| χ² test | 728.16 | p < 0.0001 | Highly significant |
| LoF depletion | 0 observed | Consistent | Strong selection |
| AF analysis | 90% ultra-rare | - | Deleterious alleles |
| Regional χ² | Exons 2-3: 418.4 | p < 0.0001 | Domain-specific |
PATHOGENIC (Likely):
VUS (Require Functional Studies):
BENIGN (Likely):
Data Source: 1000 Genomes Project Phase 3
Genome Assembly: GRCh38
Sample Size: 3,202 diploid individuals (6,404 alleles)
Analysis Tool: OneKGP
Statistical Tests:
Clinical Context:
#612528