What Is Mendelian Randomization in Drug Discovery ?

Picking the wrong target is the most expensive mistake in drug discovery. A programme can run for years before a target turns out not to drive the disease after all. Mendelian randomization is one of the few tools that can tell you, before that spend, whether a gene actually causes a disease or only travels alongside it. This page explains what it is, why it changes target choice, and how Disease Atlas by Euretos uses it.

Understand the core idea: nature's randomized trial

A randomized controlled trial works because patients are assigned to treatment or placebo at random, so the two groups differ only in the treatment. Mendelian randomization borrows that logic from biology. At conception, each person inherits their gene variants at random, independent of lifestyle, wealth, or environment. If a variant nudges a protein up or down for life, the people who carry it are, in effect, a treatment group that nature assigned at birth. Compare their disease rates to non-carriers and you get a readout of what changing that protein does, free of most of the confounding that clouds ordinary observational studies.

Three pieces make up the method. The variant is the instrument. The protein or biomarker it changes is the exposure. The disease is the outcome.

See why it matters for target choice

Most evidence linking a gene to a disease is association: the gene shows up more often in patients. Association cannot tell you direction. The gene might drive the disease, the disease might change the gene's activity, or a third factor might move both. Commit a programme to an association that runs the wrong way, and the target fails in the clinic for reasons no assay caught.

Mendelian randomization addresses this because the variant is fixed at conception, long before disease onset. Reverse causation cannot run backward in time, and most confounders are balanced across carriers and non-carriers. What remains is a causal estimate: if you perturb this target, does the disease move.

The payoff is measurable. Targets with human genetic support are roughly twice as likely to reach approval. Nelson and colleagues first showed this in 2015, finding that the share of drug mechanisms with direct genetic support rose from 2.0% at the preclinical stage to 8.2% among approved drugs. King and colleagues confirmed the roughly two-fold effect in 2019, and Minikel and colleagues refined it again in 2024. Genetic causal evidence is the closest thing target discovery has to a leading indicator of clinical success.

Limitations of Mandelian Randomization

Proper use of the method means naming its limits.

Horizontal pleiotropy: the variant might reach the disease through some other route, not through the gene you are testing. The first guard is to use only variants that sit at the gene and control its own expression, so an effect on the disease almost has to run through that gene. Two further checks catch what slips past. One looks at the direction of effect and drops any variant that tracks the disease more closely than it tracks the gene, because that variant is probably acting somewhere else. The other compares the several variants used for one gene: if they tell a consistent story the estimate holds, and if they disagree more than chance allows, that disagreement is the tell-tale sign of a hidden second pathway, and the outliers are removed.
Instrument strength: a weak genetic effect gives a noisy estimate.
Lifelong versus acute: the method reflects a lifetime of small exposure, not a drug given for months. It speaks to whether a target is causal, less to dose or timing.
Direction: separating cause from reverse cause takes care, and methods such as Steiger filtering help.
Context: a genome-wide estimate does not tell you which tissue or cell type the effect runs through.

That last point matters most for target choice, and it is where Mendelian randomization on its own stops.

See how Disease Atlas uses Mendelian randomization

Disease Atlas by Euretos treats Mendelian randomization as its strongest evidence layer, not its only one. The platform ranks the protein-coding genome against each disease, and the causal layer built on Mendelian randomization sits at the top of that ranking because it answers the causal question directly. Around it sit complementary layers: perturbation responses, network context, single-cell expression, and a foundation model that integrates the signal. Candidates rise when these layers converge, independent of how often a gene appears in the literature.

In Disease Atlas the exposure is the gene's own expression. The instruments are expression quantitative trait loci (eQTLs), the common variants that tune how much a gene is transcribed, drawn from GTEx and eQTLGen, and the outcome is disease risk from large human cohorts such as UK Biobank and FinnGen. Because eQTLs are measured tissue by tissue, the causal estimate carries a tissue context with it, which is part of how the ranking resolves to the cell types where the disease operates. Where a gene has no strong eQTL instrument, the causal layer does not force an estimate. It leans on the perturbation evidence instead, and the scorecard shows which support a given rank rests on.

Two choices make the difference. First, every layer is built from primary biological data, genetic effect sizes, perturbation magnitudes, single-cell expression, rather than from claims extracted out of papers. Second, the result is resolved to the cell types where the disease operates, which is the context a bare genetic estimate cannot give you. The researcher moves from a causal signal to a ranked, cell-type-resolved, defensible shortlist in a single session.

Is Mendelian randomization the same as a clinical trial?

No. It uses the random inheritance of gene variants as a natural experiment to estimate whether a target is causal. It is evidence for target choice, not a substitute for a trial.

What is an instrumental variable in Mendelian randomization?

A genetic variant used as a stand-in for changing a protein or biomarker. For the estimate to hold, the variant should affect the disease only through that exposure.

Can Mendelian randomization prove a drug will work?

No. It tells you whether perturbing the target is likely to move the disease. Dose, timing, and the molecule itself still have to be tested.

How is Mendelian randomization different from a GWAS association?

A genome-wide association study shows that a gene is associated with a disease. Mendelian randomization uses the same genetics to test whether the relationship is causal, and in which direction.

What is horizontal pleiotropy?

When a variant affects the disease through a pathway other than the intended target. It is the main threat to an estimate, and it is reduced by using variants at the target gene itself.

References

Nelson MR et al. The support of human genetic evidence for approved drug indications. Nature Genetics, 2015. https://www.nature.com/articles/ng.3314
King EA, Davis JW, Degner JF. Are drug targets with genetic support twice as likely to be approved? PLoS Genetics, 2019. PMID 31830040. https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1008489
Minikel EV et al. Refining the impact of genetic evidence on clinical success. Nature, 2024. https://www.nature.com/articles/s41586-024-07316-0
Gill D, Georgakis MK, Walker VM, Schmidt AF et al. Mendelian randomization for studying the effects of perturbing drug targets. Wellcome Open Research, 2021.
Common pitfalls in drug target Mendelian randomization and how to avoid them. BMC Medicine, 2024. https://bmcmedicine.biomedcentral.com/articles/10.1186/s12916-024-03700-9

What Is Mendelian Randomization in Drug Discovery?

Understand the core idea: nature's randomized trial

See why it matters for target choice

Limitations of Mandelian Randomization

See how Disease Atlas uses Mendelian randomization

Frequently asked questions

See if for your disease

References