A Complete Guide to DISGENET Scores

When you search DISGENET, you’re not just retrieving associations; you’re working with evidence. And evidence needs to be weighed, compared, and understood in context. That’s what DISGENET’s scoring metrics are designed to help you do.

In this article, we walk through every scoring metric available in DISGENET: what each one measures, how it’s calculated, and when to use it in your research.

The GDA Score and VDA Score: Your Starting Point

The GDA Score (Gene–Disease Association) and VDA Score (Variant–Disease Association) are the core evidence metrics in DISGENET. Every time you run a query, these scores answer the same fundamental question: How confident can you be in this association?

Both scores are composite metrics — they summarize the amount and type of evidence for an association by taking into account the number of sources, their level of curation, publication count, and evidence type. The result is a single number that lets you quickly distinguish well-established associations from emerging, exploratory ones.

How the scores are calculated

The GDA Score is built from five weighted evidence components:

Image showing the breakdown of GDA Score

Values (≤ x.xx) represent the maximum contribution each component can make.

Curated — Expert-reviewed databases including ClinGen, ClinVar, UniProt, Orphanet, and others — up to 0.70
Clinical Trials — Evidence from ClinicalTrials.gov — up to 0.10
Inferred — Large-scale genetic and phenotype analyses (GWAS, PheWAS, HPO, ChEMBL, and others) — up to 0.25
Models — Animal model evidence from RGD, MGD, and literature — up to 0.10
Literature — PubMed publications from Text-mined data — up to 0.40

The VDA Score uses three components:

Image showing the breakdown of VDA Score

Values (≤ x.xx) represent the maximum contribution each component can make.

Curated — Expert-reviewed variant databases — up to 0.70
Biobank — Population-scale genetic studies (FinnGen, UK Biobank) — up to 0.20
Literature — PubMed publications from Text-mined data — up to 0.40

An important note: The DISGENET Score was originally capped at 1. That cap has now been removed. Associations with exceptionally strong, multi-source support may now exceed 1 — GDA scores can reach up to ~1.55, and VDA scores up to ~1.3. This means the full range of evidence is now visible, rather than compressing the top associations into a single value.

What does the score actually tell you?

Score range	What it means
Close to 1 (or above)	Highly supported — multiple curated resources and more than ten publications
Above 0.7	Backed by more than four expert-curated databases, or by fewer expert-curated databases and other data sources (e.g., Inferred or Biobank)
0.1 – 0.2	Novel or emerging — few publications, or 1 or 2 inferred sources, no expert curation yet

High scores are your anchors for established biology. Low scores are where discovery happens.

Where to find it: The GDA and VDA Scores appear in the Summary Table after running a query. If the column is not visible, click Select Columns and enable ScoreGDA or ScoreVDA.

For more details, see the documentati on page.

The Normalized Score: Precision Ranking Across the Full Dataset

With the score cap removed, a new challenge emerges: how do you compare associations when scores can now range beyond 1? That’s exactly what the Normalized Score was designed to solve.

The Normalized Score is a min-max normalized version of the GDA and VDA Scores. It rescales all association scores proportionally to a 0–1 range, preserving their relative ordering and differences — without compression or artificial ties at the top.

The key distinction is this: the original score reflects absolute evidence level; the Normalized Score reflects relative position within the dataset.

Why this matters in practice

Image showing the summary table highlighting the Normalized Score and ScoreGDA columns

Consider Alzheimer’s Disease genes. With the score cap lifted, genes like APP, APOE, MAPT, TREM2, and ABCA7 appear with a GDA Score of 1.0 — indistinguishable from one another. With the normalization, their scores range from 0.77 to 0.87, making the relative ranking immediately clear and easy to work with downstream.

Use the Normalized Score when:

Comparing associations across different diseases or genes, where absolute score differences may be misleading.
You need a strict 0–1 range for downstream analysis or modelling.
Ranking a large set of candidates where precision matters.

My Score: Prioritization Built Around Your Research

No two research teams weigh evidence the same way. A clinical geneticist and a drug discovery scientist may be looking at the same associations — but with very different priorities. My Score puts that control in your hands.

My Score is a fully customizable scoring metric for GDA and VDA associations. You choose the evidence sources that matter most to your work and assign weights to each — DISGENET then filters the data according to the sources selected and computes a weighted score to rank the associations accordingly.

What you can do with My Score

Select any combination of sources (e.g., GenCC, UniProt, ClinicalTrials, GWAS Catalog).
Assign custom weights that sum to 1.
Save your configuration and reapply it across future searches.
Make your prioritization method transparent and reproducible across teams.

For example, a pharmacogenomics team might define My Score as:

My Score = 0.50 · GenCC + 0.25 · ClinPGx + 0.25 · ClinicalTrials

This configuration filters and ranks associations specifically through the lens of curated clinical relevance — surfacing candidates that align with their internal standards, rather than the platform’s default weighting.

Associations not supported by your selected sources will be filtered out, keeping your results focused.

My Score is available in the web interface and can be saved to your Favorites for reuse. (Available on Standard & Advanced plans.)

Historical Score: Evidence Through Time

Science doesn’t happen all at once. Associations are discovered, confirmed, and sometimes revised over years and decades. The Historical Score lets you explore that timeline.

Introduced in DISGENET v26.2, the Historical Score recalculates the GDA Score using only evidence available within a selected year or range of years. It applies a temporal filter across all evidence types — curated databases, literature, inferred data, clinical trials, and animal models.

How it works

For specialized databases (e.g., ClinVar, GenCC, and ClinGen), record-level timestamps are used to determine when evidence became available.
For all sources, supporting publication and clinical trial dates are also used to estimate evidence availability.
The evidence availability date is defined as the earliest available date, based on either a record-level timestamp or a supporting publication/clinical trial.
Literature and clinical trial counts, and the final weighted score components are dynamically recomputed using only the evidence available during the selected time period.
Results are presented as a chart (yearly score and publication/clinical trials) and a score breakdown card.
Functions to compute and plot the cumulative Historical score are available in the disgenet2r R package.

When to use the Historical Score

Benchmark a target against what was known before a landmark study or platform update.
Track evidence trajectories — understanding whether support has built steadily, spiked suddenly, or plateaued.
Reproduce historical scoring for a specific year, supporting reproducible research and audit trails.
Train AI/ML models on evidence available up to a given year, then validate their predictive power against what emerged afterward.

Choosing the Right Score

Score	The question it answers	Best for	Supported tools
GDA / VDA Score	How well-supported is this association today?	General prioritization, target validation	Web, REST API, R package, AIDA
Normalized Score	How does this association rank relative to all others?	Cross-disease comparisons, downstream analysis	Web, REST API, R package, Cytoscape App, AIDA
My Score	How does this association rank by my criteria?	Team-specific prioritization, reproducible workflows	Web
Historical Score	How well-supported was this association at a given point in time?	Temporal analysis, benchmarking, ML training	Web, REST API, R package, AIDA

Each score answers a different question. Used together with other DISGENET data attributes (publication year, citation count, association type, they give you a complete, flexible picture of the evidence landscape.

START SEARCHING

SCHEDULE A DEMO