From Data to Decision: What’s New in DISGENET v26.1

Full Traceability: Know Exactly Where Every Association Comes From

In research and drug discovery, the source of an association matters as much as the association itself. Provenance — the ability to trace each gene–disease or variant–disease association back to its original database, publication, or evidence type — is what transforms a data point into a trusted, decision-ready signal.

Image showing Gene information from the DISGENET Summary table

In pharma R&D, provenance enables target teams to assess whether a candidate is supported by specialized curated databases, preclinical models, or clinical studies — information that is critical at every triage gate. In biomarker discovery, it helps researchers distinguish well-established signals from emerging ones. In clinical genomics, it supports interpretability: can you point to the primary source, evaluate its quality, and defend your interpretation? With v26.1, the answer is a clearer yes.

A smarter NLP pipeline: result-focused evidence only

Not all sentences in a scientific paper carry the same evidential weight. A statement of hypothesis is not an experimental result. A method’s description is not a finding. The new Relation Extraction (RE) model in v26.1 understands this distinction: it filters for sentences reporting experimental outcomes and excludes those focused on hypotheses, methods, or background. This result-focused optimization directly improves the relevance and actionability of NLP-derived associations in DISGENET v26.1. For a deeper look at the model, read the dedicated blog post →

Supporting evidence is further classified using the new Polarity attribute, enabling users to differentiate between positive and negative findings directly within the Evidence table. This enables a more nuanced interpretation of gene–disease and variant–disease relationships, facilitating a more comprehensive assessment of consistency and conflicting evidence across studies.

New publication attributes: impact and translation potential

Each publication supporting an association now displays two additional attributes:

  • Citing article count — the number of articles that cite the publication, with links to those citing works, so you can immediately assess its scientific influence and follow the evidence trail forward in time.
  • Translation — a new metric reflecting the translational potential of each publication, from bench to bedside. This allows users to weigh evidence not only by volume or recency, but by its proximity to clinical application.

Together, these attributes provide a richer context for every evidence item, supporting more nuanced judgments about data quality and clinical relevance.

Image highlighting new Evidence Table attributes - Citation count and Translation columns

Prioritize with Confidence: Score Inspection, Enrichment, and Temporal Analysis

Explore what drives each score

The DISGENET score integrates multiple evidence types into a single ranking signal. In v26.1, you can now explore the contribution of each evidence component directly — click on any score value in the interface to inspect the underlying factors and understand what is driving it. This capability is available both in the web interface and via the REST API.

This level of transparency matters for target prioritization: instead of treating the score as a black box, you can assess whether a high-scoring GDA is driven by curated database support, clinical evidence, NLP-extracted findings, or a combination — and weight your decisions accordingly.

Image Showing the DISGENET Score breakdown for Gene IDE in relation to Alzheimer's Disease

Visualize how evidence has evolved over time

For every association, the web interface now includes a timeline visualization showing how the supporting evidence has grown, changed, or shifted over time. This is particularly valuable for:

  • Target validation: Is this association a recent discovery or has it been accumulating evidence for years?
  • Biomarker discovery: Is the signal gaining momentum or plateauing?
  • Clinical genomics: Has the clinical relevance of this variant been consistently reported, or does it fluctuate?

The R package has been extended with new temporal analysis functions, enabling users to run this type of time-series analysis programmatically on their gene or variant for a particular disease.

New feature is shown on the image, Evidence Timeline on the left side bar of the summary table, which appears after clicking the publication years and shows the evolution of evidence over time

Disease Enrichment Analysis — now fully integrated into your search workflow

Disease Enrichment Analysis can now be launched directly from selected gene or variant sets within your search results — no need to navigate away or reformat your data. The analysis now includes:

  • Odds ratio metrics for more robust statistical interpretation
  • Customizable plots that can be adapted to your reporting needs
  • Downloadable outputs for downstream analysis and publication

Improved Advanced Search

Advanced Search has been enhanced with more flexible text matching — handling synonyms, partial terms, and variant nomenclature more reliably — and now returns results ranked by relevance rather than by a fixed default order. Results can be downloaded for further analysis.

Meet the New AIDA

AIDA — the DISGENET AI assistant — has been rebuilt on an updated architecture and significantly expanded in scope. In addition to genes, variants, and diseases, AIDA now covers therapies, compounds, and disease subtypes, making it possible to ask cross-entity questions that previously required manual multi-step queries.

Whether you are running a target landscape assessment, exploring drug repurposing hypotheses, or mapping the evidence landscape for a disease subtype, AIDA provides a natural-language entry point into the DISGENET knowledge graph — grounded in curated, traceable evidence.

A Redesigned Interface Built Around Your Workflow

The DISGENET web interface has been substantially updated in v26.1, with every change aimed at reducing friction and accelerating insight.

Navigation and orientation:

  • A redesigned navigation bar and breadcrumb navigation let you track your exploration path and move between related entities without losing context.
  • New side panels surface contextual information on entities and scoring without requiring a page change.

Working with data:

  • Tables are now more interactive, with customizable layouts, built-in filtering, and one-click access to features such as Favorites and MyScore.
  • Action buttons for enrichment, saving, and downloading are now directly accessible from the results view.
  • The score component explorer (described above) is embedded directly into the table, enabling inline evidence inspection.

Documentation:

  • The Docs section has been updated with more visual explanations of metrics and scoring.
  • A new dedicated section on Disease Enrichment Analysis is now available.

Whether you are exploring targets in pharma R&D, validating biomarkers, or interpreting variants in a clinical context, the updated interface is designed to help you move faster — from initial search to actionable conclusion.

Try It Now

DISGENET v26.1 is live. Log in to explore the new provenance attributes, run your Disease Enrichment Analysis on the updated interface, inspect the score components for your targets, or ask AIDA a question about your disease area.