Eukaryotic Pathogen Database
From Wikipedia, the free encyclopedia
The Eukaryotic Pathogen, Vector & Host Informatics Resources, or VEuPathDB, is a database of genomic and other large-scale datasets related to various eukaryotic pathogens, as well as their vectors and hosts. VEuPathDB stores data related to its organisms of interest and provides tools for searching through and analyzing the data. It currently consists of 14 component data platforms, each dedicated to a certain research topic, in addition to the main VEuPathDB portal website. VEuPathDB includes:[1]
- Genomics resources covering eukaryotic protozoan parasites
- Host responses to parasite infection (HostDB)
- Orthologs (OrthoMCL)
- Clinical and epidemiological data (ClinEpiDB)
- Microbiome data (MicrobiomeDB)

History
VEuPathDB traces its origins to efforts in the early 2000s to organize genomic and related large-scale biological data for infectious disease research. Initial projects such as PlasmoDB (for Plasmodium spp.), CryptoDB (for Cryptosporidium), and ToxoDB (for Toxoplasma gondii) were developed as standalone databases focused on specific eukaryotic pathogens. These early component sites were integrated under the umbrella of ApiDB[2], established by the U.S. National Institute of Allergy and Infectious Diseases (NIAID) to support apicomplexan parasite research.
As the scope of the resource expanded to include a broader range of eukaryotic pathogens, the project was renamed EuPathDB to reflect its extended taxonomic coverage[3].
In parallel, VectorBase was developed to serve the invertebrate vector research community by providing similar genomic and functional datasets for disease vectors such as mosquitoes and ticks[4]. Both EuPathDB and VectorBase were funded as part of the NIH Bioinformatics Resource Centers (BRC) program, which began supporting pathogen and vector genomic resources in 2004.
In 2019, these two major resources were formally merged to create VEuPathDB, a unified bioinformatics platform integrating the strengths of EuPathDB and VectorBase into a single portal. This merger brought together data for eukaryotic pathogens, their invertebrate vectors, and relevant host organisms, supported by common infrastructure, analysis tools, and a shared web interface. The combined resource was designed to streamline data access and analysis for researchers studying infectious diseases and host-pathogen interactions[5].
Since the merger, VEuPathDB has continued to grow in scope and capability, incorporating thousands of curated datasets across diverse organisms and data types, expanding advanced search and visualization tools, and evolving its infrastructure to accommodate new analytic methods and user needs[6].
Functions
It is an integrated database covering the eukaryotic pathogens in several genera as well as hosts and vectors of these organisms. It enables the accessing of detailed genome information associated with these pathogens. VEuPathDB was formerly known as ApiDB and was the integrated resources for the apicomplexans covering the databases of associated pathogens, ToxoDB, PiroplasmDB and CryptoDB.[7]
VEuPathDB is noted for its sophisticated search strategy system and comprehensive gene pages, providing invaluable help to researchers.[8]
Component databases
Currently, VEuPathDB consists of 14 component data platforms, each with a particular focus, and a main portal site:[9]
- VEuPathDB (The main portal site)
- AmoebaDB (Pathogenic Amoeba)
- CryptoDB (Cryptosporidium species)
- FungiDB (Pathogenic fungi)
- GiardiaDB (Giardia species)
- MicrosporidiaDB (Microsporidia species)
- PiroplasmaDB (Pathogenic Piroplasmida)
- PlasmoDB (Plasmodium species)
- ToxoDB (Toxoplasma species)
- TrichDB (Trichomonas species)
- TriTrypDB (Kinetoplastida such as Leishmania and Trypanosoma species)
- HostDB (Host response to parasite infection)
- OrthoMCL (For orthologous protein sequences)
- ClinEpiDB (for data from clinical and epidemiological studies and trials)
- MicrobiomeDB (for microbiome data)