
The Eukaryotic Pathogen, Vector & Host Informatics Resources, or VEuPathDB, is a database of genomic and other large-scale datasets related to various eukaryotic pathogens, as well as their vectors and hosts. VEuPathDB stores data related to its organisms of interest and provides tools for searching through and analyzing the data. It currently consists of 14 component data platforms, each dedicated to a certain research topic, in addition to the main VEuPathDB portal website. VEuPathDB includes:1
- Genomics resources covering eukaryotic protozoan parasites
- Host responses to parasite infection (HostDB)
- Orthologs (OrthoMCL)
- Clinical and epidemiological data (ClinEpiDB)
- Microbiome data (MicrobiomeDB)
History
VEuPathDB traces its origins to efforts in the early 2000s to organize genomic and related large-scale biological data for infectious disease research. Initial projects such as PlasmoDB (for Plasmodium spp.), CryptoDB (for Cryptosporidium), and ToxoDB (for Toxoplasma gondii) were developed as standalone databases focused on specific eukaryotic pathogens. These early component sites were integrated under the umbrella of ApiDB2, established by the U.S. National Institute of Allergy and Infectious Diseases (NIAID) to support apicomplexan parasite research.
As the scope of the resource expanded to include a broader range of eukaryotic pathogens, the project was renamed EuPathDB to reflect its extended taxonomic coverage3.
In parallel, VectorBase was developed to serve the invertebrate vector research community by providing similar genomic and functional datasets for disease vectors such as mosquitoes and ticks4. Both EuPathDB and VectorBase were funded as part of the NIH Bioinformatics Resource Centers (BRC) program, which began supporting pathogen and vector genomic resources in 2004.
In 2019, these two major resources were formally merged to create VEuPathDB, a unified bioinformatics platform integrating the strengths of EuPathDB and VectorBase into a single portal. This merger brought together data for eukaryotic pathogens, their invertebrate vectors, and relevant host organisms, supported by common infrastructure, analysis tools, and a shared web interface. The combined resource was designed to streamline data access and analysis for researchers studying infectious diseases and host-pathogen interactions5.
Since the merger, VEuPathDB has continued to grow in scope and capability, incorporating thousands of curated datasets across diverse organisms and data types, expanding advanced search and visualization tools, and evolving its infrastructure to accommodate new analytic methods and user needs6.
Overview of Resources and Tools
VEuPathDB provides free online access to omics data from eukaryotic protozoan and fungal pathogens, arthropod vectors of disease, and host responses to pathogen infection. The goal of VEuPathDB is to make data easily accessible, findable, and reusable by laboratory scientists. All integrated data and analyses follow standard workflows and methods to ensure data accuracy and enable data interoperability.
Integrated data types include genomes and annotation (both structural and functional), transcriptomic data (e.g., single-cell/ bulk RNA-sequence and microarray data), proteomic data (e.g., mass spectrometry evidence and quantitative data), isolate sequencing data used for variant calling and copy number variation determination, epigenomics, whole-genome phenotyping data (e.g., CRISPR screens and large-scale subcellular localization data), etc.7
Standard analyses provide additional data such as InterPro domains, signal peptide and transmembrane domain predictions, and metabolic pathways.
These data and analyses underly the unique Search Strategies system and enable in silico experiments that easily query across datasets, data types, and organisms.5
Component databases
Currently, VEuPathDB consists of 14 component data platforms, each with a particular focus, and a main portal site:8
- VEuPathDB (The main portal site)
- AmoebaDB (Pathogenic Amoeba)
- CryptoDB (Cryptosporidium species)
- FungiDB (Pathogenic fungi)
- GiardiaDB (Giardia species)
- MicrosporidiaDB (Microsporidia species)
- PiroplasmaDB (Pathogenic Piroplasmida)
- PlasmoDB (Plasmodium species)
- ToxoDB (Toxoplasma species)
- TrichDB (Trichomonas species)
- TriTrypDB (Kinetoplastida such as Leishmania and Trypanosoma species)
- HostDB (Host response to parasite infection)
- OrthoMCL (For orthologous protein sequences)
- ClinEpiDB (for data from clinical and epidemiological studies and trials)
- MicrobiomeDB (for microbiome data)
Subscription Model
In September 2024, the National Institute of Allergy and Infectious Diseases (NIAID) contract supporting VEuPathDB was not renewed, a decision that shocked the user community910. Starting in March 2025, VEuPathDB implemented a voluntary subscription model to keep the resources open and accessible to everyone while sustaining operations11.
References
References
- "VEuPathDB". veupathdb.org. Retrieved 2026-02-17.
- Aurrecoechea, Cristina; Heiges, Mark; Wang, Haiming; Wang, Zhiming; Fischer, Steve; Rhodes, Philippa; Miller, John; Kraemer, Eileen; Stoeckert, Christian J.; Roos, David S.; Kissinger, Jessica C. (2007-01-01). "ApiDB: integrated resources for the apicomplexan bioinformatics resource center". Nucleic Acids Research. 35 (suppl_1): D427–D430. doi:10.1093/nar/gkl880. ISSN 1362-4962. PMC 1669770. PMID 17098930.
- Aurrecoechea, Cristina; Barreto, Ana; Basenko, Evelina Y.; Brestelli, John; Brunk, Brian P.; Cade, Shon; Crouch, Kathryn; Doherty, Ryan; Falke, Dave; Fischer, Steve; Gajria, Bindu; Harb, Omar S.; Heiges, Mark; Hertz-Fowler, Christiane; Hu, Sufen (2016-11-29). "EuPathDB: the eukaryotic pathogen genomics database resource". Nucleic Acids Research. 45 (D1): D581–D591. doi:10.1093/nar/gkw1105. ISSN 0305-1048. Archived from the original on 2024-05-18.
- Giraldo-Calderón, Gloria I.; Emrich, Scott J.; MacCallum, Robert M.; Maslen, Gareth; Dialynas, Emmanuel; Topalis, Pantelis; Ho, Nicholas; Gesing, Sandra; the VectorBase Consortium; Madey, Gregory; Collins, Frank H.; Lawson, Daniel (2015-01-28). "VectorBase: an updated bioinformatics resource for invertebrate vectors and other organisms related with human diseases". Nucleic Acids Research. 43 (D1): D707–D713. doi:10.1093/nar/gku1117. ISSN 1362-4962. PMC 4383932. PMID 25510499.
- Amos, Beatrice; Aurrecoechea, Cristina; Barba, Matthieu; Barreto, Ana; Basenko, Evelina; Bażant, Wojciech; Belnap, Robert; Blevins, Ann S; Böhme, Ulrike; Brestelli, John; Brunk, Brian P; Caddick, Mark; Callan, Danielle; Campbell, Lahcen; Christensen, Mikkel (2022-01-07). "VEuPathDB: the eukaryotic pathogen, vector and host bioinformatics resource center". Nucleic Acids Research. 50 (D1): D898–D911. doi:10.1093/nar/gkab929. ISSN 0305-1048.
- Alvarez-Jarreta, Jorge; Amos, Beatrice; Aurrecoechea, Cristina; Bah, Saikou; Barba, Matthieu; Barreto, Ana; Basenko, Evelina Y; Belnap, Robert; Blevins, Ann; Böhme, Ulrike; Brestelli, John; Brown, Stuart; Callan, Danielle; Campbell, Lahcen I; Christophides, George K (2024-01-05). "VEuPathDB: the eukaryotic pathogen, vector and host bioinformatics resource center in 2023". Nucleic Acids Research. 52 (D1): D808–D816. doi:10.1093/nar/gkad1003. ISSN 0305-1048. PMC 10767879. PMID 37953350.
- Harb, Omar S.; McDowell, Mary Ann; Roos, David S. (2024), Setubal, João Carlos; Stadler, Peter F.; Stoye, Jens (eds.), "VEuPathDB Resources: A Platform for Free Online Data Exploration, Integration, and Analysis", Comparative Genomics: Methods and Protocols, New York, NY: Springer US, pp. 573–586, doi:10.1007/978-1-0716-3838-5_19, ISBN 978-1-0716-3838-5, retrieved 2026-04-02
{{citation}}: CS1 maint: work parameter with ISBN (link) - "The Eukaryotic Pathogen genome resource". EuPathDB. Retrieved 2013-11-11.
- Fernandez-Prada, Christopher; Moretti, Nilmar S.; Monte-Neto, Rubens L. do (2025-01-01). "Critical loss: the effects of VEuPathDB defunding on global health". The Lancet Microbe. 6 (1). doi:10.1016/j.lanmic.2024.100980. ISSN 2666-5247. PMID 39288782.
- "Science". AAAS. Retrieved 2026-04-02.
- "VEuPathDB". veupathdb.org. Retrieved 2026-04-02.