Taxonomy Dataset
The taxonomy/
dataset is the backbone of biological consistency in the Pristine Seas Science Database. It provides harmonized reference lists and ecological traits for all taxa recorded across expeditions, enabling seamless integration of species-level data across methods, habitats, and years.
This shared taxonomy serves three essential roles:
- Identity resolution — Ensures consistent naming using authoritative sources (e.g., WoRMS, Coral Traits)
- Ecological interpretation — Assigns functional and trophic traits for downstream analyses (e.g., predator biomass, herbivore richness)
- Data integration — Supports clean joins between observation tables and trait metadata
Each major taxonomic group is documented in a dedicated table:
taxonomy/
├── fish # Reef and pelagic fishes from UVS, BRUVS, etc
├── nthos # Sessile benthic taxa from LPI, recruits, photomosaics (e.g., corals, algae, sponges)
├── inverts # Sessile and mobile invertebrates
All tables follow a standardized schema, centered on aphia_id (WoRMS) as the canonical key. Fields include accepted_name, family, trophic_group, functional_group, and additional group-specific traits.