Poster Presentation Australian Marine Sciences Association 2026 Conference

CheckEM: An Open-source Toolkit for Standardising, Cleaning, and Visualising Stereo-video Fish Survey Data (138954)

Brooke Gibbons 1 , Claude Spencer 2 , Jordan Goetze 2 , Dianne McLean 3 , Todd Bond 1 , Jacquomo Monk 4 , Matthew Navarro 1 , Daniel Agnello 1 , Charlotte Aston 1 , Charlie Huveneers 5 , Stephen J Newman 6 , Shaun Wilson 3 , Andrew Hoey 7 , Ben Radford 3 , Conrad Speed 3 , Daniel Ierodiaconou 8 , Tim Langlois 1
  1. The University of Western Australia, Crawley, WA, Australia
  2. Department of Biodiversity, Conservation and Attractions, Kensington, Western Australia, Australia
  3. Australian Institute of Marine Science, Perth, Western Australia, Australia
  4. Institute for Marine and Antarctic Studies, University of Tasmania, Hobart, Tasmania, Australia
  5. College of Science and Engineering, Flinders University, Adelaide, South Australia, Australia
  6. Department of Primary Industries and Regional Development, Hillarys, WA, Australia
  7. College of Science and Engineering, James Cook University, Townsville, QLD, Australia
  8. School of Life and Environmental Science, Deakin University, Warrnambool, Victoria, Australia

Accurate, interoperable, and representative data are fundamental to effective marine research and monitoring. Stereo-video methods are increasingly used in fish surveys worldwide due to their cost-effectiveness, non-destructive nature, and ability to produce permanent records and precise body size measurements. However, the reliability of both stereo- and mono-video data depends on standardised workflows and robust quality assurance processes. A national synthesis of fish survey datasets revealed widespread errors in metadata and species annotations, highlighting a critical need for improved data validation tools.

We present CheckEM, an open-source web application and R package designed to perform automated quality control checks on fish survey data. CheckEM identifies inconsistencies in metadata and cross-validates species annotations against taxonomic databases, known spatial distributions, and where applicable, maximum body size thresholds. The toolkit flags issues such as out-of-range observations, outdated nomenclature, and size outliers, and is compatible with outputs from multiple annotation platforms. It provides data standardisation, cleaning, and interactive visualisation through an accessible user interface, alongside downloadable error reports to support iterative data improvement.

By enhancing data accuracy, consistency, and transparency, CheckEM facilitates interoperability and reuse across datasets, strengthening collaborative research and enabling more robust ecological analyses. This tool supports improved decision-making in marine monitoring and management by ensuring higher confidence in underlying data.