Conference
Reliability-Aware Citizen Science for Environmental Machine Learning
International Conference on Pattern Recognition 2026
— 2026
Key information
Authors:
Published in
August 2026
Abstract
Citizen science (CS) campaigns based on crowdsourcing pro- vide scalable labeled data for applications such as environmental moni- toring, but volunteer-generated annotations often suffer from noise and inconsistencies. Prior work has proposed statistical filtering techniques, including outlier detection and entropy-based agreement measures, to improve annotation reliability, yet their impact on downstream machine learning performance remains unclear. To address this gap, we conduct a case study using two CS campaigns from the ForestEyes project, focused on deforestation detection in tropical rainforests. Our approach is eval- uated against standard campaign-based and PRODES-based (expert- labeled) baselines for two different satellite (Landsat-8 and Sentinel-2). The results indicate that reliability-aware filtering increases balanced ac- curacy by up to 3.0% (relative gain) over unfiltered campaign data and surpasses the PRODES-based oracle by up to 0.8%, while requiring sub- stantially fewer training samples (at least ∼ 30% fewer). These findings highlight the potential of optimized citizen science data pipelines for deforestation detection in data-scarce regions.
Publication details
Authors in the community:
Fabio Augusto Faria
ist430285
Publication version
VoR - Version of Record
Title of the publication container
International Conference on Pattern Recognition 2026
Location of the conference
Lyon
Fields of Science and Technology (FOS)
computer-and-information-sciences - Computer and information sciences
Publication language (ISO code)
eng - English
Rights type:
Only metadata available