I am excited to share our latest paper, now available on arXiv: Overhead Wildlife Locator (OWL): Benchmarking Weakly Supervised Learning for Aerial Wildlife Surveys
Counting wildlife from aerial imagery is one of the most labor-intensive steps in modern conservation. Standard object detectors need bounding-box annotations, which are reported to be up to 7x slower and 3x more expensive to produce than simple point labels. That annotation bottleneck slows down the surveys that wildlife managers depend on.
To address this, we built our MegaDetector for overhead animal localization. The framework is called OWL: a weakly supervised density-estimation approach that learns to locate and count animals from point annotations alone. It comes in three model variants, each suited to a different survey regime:
⚡ OWL-C - a fully convolutional model for fast, high-throughput screening
🧩 OWL-T - a Swin-augmented hybrid for cluttered, heterogeneous scenes
🧠 OWL-D - built on a frozen DINOv3 foundation encoder with a DPT-style fusion decoder
We benchmarked all three against strong detection baselines (POLO, YOLOv11, and HerdNet) across five public aerial datasets, from sparse fixed-wing savanna surveys to dense UAV paddock imagery. OWL-D set a new state of the art on the Delplanque benchmark (0.934 AP vs. HerdNet's 0.840) and achieved the highest detection accuracy (AP) on four of the five datasets.
The most rewarding outcome was seeing OWL hold up in a real deployment. Working with the Alaska Department of Fish and Game, we applied OWL-C to the 2022 Central Arctic caribou census under cross-herd and cross-temporal transfer (trained on 2017 Porcupine Caribou Herd imagery) and reached F1 = 0.965 with a +3.1% population-count error, across nearly 15 gigapixels of survey imagery.
To support the community, we're releasing the code, model weights, and annotated patch-level datasets for large-scale caribou aerial surveys (Porcupine Caribou Herd 2017 and Central Arctic Herd 2022). We hope OWL helps other monitoring programs facing the same annotation and scaling challenges.
Huge thanks to my co-authors Zhongqi Miao, Bruno Demuro, Caleb Robinson, Rahul Dodhia, Lasha O., Jason Holmberg, Kirk Larsen, Howard Frederick, Nathan J. Pamperin, Pablo Arbeláez, and Juan M. Lavista Ferres. This was a deeply collaborative effort spanning the Microsoft AI for Good Lab, Microsoft AI for Good Lab - Biodiversity, Universidad de los Andes - Colombia (Cinfonia), the Alaska Department of Fish and Game, Conservation X Labs, and the Tanzania Wildlife Research Institute.
📄 Read the paper:
https://lnkd.in/eWCR48dF
💻 Code, model weights, and datasets:
https://lnkd.in/eT-Pb7Vu
#AIForGood #MegaDetector #Overhead #RemoteSensing #WildlifeMonitoring #MachineLearning #DeepLearning #ComputerVision #ConservationTechnology #AerialSurveys #FoundationModels #Caribou #ConservationAI #Photogrammetry