Announcement: this year, SEVEN challenges are being organized in the framework of PBVS 2026 workshop.

7th Thermal Image Super-Resolution Challenge (TISR)

The seventh Thermal Image Super-Resolution is a new edition of last year's challenge, which considers a cross-spectral dataset captured with visible (Basler camera) and thermal (TAU2 camera) sensors. It consists of two tracks.

Track 1 features a single evaluation task that requires participants to generate an x8 super-resolution thermal image from the given low-resolution thermal images. The challenge uses a noiseless set of images, bicubic down-sampled by a factor of 8, as input.

Track 2 consists of two evaluation tasks using the above-mentioned dataset. The first evaluation involves generating an x8 super-resolution thermal image, while the second evaluation requires participants to generate an x16 super-resolution thermal image. In both cases, the provided high-resolution visible image should be used as guidance for enhancing the low-resolution thermal image. The architectures proposed in this track must incorporate visible images as guidance.

For further details and access to the dataset, please refer to the CodaLab page:

TRACK 1

TRACK 2

5th Multi-modal Aerial View Imagery Challenge: Classification (MAVIC-C)

Electro-optical (EO) sensors that capture images in the visible spectrum such as RGB and grayscale images, have been most prevalent in the computer vision research area. However, other sensors such as synthetic aperture radar (SAR) can reproduce images from radar signals that in some cases could complement EO sensors when such sensors fail to capture significant information (i.e. weather conditions, no visible light, etc.).

An ideal automated target recognition system would be based on multi-sensor information to compensate for the shortcomings of either of the sensor-based platforms individually. However, it is currently unclear if/how using EO and SAR data together can improve the performance of automatic target recognition (ATR) systems. Thus, the motivation for this challenge is to understand if and how data from one modality can improve the learning process for the other modality and vice versa. Ideas from domain adaptation, transfer learning or fusion are welcomed to solve this problem.

In addition to target recognition, this challenge focuses on accuracy and out-of-distribution detection. A robust target recognition system would not only provide a labeled target but also a confidence score for the target. A low score would correspond to an out-of-distribution sample.

More details and dataset in CodaLab page:

Codabench link: https://www.codabench.org/competitions/12529/

4th Multi-modal Aerial View Imagery Challenge: Translation (MAVIC-T)

We iterate on the sensor translation challenge introduced at PBVS 2023. Sensor translation algorithms allow for dataset augmentation and enable the fusion of information from multiple sensors. Multi-modal sensor translation and data generation has wide ranging applications. We introduce a custom, multi-modal paired image dataset consisting of Electro-optical (EO) and Synthetic Aperture Radar (SAR) paired images, and RGB-IR paired images. The motivation for this challenge is to facilitate state of the art techniques in high-fidelity, conditioned data generation. This competition challenges participants to design general methods to translate aligned images from multiple modalities.

More details and dataset in CodaLab page:

Codabench link: https://www.codabench.org/competitions/12566/

2nd Thermal Pedestrian Multiple Object Tracking Challenge (TP-MOT)

The Thermal MOT Challenge is dedicated to advancing object tracking research in thermal imaging, a critical modality for scenarios where visible-light sensors may fail, such as low-light conditions, nighttime, and adverse weather. Unlike RGB-based datasets, which dominate MOT research, thermal imaging provides unique advantages by capturing long-wavelength infrared (LWIR) data, making it an essential tool for robust tracking in challenging environments.

This challenge introduces the Thermal MOT Dataset, the first large-scale thermal imaging dataset annotated specifically for multiple object tracking. The dataset comprises 30 sequences (9000 frames) collected at five urban intersections using a FLIR ADK thermal sensor. These sequences include diverse scenes and object types in public spaces, providing a comprehensive benchmark for thermal MOT research.

Participants are invited to develop innovative algorithms that leverage thermal data to improve tracking accuracy and robustness. The challenge emphasizes single-modality tracking in the thermal domain and encourages approaches that address the unique characteristics of thermal imagery, such as noise, resolution, and contrast.

Dataset download link: Sharepoint

Rules

  • Solutions must adhere to the tracking-by-detection paradigm in a two-stage approach. An object detection model must be used to predict boxes, and a separate model/algorithm can be used to associate boxes into tracks.
  • For a fair comparison of MOT algorithms, the detectors used can be either YOLOV8s or YOLOV11s
  • Ground truth annotations for the training set are provided. Models must not be trained on images from the validation set.
  • Same tracking paramters should be used for all 6 test sequences. No unique parameters must be changed based on any particular sequence.
  • To be considered in the competition, submissions must introduce an algorithmic or model improvement on existing solutions. Submissions that rely simply on hyper-parameter tuning or brute force tuning will not be considered.

Codabench link: https://www.codabench.org/competitions/12298/

1st Hyperspectral Image Super-Resolution Challenge (HISR)

The first edition of this challenge introduces hyperspectral data with real LR–HR pairs. Each scene is captured twice:

  1. low-resolution hyperspectral cube via a low-magnification lens
  2. high-resolution hyperspectral cube via a high-magnification lens of the same field of view

Task: x4 Real Super-Resolution

Participants must generate a x4 HR hyperspectral image from the given real LR input. Unlike synthetic bicubic setups, the LR data reflect true optical blur, resolution loss, and sensor noise. Evaluation considers a combination of spatial and spectral fidelity metrics.

Dataset access and submission details are available on the CodaBench page (link TBA)

1st Mars Landslide Segmentation Challenge (MARS-LS)

Detecting landslides on Mars is fundamentally more challenging than on Earth due to the absence of vegetation and the highly uniform, desert-like surface. Electro-optical imagery alone often lacks sufficient contrast to distinguish landslide features from surrounding terrain, as Martian landslides must be identified primarily through subtle morphological cues such as scarps, hummocky deposits, and runout patterns. While multi-modal data, such as RGB or grayscale imagery, digital elevation models, slope, and thermal inertia, can provide complementary information, each modality differs in spatial resolution, noise, and physical meaning, making direct integration difficult.

An ideal Maritain landslide detection system would effectively leverage multi-modal data to overcome the limitations of individual modalities and improve robustness across diverse terrains. However, it remains unclear how best to fuse these heterogeneous data sources to enhance learning, generalization, and out-of-distribution performance, especially under limited labeled data. This challenge motivates the exploration of multi-modal fusion, attention-based models, and transfer or domain adaptation techniques to develop robust deep networks that achieve accurate landslide segmentation while maintaining strong generalization to unseen Martian regions.

Participants must generate a x4 HR hyperspectral image from the given real LR input. Unlike synthetic bicubic setups, the LR data reflect true optical blur, resolution loss, and sensor noise. Evaluation considers a combination of spatial and spectral fidelity metrics.

Dataset access and submission details are available on the CodaBench page

Codabench link: https://www.codabench.org/competitions/12305/

1st Unified Pedestrian Pose Estimation in Thermal Imaging Challenge (UPPET)

In the context of human pose estimation (HPE) using thermal surveillance imagery, generalizing across different sensors is challenged by domain gaps. These gaps stem from variations in sensor specifications, environmental conditions, and calibration inconsistencies, which can affect the thermal characteristics and representations of human figures. This challenge aims to spotlight the problem of domain gaps in a real-world thermal surveillance context and highlight the challenges and limitations of existing methods to provide a direction for future research.

In order to dynamize the fields of HPE in Thermal Imaging for realistic scenarios, we propose specially selected challenging splits for a challenge with three tracks:

  • Track 1: Generalization of HPE in Thermal Imaging using cross-validation scheme: The task is to train a pose estimator that accurately predicts persons’ keypoints under domain shifts.
  • Track 2: HPE in Thermal Imaging using specialization scheme: The task is to train a pose estimator that accurately predicts persons’ keypoints in thermal imaging.
  • Track 3: HPE on Thermal Imaging based on Synthetic Data: The task is to generate own thermal synthetic data for training. Explicitly, the participants should use generative networks (stable diffusion..etc.) to generate synthetic thermal training data. Participants must release and opensource their synthetic data. After training only on synthetic data, the pose estimator is evaluated using specialization scheme on real data.

Dataset access and submission details are available on the Eval.AI page (link TBA)