Publications

Segment Anyword
Segment Anyword

Training-free prompt learning for language-grounded segmentation using token-level cross-attention from a frozen diffusion model to generate object masks.

Dynamic Mixture of Agents (DMoA)
Dynamic Mixture of Agents (DMoA)

A test-time LLM ensembling strategy that dynamically adapts to balance performance, diversity, and consistency, achieving state-of-the-art results.

Decoding by Contrasting Retrieval Heads
Decoding by Contrasting Retrieval Heads

Training-free decoding that mitigates LLM hallucinations by contrasting a base model with a masked-retrieval variant, boosting summarisation by up to 18.6%.

Foveation for Segmentation of Ultra-High Resolution Images
Foveation for Segmentation of Ultra-High Resolution Images

Building on our prior foveation work (MICCAI 2020), we introduce a more computationally efficient hard-gated categorical sampling method for FoV-resolution patch configurations with two differentiable solutions. We validate its generalizability on three vision datasets Cityscapes, DeepGlobe, and Gleason2019 histopathology.