Publications

(arXiv 2023) › Can GPT-4 Perform Neural Architecture Search?
(NeurIPS 2022) › ReCo: Retrieve and Co-segment for Zero-shot Transfer
(NeurIPS 2022) › RLIP: Relational Language-Image Pre-training for Human-Object Interaction Detection
(arXiv 2022) › BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
(arXiv 2022) › Crosslingual Generalization through Multitask Finetuning
(ECCV 2022) › Automatic dense annotation of large-vocabulary sign language videos
(CVPRW 2022) › Unsupervised Salient Object Detection with Spectral Cluster Voting
(CVPR 2022) › Cross Modal Retrieval with Querybank Normalisation
(CVPR 2022) › Sign Language Video Retrieval with Free-Form Textual Queries
(QST 2022) › Quantum Self-Supervised Learning
(IJCV 2022) › Scaling up sign spotting through sign language dictionaries
(ToM 2022) › Audio Retrieval with Natural Language Queries: A Benchmark Study
(Technical Report 2021) › BOBSL: BBC-Oxford British Sign Language Dataset
(ICCVW 2021) › All you need are a few pixels: semantic segmentation with PixelPick
(ICCV 2021) › TEACHTEXT: CrossModal Generalized Distillation for Text-Video Retrieval
(ICCV 2021) › Aligning Subtitles in Sign Language Videos
(Interspeech 2021) › Audio Retrieval with Natural Language Queries
(CVPR 2021) › Read and Attend: Temporal Localisation in Sign Language Videos
(CVPRW 2021) › Sign Segmentation with Changepoint-Modulated Pseudo-Labelling
(ICASSP 2021) › Sign Language Segmentation with Temporal Convolutional Networks
(ICASSP 2021) › QUERYD: A Video Dataset with High-Quality Text and Audio Narrations
(CVPR 2021) › Adaptive Cross-Modal Prototypes for Cross-Domain Visual-Language Retrieval
(AAAI 2021) › Mind-the-Gap! Unsupervised Domain Adaptation for Text-Video Retrieval
(ACCV 2020) › Watch, read and lookup: learning to spot signs from multiple supervisors
(ArXiv 2020) › Explaining the Adaptive Generalisation Gap
(Livestock Science 2020) › Movement change detected by optical flow precedes, but does not predict, tail-biting in pigs
(arXiv 2020) › Iterative Averaging in the Quest for Best Test Error
(BMVC 2020) › Seeing wake words: Audio-visual Keyword Spotting
(ECCV 2020) › BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues
(ICASSP 2020) › Disentangled Speech Embeddings Using Cross-Modal Self-Supervision
(ICCV 2019) › Unsupervised Learning of Landmarks by Descriptor Vector Exchange
(ICCV 2019) › Small Steps and Giant Leaps: Minimal Newton Solvers for Deep Learning
(BMVC 2019) › Use What You Have: Video retrieval using representations from collaborative experts
(TPAMI 2019) › Squeeze-and-Excitation Networks
(NeurIPS 2018) › Gather-Excite: Exploiting Feature Context in Convolutional Neural Networks
(ACM MM 2018) › Emotion Recognition in Speech using Cross-Modal Transfer in the Wild
(ECCV 2018) › Semi-convolutional Operators for Instance Segmentation
(ECCV 2018) › Learnable PINs: Cross-Modal Embeddings for Person Identity
(CVPR 2018) › Self-Supervised Learning of Geometrically Stable Features Through Probabilistic Introspection
(CVPR 2018) › Seeing Voices and Hearing Faces: Cross-Modal Biometric Matching
(NeurIPS-MLATL 2016) › Unknowable Manipulators: Social Network Curator Algorithms
(BMVC 2016) › Learning Grimaces by Watching TV
(Technical Report 2015) › Up-To-Date Malaria Mapping : Prediction From Remote Imagery