Research
A self-supervised monocular video depth framework that enforces temporal and geometric consistency without offline SfM, using diffusion-based temporal refinement and implicit scene-flow modelling to enable stable, online depth estimation in dynamic scenes
Dec 2024
Generalizable and accurate stereo depth estimation is vital for 3D reconstruction, especially in surgery. Supervised learning methods obtain best performance however, limited ground truth data for surgical scenes limits generalizability.
Dec 2023
We present an extension to masked autoencoders (MAE) which improves on the representations learnt by the model by explicitly encouraging the learning of higher scene-level features. We do this by: (i) the introduction of a perceptual similarity term between generated and real images (ii) incorporating several techniques from the adversarial trainin...
Dec 2022
Introduces SurgT, the first standardised benchmark for soft-tissue tracking in stereo endoscopic surgery, providing a large-scale dataset, novel evaluation metrics, and a public challenge to advance unsupervised deep learning methods for surgical tracking
Aug 2023
My PhD thesis introducing self-supervised generative models, diffusion-based video depth, and implicit scene-flow learning to achieve sub-millimetre, temporally consistent depth estimation for mixed-reality surgical guidance
May 2023
Accurate stereo depth estimation is crucial for 3D reconstruction in surgery. Self-supervised approaches are more preferable than supervised approaches when limited data is available for training but they can not learn clear discrete data representations. In this work, we propose a two-phase training procedure which entails: (1) Performing Contrast...
Sep 2022
An evolutionary neural architecture that automatically designs and optimises deep multimodal networks, learning joint representations of ICU time-series, clinical events, and medical text to predict deterioration trajectories
May 2022
A plethora of deep learning-based methods have been proposed for self-supervised monocular depth estimation. The majority of these models utilise a U-Net-based architecture for disparity estimation. However, this architecture may not be optimal, and as shown in self-supervised approaches, replacing standard U-Net encoder with more complex architect...
Feb 2022
Surgical robots rely on robust and efficient computer vision algorithms to be able to intervene in real-time. The main problem, however, is that the training or testing of such algorithms, especially when using deep learning techniques, requires large endoscopic datasets which are challenging to obtain, since they require expensive hardware, ethica...
Feb 2021
Deep reinforcement learning has the potential to train robots to perform complex tasks in the real world without requiring accurate models of the robot or its environment. A practical approach is to train agents in simulation, and then transfer them to the real world. One of the most popular methods for achieving this is to use domain randomisation...
Dec 2019









