Journal Article10.1007/s11548-025-03344-x
Video-based multi-target multi-camera tracking for postoperative phase recognition.
Franziska Jurosch,Janik Zeller,Lars Wagner,Ege Özsoy,Alissa Jell,Sven Kolb,D. Wilhelm +6 more
1
TL;DR: This study proposes a novel multi-target multi-camera tracking architecture for postoperative phase recognition, location tracking, and timestamp generation, achieving 84.9% traversal accuracy, 91.4% correct timestamp generation, and 92.0% patient tracking IDF1 in a simulated postoperative setting.
read more
Abstract: Deep learning methods are commonly used to generate context understanding to support surgeons and medical professionals. By expanding the current focus beyond the operating room (OR) to postoperative workflows, new forms of assistance are possible. In this article, we propose a novel multi-target multi-camera tracking (MTMCT) architecture for postoperative phase recognition, location tracking, and automatic timestamp generation. Three RGB cameras were used to create a multi-camera data set containing 19 reenacted postoperative patient flows. Patients and beds were annotated and used to train the custom MTMCT architecture. It includes bed and patient tracking for each camera and a postoperative patient state module to provide the postoperative phase, current location of the patient, and automatically generated timestamps. The architecture demonstrates robust performance for single- and multi-patient scenarios by embedding medical domain-specific knowledge. In multi-patient scenarios, the state machine representing the postoperative phases has a traversal accuracy of 84.9±6.0%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$84.9 \pm 6.0\%$$\end{document}, 91.4±1.5%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$91.4 \pm 1.5\%$$\end{document} of timestamps are generated correctly, and the patient tracking IDF1 reaches 92.0±3.6%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$92.0 \pm 3.6\%$$\end{document}. Comparative experiments show the effectiveness of using AFLink for matching partial trajectories in postoperative settings. As our approach shows promising results, it lays the foundation for real-time surgeon support, enhancing clinical documentation and ultimately improving patient care.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Spatio-Temporal Residual Attention Network for Satellite-Based Infrared Small Target Detection
Yan Chang,Decao Ma,Qisong Yang,Shaopeng Li,Daqiao Zhang,Yan Chang,Decao Ma,Qisong Yang,Shaopeng Li,Daqiao Zhang +9 more
TL;DR: This paper proposes a Spatio-Temporal Residual Attention Network for infrared small target detection in satellite videos, leveraging inter-frame residual guidance and spatio-temporal feature enhancement to improve detection accuracy and robustness in complex backgrounds.