Amir
Shinan Liu,Tarun Mangla,Ted Shaowang,Jinjin Zhao,John Paparrizos,Sanjay Krishnan,Nick Feamster +6 more
5
TL;DR: In this paper , the synthesis of video and network data for robust interaction recognition in connected environments is advocated for using machine learning-based approaches for activity recognition, where each labeled activity is associated with both a video capture and an accompanying network traffic trace.
read more
Abstract: Activity recognition using video data is widely adopted for elder care, monitoring for safety and security, and home automation. Unfortunately, using video data as the basis for activity recognition can be brittle, since models trained on video are often not robust to certain environmental changes, such as camera angle and lighting changes. There has been a proliferation of network-connected devices in home environments. Interactions with these smart devices are associated with network activity, making network data a potential source for recognizing these device interactions. This paper advocates for the synthesis of video and network data for robust interaction recognition in connected environments. We consider machine learning-based approaches for activity recognition, where each labeled activity is associated with both a video capture and an accompanying network traffic trace. We develop a simple but effective framework AMIR (Active Multimodal Interaction Recognition)1 that trains independent models for video and network activity recognition respectively, and subsequently combines the predictions from these models using a meta-learning framework. Whether in lab or at home, this approach reduces the amount of "paired" demonstrations needed to perform accurate activity recognition, where both network and video data are collected simultaneously. Specifically, the method we have developed requires up to 70.83% fewer samples to achieve 85% F1 score than random data collection, and improves accuracy by 17.76% given the same number of samples.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Deep learning for computer vision based activity recognition and fall detection of the elderly: a systematic review
F. Xavier Gaya-Morey,Cristina Manresa-Yee,NULL AUTHOR_ID +2 more
TL;DR: This systematic review synthesizes 151 studies on deep learning-based fall detection and human activity recognition for the elderly, analyzing DL models, datasets, and hardware configurations, and highlighting recent advancements and areas for improvement in Ambient Assisted Living systems.
6
Network Anatomy and Real-Time Measurement of Nvidia GeForce NOW Cloud Gaming
Minzhao Lyu,Sharat Chandra Madanapalli,Arun Vishwanath,Vijay Sivaraman +3 more
TL;DR: Network anatomy and real-time measurement of Nvidia GeForce NOW cloud gaming session to improve user experience and network capacity provisioning.
3
An Interactive Dive into Time-Series Anomaly Detection
Paul Boniol,John Paparrizos,Themis Palpanas +2 more
- 13 May 2024
TL;DR: This tutorial takes a holistic view of anomaly detection in time series, starting from the core definitions and taxonomies related to time series and anomaly types to an extensive description of the anomaly detection methods proposed by different communities in the literature.
MSAD: A deep dive into model selection for time series anomaly detection
Paparrizos John,Palpanas, Themis,Senellart, Pierre,Boniol Paul +3 more
TL;DR: This paper proposes MSAD, a model selection method for time series anomaly detection, evaluating 234 configurations of 16 base classifiers across 1980 time series, demonstrating that model selection outperforms individual anomaly detection methods with comparable execution time.
Beyond the Dimensions: A Structured Evaluation of Multivariate Time Series Distance Measures
Jens E. d’Hondt,Odysseas Papapetrou,John Paparrizos +2 more
- 13 May 2024
TL;DR: Structured evaluation of multivariate time series distance measures. Taxonomy based on how measures handle multiple variates. No single measure is superior.
References
Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields
Zhe Cao,Tomas Simon,Shih-En Wei,Yaser Sheikh +3 more
- 21 Jul 2017
TL;DR: Part Affinity Fields (PAFs) as discussed by the authors uses a nonparametric representation to learn to associate body parts with individuals in the image and achieves state-of-the-art performance on the MPII Multi-Person benchmark.
6.2K
SlowFast Networks for Video Recognition
Christoph Feichtenhofer,Haoqi Fan,Jitendra Malik,Kaiming He +3 more
- 01 Oct 2019
TL;DR: This work presents SlowFast networks for video recognition, which achieves strong performance for both action classification and detection in video, and large improvements are pin-pointed as contributions by the SlowFast concept.
Deep learning for sensor-based activity recognition: A survey
TL;DR: The recent advance of deep learning based sensor-based activity recognition is surveyed from three aspects: sensor modality, deep model, and application and detailed insights on existing work are presented and grand challenges for future research are proposed.
Activity Recognition in the Home Using Simple and Ubiquitous Sensors
Emmanuel Munguia Tapia,Stephen S. Intille,Kent Larson +2 more
- 21 Apr 2004
TL;DR: Preliminary results on a small dataset show that it is possible to recognize activities of interest to medical professionals such as toileting, bathing, and grooming with detection accuracies ranging from 25% to 89% depending on the evaluation criteria used.
Sensor-Based Activity Recognition
Liming Chen,Jesse Hoey,Chris D. Nugent,Diane J. Cook,Zhiwen Yu +4 more
- 01 Nov 2012
TL;DR: A comprehensive survey to examine the development and current status of various aspects of sensor-based activity recognition, making a primary distinction in this paper between data-driven and knowledge-driven approaches.
1.1K