Amir

doi:10.1145/3580818

Open AccessJournal Article10.1145/3580818

Amir

Shinan Liu, +6 more

- 27 Mar 2022

- Proceedings of the ACM on interactive, m...

- Vol. 7, Iss: 1, pp 1-26

5

TL;DR: In this paper , the synthesis of video and network data for robust interaction recognition in connected environments is advocated for using machine learning-based approaches for activity recognition, where each labeled activity is associated with both a video capture and an accompanying network traffic trace.

Abstract: Activity recognition using video data is widely adopted for elder care, monitoring for safety and security, and home automation. Unfortunately, using video data as the basis for activity recognition can be brittle, since models trained on video are often not robust to certain environmental changes, such as camera angle and lighting changes. There has been a proliferation of network-connected devices in home environments. Interactions with these smart devices are associated with network activity, making network data a potential source for recognizing these device interactions. This paper advocates for the synthesis of video and network data for robust interaction recognition in connected environments. We consider machine learning-based approaches for activity recognition, where each labeled activity is associated with both a video capture and an accompanying network traffic trace. We develop a simple but effective framework AMIR (Active Multimodal Interaction Recognition)1 that trains independent models for video and network activity recognition respectively, and subsequently combines the predictions from these models using a meta-learning framework. Whether in lab or at home, this approach reduces the amount of "paired" demonstrations needed to perform accurate activity recognition, where both network and video data are collected simultaneously. Specifically, the method we have developed requires up to 70.83% fewer samples to achieve 85% F1 score than random data collection, and improves accuracy by 17.76% given the same number of samples.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1007/s10489-024-05645-1

Deep learning for computer vision based activity recognition and fall detection of the elderly: a systematic review

F. Xavier Gaya-Morey, +2 more

- 08 Jul 2024

- Applied Intelligence

TL;DR: This systematic review synthesizes 151 studies on deep learning-based fall detection and human activity recognition for the elderly, analyzing DL models, datasets, and hardware configurations, and highlighting recent advancements and areas for improvement in Ambient Assisted Living systems.

...read moreread less

6

Book Chapter•10.1007/978-3-031-56249-5_3

Network Anatomy and Real-Time Measurement of Nvidia GeForce NOW Cloud Gaming

Minzhao Lyu, +3 more

- 01 Jan 2024

- Lecture Notes in Computer Science

TL;DR: Network anatomy and real-time measurement of Nvidia GeForce NOW cloud gaming session to improve user experience and network capacity provisioning.

...read moreread less

3

Journal Article•10.1109/icde60146.2024.00409

An Interactive Dive into Time-Series Anomaly Detection

Paul Boniol, +2 more

- 13 May 2024

TL;DR: This tutorial takes a holistic view of anomaly detection in time series, starting from the core definitions and taxonomies related to time series and anomaly types to an extensive description of the anomaly detection methods proposed by different communities in the literature.

...read moreread less

1

Journal Article•10.1007/s00778-025-00949-1

MSAD: A deep dive into model selection for time series anomaly detection

Paparrizos John, +3 more

- 25 Oct 2025

- The Vldb Journal

TL;DR: This paper proposes MSAD, a model selection method for time series anomaly detection, evaluating 234 configurations of 16 base classifiers across 1980 time series, demonstrating that model selection outperforms individual anomaly detection methods with comparable execution time.

...read moreread less

Journal Article•10.1109/icdew61823.2024.00020

Beyond the Dimensions: A Structured Evaluation of Multivariate Time Series Distance Measures

Jens E. d’Hondt, +2 more

- 13 May 2024

TL;DR: Structured evaluation of multivariate time series distance measures. Taxonomy based on how measures handle multiple variates. No single measure is superior.

...read moreread less

References

•Proceedings Article•10.1109/CVPR.2017.143

Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields

Zhe Cao, +3 more

- 21 Jul 2017

TL;DR: Part Affinity Fields (PAFs) as discussed by the authors uses a nonparametric representation to learn to associate body parts with individuals in the image and achieves state-of-the-art performance on the MPII Multi-Person benchmark.

...read moreread less

6.2K

•Proceedings Article•10.1109/ICCV.2019.00630

SlowFast Networks for Video Recognition

Christoph Feichtenhofer, +3 more

- 01 Oct 2019

TL;DR: This work presents SlowFast networks for video recognition, which achieves strong performance for both action classification and detection in video, and large improvements are pin-pointed as contributions by the SlowFast concept.

...read moreread less

4.2K

•Journal Article•10.1016/J.PATREC.2018.02.010

Deep learning for sensor-based activity recognition: A survey

Jindong Wang, +4 more

- 01 Feb 2018

- Pattern Recognition Letters

TL;DR: The recent advance of deep learning based sensor-based activity recognition is surveyed from three aspects: sensor modality, deep model, and application and detailed insights on existing work are presented and grand challenges for future research are proposed.

...read moreread less

1.9K

Book Chapter•10.1007/978-3-540-24646-6_10

Activity Recognition in the Home Using Simple and Ubiquitous Sensors

Emmanuel Munguia Tapia, +2 more

- 21 Apr 2004

TL;DR: Preliminary results on a small dataset show that it is possible to recognize activities of interest to medical professionals such as toileting, bathing, and grooming with detection accuracies ranging from 25% to 89% depending on the evaluation criteria used.

...read moreread less

1.5K

Journal Article•10.1109/TSMCC.2012.2198883

Sensor-Based Activity Recognition

Liming Chen, +4 more

- 01 Nov 2012

TL;DR: A comprehensive survey to examine the development and current status of various aspects of sensor-based activity recognition, making a primary distinction in this paper between data-driven and knowledge-driven approaches.

...read moreread less

1.1K