Simin Li
6 Papers
11 Citations
Simin Li is an academic researcher. The author has contributed to research in topics: Computer science. The author has an hindex of 2, co-authored 5 publications.
Chat about Author
Papers
Towards Comprehensive Testing on the Robustness of Cooperative Multi-agent Reinforcement Learning
Jun Guo,Yonghong Chen,Yihang Hao,Zixin Yin,Yin Yu,Simin Li +5 more
- 17 Apr 2022
TL;DR: The first robustness testing framework for c-MARL algorithms, motivated by Markov Decision Process, MARLSafe considers the robustness of c- MARL algorithms comprehensively from three aspects, namely state robustness, action robustness and reward robustness.
Hierarchical Perceptual Noise Injection for Social Media Fingerprint Privacy Protection
TL;DR: FingerSafe is the first to provide feasible biometric protection in both digital and realistic scenarios, and suppress the low-level local contrast stimulus by regularizing the response of Lateral Geniculate Nucleus.
Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority Influence
TL;DR: In this paper , a black-box attack called Adversarial Minority Influence (AMI) is proposed for cooperative multi-agent reinforcement learning (c-MARL) under adversarial attacks, which can be launched without knowing victim parameters.
Towards Benchmarking and Assessing Visual Naturalness of Physical World Adversarial Attacks
TL;DR: Zhang et al. as mentioned in this paper proposed a Dual Prior Alignment (DPA) network, which aims to embed human knowledge into model reasoning process by rating prior alignment and mimicking human gaze behavior by attentive prior alignment.
MIR2: Towards Provably Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization
Simin Li,Ruixiao Xu,Jun Guo,Pu Feng,Jiakai Wang,Aishan Liu,Yaodong Yang,Xianglong Liu,Weifeng Lv +8 more
TL;DR: This work frames robustness as an inference problem and proves that minimizing mutual information between histories and actions implicitly maximizes a lower bound on robustness under certain assumptions, and proposes MIR2, which trains policy in routine scenarios and minimize Mutual Information as Robust Regularization.