Shie Mannor
33 Papers
7 Citations
Shie Mannor is an academic researcher. The author has contributed to research in topics: Computer science & Engineering. The author has an hindex of 4, co-authored 31 publications.
Chat about Author
Papers
CALM: Conditional Adversarial Latent Models for Directable Virtual Characters
TL;DR: In this article , a conditional adversarial latent model (CALM) is proposed to generate diverse and directable behaviors for user-controlled interactive virtual characters using imitation learning, which can capture the complexity and diversity of human motion.
Optimizing Tensor Network Contraction Using Reinforcement Learning
Eli A. Meirom,Haggai Maron,Shie Mannor,G. Chechik +3 more
- 18 Apr 2022
TL;DR: This work proposes a Reinforcement Learning (RL) approach combined with Graph Neural Networks (GNN) to address the contraction ordering problem and shows how a carefully implemented RL-agent that uses a GNN as the basic policy construct can address these challenges and obtain significant improve-ments over state-of-the-art techniques.
9
Policy Gradient for s-Rectangular Robust Markov Decision Processes
TL;DR: In this paper , the robust policy gradient method (RPG) for s-rectangular robust Markov Decision Processes (MDPs) is presented, where the adversarial kernel is a one-rank perturbation of the nominal kernel.
8
Explainability-based Trust Algorithm for electricity price forecasting models
Leena Heistrene,Ram Machlev,Michael W. Perl,Juri Belikov,Dmitry Baimel,Kfir Y. Levy,Shie Mannor,Yoash Levron +7 more
TL;DR: In this article , the authors proposed a trust algorithm for electricity price forecasting (EPF) users based on explainable artificial intelligence techniques, which generates trust scores that reflect the model's prediction quality for each new input.
6
Proceedings Article
The Geometry of Robust Value Functions
Kaixin Wang,Navdeep Kumar,Kuangqi Zhou,Bryan Hooi,Jiashi Feng,Shie Mannor +5 more
- 30 Jan 2022
TL;DR: In this paper , the robust value space is determined by a set of conic hypersurfaces, each of which contains the robust values of all policies that agree on one state.
4