A Sample-Efficient Actor-Critic Algorithm for Recommendation Diversification

doi:10.1049/CJE.2019.10.004

Open AccessJournal Article10.1049/CJE.2019.10.004

A Sample-Efficient Actor-Critic Algorithm for Recommendation Diversification

Shuang Li, +4 more

- 01 Jan 2020

- Chinese Journal of Electronics

- Vol. 29, Iss: 1, pp 89-96

7

TL;DR: A novel actor-critic reinforcement learning algorithm for recommendation diversification that acts as the ranking policy, while the introduced critic predicts the expected future rewards of each candidate action.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1049/CJE.2020.05.004

Multi-feedback Pairwise Ranking via Adversarial Training for Recommender

Jianfang Wang, +4 more

- 01 Jul 2020

- Chinese Journal of Electronics

TL;DR: A novel Multi-feedback pairwise ranking method via Adversarial training (AT-MPR) for recommender to enhance the robustness and overall performance in the event of rating pollution and outperforms state-of-the-art implicit feedback collaborative ranking models in two evaluation metrics.

...read moreread less

4

•Journal Article•10.1049/cje.2020.00.417

Intelligent Orchestrating of IoT Microservices Based on Reinforcement Learning

Yuqin Wu, +5 more

- 01 Sep 2022

- Chinese Journal of Electronics

4

•Journal Article•10.3934/mbe.2023067

Extractive text summarization model based on advantage actor-critic and graph matrix methodology.

Senqi Yang, +5 more

- 01 Jan 2023

- Mathematical Biosciences and Engineering

TL;DR: Zhang et al. as mentioned in this paper introduced an extractive text summarization model based on a graph matrix and advantage actor-critic (GA2C) method, where the decision-making network made decisions and sent the results to the evaluation network for scoring.

...read moreread less

3

Journal Article•10.48550/arXiv.2211.11869

Examining Policy Entropy of Reinforcement Learning Agents for Personalization Tasks

Anton Dereventsov, +2 more

- 21 Nov 2022

- arXiv.org

TL;DR: In this paper , the authors examine the behavior of reinforcement learning systems in personalization environments and detail the differences in policy entropy associated with the type of learning algorithm utilized, showing that policy optimization agents often possess low-entropy policies during training, which in practice results in agents prioritizing certain actions and avoiding others.

...read moreread less

2

•Journal Article•10.1155/2023/5546795

Research and Application of Rock Burst Hazard Assessment of the Working Face Based on the CF-TOPSIS Method

Feng Zhu, +6 more

- 22 Jun 2023

- Shock and Vibration

TL;DR: In this article , an improved comprehensive weighting prediction (CF-TOPSIS) method was proposed to predict weight and grade indices for rock burst evaluation in underground coal mines, and the prediction results combined with field drill cutting methods and microseismic monitoring data verify the accuracy of the proposed method.

...read moreread less

1

References

•Book

Reinforcement Learning: An Introduction

Richard S. Sutton, +1 more

- 01 Jan 1988

TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.

...read moreread less

39.7K

•Proceedings Article

Asynchronous methods for deep reinforcement learning

Volodymyr Mnih, +7 more

- 19 Jun 2016

TL;DR: A conceptually simple and lightweight framework for deep reinforcement learning that uses asynchronous gradient descent for optimization of deep neural network controllers and shows that asynchronous actor-critic succeeds on a wide variety of continuous motor control problems as well as on a new task of navigating random 3D mazes using a visual input.

...read moreread less

9.2K

•Proceedings Article

Continuous control with deep reinforcement learning

Timothy P. Lillicrap, +7 more

- 22 Jul 2016

TL;DR: In this paper, an actor-critic, model-free algorithm based on the deterministic policy gradient is proposed to operate over continuous action spaces, which is able to find policies whose performance is competitive with those found by a planning algorithm with full access to the dynamics of the domain.

...read moreread less

6.5K

•Journal Article•10.1145/3130348.3130369

The use of MMR, diversity-based reranking for reordering documents and producing summaries

Jaime Carbinell, +1 more

- 01 Aug 1998

TL;DR: A method for combining query-relevance with information-novelty in the context of text retrieval and summarization and preliminary results indicate some benefits for MMR diversity ranking in document retrieval and in single document summarization.

...read moreread less

2.3K

Proceedings Article•10.1145/1390334.1390446

Novelty and diversity in information retrieval evaluation

Charles L. A. Clarke, +6 more

- 20 Jul 2008

TL;DR: This paper develops a framework for evaluation that systematically rewards novelty and diversity into a specific evaluation measure, based on cumulative gain, and demonstrates the feasibility of this approach using a test collection based on the TREC question answering track.

...read moreread less

1K