A Hierarchical Bayesian Factorization Model for Implicit and Explicit Feedback Data

doi:10.1007/978-3-319-69179-4_8

Book Chapter10.1007/978-3-319-69179-4_8

A Hierarchical Bayesian Factorization Model for Implicit and Explicit Feedback Data

ThaiBinh Nguyen, +2 more

- 05 Nov 2017

- pp 104-118

2

TL;DR: This paper presents a hierarchical Bayesian model that can infer the latent feature vectors of items directly from the implicit feedback when they cannot be obtained from the rating data, and infer the full posterior distributions of these parameters using a Gibbs sampling method.

Abstract: Matrix factorization (MF) is one of the most efficient methods for performing collaborative filtering. An MF-based method represents users and items by latent feature vectors that are obtained by decomposing the rating matrix of users to items. However, MF-based methods suffer from the cold-start problem: if no rating data are available for an item, the model cannot find a latent feature vector for that item, and thus cannot make a recommendation for it. In this paper, we present a hierarchical Bayesian model that can infer the latent feature vectors of items directly from the implicit feedback (e.g., clicks, views, purchases) when they cannot be obtained from the rating data. We infer the full posterior distributions of these parameters using a Gibbs sampling method. We show that the proposed method is strong with overfitting even if the model is very complex or the data are very sparse. Our experiments on real-world datasets demonstrate that our proposed method significantly outperforms competing methods on rating prediction tasks, especially for very sparse datasets.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

Boosting the Rating Prediction with Click Data and Textual Contents.

ThaiBinh Nguyen, +1 more

- 21 Aug 2019

- arXiv: Information Retrieval

TL;DR: This paper develops TCMF (Textual Co Matrix Factorization) that learns the user and item representations jointly from the user-item matrix, textual contents and item co-click matrix built from click data.

...read moreread less

•Posted Content

Learning Representations from Product Titles for Modeling Shopping Transactions

Binh Nguyen, +1 more

- 03 Nov 2018

- arXiv: Information Retrieval

TL;DR: BASTEXT is an efficient model of shopping baskets and the texts associated with the products that can efficiently model millions of baskets and that it outperforms the state-of-the-art methods in the next product recommendation task.

...read moreread less

References

•Proceedings Article

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

- 05 Dec 2013

TL;DR: This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.

...read moreread less

24.1K

•Posted Content

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

- 16 Oct 2013

- arXiv: Computation and Language

TL;DR: In this paper, the Skip-gram model is used to learn high-quality distributed vector representations that capture a large number of precise syntactic and semantic word relationships and improve both the quality of the vectors and the training speed.

...read moreread less

22.9K

•Proceedings Article

Algorithms for Non-negative Matrix Factorization

Daniel D. Lee, +1 more

- 01 Jan 2000

TL;DR: Two different multiplicative algorithms for non-negative matrix factorization are analyzed and one algorithm can be shown to minimize the conventional least squares error while the other minimizes the generalized Kullback-Leibler divergence.

...read moreread less

9K

•Proceedings Article

Distributed Representations of Sentences and Documents

Quoc V. Le, +1 more

- 21 Jun 2014

TL;DR: Paragraph Vector is an unsupervised algorithm that learns fixed-length feature representations from variable-length pieces of texts, such as sentences, paragraphs, and documents, and its construction gives the algorithm the potential to overcome the weaknesses of bag-of-words models.

...read moreread less

8.9K

•Proceedings Article

Probabilistic Matrix Factorization

Andriy Mnih, +1 more

- 03 Dec 2007

TL;DR: The Probabilistic Matrix Factorization (PMF) model is presented, which scales linearly with the number of observations and performs well on the large, sparse, and very imbalanced Netflix dataset and is extended to include an adaptive prior on the model parameters.

...read moreread less

4.7K

...

Expand

A Hierarchical Bayesian Factorization Model for Implicit and Explicit Feedback Data

Chat with Paper

AI Agents for this Paper

Citations

Boosting the Rating Prediction with Click Data and Textual Contents.

Learning Representations from Product Titles for Modeling Shopping Transactions

References

Distributed Representations of Words and Phrases and their Compositionality

Distributed Representations of Words and Phrases and their Compositionality

Algorithms for Non-negative Matrix Factorization

Distributed Representations of Sentences and Documents

Probabilistic Matrix Factorization

Related Papers (5)

Is Simple Better? Revisiting Non-Linear Matrix Factorization for Learning Incomplete Ratings

Two-step hybrid collaborative filtering using deep variational Bayesian autoencoders

Bayesian probabilistic multi-topic matrix factorization for rating prediction

Multi-task Learning for Bayesian Matrix Factorization

Neural Variational Hybrid Collaborative Filtering.