Product Knowledge Graph Embedding for E-commerce
Da Xu,Chuanwei Ruan,Evren Korpeoglu,Sushant Kumar,Kannan Achan +4 more
- 20 Jan 2020
- pp 672-680
TL;DR: The key entities are defined and summarize the pivotal product relations that are critical for general e-commerce applications including marketing, advertisement, search ranking and recommendation.
read more
Abstract: In this paper, we propose a new product knowledge graph (PKG) embedding approach for learning the intrinsic product relations as product knowledge for e-commerce. We define the key entities and summarize the pivotal product relations that are critical for general e-commerce applications including marketing, advertisement, search ranking and recommendation. We first provide a comprehensive comparison between PKG and ordinary knowledge graph (KG) and then illustrate why KG embedding methods are not suitable for PKG learning. We construct a self-attention-enhanced distributed representation learning model for learning PKG embeddings from raw customer activity data in an end-to-end fashion. We design an effective multi-task learning schema to fully leverage the multi-modal e-commerce data. The ¶oincare embedding is also employed to handle complex entity structures. We use a real-world dataset from \textslgrocery.walmart.com to evaluate the performances on knowledge completion, search ranking and recommendation. The proposed approach compares favourably to baselines in knowledge completion and downstream tasks.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Figures

Table 3: Summary of product catalog data. 
Table 2: Prediction for each task (relation) according to trained embeddings. 
Table 4: Testing performances on the knowledge completion tasks. The results are average over three runs and reported in %. The best performing method in each row is boldfaced, and the second best method in each row is underlined. The labels beneath the metric name corresponds to the labels in Figure 3. 
Table 5: Testing performance (in %) on next-impression recommendation and the search ranking task for queries that have encountered in training and new queries. The results are averaged over three runs. 
Figure 3: (a) Heatmap for the task correlations ρ (b) Test performances of different training schedules on all the tasks. The symbols for each task can be found beneath themetrics in Table 4 and 5. Micro-F1 and macro-F1 are divided by 4 for presentation purpose. 
Figure 1: Visual illustrations. (a) Example of product category hierarchy. (b) Sketched product knowledge graph.
Citations
•Posted Content
Inductive Representation Learning on Temporal Graphs.
TL;DR: The temporal graph attention (TGAT) layer is proposed to efficiently aggregate temporal-topological neighborhood features as well as to learn the time-feature interactions by developing a novel functional time encoding technique based on the classical Bochner's theorem from harmonic analysis.
534
A Survey on Knowledge Graph Embedding: Approaches, Applications and Benchmarks
TL;DR: This paper systematically introduces the existing state-of-the-art approaches and a variety of applications that benefit from these methods in knowledge graph embedding and introduces the advanced models that utilize additional semantic information to improve the performance of the original methods.
256
Domain knowledge graph-based research progress of knowledge representation
TL;DR: The research introduces the related concepts of the knowledge representation and analyzes knowledge representation of knowledge graphs by category, which includes some classical general knowledge graphs and several typical domain knowledge graphs.
107
•Book
Machine Knowledge: Creation and Curation of Comprehensive Knowledge Bases
Gerhard Weikum,Luna Dong,Simon Razniewski,Fabian M. Suchanek +3 more
- 12 Jul 2021
TL;DR: In this paper, the authors survey fundamental concepts and practical methods for creating and curating large-scale knowledge bases, including methods for discovering and canonicalizing entities and their semantic types and organizing them into clean taxonomies.
•Posted Content
Machine Knowledge: Creation and Curation of Comprehensive Knowledge Bases
TL;DR: In this article, the authors survey fundamental concepts and practical methods for creating and curating large-scale knowledge bases, including methods for discovering and canonicalizing entities and their semantic types and organizing them into clean taxonomies.
90
References
•Proceedings Article
Attention is All you Need
Ashish Vaswani,Noam Shazeer,Niki Parmar,Jakob Uszkoreit,Llion Jones,Aidan N. Gomez,Lukasz Kaiser,Illia Polosukhin +7 more
- 12 Jun 2017
TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.
Attention Is All You Need
Ashish Vaswani,Noam Shazeer,Niki Parmar,Jakob Uszkoreit,Llion Jones,Aidan N. Gomez,Łukasz Kaiser,Illia Polosukhin +7 more
- 01 Jan 2017
Abstract: The dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration. The best performing models also connect the encoder and decoder through an attention mechanism. We propose a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely. Experiments on two machine translation tasks show these models to be superior in quality while being more parallelizable and requiring significantly less time to train. Our model achieves 28.4 BLEU on the WMT 2014 English-to-German translation task, improving over the existing best results, including ensembles by over 2 BLEU. On the WMT 2014 English-to-French translation task, our model establishes a new single-model state-of-the-art BLEU score of 41.8 after training for 3.5 days on eight GPUs, a small fraction of the training costs of the best models from the literature. We show that the Transformer generalizes well to other tasks by applying it successfully to English constituency parsing both with large and limited training data.
51.8K
•Proceedings Article
Neural Machine Translation by Jointly Learning to Align and Translate
Dzmitry Bahdanau,Kyunghyun Cho,Yoshua Bengio +2 more
- 01 Jan 2015
TL;DR: It is conjecture that the use of a fixed-length vector is a bottleneck in improving the performance of this basic encoder-decoder architecture, and it is proposed to extend this by allowing a model to automatically (soft-)search for parts of a source sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly.
25.7K
•Proceedings Article
Distributed Representations of Words and Phrases and their Compositionality
Tomas Mikolov,Ilya Sutskever,Kai Chen,Greg S. Corrado,Jeffrey Dean +4 more
- 05 Dec 2013
TL;DR: This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.
•Posted Content
Neural Machine Translation by Jointly Learning to Align and Translate
TL;DR: In this paper, the authors propose to use a soft-searching model to find the parts of a source sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly.
20.9K