Augmenting Low-Resource Text Classification with Graph-Grounded Pre-training and Prompting

doi:10.1145/3539618.3591641

Open AccessPosted Content10.1145/3539618.3591641

Augmenting Low-Resource Text Classification with Graph-Grounded Pre-training and Prompting

- 05 May 2023

33

TL;DR: This article proposed a graph-grounded pre-training and prompting (G2P2) model to address low-resource text classification in a two-pronged approach, where three graph interaction-based contrastive strategies were proposed to jointly pre-train a graphtext model.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.48550/arxiv.2312.02783

Large Language Models on Graphs: A Comprehensive Survey

Bowen Jin, +5 more

- 05 Dec 2023

- arXiv.org

TL;DR: A systematic review of scenarios and techniques related to large language models on graphs, including LLM as Predictor, LLM as Encoder, and LLM as Aligner, and compare the advantages and disadvantages of different schools of models is provided.

...read moreread less

60

Journal Article•10.1145/3626772.3657775

GraphGPT: Graph Instruction Tuning for Large Language Models

Jiabin Tang, +7 more

- 10 Jul 2024

27

Preprint•10.48550/arxiv.2405.08011

A Survey of Large Language Models for Graphs

Xubin Ren, +4 more

- 10 May 2024

TL;DR: A survey of large language models for graphs explores the latest state-of-the-art LLMs applied in graph learning and introduces a novel taxonomy to categorize existing methods based on framework design.

...read moreread less

22

Journal Article•10.1145/3589335.3651476

Can we soft prompt LLMs for graph learning tasks?

Zheyuan Liu, +3 more

- 15 Feb 2024

- arXiv.org

TL;DR: The GraphPrompter framework unveils the substantial capabilities of LLMs as predictors in graph-related tasks, enabling researchers to utilize LLMs across a spectrum of real-world graph scenarios more effectively.

...read moreread less

11

Preprint•10.48550/arxiv.2402.16024

HiGPT: Heterogeneous Graph Language Model

Jiabin Tang, +6 more

- 25 Feb 2024

TL;DR: HiGPT is a general large graph model designed to generalize heterogeneous graph learning models to diverse downstream tasks with distribution shifts in both node token sets and relation type heterogeneity.

...read moreread less

11

...

Expand

References

•Proceedings Article

Attention is All you Need

Ashish Vaswani, +7 more

- 12 Jun 2017

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.

...read moreread less

94.2K

•Posted Content

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

- 11 Oct 2018

- arXiv: Computation and Language

TL;DR: A new language representation model, BERT, designed to pre-train deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.

...read moreread less

81.7K

Preprint•10.48550/arxiv.1706.03762

Attention Is All You Need

Ashish Vaswani, +7 more

- 01 Jan 2017

Abstract: The dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration. The best performing models also connect the encoder and decoder through an attention mechanism. We propose a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely. Experiments on two machine translation tasks show these models to be superior in quality while being more parallelizable and requiring significantly less time to train. Our model achieves 28.4 BLEU on the WMT 2014 English-to-German translation task, improving over the existing best results, including ensembles by over 2 BLEU. On the WMT 2014 English-to-French translation task, our model establishes a new single-model state-of-the-art BLEU score of 41.8 after training for 3.5 days on eight GPUs, a small fraction of the training costs of the best models from the literature. We show that the Transformer generalizes well to other tasks by applying it successfully to English constituency parsing both with large and limited training data.

...read moreread less

51.8K

•Proceedings Article

Efficient Estimation of Word Representations in Vector Space

Tomas Mikolov, +3 more

- 16 Jan 2013

TL;DR: Two novel model architectures for computing continuous vector representations of words from very large data sets are proposed and it is shown that these vectors provide state-of-the-art performance on the authors' test set for measuring syntactic and semantic word similarities.

...read moreread less

27.5K

•Posted Content

RoBERTa: A Robustly Optimized BERT Pretraining Approach

Yinhan Liu, +9 more

- 26 Jul 2019

- arXiv: Computation and Language

TL;DR: It is found that BERT was significantly undertrained, and can match or exceed the performance of every model published after it, and the best model achieves state-of-the-art results on GLUE, RACE and SQuAD.

...read moreread less

26.2K

...

Expand

Augmenting Low-Resource Text Classification with Graph-Grounded Pre-training and Prompting

Chat with Paper

AI Agents for this Paper

Citations

Large Language Models on Graphs: A Comprehensive Survey

GraphGPT: Graph Instruction Tuning for Large Language Models

A Survey of Large Language Models for Graphs

Can we soft prompt LLMs for graph learning tasks?

HiGPT: Heterogeneous Graph Language Model

References

Attention is All you Need

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Attention Is All You Need

Efficient Estimation of Word Representations in Vector Space

RoBERTa: A Robustly Optimized BERT Pretraining Approach

Related Papers (5)

Physician-Friendly Machine Learning: A Case Study with Cardiovascular Disease Risk Prediction

Semantic Retrieval Research Based on Ontology

Faciliatating complex Web queries through visual user interfaces and query relaxation

Breakdown of Machine Learning Algorithms

A framework to derive web page context from hyperlink structure