Towards Better Instruction Following Language Models for Chinese: Investigating the Impact of Training Data and Evaluation

doi:10.48550/arXiv.2304.07854

Journal Article10.48550/arXiv.2304.07854

Towards Better Instruction Following Language Models for Chinese: Investigating the Impact of Training Data and Evaluation

Yunjie Ji, +5 more

- 16 Apr 2023

- arXiv.org

- Vol. abs/2304.07854

13

TL;DR: This article examined the influence of training data factors, including quantity, quality, and linguistic distribution, on model performance and provided valuable insights for the continued advancement of open-source chat models.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.48550/arxiv.2308.13416

SoTaNa: The Open-Source Software Development Assistant

Ensheng Shi, +8 more

- 25 Aug 2023

- arXiv.org

TL;DR: SoTaNa utilizes ChatGPT to generate high-quality instruction-based data for the domain of software engineering and employs a parameter-efficient fine-tuning approach to enhance the open-source foundation model, LLaMA.

...read moreread less

7

Journal Article•10.48550/arxiv.2311.09071

How Multilingual is Multilingual LLM?

Fei Yuan, +3 more

- 15 Nov 2023

- arXiv.org

TL;DR: This study evaluates the multilingual capacity of LLMs by conducting an exhaustive analysis across 101 languages, and classifies languages with similar characteristics into four distinct quadrants, shedding light on the rationale behind their categorization and offering actionable guidelines for tuning these languages.

...read moreread less

3

Journal Article•10.48550/arxiv.2401.03512

Token-free LLMs Can Generate Chinese Classical Poetry with More Accurate Format

Chengyue Yu, +4 more

- 07 Jan 2024

- arXiv.org

TL;DR: The finetuned token-free model, which is based on Qwen-chat-7B, is released, which can generate chinese classical poetry following complex instructions like LLMs (such as story paraphrasing), and also perform well in format.

...read moreread less

3

Journal Article•10.48550/arxiv.2310.07488

KwaiYiiMath: Technical Report

Jia-Yi Fu, +20 more

- 11 Oct 2023

- arXiv.org

TL;DR: The KwaiyiiMath is introduced, which enhances the mathematical reasoning abilities of KwaiYiiBase1, by applying Supervised Fine-Tuning (SFT) and Reinforced Learning from Human Feedback (RLHF), including on both English and Chinese mathematical tasks.

...read moreread less

1

Proceedings Article•10.18653/v1/2024.naacl-long.256

Flames: Benchmarking Value Alignment of LLMs in Chinese

Kexin Huang, +11 more

- 12 Nov 2023

TL;DR: This paper proposes Flames, a value alignment benchmark for large language models (LLMs) that evaluates their alignment with human values, particularly in the Chinese context, and finds most mainstream LLMs perform poorly on safety and fairness dimensions.

...read moreread less

References

•Proceedings Article

Attention is All you Need

Ashish Vaswani, +7 more

- 12 Jun 2017

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.

...read moreread less

94.2K

•Posted Content

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

- 11 Oct 2018

- arXiv: Computation and Language

TL;DR: A new language representation model, BERT, designed to pre-train deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.

...read moreread less

81.7K

•Posted Content

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Colin Raffel, +8 more

- 23 Oct 2019

- arXiv: Learning

TL;DR: This systematic study compares pre-training objectives, architectures, unlabeled datasets, transfer approaches, and other factors on dozens of language understanding tasks and achieves state-of-the-art results on many benchmarks covering summarization, question answering, text classification, and more.

...read moreread less

12.9K

Proceedings Article•10.48550/arXiv.2203.02155

Training language models to follow instructions with human feedback

Long Ouyang, +19 more

- 04 Mar 2022

TL;DR: The results show that fine-tuning with human feedback is a promising direction for aligning language models with human intent and showing improvements in truthfulness and reductions in toxic output generation while having minimal performance regressions on public NLP datasets.

...read moreread less

7.1K

Journal Article•10.48550/arXiv.2302.13971

LLaMA: Open and Efficient Foundation Language Models

Hugo Touvron, +13 more

- 27 Feb 2023

- arXiv.org

TL;DR: This article introduced LLaMA, a collection of foundation language models ranging from 7B to 65B parameters, and trained their models on trillions of tokens, and showed that it is possible to train state-of-the-art models using publicly available datasets exclusively, without resorting to proprietary and inaccessible datasets.

...read moreread less

6.6K

...

Expand