Jingwei Yi

8 Papers

2 Citations

Jingwei Yi is an academic researcher. The author has contributed to research in topics: Computer science. The author has an hindex of 1, co-authored 4 publications.

Author Tools

Create citation map

Create Author Profile

Analyze Jingwei Yi's Top Papers

Chat about Author

Papers

Journal Article•10.1038/s42256-023-00765-8

Defending ChatGPT against jailbreak attack via self-reminders

Yueqi Xie, +7 more

- 01 Dec 2023

- Nature Machine Intelligence

TL;DR: This work systematically documents the threats posed by jailbreak attacks, introduces and analyses a dataset for evaluating defensive interventions and proposes the psychologically inspired self-reminder technique that can efficiently and effectively mitigate against jailbreaks without further training.

...read moreread less

100

Proceedings Article•10.48550/arXiv.2305.10036

Are You Copying My Model? Protecting the Copyright of Large Language Models for EaaS via Backdoor Watermark

Wenjun Peng, +9 more

- 17 May 2023

TL;DR: Yin et al. as mentioned in this paper proposed an Embedding Watermark method called {pasted macro ‘METHOD’} that implants backdoors on embeddings by selecting a group of moderate-frequency words from a general text corpus to form a trigger set.

...read moreread less

Journal Article•10.48550/arxiv.2312.14197

Benchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language Models

Jingwei Yi, +7 more

- 21 Dec 2023

- arXiv.org

TL;DR: This work introduces the first benchmark, BIPIA, to measure the robustness of various LLMs and defenses against indirect prompt injection attacks, and proposes four black-box methods based on prompt learning and a white-box defense methods based on fine-tuning with adversarial training to enable LLMs to distinguish between instructions and external content and ignore instructions in the external content.

...read moreread less

Book•10.1145/3583780.3614991

Non-IID always Bad? Semi-Supervised Heterogeneous Federated Learning with Local Knowledge Enhancement

Chao Zhang, +9 more

- 21 Oct 2023

TL;DR: This paper proposes a semi-supervised heterogeneous federated learning method with local knowledge enhancement, called FedLoKe, which aims to train an accurate global model from both labeled and unlabeled local data with non-IID distributions.

...read moreread less

Proceedings Article•10.48550/arXiv.2210.08809

Effective and Efficient Query-aware Snippet Extraction for Web Search

Jingwei Yi, +6 more

- 17 Oct 2022

TL;DR: An efﬁcient version of DeepQSE is proposed, named Ef-DeepQSE, which can improve the inference speed of Deep QSE without affecting its performance, and decompose the query-aware snippet extraction task into two stages, i.e., a coarse-grained candidate sentence selection stage where sentence representations can be cached, and a ﬁne- grained relevance modeling stage.

...read moreread less