Concepedia

Publication | Open Access

Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection

33

Citations

0

References

2024

Year

Abstract

Jun Yan, Vikas Yadav, Shiyang Li, Lichang Chen, Zheng Tang, Hai Wang, Vijay Srinivasan, Xiang Ren, Hongxia Jin. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers). 2024.