Refining Positive and Toxic Samples for Dual Safety Self-Alignment of LLMs With Minimal Human Interventions - researchr publication

researchr

You are not signed in
Sign in
Sign up

Jingxin Xu, Guoshun Nan, Sheng Guan, Sicong Leng, Yilian Liu, Zixiao Wang, YuYang Ma, Zhili Zhou, Yanzhao Hou, Xiaofeng Tao. Refining Positive and Toxic Samples for Dual Safety Self-Alignment of LLMs With Minimal Human Interventions. IEEE Transactions on Information Forensics and Security, 21:1409-1423, 2026. [doi]

Abstract is missing.

runs on WebDSL