Defending LLMs against jailbreak attacks through representation offset detection - researchr publication references

researchr

You are not signed in
Sign in
Sign up

Shuo Liu, Xiang Cheng 0003, ZhenZhong Zheng, Sen Su. Defending LLMs against jailbreak attacks through representation offset detection. Inf. Process. Manage., 63(5):104662, 2026. [doi]

No references recorded for this publication.

No citations of this publication recorded.

runs on WebDSL