Survey on LLM Safety: Attacks, Defenses, Alignment, Metrics, and Guardrails

Pratik Jalan, Vadivel Abishethvarman, Bhavik Chandna, Usman Naseem. Survey on LLM Safety: Attacks, Defenses, Alignment, Metrics, and Guardrails. Machine Learning, 115(6):130, June 2026. [doi]

Abstract

Abstract is missing.