Information Theoretic Guarantees For Policy Alignment In Large Language Models

Youssef Mroueh, Apoorva Nitsure. Information Theoretic Guarantees For Policy Alignment In Large Language Models. Trans. Mach. Learn. Res., 2025, 2025. [doi]

Authors

Youssef Mroueh

This author has not been identified. Look up 'Youssef Mroueh' in Google

Apoorva Nitsure

This author has not been identified. Look up 'Apoorva Nitsure' in Google