LLM-GridEval: Measuring the Evaluation Validity Gap in Smart Grid Security with Adaptive LLM Attackers

Chenglong Fu 0002, Shirin Besati, Miao Wang. LLM-GridEval: Measuring the Evaluation Validity Gap in Smart Grid Security with Adaptive LLM Attackers. In Proceedings of the 2026 ACM Sustainability Week, ACM Sustainability Week 2026, Banff, Alberta, Canada, June 22-25, 2026. pages 100-109, ACM, 2026. [doi]

@inproceedings{FuBW26,
  title = {LLM-GridEval: Measuring the Evaluation Validity Gap in Smart Grid Security with Adaptive LLM Attackers},
  author = {Chenglong Fu 0002 and Shirin Besati and Miao Wang},
  year = {2026},
  doi = {10.1145/3765611.3815146},
  url = {https://doi.org/10.1145/3765611.3815146},
  researchr = {https://researchr.org/publication/FuBW26},
  cites = {0},
  citedby = {0},
  pages = {100-109},
  booktitle = {Proceedings of the 2026 ACM Sustainability Week, ACM Sustainability Week 2026, Banff, Alberta, Canada, June 22-25, 2026},
  publisher = {ACM},
  isbn = {979-8-4007-2199-1},
}