Conference Paper (published)
Details
Citation
Even-Mendoza K, Brownlee A, Geiger A, Hanna C, Petke J, Sarro F & Sobania D (2025) LLM-Guided Genetic Improvement: Envisioning Semantic Aware Automated Software Evolution. In: IEEE/ACM International Conference on Automated Software engineering, Seoul, 16.11.2025-20.11.2025.
Abstract
Genetic Improvement (GI) of software automatically creates alternative software versions which are improved according to certain properties of interests (e.g., running-time). Search-based GI excels at navigating large program spaces, but operates primarily at the syntactic level. In contrast, Large Language Models (LLMs) offer semantic-aware edits, yet lack goal-directed feedback and control (which is instead a strength of GI). As such, we propose the investigation of a new research line on AI-powered GI aimed at incorporating semantic aware search. We take a first step at it by augmenting GI with the use of automated clustering of LLM edits. We provide initial empirical evidence that our proposal, dubbed PatchCat, allows us to automatically and effectively categorize LLM-suggested patches. PatchCat identified 18 different types of software patches and categorized newly suggested patches with high accuracy. It also enabled detecting NoOp edits in advance and, prospectively, to skip test suite execution to save resources in many cases. These results, coupled with the fact that PatchCat works with small, local LLMs, are a promising step toward interpretable, efficient, and green GI. We outline a rich agenda of future work and call for the community to join our vision of building a principled understanding of LLM-driven mutations, guiding the GI search process with semantic signals.
Keywords
Large language models; Genetic improvement
Status | Accepted |
---|---|
Conference | IEEE/ACM International Conference on Automated Software engineering |
Conference location | Seoul |
Dates |
People (1)
Associate Professor, Computing Science and Mathematics - Division