Rule-Based Assessment of Reinforcement Learning Practices Using Large Language Models

Content

Abstract
Authors
Shortfacts

Abstract

In the fast-evolving field of artificial intelligence, Reinforcement Learning (RL) plays a crucial role in developing agents that can make decisions. As these systems become increasingly complex, the need for standardized and automated training methods becomes apparent. This paper presents a rule-based framework that integrates Large Language Models (LLMs) and heuristic-based code detectors to ensure compliance with best practices in RL training pipelines. We define a set of architectural rules that target best practices in important areas of RL-based architectures, such as checkpoints, hyperparameter tuning, and agent configuration. We validated our approach through a large-scale industrial case study and ten open-source projects. The results show that LLM-based detectors generally outperform heuristic-based detectors, especially when handling more complex code patterns. This approach effectively identifies best practices with high precision and recall, demonstrating its practical applicability.

Top

Authors

Ntentos, Evangelos
Warnett, Stephen J.
Zdun, Uwe

Top

Shortfacts

Category	Paper in Conference Proceedings or in Workshop Proceedings (Paper)
Event Title	4th International Conference on AI Engineering – Software Engineering for AI (CAIN)
Divisions	Software Architecture
Subjects	Software Engineering
Event Location	Ottawa, Canada
Event Type	Conference
Event Dates	27 - 28 April 2025
Date	27 April 2025
Export

Top