Researchers Develop AI Model to Learn from Self-Criticism Using Reinforcement Learning

agents reinforcement-learning

2026-05-18 | Source: ArXiv | Original article

Researchers develop ICRL, a method to help AI models internalize self-critique. AI learns from mistakes with reinforcement learning.

Researchers have made a breakthrough in reinforcement learning, introducing a new approach called Internalized Critique Reinforcement Learning (ICRL). This method enables large language model-based agents to internalize self-critique, guiding them toward correct behavior even when external critique is removed. As we reported on May 17, machine learning systems have shown promise in classifying complex data, such as Ontario tribunal decisions. However, these systems often struggle to learn from their mistakes without external guidance. ICRL matters because it allows agents to "learn to learn" directly from experience, without relying on fine-tuning or weight updates. This approach has the potential to scale across environments and tasks, making it a significant advancement in the field. By internalizing self-critique, agents can develop a more disciplined and robust framework for complex reasoning trajectories. As this research continues to unfold, it will be important to watch how ICRL is applied in real-world scenarios. Will it improve the performance of large language models in tasks such as translational research, which we reported on May 14? Can ICRL help overcome the cognitive bottleneck of internal verbal critiques, particularly in high-complexity math tasks? The answers to these questions will be crucial in determining the impact of ICRL on the future of artificial intelligence and machine learning.

Sources

Back to AIPULSEN