Neurosymbolic C++ Verification using Rocq
Lennart Beringer | Jasper Haag | Rodolphe Lepigre | Gregory Malecha | Rayan Soban | Ehtesham Zahoor
SkyLabs AI*
|
ABSTRACT Neurosymbolic capabilities have revolutionized the entire software development process, from requirements engineering and code development to testing and maintenence. In this talk, we will present our work integrating agentic techniques into the Brick C++ verification platform. Our approach combines algorithmic reasoning techniques, e.g. concurrent separation logic with sophisticated automation, proof search strategies, etc., with higher-level insights and probabilistic inference provided by LLMs. Our experience demonstrates that combining these technologies enables scalable reasoning for real-world programs. The Formal Foundation The AI Layer The Rocq Agent Toolkit To facilitate the development of agents in Rocq, we developed the Rocq
While our current use focuses on verifications using Brick, the Rocq Agent Toolkit can be used Our presentation will include experimental results concerning the functional capabilities of agents built with these paradigms on Brick verification tasks drawn from real-world systems code in C++. * { lennart.beringer | jasper | rodolphe.lepigre | gregory | rayan | ehteseham.zahoor}@skylabs-ai.com
|
|
BIOS Dr. Lennart Beringer is a Senior Formal Methods Scientist at Skylabs AI working on the integration of agentic capabilities with conventional formal verification technology. His areas of expertise span interactive theorem proving, program logics, compiler correctness, proof-carrying code, and applied security. Prior to joining Skylabs AI in 2024, he was a co-developer of the Verified Software Toolchain at Princeton University and Associate Director of the NSF Expedition "The Science of Deep Specification". Dr. Gregory Malecha is the Director of Formal Methods at SkyLabs AI where he has been leading the integration of state-of-the-art formal verification techniques and artificial intelligence. He has been building and using tools for formal verification of modern software focusing on low-level, concurrent code and mainstream languages. Using this technology, his team and collaborators have carried out verification on sophisticated systems components ranging from hypervisors to virtual machine monitors and demonstrated how these proofs can be combined into end-to-end verified systems. Leveraging the capabilities of generative AI, Dr. Malecha is now working to accelerate the verification process and to put verification technology into the hands of everyday engineers and agents. |