{"id":1320,"date":"2025-09-29T07:49:50","date_gmt":"2025-09-29T07:49:50","guid":{"rendered":"https:\/\/scipapermill.com\/index.php\/2025\/09\/29\/formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems\/"},"modified":"2025-12-28T22:06:10","modified_gmt":"2025-12-28T22:06:10","slug":"formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems","status":"publish","type":"post","link":"https:\/\/scipapermill.com\/index.php\/2025\/09\/29\/formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems\/","title":{"rendered":"Formal Verification in the Age of AI: Ensuring Trustworthiness from Code to Cyber-Physical Systems"},"content":{"rendered":"<h3>Latest 50 papers on formal verification: Sep. 29, 2025<\/h3>\n<p>The relentless march of AI and ML innovation brings unprecedented capabilities, but also complex challenges, particularly concerning reliability, safety, and trustworthiness. In critical domains\u2014from autonomous vehicles and medical devices to cybersecurity and smart contracts\u2014even a minor flaw can have catastrophic consequences. This makes formal verification, a set of techniques for mathematically proving the correctness of systems, more vital than ever. Recent research highlights a burgeoning field where cutting-edge AI meets rigorous formal methods, promising a future of verifiable and robust AI-powered systems. This post delves into recent breakthroughs that are bridging this critical gap.<\/p>\n<h3 id=\"the-big-ideas-core-innovations\">The Big Idea(s) &amp; Core Innovations<\/h3>\n<p>At the heart of recent advancements is the idea of deeply integrating formal verification into the AI\/ML lifecycle, from initial design to runtime operations. A recurring theme is the leverage of Large Language Models (LLMs) to automate and streamline traditionally manual and labor-intensive formal methods. For instance, the <strong>VeriSafe Agent<\/strong>, presented by Jungjae Lee and colleagues from <em>KAIST<\/em>, introduces a novel system for mobile GUI agents that translates natural language user instructions into formally verifiable specifications. This <code>autoformalization<\/code> enables pre-action verification, achieving up to 98.33% accuracy in detecting erroneous actions, a significant leap over purely LFM-based methods, as highlighted in their paper, <a href=\"https:\/\/arxiv.org\/pdf\/2503.18492\">\u201cVeriSafe Agent: Safeguarding Mobile GUI Agent via Logic-based Action Verification\u201d<\/a>.<\/p>\n<p>Similarly, the <a href=\"https:\/\/arxiv.org\/pdf\/2509.09291\">\u201cWhat You Code Is What We Prove: Translating BLE App Logic into Formal Models with LLMs for Vulnerability Detection\u201d<\/a> paper demonstrates how LLMs can translate Bluetooth Low Energy (BLE) application logic into formal models, significantly improving automated vulnerability detection. This is complemented by <strong>Preguss<\/strong>, a framework from <em>Zhejiang University<\/em> researchers led by Zhongyi Wang, as detailed in <a href=\"https:\/\/arxiv.org\/pdf\/2508.14532\">\u201cPreguss: It Analyzes, It Specifies, It Verifies\u201d<\/a>. Preguss uses LLMs to automate the generation and refinement of formal specifications for large-scale software, synergizing static analysis with deductive verification by breaking down programs into manageable units. In the realm of hardware, <em>UC Irvine<\/em> researchers in <a href=\"https:\/\/arxiv.org\/pdf\/2509.06239\">\u201cProof2Silicon: Prompt Repair for Verified Code and Hardware Generation via Reinforcement Learning\u201d<\/a> present <strong>Proof2Silicon<\/strong>, a reinforcement learning framework that uses prompt repair to generate formally verified code and hardware. This approach effectively bridges LLMs with formal specifications and reactive synthesis, promising high-quality, trustworthy outputs.<\/p>\n<p>Another significant thrust involves applying formal methods to complex, dynamic systems. For example, <em>Carnegie Mellon University<\/em> researchers in <a href=\"https:\/\/arxiv.org\/pdf\/2409.00536\">\u201cFormal Verification and Control with Conformal Prediction\u201d<\/a> explore <code>conformal prediction<\/code> to quantify uncertainty and ensure safety in learning-enabled autonomous systems (LEASs), offering a lightweight, data-driven alternative to traditional model-based approaches. In robotic systems, <a href=\"https:\/\/arxiv.org\/pdf\/2508.18820\">\u201cAS2FM: Enabling Statistical Model Checking of ROS 2 Systems for Robust Autonomy\u201d<\/a> introduces a framework for statistical model checking of ROS 2 systems to enhance autonomy and reliability through probabilistic models. Meanwhile, <em>Bitdefender and INRIA<\/em> researchers, in <a href=\"https:\/\/arxiv.org\/pdf\/2509.13035\">\u201cBridging Threat Models and Detections: Formal Verification via CADP\u201d<\/a>, leverage <code>attack trees<\/code> and a novel language (GTDL) with the CADP toolset to formally verify cybersecurity detection rules, identifying crucial gaps automatically. Even the core Bitcoin consensus protocol is being scrutinized, with <em>University of Bologna<\/em> and <em>Inria<\/em> researchers in <a href=\"https:\/\/arxiv.org\/pdf\/2508.19452\">\u201cFormal Modeling and Verification of the Algorand Consensus Protocol in CADP\u201d<\/a> demonstrating its vulnerabilities to adversarial conditions through formal analysis.<\/p>\n<h3 id=\"under-the-hood-models-datasets-benchmarks\">Under the Hood: Models, Datasets, &amp; Benchmarks<\/h3>\n<p>These innovations are often enabled by specialized tools, models, and datasets:<\/p>\n<ul>\n<li><strong>VeriSafe Agent<\/strong>: Integrates a <code>Domain-Specific Language (DSL)<\/code> and <code>Developer Library<\/code> for mobile environments to translate user instructions and UI actions into logical formulas. <a href=\"https:\/\/github.com\/VeriSafeAgent\/VeriSafeAgent\">Code available<\/a> and [https:\/\/github.com\/VeriSafeAgent\/VeriSafeAgent_Library].<\/li>\n<li><strong>AD-VF<\/strong>: Leverages LLMs for <code>automatic differentiation<\/code> to facilitate fine-tuning-free robot planning, directly incorporating formal methods feedback without extensive model tuning. <a href=\"https:\/\/arxiv.org\/pdf\/2509.18384\">Paper<\/a><\/li>\n<li><strong>Online Data-Driven Reachability Analysis<\/strong>: Employs a <code>set-based Exponentially Forgetting Zonotopic Recursive Least Squares (EF-ZRLS)<\/code> method to estimate time-varying models and compute over-approximated reachable sets directly from noisy data. <a href=\"https:\/\/github.com\/TUM-CPS-HN\/ZRLS\">Code available<\/a>.<\/li>\n<li><strong>CASP Dataset<\/strong>: A novel dataset of C code paired with formal specifications in <code>ACSL<\/code>, specifically designed to evaluate LLMs\u2019 ability to generate formally verified code. <a href=\"https:\/\/huggingface.co\/datasets\/nicher92\/CASP_dataset\">Dataset available<\/a> and <a href=\"https:\/\/huggingface.co\/datasets\/nicher92\/CASP_source_files\">source files<\/a>.<\/li>\n<li><strong>APOLLO<\/strong>: Integrates <code>LLMs<\/code> (including general-purpose and specialized provers) with <code>Lean compiler capabilities<\/code> for automated theorem proving, setting new benchmarks on the <code>miniF2F<\/code> benchmark. <a href=\"https:\/\/github.com\/aziksh-ospanov\/APOLLO\">Code available<\/a>.<\/li>\n<li><strong>Lean4Lean<\/strong>: An external typechecker for the <code>Lean theorem prover<\/code> implemented in Lean itself, used to verify properties of Lean\u2019s kernel and metatheory. <a href=\"https:\/\/github.com\/digama0\/lean4lean\">Code available<\/a>.<\/li>\n<li><strong>PYVERITAS<\/strong>: Utilizes <code>LLM-based transpilation<\/code> to convert Python code into C, followed by <code>bounded model checking (CBMC)<\/code> and <code>MaxSAT-based fault localization (CFAULTS)<\/code> for formal verification. <a href=\"https:\/\/github.com\/pyveritas\/pyveritas\">Code available<\/a>.<\/li>\n<li><strong>TrustGeoGen<\/strong>: A formal language-verified data generation engine that produces multimodal geometric data with <code>trustworthy reasoning guarantees<\/code>, introducing \u2018Connection Thinking\u2019 and a synthetic dataset that outperforms existing benchmarks. <a href=\"https:\/\/github.com\/Alpha\/TrustGeoGen\">Code available<\/a>.<\/li>\n<li><strong>Geoint-R1<\/strong>: A multimodal reasoning framework for geometric problems that dynamically constructs auxiliary elements and provides formal verification. It introduces the <code>Geoint benchmark<\/code> with annotated geometry problems. <a href=\"https:\/\/arxiv.org\/pdf\/2508.03173\">Paper<\/a>.<\/li>\n<li><strong>Hornet Node and Hornet DSL<\/strong>: A minimal, executable specification for Bitcoin consensus rules, offering a clean and modular alternative to traditional implementations. <a href=\"https:\/\/hornetnode.org\/paper.html\">Paper and code<\/a>.<\/li>\n<li><strong>e-boost<\/strong>: Combines <code>adaptive heuristics<\/code> with <code>exact solving<\/code> for efficient E-graph extraction in logic synthesis, achieving significant area improvements. <a href=\"https:\/\/github.com\/Yu-Maryland\/e-boost\">Code available<\/a>.<\/li>\n<li><strong>Formal Verification of Physical Layer Security Protocols<\/strong>: Introduces a framework based on a <code>generic message theory<\/code> and a web interface for sound animation of security protocols. <a href=\"https:\/\/github.com\/RandallYe\/Animation_of\">Code available<\/a>.<\/li>\n<li><strong>RLSR<\/strong>: Large language models improve themselves using <code>self-judging<\/code> without ground truth labels, leveraging the asymmetry between generating and verifying solutions. <a href=\"https:\/\/github.com\/Jiayi-Pan\/TinyZero\">Code available<\/a>.<\/li>\n<\/ul>\n<h3 id=\"impact-the-road-ahead\">Impact &amp; The Road Ahead<\/h3>\n<p>These breakthroughs promise to revolutionize how we build, deploy, and trust AI systems. The ability to automatically generate formal specifications, verify complex codebases, and ensure the safety of autonomous agents will unlock new levels of reliability and trustworthiness. For the AI\/ML community, this means safer autonomous vehicles, more secure smart contracts, dependable medical devices, and robust cyber-physical systems. The integration of LLMs with formal methods is particularly exciting, showing that AI can not only create but also critically evaluate its own creations, bridging the gap between probabilistic learning and deterministic guarantees. This synergy points towards a future where AI systems are not just powerful, but provably correct and inherently trustworthy, driving innovation in safety-critical domains and beyond. The journey has just begun, and the road ahead is paved with exciting challenges and transformative potential for verifiable AI.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Latest 50 papers on formal verification: Sep. 29, 2025<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_yoast_wpseo_focuskw":"","_yoast_wpseo_title":"","_yoast_wpseo_metadesc":"","_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[56,419,163],"tags":[696,148,78,1611,74,740],"class_list":["post-1320","post","type-post","status-publish","format-standard","hentry","category-artificial-intelligence","category-logic-in-computer-science","category-software-engineering","tag-autoformalization","tag-formal-verification","tag-large-language-models-llms","tag-main_tag_formal_verification","tag-reinforcement-learning","tag-software-correctness"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Formal Verification in the Age of AI: Ensuring Trustworthiness from Code to Cyber-Physical Systems<\/title>\n<meta name=\"description\" content=\"Latest 50 papers on formal verification: Sep. 29, 2025\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/scipapermill.com\/index.php\/2025\/09\/29\/formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Formal Verification in the Age of AI: Ensuring Trustworthiness from Code to Cyber-Physical Systems\" \/>\n<meta property=\"og:description\" content=\"Latest 50 papers on formal verification: Sep. 29, 2025\" \/>\n<meta property=\"og:url\" content=\"https:\/\/scipapermill.com\/index.php\/2025\/09\/29\/formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems\/\" \/>\n<meta property=\"og:site_name\" content=\"SciPapermill\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/people\/SciPapermill\/61582731431910\/\" \/>\n<meta property=\"article:published_time\" content=\"2025-09-29T07:49:50+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-12-28T22:06:10+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/i0.wp.com\/scipapermill.com\/wp-content\/uploads\/2025\/07\/cropped-icon.jpg?fit=512%2C512&ssl=1\" \/>\n\t<meta property=\"og:image:width\" content=\"512\" \/>\n\t<meta property=\"og:image:height\" content=\"512\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Kareem Darwish\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Kareem Darwish\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2025\\\/09\\\/29\\\/formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2025\\\/09\\\/29\\\/formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems\\\/\"},\"author\":{\"name\":\"Kareem Darwish\",\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#\\\/schema\\\/person\\\/2a018968b95abd980774176f3c37d76e\"},\"headline\":\"Formal Verification in the Age of AI: Ensuring Trustworthiness from Code to Cyber-Physical Systems\",\"datePublished\":\"2025-09-29T07:49:50+00:00\",\"dateModified\":\"2025-12-28T22:06:10+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2025\\\/09\\\/29\\\/formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems\\\/\"},\"wordCount\":989,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#organization\"},\"keywords\":[\"autoformalization\",\"formal verification\",\"large language models (llms)\",\"main_tag_formal_verification\",\"reinforcement learning\",\"software correctness\"],\"articleSection\":[\"Artificial Intelligence\",\"Logic in Computer Science\",\"Software Engineering\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2025\\\/09\\\/29\\\/formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems\\\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2025\\\/09\\\/29\\\/formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems\\\/\",\"url\":\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2025\\\/09\\\/29\\\/formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems\\\/\",\"name\":\"Formal Verification in the Age of AI: Ensuring Trustworthiness from Code to Cyber-Physical Systems\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#website\"},\"datePublished\":\"2025-09-29T07:49:50+00:00\",\"dateModified\":\"2025-12-28T22:06:10+00:00\",\"description\":\"Latest 50 papers on formal verification: Sep. 29, 2025\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2025\\\/09\\\/29\\\/formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2025\\\/09\\\/29\\\/formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems\\\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2025\\\/09\\\/29\\\/formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/scipapermill.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Formal Verification in the Age of AI: Ensuring Trustworthiness from Code to Cyber-Physical Systems\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#website\",\"url\":\"https:\\\/\\\/scipapermill.com\\\/\",\"name\":\"SciPapermill\",\"description\":\"Follow the latest research\",\"publisher\":{\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/scipapermill.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#organization\",\"name\":\"SciPapermill\",\"url\":\"https:\\\/\\\/scipapermill.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/i0.wp.com\\\/scipapermill.com\\\/wp-content\\\/uploads\\\/2025\\\/07\\\/cropped-icon.jpg?fit=512%2C512&ssl=1\",\"contentUrl\":\"https:\\\/\\\/i0.wp.com\\\/scipapermill.com\\\/wp-content\\\/uploads\\\/2025\\\/07\\\/cropped-icon.jpg?fit=512%2C512&ssl=1\",\"width\":512,\"height\":512,\"caption\":\"SciPapermill\"},\"image\":{\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/www.facebook.com\\\/people\\\/SciPapermill\\\/61582731431910\\\/\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/scipapermill\\\/\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#\\\/schema\\\/person\\\/2a018968b95abd980774176f3c37d76e\",\"name\":\"Kareem Darwish\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/5fc627e90b8f3d4e8d6eac1f6f00a2fae2dc0cd66b5e44faff7e38e3f85d3dff?s=96&d=mm&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/5fc627e90b8f3d4e8d6eac1f6f00a2fae2dc0cd66b5e44faff7e38e3f85d3dff?s=96&d=mm&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/5fc627e90b8f3d4e8d6eac1f6f00a2fae2dc0cd66b5e44faff7e38e3f85d3dff?s=96&d=mm&r=g\",\"caption\":\"Kareem Darwish\"},\"description\":\"The SciPapermill bot is an AI research assistant dedicated to curating the latest advancements in artificial intelligence. Every week, it meticulously scans and synthesizes newly published papers, distilling key insights into a concise digest. Its mission is to keep you informed on the most significant take-home messages, emerging models, and pivotal datasets that are shaping the future of AI. This bot was created by Dr. Kareem Darwish, who is a principal scientist at the Qatar Computing Research Institute (QCRI) and is working on state-of-the-art Arabic large language models.\",\"sameAs\":[\"https:\\\/\\\/scipapermill.com\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Formal Verification in the Age of AI: Ensuring Trustworthiness from Code to Cyber-Physical Systems","description":"Latest 50 papers on formal verification: Sep. 29, 2025","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/scipapermill.com\/index.php\/2025\/09\/29\/formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems\/","og_locale":"en_US","og_type":"article","og_title":"Formal Verification in the Age of AI: Ensuring Trustworthiness from Code to Cyber-Physical Systems","og_description":"Latest 50 papers on formal verification: Sep. 29, 2025","og_url":"https:\/\/scipapermill.com\/index.php\/2025\/09\/29\/formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems\/","og_site_name":"SciPapermill","article_publisher":"https:\/\/www.facebook.com\/people\/SciPapermill\/61582731431910\/","article_published_time":"2025-09-29T07:49:50+00:00","article_modified_time":"2025-12-28T22:06:10+00:00","og_image":[{"width":512,"height":512,"url":"https:\/\/i0.wp.com\/scipapermill.com\/wp-content\/uploads\/2025\/07\/cropped-icon.jpg?fit=512%2C512&ssl=1","type":"image\/jpeg"}],"author":"Kareem Darwish","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Kareem Darwish","Est. reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/scipapermill.com\/index.php\/2025\/09\/29\/formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems\/#article","isPartOf":{"@id":"https:\/\/scipapermill.com\/index.php\/2025\/09\/29\/formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems\/"},"author":{"name":"Kareem Darwish","@id":"https:\/\/scipapermill.com\/#\/schema\/person\/2a018968b95abd980774176f3c37d76e"},"headline":"Formal Verification in the Age of AI: Ensuring Trustworthiness from Code to Cyber-Physical Systems","datePublished":"2025-09-29T07:49:50+00:00","dateModified":"2025-12-28T22:06:10+00:00","mainEntityOfPage":{"@id":"https:\/\/scipapermill.com\/index.php\/2025\/09\/29\/formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems\/"},"wordCount":989,"commentCount":0,"publisher":{"@id":"https:\/\/scipapermill.com\/#organization"},"keywords":["autoformalization","formal verification","large language models (llms)","main_tag_formal_verification","reinforcement learning","software correctness"],"articleSection":["Artificial Intelligence","Logic in Computer Science","Software Engineering"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/scipapermill.com\/index.php\/2025\/09\/29\/formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/scipapermill.com\/index.php\/2025\/09\/29\/formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems\/","url":"https:\/\/scipapermill.com\/index.php\/2025\/09\/29\/formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems\/","name":"Formal Verification in the Age of AI: Ensuring Trustworthiness from Code to Cyber-Physical Systems","isPartOf":{"@id":"https:\/\/scipapermill.com\/#website"},"datePublished":"2025-09-29T07:49:50+00:00","dateModified":"2025-12-28T22:06:10+00:00","description":"Latest 50 papers on formal verification: Sep. 29, 2025","breadcrumb":{"@id":"https:\/\/scipapermill.com\/index.php\/2025\/09\/29\/formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/scipapermill.com\/index.php\/2025\/09\/29\/formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/scipapermill.com\/index.php\/2025\/09\/29\/formal-verification-in-the-age-of-ai-ensuring-trustworthiness-from-code-to-cyber-physical-systems\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/scipapermill.com\/"},{"@type":"ListItem","position":2,"name":"Formal Verification in the Age of AI: Ensuring Trustworthiness from Code to Cyber-Physical Systems"}]},{"@type":"WebSite","@id":"https:\/\/scipapermill.com\/#website","url":"https:\/\/scipapermill.com\/","name":"SciPapermill","description":"Follow the latest research","publisher":{"@id":"https:\/\/scipapermill.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/scipapermill.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/scipapermill.com\/#organization","name":"SciPapermill","url":"https:\/\/scipapermill.com\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/scipapermill.com\/#\/schema\/logo\/image\/","url":"https:\/\/i0.wp.com\/scipapermill.com\/wp-content\/uploads\/2025\/07\/cropped-icon.jpg?fit=512%2C512&ssl=1","contentUrl":"https:\/\/i0.wp.com\/scipapermill.com\/wp-content\/uploads\/2025\/07\/cropped-icon.jpg?fit=512%2C512&ssl=1","width":512,"height":512,"caption":"SciPapermill"},"image":{"@id":"https:\/\/scipapermill.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/people\/SciPapermill\/61582731431910\/","https:\/\/www.linkedin.com\/company\/scipapermill\/"]},{"@type":"Person","@id":"https:\/\/scipapermill.com\/#\/schema\/person\/2a018968b95abd980774176f3c37d76e","name":"Kareem Darwish","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/5fc627e90b8f3d4e8d6eac1f6f00a2fae2dc0cd66b5e44faff7e38e3f85d3dff?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/5fc627e90b8f3d4e8d6eac1f6f00a2fae2dc0cd66b5e44faff7e38e3f85d3dff?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/5fc627e90b8f3d4e8d6eac1f6f00a2fae2dc0cd66b5e44faff7e38e3f85d3dff?s=96&d=mm&r=g","caption":"Kareem Darwish"},"description":"The SciPapermill bot is an AI research assistant dedicated to curating the latest advancements in artificial intelligence. Every week, it meticulously scans and synthesizes newly published papers, distilling key insights into a concise digest. Its mission is to keep you informed on the most significant take-home messages, emerging models, and pivotal datasets that are shaping the future of AI. This bot was created by Dr. Kareem Darwish, who is a principal scientist at the Qatar Computing Research Institute (QCRI) and is working on state-of-the-art Arabic large language models.","sameAs":["https:\/\/scipapermill.com"]}]}},"views":101,"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_shortlink":"https:\/\/wp.me\/pgIXGY-li","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/posts\/1320","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/comments?post=1320"}],"version-history":[{"count":1,"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/posts\/1320\/revisions"}],"predecessor-version":[{"id":3730,"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/posts\/1320\/revisions\/3730"}],"wp:attachment":[{"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/media?parent=1320"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/categories?post=1320"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/tags?post=1320"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}