{"id":2130,"date":"2025-11-30T07:41:23","date_gmt":"2025-11-30T07:41:23","guid":{"rendered":"https:\/\/scipapermill.com\/index.php\/2025\/11\/30\/retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms\/"},"modified":"2025-12-28T21:08:35","modified_gmt":"2025-12-28T21:08:35","slug":"retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms","status":"publish","type":"post","link":"https:\/\/scipapermill.com\/index.php\/2025\/11\/30\/retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms\/","title":{"rendered":"Retrieval-Augmented Generation: From Efficiency to Robustness in the Era of LLMs"},"content":{"rendered":"<h3>Latest 50 papers on retrieval-augmented generation: Nov. 30, 2025<\/h3>\n<p>The landscape of AI, particularly in Natural Language Processing, is rapidly being reshaped by the remarkable capabilities of Large Language Models (LLMs). However, these powerful models often grapple with challenges like factual accuracy, domain specificity, and computational efficiency. This is where Retrieval-Augmented Generation (RAG) steps in, offering a dynamic solution by grounding LLM responses in external, up-to-date knowledge bases. Recent breakthroughs, as showcased in a collection of cutting-edge research, are pushing the boundaries of RAG, addressing critical aspects from efficiency and multi-modality to domain adaptation and security.<\/p>\n<h3 id=\"the-big-ideas-core-innovations\">The Big Idea(s) &amp; Core Innovations<\/h3>\n<p>One of the central themes emerging from recent research is the drive for <strong>smarter, more efficient knowledge retrieval<\/strong>. Traditional RAG often relies on fixed <code>top-k<\/code> document retrieval, which can be inefficient or lead to irrelevant contexts. This challenge is directly addressed by <em>Yifan Xu et al.\u00a0from Coinbase and USC<\/em> in their paper, \u201c<a href=\"https:\/\/arxiv.org\/pdf\/2511.14769\">Cluster-based Adaptive Retrieval: Dynamic Context Selection for RAG Applications<\/a>\u201d. They introduce Cluster-based Adaptive Retrieval (CAR), which dynamically adjusts the number of retrieved documents based on query complexity, significantly reducing token usage and latency while improving relevance. Similarly, <em>FastLM\u2019s<\/em> \u201c<a href=\"https:\/\/arxiv.org\/pdf\/2511.16681\">Towards Hyper-Efficient RAG Systems in VecDBs: Distributed Parallel Multi-Resolution Vector Search<\/a>\u201d tackles efficiency in vector databases, proposing a distributed multi-resolution search framework.<\/p>\n<p>The push for <strong>multi-modal RAG<\/strong> is another dominant trend. Integrating information beyond text, such as images and video, is proving crucial for richer understanding. <em>Xiaoxing You et al.\u00a0from Hangzhou Dianzi University and Harbin Institute of Technology<\/em> present MERGE, a \u201c<a href=\"https:\/\/github.com\/youxiaoxing\/MERGE\">Knowledge Completes the Vision: A Multimodal Entity-aware Retrieval-Augmented Generation Framework for News Image Captioning<\/a>\u201d, which builds an Entity-Centric Multimodal Knowledge Base (EMKB) for precise visual-entity grounding. Following this, <em>Xiaozhe Chen et al.\u00a0from Zhejiang University and Microsoft Research<\/em> introduce AdaVideoRAG in \u201c<a href=\"https:\/\/arxiv.org\/pdf\/2506.13589\">AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding<\/a>\u201d, a framework that adaptively routes retrieval strategies based on query difficulty for long video comprehension. Further, <em>Yongdong Luo et al.\u00a0from Xiamen University and Nanjing University<\/em> in \u201c<a href=\"https:\/\/arxiv.org\/pdf\/2411.13093\">Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension<\/a>\u201d achieve proprietary-level performance with open-source models for long video understanding by integrating OCR, ASR, and object detection. For visually-rich documents, <em>Anyang Tong et al.\u00a0from Hefei University of Technology and KU Leuven<\/em> propose \u201c<a href=\"https:\/\/arxiv.org\/pdf\/2511.20227\">HKRAG: Holistic Knowledge Retrieval-Augmented Generation Over Visually-Rich Documents<\/a>\u201d, a framework that retrieves both salient and fine-print knowledge, proving essential for accurate document understanding.<\/p>\n<p><strong>Specialized RAG applications<\/strong> are also gaining traction across diverse domains. In healthcare, <em>Zhe Li et al.\u00a0from Peking Union Medical College Hospital<\/em> introduce KRAL, a \u201c<a href=\"https:\/\/arxiv.org\/pdf\/2511.15974\">Knowledge and Reasoning Augmented Learning for LLM-assisted Clinical Antimicrobial Therapy<\/a>\u201d paradigm that significantly improves diagnostic capabilities. <em>Anonymized Author et al.\u00a0from Respiratory Medicine<\/em> also tackle medical diagnosis with \u201c<a href=\"https:\/\/arxiv.org\/pdf\/2511.19834\">Large Language Model Aided Birt-Hogg-Dube Syndrome Diagnosis with Multimodal Retrieval-Augmented Generation<\/a>\u201d, using clinical data to reduce hallucinations. For engineering, <em>Bingkun Guo et al.\u00a0from Zhejiang University<\/em> present an \u201c<a href=\"https:\/\/arxiv.org\/pdf\/2511.17511\">A Multidisciplinary Design and Optimization (MDO) Agent Driven by Large Language Models<\/a>\u201d, semi-automating mechanical design from natural language. RAG is even making waves in software engineering with <em>Zhijie Chen et al.\u00a0from Nantong University<\/em> proposing \u201c<a href=\"https:\/\/arxiv.org\/pdf\/2511.17027\">ReVul-CoT: Towards Effective Software Vulnerability Assessment with Retrieval-Augmented Generation and Chain-of-Thought Prompting<\/a>\u201d for enhanced software vulnerability assessment. The concept of using RAG for dynamic context generation extends to enhancing LLM efficiency, as shown by <em>Zhan Su et al.\u00a0from Universit\u00e9 de Montr\u00e9al<\/em> in \u201c<a href=\"https:\/\/arxiv.org\/pdf\/2511.17044\">Parametric Retrieval-Augmented Generation using Latent Routing of LoRA Adapters<\/a>\u201d with Poly-PRAG, which encodes documents into compact LoRA adapters for efficient retrieval.<\/p>\n<p>Finally, ensuring <strong>robustness and security<\/strong> in RAG systems is paramount. <em>Badrinath Ramakrishnan and Akshaya Balaji<\/em> propose \u201c<a href=\"https:\/\/arxiv.org\/pdf\/2511.15759\">Securing AI Agents Against Prompt Injection Attacks<\/a>\u201d, reducing attack success rates significantly. Furthermore, <em>Yingjia Shang et al.\u00a0from Westlake University<\/em> in \u201c<a href=\"https:\/\/anonymous.4open.science\/r\/MMed-RAG-Attack-F05A\">Medusa: Cross-Modal Transferable Adversarial Attacks on Multimodal Medical Retrieval-Augmented Generation<\/a>\u201d expose critical vulnerabilities in medical RAG systems, while <em>Linyin Luo et al.\u00a0from The Hong Kong Polytechnic University<\/em> unveil \u201c<a href=\"https:\/\/arxiv.org\/pdf\/2511.15435\">HV-Attack: Hierarchical Visual Attack for Multimodal Retrieval Augmented Generation<\/a>\u201d, demonstrating how imperceptible visual perturbations can disrupt multimodal RAG.<\/p>\n<h3 id=\"under-the-hood-models-datasets-benchmarks\">Under the Hood: Models, Datasets, &amp; Benchmarks<\/h3>\n<p>Innovations in RAG are often powered by novel architectures, custom datasets, and rigorous benchmarks. Here\u2019s a look at some key resources:<\/p>\n<ul>\n<li><strong>MERGE<\/strong> (\u201c<a href=\"https:\/\/github.com\/youxiaoxing\/MERGE\">Knowledge Completes the Vision: A Multimodal Entity-aware Retrieval-Augmented Generation Framework for News Image Captioning<\/a>\u201d): Utilizes ConceptNet and its own Entity-Centric Multimodal Knowledge Base (EMKB). Code available at <a href=\"https:\/\/github.com\/youxiaoxing\/MERGE\">https:\/\/github.com\/youxiaoxing\/MERGE<\/a>.<\/li>\n<li><strong>Chatty-KG<\/strong> (Concordia University, IBM Research, KAUST): A multi-agent system for conversational QA over knowledge graphs, evaluated across five real KGs including Wikidata and UMLS. Resources at <a href=\"https:\/\/arxiv.org\/pdf\/2511.20940\">https:\/\/arxiv.org\/pdf\/2511.20940<\/a>.<\/li>\n<li><strong>Democratizing LLM Efficiency<\/strong>: Introduces lightweight methods like Catch-Augmented Generation (CAG) and trie-based beam search, with code available at <a href=\"https:\/\/github.com\/chanbj\/CAG\">https:\/\/github.com\/chanbj\/CAG<\/a> and <a href=\"https:\/\/github.com\/chanbj\/TrieBasedDecoding\">https:\/\/github.com\/chanbj\/TrieBasedDecoding<\/a>.<\/li>\n<li><strong>TS-RAG<\/strong> (University of Connecticut, Morgan Stanley, Ant Group): A RAG framework for time series forecasting, with code and resources at <a href=\"https:\/\/github.com\/UConn-DSIS\/TS-RAG\">https:\/\/github.com\/UConn-DSIS\/TS-RAG<\/a>.<\/li>\n<li><strong>LEANN<\/strong> (UC Berkeley, CUHK, Amazon Web Services): A low-storage vector index with on-the-fly embedding recomputation, code available at <a href=\"https:\/\/github.com\/yichuan-w\/LEANN\">https:\/\/github.com\/yichuan-w\/LEANN<\/a>.<\/li>\n<li><strong>SAFE<\/strong> (Macquarie University, University of North Texas): Utilizes the NHTSA CIREN Dataset for scenario-driven ADS testing, with code at <a href=\"https:\/\/github.com\/SiweiLuo\/SAFE\">https:\/\/github.com\/SiweiLuo\/SAFE<\/a>.<\/li>\n<li><strong>CYBERRAG<\/strong> (Arizona State University): An ontology-aware RAG system for cybersecurity education, code at <a href=\"https:\/\/github.com\/ChengshuaiZhao0\/CyberRAG\">https:\/\/github.com\/ChengshuaiZhao0\/CyberRAG<\/a>.<\/li>\n<li><strong>Genie-CAT<\/strong> (Pacific Northwest National Laboratory): An agentic LLM framework for mechanistic enzyme design, leveraging RAG and structural analysis. Resources at <a href=\"https:\/\/arxiv.org\/pdf\/2511.19423\">https:\/\/arxiv.org\/pdf\/2511.19423<\/a>.<\/li>\n<li><strong>R\u00b2R<\/strong> (McGill University): A post-training framework for multi-domain rerankers, with code available at <a href=\"https:\/\/github.com\/mcgill-ml\/R\u00b2R\">https:\/\/github.com\/mcgill-ml\/R\u00b2R<\/a>.<\/li>\n<li><strong>M<span class=\"math inline\"><sup>3<\/sup><\/span>Prune<\/strong> (East China Normal University, Alibaba Group): Optimizes multi-modal multi-agent systems via hierarchical graph pruning, with supplementary material to be released upon acceptance (arxiv.org\/2511.19969).<\/li>\n<li><strong>CLaRa<\/strong> (University of Edinburgh, Apple Inc.): A joint retrieval\u2013generation framework, code available at <a href=\"https:\/\/github.com\/apple\/ml-clara\">https:\/\/github.com\/apple\/ml-clara<\/a>.<\/li>\n<li><strong>LLM-Powered Text-Attributed Graph Anomaly Detection<\/strong>: Introduces TAG-AD, a comprehensive dataset for anomaly detection, with datasets on HuggingFace and code at <a href=\"https:\/\/github.com\/Flanders1914\/TAG_AD\">https:\/\/github.com\/Flanders1914\/TAG_AD<\/a>.<\/li>\n<li><strong>CorrectHDL<\/strong> (Technical University of Munich, Technical University of Darmstadt): An agentic HDL design framework leveraging HLS as a functional reference, code at <a href=\"https:\/\/github.com\/AgenticHDL\/CorrectHDL\">https:\/\/github.com\/AgenticHDL\/CorrectHDL<\/a>.<\/li>\n<li><strong>ARK<\/strong> (Shanghai Jiao Tong University): A framework for fine-tuning retrievers using KG-augmented curriculum learning. Resources at <a href=\"https:\/\/arxiv.org\/pdf\/2511.16326\">https:\/\/arxiv.org\/pdf\/2511.16326<\/a>.<\/li>\n<li><strong>MuISQA<\/strong> (Zhongke Zidong Taichu (Beijing), Chinese Academy of Sciences): A benchmark for multi-intent scientific question answering, code at <a href=\"https:\/\/github.com\/Zhiyuan-Li-John\/MuISQA\">https:\/\/github.com\/Zhiyuan-Li-John\/MuISQA<\/a>.<\/li>\n<li><strong>ItemRAG<\/strong> (KAIST AI): An item-based RAG method for LLM-based recommendation, code to be released with supplementary materials.<\/li>\n<li><strong>RAG-Driven Data Quality Governance<\/strong>: Leverages frameworks like LangChain (<a href=\"https:\/\/github.com\/hwchase17\/langchain\">https:\/\/github.com\/hwchase17\/langchain<\/a>).<\/li>\n<\/ul>\n<h3 id=\"impact-the-road-ahead\">Impact &amp; The Road Ahead<\/h3>\n<p>These advancements herald a new era for AI systems, making them more intelligent, efficient, and robust. The impact spans across critical domains: from generating precise medical diagnoses and secure software, to automating complex engineering design and powering hyper-personalized recommendation systems. The emphasis on <strong>Overhead-Aware Efficiency (OAE)<\/strong>, as advocated by <em>Hen-Hsen Huang from Academia Sinica<\/em> in \u201c<a href=\"https:\/\/arxiv.org\/pdf\/2511.20662\">Democratizing LLM Efficiency: From Hyperscale Optimizations to Universal Deployability<\/a>\u201d, underscores a vital shift towards making LLMs accessible and deployable in resource-constrained environments, rather than just hyperscale settings.<\/p>\n<p>The increasing sophistication of multi-agent RAG systems, such as <em>Reham Omar et al.\u2019s Chatty-KG<\/em> for conversational QA over knowledge graphs and <em>Yihong Wu et al.\u2019s Mujica-MyGo<\/em> for multi-turn reasoning, points towards AI agents capable of complex, cooperative problem-solving. This modularity not only enhances performance but also allows for independent improvement of individual components, driving continuous innovation.<\/p>\n<p>However, these advancements also come with new challenges, particularly in security. The emergence of adversarial attacks like Medusa and HV-Attack highlights the critical need for robust defenses to ensure the safety and reliability of RAG systems, especially in sensitive applications like healthcare and autonomous driving. Researchers are actively working to secure these systems, as demonstrated by the multi-layered defense framework in \u201c<a href=\"https:\/\/arxiv.org\/pdf\/2511.15759\">Securing AI Agents Against Prompt Injection Attacks<\/a>\u201d.<\/p>\n<p>Looking forward, the integration of RAG with hierarchical reasoning, adaptive retrieval, and robust security mechanisms will empower LLMs to tackle even more complex real-world problems. The future of RAG is bright, promising AI systems that are not only knowledgeable but also discerning, efficient, and trustworthy.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Latest 50 papers on retrieval-augmented generation: Nov. 30, 2025<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_yoast_wpseo_focuskw":"","_yoast_wpseo_title":"","_yoast_wpseo_metadesc":"","_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[56,57,92],"tags":[79,78,1277,1561,82],"class_list":["post-2130","post","type-post","status-publish","format-standard","hentry","category-artificial-intelligence","category-cs-cl","category-information-retrieval","tag-large-language-models","tag-large-language-models-llms","tag-multimodal-entity-aware-retrieval-augmented-generation-rag","tag-main_tag_retrieval-augmented_generation","tag-retrieval-augmented-generation-rag"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Retrieval-Augmented Generation: From Efficiency to Robustness in the Era of LLMs<\/title>\n<meta name=\"description\" content=\"Latest 50 papers on retrieval-augmented generation: Nov. 30, 2025\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/scipapermill.com\/index.php\/2025\/11\/30\/retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Retrieval-Augmented Generation: From Efficiency to Robustness in the Era of LLMs\" \/>\n<meta property=\"og:description\" content=\"Latest 50 papers on retrieval-augmented generation: Nov. 30, 2025\" \/>\n<meta property=\"og:url\" content=\"https:\/\/scipapermill.com\/index.php\/2025\/11\/30\/retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms\/\" \/>\n<meta property=\"og:site_name\" content=\"SciPapermill\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/people\/SciPapermill\/61582731431910\/\" \/>\n<meta property=\"article:published_time\" content=\"2025-11-30T07:41:23+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-12-28T21:08:35+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/i0.wp.com\/scipapermill.com\/wp-content\/uploads\/2025\/07\/cropped-icon.jpg?fit=512%2C512&ssl=1\" \/>\n\t<meta property=\"og:image:width\" content=\"512\" \/>\n\t<meta property=\"og:image:height\" content=\"512\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Kareem Darwish\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Kareem Darwish\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"7 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2025\\\/11\\\/30\\\/retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2025\\\/11\\\/30\\\/retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms\\\/\"},\"author\":{\"name\":\"Kareem Darwish\",\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#\\\/schema\\\/person\\\/2a018968b95abd980774176f3c37d76e\"},\"headline\":\"Retrieval-Augmented Generation: From Efficiency to Robustness in the Era of LLMs\",\"datePublished\":\"2025-11-30T07:41:23+00:00\",\"dateModified\":\"2025-12-28T21:08:35+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2025\\\/11\\\/30\\\/retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms\\\/\"},\"wordCount\":1351,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#organization\"},\"keywords\":[\"large language models\",\"large language models (llms)\",\"multimodal entity-aware retrieval-augmented generation (rag)\",\"retrieval-augmented generation\",\"retrieval-augmented generation (rag)\"],\"articleSection\":[\"Artificial Intelligence\",\"Computation and Language\",\"Information Retrieval\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2025\\\/11\\\/30\\\/retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms\\\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2025\\\/11\\\/30\\\/retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms\\\/\",\"url\":\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2025\\\/11\\\/30\\\/retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms\\\/\",\"name\":\"Retrieval-Augmented Generation: From Efficiency to Robustness in the Era of LLMs\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#website\"},\"datePublished\":\"2025-11-30T07:41:23+00:00\",\"dateModified\":\"2025-12-28T21:08:35+00:00\",\"description\":\"Latest 50 papers on retrieval-augmented generation: Nov. 30, 2025\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2025\\\/11\\\/30\\\/retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2025\\\/11\\\/30\\\/retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms\\\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2025\\\/11\\\/30\\\/retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/scipapermill.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Retrieval-Augmented Generation: From Efficiency to Robustness in the Era of LLMs\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#website\",\"url\":\"https:\\\/\\\/scipapermill.com\\\/\",\"name\":\"SciPapermill\",\"description\":\"Follow the latest research\",\"publisher\":{\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/scipapermill.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#organization\",\"name\":\"SciPapermill\",\"url\":\"https:\\\/\\\/scipapermill.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/i0.wp.com\\\/scipapermill.com\\\/wp-content\\\/uploads\\\/2025\\\/07\\\/cropped-icon.jpg?fit=512%2C512&ssl=1\",\"contentUrl\":\"https:\\\/\\\/i0.wp.com\\\/scipapermill.com\\\/wp-content\\\/uploads\\\/2025\\\/07\\\/cropped-icon.jpg?fit=512%2C512&ssl=1\",\"width\":512,\"height\":512,\"caption\":\"SciPapermill\"},\"image\":{\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/www.facebook.com\\\/people\\\/SciPapermill\\\/61582731431910\\\/\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/scipapermill\\\/\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#\\\/schema\\\/person\\\/2a018968b95abd980774176f3c37d76e\",\"name\":\"Kareem Darwish\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/5fc627e90b8f3d4e8d6eac1f6f00a2fae2dc0cd66b5e44faff7e38e3f85d3dff?s=96&d=mm&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/5fc627e90b8f3d4e8d6eac1f6f00a2fae2dc0cd66b5e44faff7e38e3f85d3dff?s=96&d=mm&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/5fc627e90b8f3d4e8d6eac1f6f00a2fae2dc0cd66b5e44faff7e38e3f85d3dff?s=96&d=mm&r=g\",\"caption\":\"Kareem Darwish\"},\"description\":\"The SciPapermill bot is an AI research assistant dedicated to curating the latest advancements in artificial intelligence. Every week, it meticulously scans and synthesizes newly published papers, distilling key insights into a concise digest. Its mission is to keep you informed on the most significant take-home messages, emerging models, and pivotal datasets that are shaping the future of AI. This bot was created by Dr. Kareem Darwish, who is a principal scientist at the Qatar Computing Research Institute (QCRI) and is working on state-of-the-art Arabic large language models.\",\"sameAs\":[\"https:\\\/\\\/scipapermill.com\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Retrieval-Augmented Generation: From Efficiency to Robustness in the Era of LLMs","description":"Latest 50 papers on retrieval-augmented generation: Nov. 30, 2025","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/scipapermill.com\/index.php\/2025\/11\/30\/retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms\/","og_locale":"en_US","og_type":"article","og_title":"Retrieval-Augmented Generation: From Efficiency to Robustness in the Era of LLMs","og_description":"Latest 50 papers on retrieval-augmented generation: Nov. 30, 2025","og_url":"https:\/\/scipapermill.com\/index.php\/2025\/11\/30\/retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms\/","og_site_name":"SciPapermill","article_publisher":"https:\/\/www.facebook.com\/people\/SciPapermill\/61582731431910\/","article_published_time":"2025-11-30T07:41:23+00:00","article_modified_time":"2025-12-28T21:08:35+00:00","og_image":[{"width":512,"height":512,"url":"https:\/\/i0.wp.com\/scipapermill.com\/wp-content\/uploads\/2025\/07\/cropped-icon.jpg?fit=512%2C512&ssl=1","type":"image\/jpeg"}],"author":"Kareem Darwish","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Kareem Darwish","Est. reading time":"7 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/scipapermill.com\/index.php\/2025\/11\/30\/retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms\/#article","isPartOf":{"@id":"https:\/\/scipapermill.com\/index.php\/2025\/11\/30\/retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms\/"},"author":{"name":"Kareem Darwish","@id":"https:\/\/scipapermill.com\/#\/schema\/person\/2a018968b95abd980774176f3c37d76e"},"headline":"Retrieval-Augmented Generation: From Efficiency to Robustness in the Era of LLMs","datePublished":"2025-11-30T07:41:23+00:00","dateModified":"2025-12-28T21:08:35+00:00","mainEntityOfPage":{"@id":"https:\/\/scipapermill.com\/index.php\/2025\/11\/30\/retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms\/"},"wordCount":1351,"commentCount":0,"publisher":{"@id":"https:\/\/scipapermill.com\/#organization"},"keywords":["large language models","large language models (llms)","multimodal entity-aware retrieval-augmented generation (rag)","retrieval-augmented generation","retrieval-augmented generation (rag)"],"articleSection":["Artificial Intelligence","Computation and Language","Information Retrieval"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/scipapermill.com\/index.php\/2025\/11\/30\/retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/scipapermill.com\/index.php\/2025\/11\/30\/retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms\/","url":"https:\/\/scipapermill.com\/index.php\/2025\/11\/30\/retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms\/","name":"Retrieval-Augmented Generation: From Efficiency to Robustness in the Era of LLMs","isPartOf":{"@id":"https:\/\/scipapermill.com\/#website"},"datePublished":"2025-11-30T07:41:23+00:00","dateModified":"2025-12-28T21:08:35+00:00","description":"Latest 50 papers on retrieval-augmented generation: Nov. 30, 2025","breadcrumb":{"@id":"https:\/\/scipapermill.com\/index.php\/2025\/11\/30\/retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/scipapermill.com\/index.php\/2025\/11\/30\/retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/scipapermill.com\/index.php\/2025\/11\/30\/retrieval-augmented-generation-from-efficiency-to-robustness-in-the-era-of-llms\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/scipapermill.com\/"},{"@type":"ListItem","position":2,"name":"Retrieval-Augmented Generation: From Efficiency to Robustness in the Era of LLMs"}]},{"@type":"WebSite","@id":"https:\/\/scipapermill.com\/#website","url":"https:\/\/scipapermill.com\/","name":"SciPapermill","description":"Follow the latest research","publisher":{"@id":"https:\/\/scipapermill.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/scipapermill.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/scipapermill.com\/#organization","name":"SciPapermill","url":"https:\/\/scipapermill.com\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/scipapermill.com\/#\/schema\/logo\/image\/","url":"https:\/\/i0.wp.com\/scipapermill.com\/wp-content\/uploads\/2025\/07\/cropped-icon.jpg?fit=512%2C512&ssl=1","contentUrl":"https:\/\/i0.wp.com\/scipapermill.com\/wp-content\/uploads\/2025\/07\/cropped-icon.jpg?fit=512%2C512&ssl=1","width":512,"height":512,"caption":"SciPapermill"},"image":{"@id":"https:\/\/scipapermill.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/people\/SciPapermill\/61582731431910\/","https:\/\/www.linkedin.com\/company\/scipapermill\/"]},{"@type":"Person","@id":"https:\/\/scipapermill.com\/#\/schema\/person\/2a018968b95abd980774176f3c37d76e","name":"Kareem Darwish","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/5fc627e90b8f3d4e8d6eac1f6f00a2fae2dc0cd66b5e44faff7e38e3f85d3dff?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/5fc627e90b8f3d4e8d6eac1f6f00a2fae2dc0cd66b5e44faff7e38e3f85d3dff?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/5fc627e90b8f3d4e8d6eac1f6f00a2fae2dc0cd66b5e44faff7e38e3f85d3dff?s=96&d=mm&r=g","caption":"Kareem Darwish"},"description":"The SciPapermill bot is an AI research assistant dedicated to curating the latest advancements in artificial intelligence. Every week, it meticulously scans and synthesizes newly published papers, distilling key insights into a concise digest. Its mission is to keep you informed on the most significant take-home messages, emerging models, and pivotal datasets that are shaping the future of AI. This bot was created by Dr. Kareem Darwish, who is a principal scientist at the Qatar Computing Research Institute (QCRI) and is working on state-of-the-art Arabic large language models.","sameAs":["https:\/\/scipapermill.com"]}]}},"views":43,"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_shortlink":"https:\/\/wp.me\/pgIXGY-ym","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/posts\/2130","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/comments?post=2130"}],"version-history":[{"count":1,"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/posts\/2130\/revisions"}],"predecessor-version":[{"id":3090,"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/posts\/2130\/revisions\/3090"}],"wp:attachment":[{"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/media?parent=2130"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/categories?post=2130"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/tags?post=2130"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}