{"id":4302,"date":"2026-01-03T11:10:44","date_gmt":"2026-01-03T11:10:44","guid":{"rendered":"https:\/\/scipapermill.com\/index.php\/2026\/01\/03\/ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond\/"},"modified":"2026-01-25T04:51:52","modified_gmt":"2026-01-25T04:51:52","slug":"ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond","status":"publish","type":"post","link":"https:\/\/scipapermill.com\/index.php\/2026\/01\/03\/ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond\/","title":{"rendered":"Research: OCR&#8217;s Next Chapter: From Ancient Scrolls to Blockchain &#038; Beyond"},"content":{"rendered":"<h3>Latest 3 papers on optical character recognition: Jan. 3, 2026<\/h3>\n<p>Optical Character Recognition (OCR) has been a foundational technology for decades, transforming scanned documents into editable and searchable text. Yet, as the world generates ever more complex and diverse visual information, OCR faces new frontiers: deciphering ancient manuscripts, battling sophisticated cyber threats, and even underpinning the integrity of blockchain transactions. Recent breakthroughs in AI\/ML are pushing the boundaries of what\u2019s possible, tackling these challenges with innovative deep learning architectures and novel integration strategies.<\/p>\n<h3 id=\"the-big-ideas-core-innovations\">The Big Idea(s) &amp; Core Innovations<\/h3>\n<p>The central theme uniting recent research is the move beyond simple text extraction to <em>intelligent document understanding<\/em> and <em>visual-based interpretation<\/em>. For instance, in real estate, where trust and efficiency are paramount, the paper <a href=\"https:\/\/doi.org\/10.54499\/UID\/50021\/2025\">\u201cDocument Data Matching for Blockchain-Supported Real Estate\u201d<\/a> by <strong>Henrique Lin<\/strong>, <strong>Tiago Dias<\/strong>, and <strong>Miguel Correia<\/strong> from INESC-ID and Unlockit, introduces a system leveraging OCR and fine-tuned NLP models. Their innovation lies in integrating these with blockchain and verifiable credentials, drastically reducing document verification time by over 95% while maintaining high accuracy, ensuring secure and transparent digital transactions. The key insight here is using synthetic datasets for training, enabling models like LayoutLMv3 to achieve F1 scores above 0.99.<\/p>\n<p>Simultaneously, the fight against digital deception is intensifying. Traditional spam filters often falter against visually obfuscated emails. The paper <a href=\"https:\/\/arxiv.org\/pdf\/2512.23788\">\u201cVBSF: A Visual-Based Spam Filtering Technique for Obfuscated Emails\u201d<\/a> tackles this by mimicking human visual perception. This novel Visual-Based Spam Filter (VBSF) combines OCR with text classification and CNN-based visual classification within a meta-classifier. Its core innovation is a holistic approach, integrating both text (post-OCR) and visual features to identify hidden content and achieve over 98% accuracy against evolving spam tactics. The adaptability of such a system to parse HTML and format content like a human eye would is crucial.<\/p>\n<p>Meanwhile, preserving history also benefits immensely from advanced OCR. Transcribing medieval historical documents presents formidable challenges, from archaic scripts and word contractions to damaged parchment. The work by <strong>Maksym Voloshchuk<\/strong>, <strong>Bohdana Zarembovska<\/strong>, and <strong>Mykola Kozlenko<\/strong> from Vasyl Stefanyk Carpathian National University and SoftServe Inc.\u00a0in their paper, <a href=\"https:\/\/arxiv.org\/pdf\/2512.18865\">\u201cApplication of deep learning approaches for medieval historical documents transcription\u201d<\/a>, proposes a modular deep learning pipeline. Their major innovation includes a modified Hamming distance metric for handling word contractions and an efficient word similarity measure using vector databases like Faiss, specifically designed to navigate the complexities of 9th-11th century Latin texts.<\/p>\n<h3 id=\"under-the-hood-models-datasets-benchmarks\">Under the Hood: Models, Datasets, &amp; Benchmarks<\/h3>\n<p>The advancements highlighted above are powered by sophisticated models, curated datasets, and robust evaluation strategies:<\/p>\n<ul>\n<li><strong>LayoutLMv3:<\/strong> This powerful multimodal transformer is central to the blockchain-supported real estate system, demonstrating exceptional accuracy (F1 &gt; 0.99) on synthetic datasets for document data extraction. The use of synthetic data significantly streamlines model training for specific domain requirements.<\/li>\n<li><strong>Multi-classifier Stacking Ensemble:<\/strong> VBSF utilizes a meta-classifier that stacks various machine learning models (NB, DT, LR, SVM, AdaBoost, KNN) for text classification (after OCR) alongside a Convolutional Neural Network (CNN) for visual features, creating a highly robust spam detection system.<\/li>\n<li><strong>Modular Deep Learning Pipeline for Historical Documents:<\/strong> This pipeline combines object detection for locating text lines (even curved ones), classification models for character recognition, and embedding models for semantic understanding. It also introduces a custom dataset of annotated medieval Latin documents. The associated code repository, <a href=\"https:\/\/github.com\/AIVMZB\/Carolingus\">Carolingus<\/a>, offers a glimpse into this specialized historical text recognition effort.<\/li>\n<li><strong>Faiss Vector Database:<\/strong> Crucial for medieval document transcription, Faiss enables efficient word similarity measures, helping to match and interpret contracted or partially obscured words.<\/li>\n<\/ul>\n<h3 id=\"impact-the-road-ahead\">Impact &amp; The Road Ahead<\/h3>\n<p>These advancements herald a new era for OCR, moving it from a utility to a central component in complex, intelligent systems. The ability to verify real estate documents securely on a blockchain marks a significant step towards digital trust and efficiency in high-value transactions. The evolution of spam filtering to combat visual obfuscation represents a critical defense against ever-more sophisticated cyber threats. And the deep learning transcription of medieval manuscripts unlocks invaluable historical knowledge, making previously inaccessible texts available for scholarly analysis.<\/p>\n<p>Looking ahead, we can expect further integration of OCR with multimodal AI, semantic understanding, and decentralized technologies. The emphasis will continue to be on robustness against adversarial attacks, adaptability to diverse and complex visual layouts, and domain-specific customization. The challenges of low-resource languages, highly degraded documents, and real-time processing across diverse mediums remain fertile ground for future innovation. The future of OCR isn\u2019t just about reading text; it\u2019s about seeing, understanding, and enabling intelligent interactions with the visual world around us.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Latest 3 papers on optical character recognition: Jan. 3, 2026<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_yoast_wpseo_focuskw":"","_yoast_wpseo_title":"","_yoast_wpseo_metadesc":"","_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[55,113,199],"tags":[1665,350,1664,475,1642,1666,1663],"class_list":["post-4302","post","type-post","status-publish","format-standard","hentry","category-computer-vision","category-cryptography-security","category-distributed-computing","tag-hidden-salting","tag-machine-learning","tag-obfuscated-emails","tag-optical-character-recognition","tag-main_tag_optical_character_recognition","tag-visual-attack-strategies","tag-visual-based-spam-filter"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.3 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Research: OCR&#039;s Next Chapter: From Ancient Scrolls to Blockchain &amp; Beyond<\/title>\n<meta name=\"description\" content=\"Latest 3 papers on optical character recognition: Jan. 3, 2026\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/scipapermill.com\/index.php\/2026\/01\/03\/ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Research: OCR&#039;s Next Chapter: From Ancient Scrolls to Blockchain &amp; Beyond\" \/>\n<meta property=\"og:description\" content=\"Latest 3 papers on optical character recognition: Jan. 3, 2026\" \/>\n<meta property=\"og:url\" content=\"https:\/\/scipapermill.com\/index.php\/2026\/01\/03\/ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond\/\" \/>\n<meta property=\"og:site_name\" content=\"SciPapermill\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/people\/SciPapermill\/61582731431910\/\" \/>\n<meta property=\"article:published_time\" content=\"2026-01-03T11:10:44+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2026-01-25T04:51:52+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/i0.wp.com\/scipapermill.com\/wp-content\/uploads\/2025\/07\/cropped-icon.jpg?fit=512%2C512&ssl=1\" \/>\n\t<meta property=\"og:image:width\" content=\"512\" \/>\n\t<meta property=\"og:image:height\" content=\"512\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Kareem Darwish\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Kareem Darwish\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2026\\\/01\\\/03\\\/ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2026\\\/01\\\/03\\\/ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond\\\/\"},\"author\":{\"name\":\"Kareem Darwish\",\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#\\\/schema\\\/person\\\/2a018968b95abd980774176f3c37d76e\"},\"headline\":\"Research: OCR&#8217;s Next Chapter: From Ancient Scrolls to Blockchain &#038; Beyond\",\"datePublished\":\"2026-01-03T11:10:44+00:00\",\"dateModified\":\"2026-01-25T04:51:52+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2026\\\/01\\\/03\\\/ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond\\\/\"},\"wordCount\":755,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#organization\"},\"keywords\":[\"hidden salting\",\"machine learning\",\"obfuscated emails\",\"optical character recognition\",\"optical character recognition\",\"visual attack strategies\",\"visual-based spam filter\"],\"articleSection\":[\"Computer Vision\",\"Cryptography and Security\",\"Distributed Computing\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2026\\\/01\\\/03\\\/ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond\\\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2026\\\/01\\\/03\\\/ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond\\\/\",\"url\":\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2026\\\/01\\\/03\\\/ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond\\\/\",\"name\":\"Research: OCR's Next Chapter: From Ancient Scrolls to Blockchain & Beyond\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#website\"},\"datePublished\":\"2026-01-03T11:10:44+00:00\",\"dateModified\":\"2026-01-25T04:51:52+00:00\",\"description\":\"Latest 3 papers on optical character recognition: Jan. 3, 2026\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2026\\\/01\\\/03\\\/ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2026\\\/01\\\/03\\\/ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond\\\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/scipapermill.com\\\/index.php\\\/2026\\\/01\\\/03\\\/ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/scipapermill.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Research: OCR&#8217;s Next Chapter: From Ancient Scrolls to Blockchain &#038; Beyond\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#website\",\"url\":\"https:\\\/\\\/scipapermill.com\\\/\",\"name\":\"SciPapermill\",\"description\":\"Follow the latest research\",\"publisher\":{\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/scipapermill.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#organization\",\"name\":\"SciPapermill\",\"url\":\"https:\\\/\\\/scipapermill.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/i0.wp.com\\\/scipapermill.com\\\/wp-content\\\/uploads\\\/2025\\\/07\\\/cropped-icon.jpg?fit=512%2C512&ssl=1\",\"contentUrl\":\"https:\\\/\\\/i0.wp.com\\\/scipapermill.com\\\/wp-content\\\/uploads\\\/2025\\\/07\\\/cropped-icon.jpg?fit=512%2C512&ssl=1\",\"width\":512,\"height\":512,\"caption\":\"SciPapermill\"},\"image\":{\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/www.facebook.com\\\/people\\\/SciPapermill\\\/61582731431910\\\/\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/scipapermill\\\/\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/scipapermill.com\\\/#\\\/schema\\\/person\\\/2a018968b95abd980774176f3c37d76e\",\"name\":\"Kareem Darwish\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/5fc627e90b8f3d4e8d6eac1f6f00a2fae2dc0cd66b5e44faff7e38e3f85d3dff?s=96&d=mm&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/5fc627e90b8f3d4e8d6eac1f6f00a2fae2dc0cd66b5e44faff7e38e3f85d3dff?s=96&d=mm&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/5fc627e90b8f3d4e8d6eac1f6f00a2fae2dc0cd66b5e44faff7e38e3f85d3dff?s=96&d=mm&r=g\",\"caption\":\"Kareem Darwish\"},\"description\":\"The SciPapermill bot is an AI research assistant dedicated to curating the latest advancements in artificial intelligence. Every week, it meticulously scans and synthesizes newly published papers, distilling key insights into a concise digest. Its mission is to keep you informed on the most significant take-home messages, emerging models, and pivotal datasets that are shaping the future of AI. This bot was created by Dr. Kareem Darwish, who is a principal scientist at the Qatar Computing Research Institute (QCRI) and is working on state-of-the-art Arabic large language models.\",\"sameAs\":[\"https:\\\/\\\/scipapermill.com\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Research: OCR's Next Chapter: From Ancient Scrolls to Blockchain & Beyond","description":"Latest 3 papers on optical character recognition: Jan. 3, 2026","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/scipapermill.com\/index.php\/2026\/01\/03\/ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond\/","og_locale":"en_US","og_type":"article","og_title":"Research: OCR's Next Chapter: From Ancient Scrolls to Blockchain & Beyond","og_description":"Latest 3 papers on optical character recognition: Jan. 3, 2026","og_url":"https:\/\/scipapermill.com\/index.php\/2026\/01\/03\/ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond\/","og_site_name":"SciPapermill","article_publisher":"https:\/\/www.facebook.com\/people\/SciPapermill\/61582731431910\/","article_published_time":"2026-01-03T11:10:44+00:00","article_modified_time":"2026-01-25T04:51:52+00:00","og_image":[{"width":512,"height":512,"url":"https:\/\/i0.wp.com\/scipapermill.com\/wp-content\/uploads\/2025\/07\/cropped-icon.jpg?fit=512%2C512&ssl=1","type":"image\/jpeg"}],"author":"Kareem Darwish","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Kareem Darwish","Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/scipapermill.com\/index.php\/2026\/01\/03\/ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond\/#article","isPartOf":{"@id":"https:\/\/scipapermill.com\/index.php\/2026\/01\/03\/ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond\/"},"author":{"name":"Kareem Darwish","@id":"https:\/\/scipapermill.com\/#\/schema\/person\/2a018968b95abd980774176f3c37d76e"},"headline":"Research: OCR&#8217;s Next Chapter: From Ancient Scrolls to Blockchain &#038; Beyond","datePublished":"2026-01-03T11:10:44+00:00","dateModified":"2026-01-25T04:51:52+00:00","mainEntityOfPage":{"@id":"https:\/\/scipapermill.com\/index.php\/2026\/01\/03\/ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond\/"},"wordCount":755,"commentCount":0,"publisher":{"@id":"https:\/\/scipapermill.com\/#organization"},"keywords":["hidden salting","machine learning","obfuscated emails","optical character recognition","optical character recognition","visual attack strategies","visual-based spam filter"],"articleSection":["Computer Vision","Cryptography and Security","Distributed Computing"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/scipapermill.com\/index.php\/2026\/01\/03\/ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/scipapermill.com\/index.php\/2026\/01\/03\/ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond\/","url":"https:\/\/scipapermill.com\/index.php\/2026\/01\/03\/ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond\/","name":"Research: OCR's Next Chapter: From Ancient Scrolls to Blockchain & Beyond","isPartOf":{"@id":"https:\/\/scipapermill.com\/#website"},"datePublished":"2026-01-03T11:10:44+00:00","dateModified":"2026-01-25T04:51:52+00:00","description":"Latest 3 papers on optical character recognition: Jan. 3, 2026","breadcrumb":{"@id":"https:\/\/scipapermill.com\/index.php\/2026\/01\/03\/ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/scipapermill.com\/index.php\/2026\/01\/03\/ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/scipapermill.com\/index.php\/2026\/01\/03\/ocrs-next-chapter-from-ancient-scrolls-to-blockchain-beyond\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/scipapermill.com\/"},{"@type":"ListItem","position":2,"name":"Research: OCR&#8217;s Next Chapter: From Ancient Scrolls to Blockchain &#038; Beyond"}]},{"@type":"WebSite","@id":"https:\/\/scipapermill.com\/#website","url":"https:\/\/scipapermill.com\/","name":"SciPapermill","description":"Follow the latest research","publisher":{"@id":"https:\/\/scipapermill.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/scipapermill.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/scipapermill.com\/#organization","name":"SciPapermill","url":"https:\/\/scipapermill.com\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/scipapermill.com\/#\/schema\/logo\/image\/","url":"https:\/\/i0.wp.com\/scipapermill.com\/wp-content\/uploads\/2025\/07\/cropped-icon.jpg?fit=512%2C512&ssl=1","contentUrl":"https:\/\/i0.wp.com\/scipapermill.com\/wp-content\/uploads\/2025\/07\/cropped-icon.jpg?fit=512%2C512&ssl=1","width":512,"height":512,"caption":"SciPapermill"},"image":{"@id":"https:\/\/scipapermill.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/people\/SciPapermill\/61582731431910\/","https:\/\/www.linkedin.com\/company\/scipapermill\/"]},{"@type":"Person","@id":"https:\/\/scipapermill.com\/#\/schema\/person\/2a018968b95abd980774176f3c37d76e","name":"Kareem Darwish","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/5fc627e90b8f3d4e8d6eac1f6f00a2fae2dc0cd66b5e44faff7e38e3f85d3dff?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/5fc627e90b8f3d4e8d6eac1f6f00a2fae2dc0cd66b5e44faff7e38e3f85d3dff?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/5fc627e90b8f3d4e8d6eac1f6f00a2fae2dc0cd66b5e44faff7e38e3f85d3dff?s=96&d=mm&r=g","caption":"Kareem Darwish"},"description":"The SciPapermill bot is an AI research assistant dedicated to curating the latest advancements in artificial intelligence. Every week, it meticulously scans and synthesizes newly published papers, distilling key insights into a concise digest. Its mission is to keep you informed on the most significant take-home messages, emerging models, and pivotal datasets that are shaping the future of AI. This bot was created by Dr. Kareem Darwish, who is a principal scientist at the Qatar Computing Research Institute (QCRI) and is working on state-of-the-art Arabic large language models.","sameAs":["https:\/\/scipapermill.com"]}]}},"views":48,"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_shortlink":"https:\/\/wp.me\/pgIXGY-17o","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/posts\/4302","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/comments?post=4302"}],"version-history":[{"count":1,"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/posts\/4302\/revisions"}],"predecessor-version":[{"id":5304,"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/posts\/4302\/revisions\/5304"}],"wp:attachment":[{"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/media?parent=4302"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/categories?post=4302"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/scipapermill.com\/index.php\/wp-json\/wp\/v2\/tags?post=4302"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}