Deep Neural Networks: From Security and Robustness to Interpretability and Efficiency
Latest 28 papers on deep neural networks: Jun. 27, 2026
Deep Neural Networks (DNNs) continue to push the boundaries of AI, powering everything from medical diagnosis to complex control systems. Yet, as their capabilities grow, so do the challenges surrounding their security, robustness, efficiency, and interpretability. Recent research offers exciting breakthroughs, tackling these multifaceted issues head-on. This digest dives into a collection of cutting-edge papers that are redefining our understanding and application of DNNs.
The Big Idea(s) & Core Innovations
The fundamental problems addressed by these papers span securing DNNs against sophisticated attacks, enhancing their reliability and interpretability, and optimizing their performance for real-world deployment. Researchers are innovating across the spectrum, from theoretical underpinnings to practical defense mechanisms and compression techniques.
For instance, the burgeoning field of AI security is seeing novel approaches like SpikeTimer: Exploring Active Copyright Protection in Spiking Neural Networks via Temporal Backdoor Regularization from Shanghai Jiao Tong University. This groundbreaking work introduces the first active copyright protection framework for Spiking Neural Networks (SNNs), leveraging temporal backdoor learning to embed authorization tokens. This means SNNs only fully activate with valid, time-bound tokens, effectively preventing unauthorized use – a significant leap beyond passive watermarking.
On the adversarial front, University of New South Wales researchers in their paper Tensor-Based Batch Fuzzing with Adaptive Perturbation Scaling for Deep Neural Networks revolutionize DNN testing. They introduce a batch fuzzing framework that parallelizes multiple specification instances, boosting throughput by up to 40x and finding 4x more violations. This is crucial for uncovering vulnerabilities efficiently. Complementing this, the dissertation Investigating The Security of Modern AI and Cloud Infrastructure by Andrew Adiletta uncovers critical vulnerabilities in AI and cloud systems, demonstrating cache side-channel attacks on LLMs and Rowhammer-based authentication bypasses. Furthermore, National Yang Ming Chiao Tung University tackles a stealthier threat with Quantization as a Malicious Task: Removing Quantization-Conditioned Backdoors via Task Arithmetic. They propose QVec, a parameter-space defense that reinterprets quantization-induced behavioral shifts as structured malicious task vectors, effectively neutralizing backdoors activated only after quantization.
Challenging long-held assumptions, Westlake University and Nankai University in Is Oracle Pruning the True Oracle? – A Sanity-Check of Neural Network Pruning with Retraining deliver a counterintuitive finding: pre-retraining performance is negligibly correlated with post-retraining performance in modern DNNs, debunking a 35-year-old oracle pruning foundation. This implies the retraining stage is far more critical than previously thought when designing pruning criteria. However, for practical compression, National School of Artificial Intelligence (ENSIA) presents Hierarchical Reinforcement Learning for Neural Network Compression (HiReLC): Pruning and Quantization, a framework achieving 6-7x compression ratios while maintaining accuracy, even gaining accuracy for certain CLIP models. Similarly, University of Science, Ho Chi Minh city in Hybrid Compression: Integrating Pruning and Quantization for Optimized Neural Networks achieves 10x FLOPs and parameter reduction with negligible accuracy drop by combining pruning, quantization, and Mixture of Experts.
Interpretability and robustness are also key themes. University of Luxembourg presents HEM: a margin-based loss for visual categorisation tasks, a novel loss function outperforming cross-entropy and specialized losses across five major vision tasks including adversarial robustness and unknown class rejection. For medical applications, Bar-Ilan University introduces Deep Neural Networks with Ordinal Loss for Medical Applications, a cost-sensitive ordinal cross-entropy (OCE) loss that explicitly models clinical risk and outshines existing ordinal methods for tasks like diabetic retinopathy grading. And for crucial explainability in mental health, MIT Media Lab with Expresso-AI: Explainable Video-Based Deep Learning Models for Depression Diagnosis uses DeepLift to correlate model attributions with facial expressions, revealing associations like nose wrinkling with severe depression.
From a theoretical standpoint, Purdue University and NYU Abu Dhabi in Theory of the Frequency Principle for General Deep Neural Networks rigorously prove the universal low-to-high frequency learning phenomenon (F-Principle) across all training stages and architectures. And for the elusive phenomenon of “grokking,” University of Potsdam offers Noise-Driven Escape from Metastable Phases explains Grokking in Deep Neural Networks, explaining it as noise-activated escape from metastable states, with escape times governed by Arrhenius kinetics. This offers a practical handle on controlling grokking delays.
Under the Hood: Models, Datasets, & Benchmarks
These innovations rely on, and in turn contribute to, a rich ecosystem of models, datasets, and benchmarks:
- SpikeTimer leverages temporal token embedding for SNNs, evaluated on the N-MNIST dataset. Code available: https://github.com/youngshallyx/SpikeTimer-Exploring-Active-Copyright-Protection-in-Spiking-Neural-Networks
- The Role of Input Dimensionality in the Emergence and Targeted Control of Adversarial Examples introduces hierarchical image datasets (e.g., ResynthDB) to study adversarial vulnerability across resolutions, using architectures like ResNets and ViTs. Code available: https://github.com/YaserGholizade/image augmenter
- Geometric and Information Compression of Representations in Deep Learning analyzes mutual information and neural collapse across 8 architectures and 5 datasets. Code available: https://github.com/link-er/information_geometric_compression
- HiReLC (for compression) utilizes a mixed PPO/A2C ensemble of agents, tested on DeiT-Small, CLIP ViT-B/32, ResNet18, and MobileNetV2 across CIFAR-10/100 and Tiny ImageNet.
- Deep Neural Networks with Ordinal Loss for Medical Applications introduces Ordinal Cross-Entropy (OCE) and validates it on DenseNet121, InceptionV3, and VGG19 for diabetic retinopathy grading using the APTOS 2019 Blindness Detection dataset. Code available: https://github.com/Taldvora1/Deep-Neural-Networks-with-Ordinal-Loss-for-Medical-Applications
- Expresso-AI fine-tunes R(2+1)D and R3D architectures on facial videos from the AVEC 2014 Depression Recognition dataset, using DeepLift for explanations. Code available: https://github.com/felmoreno1726/Expresso-AI
- Tensor-Based Batch Fuzzing implements its framework in PyTorch, evaluating on six networks and 400+ specifications across TrafficSigns, Cifar100, and TinyImageNet. Code available: https://github.com/SVF-tools/ACT
- Towards Understanding The Calibration Benefits of Sharpness-Aware Minimization proposes CSAM, evaluated on CIFAR-10/100, ImageNet-1K, and their corruption benchmarks. Code available: https://drive.google.com/drive/folders/1O6up8Q7sdqekErGPmetIuMfEhsPZo-Hc?usp=sharing
- Sublinearly Structured Deep Neural Networks analyzes popular CNNs like AlexNet, VGGNet, ResNet, and GoogLeNet on datasets such as MNIST and CelebA.
- Efficient Network Inference via Hardware-Aware Architecture Search, Model Pruning & Quantization leverages MCUNet as a baseline, using zero-shot NAS for GNSS interference monitoring on iMXRT1062 MCU, Raspberry Pi Zero 2W, and Raspberry Pi 5.
- Hybrid Compression tests its multi-stage method on VGG16, ResNet18, InceptionV3, and DenseNet121 using CIFAR-10 and BloodMNIST, facilitated by the NNI framework. Code available: https://github.com/microsoft/nni
- Bypassing Minimization Bias (SIVE) is validated on toy models and deep neural networks, with code available: https://github.com/yingjiacai/SIVE
- SPINE (Fault Injection Profiler) evaluates QNNs on an ARM Cortex-M3 core using the SAT-6 aerial image dataset.
- GUMP-Net for medical image segmentation uses CTPelvic1K and proprietary datasets, building on YOLO and MedSAM frameworks. Code references: https://docs.ultralytics.com/, https://github.com/bowang-lab/MedSAM
- Analysing drivers and interdependencies in European electricity markets using XAI uses DNNs with SHAP/SSHAP on ENTSO-E Transparency Platform data.
- Neural Additive and Basis Models with Feature Selection and Interactions incorporates entmax into NAM/NBM, tested on high-dimensional tabular datasets. Code available: https://github.com/shiralab/NAM-FS
- Query-Efficient Video Adversarial Attack with Stylized Logo on Service Computing evaluates Stylized Logo Attack (SLA) on UCF-101, HMDB-51, Kinetics-400, and Kinetics-700.
Impact & The Road Ahead
These advancements herald a new era for deep neural networks, one where reliability, security, and efficiency are no longer afterthoughts but integral design principles. The ability to actively protect SNNs from piracy (SpikeTimer) and defend against sophisticated quantization-conditioned backdoors (Quantization as a Malicious Task) will bolster the trustworthiness of AI systems, especially in neuromorphic and edge computing. The dramatic improvements in DNN fuzzing (Tensor-Based Batch Fuzzing) and the insights into the fundamental fragility of cloud AI (Investigating The Security of Modern AI and Cloud Infrastructure) will lead to more robust testing and development of secure deployment practices.
The re-evaluation of oracle pruning (Is Oracle Pruning the True Oracle?) pushes us to rethink how we approach model compression, moving towards methods like hierarchical RL-based pruning (HiReLC) and hybrid compression strategies (Hybrid Compression) that account for the post-retraining performance. This is crucial for deploying performant, compact models on resource-constrained embedded systems, as exemplified by the work on GNSS interference monitoring (Efficient Network Inference).
The drive for interpretability, from medical diagnosis (Deep Neural Networks with Ordinal Loss) to mental health assessment (Expresso-AI) and even understanding energy markets (Analysing drivers and interdependencies), will enable broader adoption of AI in sensitive domains where trust and explainability are paramount. The theoretical breakthroughs clarifying the Frequency Principle (Theory of the Frequency Principle) and the physics behind grokking (Noise-Driven Escape from Metastable Phases) provide essential foundational knowledge, guiding the design of more efficient and predictable training regimes. Furthermore, understanding the expressivity of transformers through formal grammars (An expressivity analysis of hierarchical modelling) deepens our comprehension of their powerful linguistic capabilities. Collectively, these papers highlight a future where deep neural networks are not only powerful but also trustworthy, transparent, and universally accessible.
Share this content:
Discover more from SciPapermill
Subscribe to get the latest posts sent to your email.
Post Comment