Supernetwork-based efficient mapping of deep learning applications to mixed-precision hardware using model adaptation

Hadjer Benmeziane; Corey Lammie; Irem Boybat; Malte Rasch; Manuel Le Gallo; Athanasios Vasilopoulos; Hsinyu Tsai; Geoffrey W. Burr; Vijay Narayanan; Kaoutar El Maghraoui; Abu Sebastian

doi:10.1038/s41467-026-71071-1

AI 硬件与加速器突破级有讲解视频

策展与解读：DAST AI · 收录方法与内容透明度

发表时间: 2026-03-27
DOI: 10.1038/s41467-026-71071-1

收录解读

随着异构模拟-数字加速器逐渐成熟，真正的瓶颈不再只是单个硬件单元效率，而是如何把神经网络各层合理映射到不同精度、不同噪声特性的硬件上，同时兼顾能效和精度。这个问题如果靠逐模型、逐层手工搜索，成本极高，也很难形成稳定部署流程。

这篇论文提出 Mixed-Precision Supernetwork，把量化层和受模拟噪声影响的层统一放到一个 supernetwork 里联合训练，并结合 mapping-aware adaptation 与 hardware-aware architecture search，动态优化层级映射与模型适配。它的重点不是单纯混合精度，而是把模型适配和硬件映射合并为同一个搜索对象。

对 AI 硬件设计主线来说，这种方法有明显的 workflow 价值：它把 analog-digital heterogeneous accelerator 上的部署问题，从一次性工程调参提升为可复用的 supernetwork 搜索与适配框架。只要后续硬件族继续扩展，这类统一映射方法就会越来越重要。

它暂时还不是更高等级，因为论文主要展示的是一条很强的 mapping/search 路线，而不是对整个 AI 加速器设计空间的全面重写。换句话说，它已经是优秀的 reusable co-design 方法，但还没有达到更广的 architecture blueprint 层面。

原始摘要与中文对照

中文对照翻译

人工智能应用的迅速普及需要可扩展的解决方案，以在实际约束下高效运行。结合专用模拟和数字单元的异构加速器提供了局部化、节能的神经网络计算。然而，在这些平台上实现最佳性能需要通过优化的神经网络层映射来平衡能效和模型精度。为此，我们引入了Mixed-Precision Supernetwork，这是一个统一的框架，用于训练将量化层与模拟噪声敏感层无缝集成的混合精度超网络。Mixed-Precision Supernetwork整合了一种映射感知自适应策略，通过硬件感知架构搜索动态优化层分配，同时优化神经网络。这一双重创新使Mixed-Precision Supernetwork成为在异构加速器上高效部署深度学习模型的开创性方法。平均而言，Mixed-Precision Supernetwork生成的映射速度提高了约2.2倍，模型精度比纯模拟方法提高了约3.4%，同时通过将高达80%的模型权重映射到模拟硬件并保持全精度，提高了能效。

原始摘要

The rapid proliferation of Artificial Intelligence applications necessitates scalable solutions that perform efficiently under real-world constraints. Heterogeneous accelerators combining specialized analog and digital units offer localized, energy-efficient neural network computations. However, achieving optimal performance on these platforms requires balancing energy efficiency and model accuracy through optimized neural network layer mapping. To this end, we introduce Mixed-Precision Supernetwork, a unified framework for training mixed-precision supernetworks that seamlessly integrate quantized layers with analog noise-sensitive layers. Mixed-Precision Supernetwork incorporates a mapping-aware adaptation strategy, dynamically optimizing layer assignments while refining the neural network via hardware-aware architecture search. This dual innovation establishes Mixed-Precision Supernetwork as a groundbreaking approach for deploying deep learning models efficiently on heterogeneous accelerators. On average, Mixed-Precision Supernetwork produces mappings ~ 2.2 × faster and achieves a ~ 3.4% increase in model accuracy over a fully analog approach, while improving energy-efficiency by mapping up to 80% of the model’s weights to analog hardware while maintaining full-precision accuracy.

解读视频

视频观看页 B 站 YouTube

链接

论文链接

收录解读

原始摘要与中文对照

中文对照翻译

原始摘要

解读视频

相关论文

链接