没有合适的资源?快使用搜索试试~ 我知道了~
AIChip_Paper_List
共208个文件
md:180个
png:23个
gif:2个
需积分: 10 1 下载量 199 浏览量
2021-04-14
12:05:08
上传
评论
收藏 2.56MB ZIP 举报
温馨提示
AI芯片论文列表 目录 关于这个项目 该项目旨在帮助工程师,研究人员和学生轻松找到并学习与AI相关领域的良好思想和设计,例如在顶级架构会议上提出的AI / ML / DL加速器,芯片和系统( ISCA,MICRO,ASPLOS,HPCA)。 这个项目是由上海交通大学的高级计算机架构实验室(ACA Lab)与Biren Research合作发起的。 来自其他来源的文章正在添加。 如果您有任何意见或愿意提供帮助,请告诉我们。 标签清单 出于指导和搜索目的,将标记和/或注释分配给所有这些论文。 我们将使用以下标签来注释这些论文。 按时间顺序列出论文 我们列出了所有与AI相关的文章。 文章/幻灯片/注释的链接在每篇文章的标题下提供(如果有)。 更新正在进行中 伊斯卡 2020年 标签 -- 标题 作者 隶属关系 推理; SIMD 与服务器级CPU集成到x86 SoC中的高性能深度学习协处理器
资源推荐
资源详情
资源评论
收起资源包目录
AIChip_Paper_List (208个子文件)
8bd612a37e0268f0c39212b13afd5277.gif 66KB
35a2980b211f2f2788724ee39a08071b.gif 43KB
README.md 91KB
A Configurable Cloud-Scale DNN Processor for Real-Time AI.md 5KB
SnaPEA Predictive Early Activation for Reducing Computation in Deep Convolutional Neural Networks.md 5KB
Towards Pervasive and User Satisfactory CNN across GPU Microarchitectures.md 5KB
Euphrates Algorithm-SoC Co-Design for Low-Power Mobile Continuous Vision.md 5KB
Energy Efficient Architecture for Graph Analytics Accelerators.md 4KB
Towards Memory Friendly Long-Short Term Memory Networks(LSTMs) on Mobile GPUs.md 4KB
FA3C FPGA-Accelerated Deep Reinforcement Learning.md 4KB
UCNN Exploiting Computational Reuse in Deep Neural Networks via Weight Repetition.md 4KB
Accelerating Markov Random Field Inference Using Molecular Optical Gibbs Sampling Units.md 4KB
GANAX A Unified MIMD-SIMD Acceleration for Generative Adversarial Networks.md 4KB
Cambricon An Instruction Set Architecture for Neural Networks.md 4KB
Minerva Enabling Low-Power, Highly-Accurate Deep Neural Network Accelerators .md 4KB
Think Fast A Tensor Streaming Processor (TSP) for Accelerating Deep Learning Workloads.md 4KB
Astra Exploiting Predictability to Optimize Deep Learning.md 4KB
In-Datacenter Performance Analysis of a Tensor Processing Unit.md 4KB
Deep Learning Acceleration with Neuron-to-Memory.md 4KB
Shortcut Mining Exploiting Cross-layer Shortcut Reuse in DCNN Accelerators.md 4KB
Capuchin_Tensor-based GPU Memory Management for Deep Learning.md 4KB
Neurocube A Programmable Digital Neuromorphic Architecture with High-Density 3D Memory.md 3KB
DeepRecSys_A System for Optimizing End-to-End At-Scale Neural Recommendation Inference.md 3KB
SC-DCNN Highly-Scalable Deep Convolutional Neural Network using Stochastic Computing.md 3KB
vDNN Virtualized Deep Neural Networks for Scalable, Memory-Efficient Neural Network Design.md 3KB
MaxNVM_Maximizing DNN Storage Density and Inference Efficiency with Sparse Encoding and Error Mitigation.md 3KB
Morph Flexible Acceleration for 3D CNN-based Video Understanding.md 3KB
SCALEDEEP A Scalable Compute Architecture for Learning and Evaluating Deep Networks.md 3KB
Communication Lower Bound in Convolution Accelerators.md 3KB
TensorDIMM A Practical Near-Memory Processing Architecture for Embeddings and Tensor Operations in Deep Learning.md 3KB
DeftNN_Addressing Bottlenecks for DNN Execution on GPUs via Synapse Vector Elimination and Near-compute Data Fission.md 3KB
Cnvlutin Ineffectual-Neuron-Free Deep Neural Network Computing .md 3KB
Processing-in-Memory for Energy-efficient Neural Network Training A Heterogeneous Approach.md 3KB
DeepSigns An End-to-End Watermarking Framework for Ownership Protection of Deep Neural Networks.md 3KB
PUMA A Programmable Ultra-efcient Memristor-based Accelerator for Machine Learning Inference.md 3KB
Scalpel Customizing DNN Pruning to the Underlying Hardware Parallelism.md 3KB
Echo Compiler-based GPU Memory Footprint Reduction for LSTM RNN Training.md 3KB
Bit-Tactical A SoftwareHardware Approach to Exploiting Value and Bit Sparsity in Neural Networks.md 3KB
HyPar_Towards Hybrid Parallelism for Deep Learning Accelerator Array.md 3KB
Alleviating Irregularity in Graph Analytics Acceleration-a Hardwar-Software Co-Design Approach.md 3KB
Fused-Layer CNN Accelerators.md 3KB
Prediction based Execution on Deep Neural Networks.md 3KB
DaDianNao A Machine-Learning Supercomputer.md 3KB
ShapeShifter Enabling Fine-Grain Data Width Adaptation in Deep Learning.md 3KB
FPSA_FPSA A Full System Stack Solution for Reconfigurable ReRAM-based NN Accelerator Architecture.md 3KB
Compressing DMA Engine.md 3KB
HyGCN A GCN Accelerator with Hybrid Architecture.md 3KB
A Multi-Neural Network Acceleration Architecture.md 3KB
Prague High-Performance Heterogeneity-Aware Asynchronous Decentralized Training.md 3KB
RANA Towards Efficient Neural Acceleration with Refresh-Optimized Embedded DRAM .md 3KB
Maximizing CNN Accelerator Efficiency Through Resource Partitioning.md 3KB
CirCNN_Accelerating and Compressing Deep Neural Networks Using Block-Circulant Weight Matrices.md 3KB
ShiDianNao Shifting Vision Processing Closer to the Sensor.md 3KB
High-Performance Deep-Learning Coprocessor Integrated into x86 SoC with Server-Class CPUs.md 2KB
EDEN Enabling Energy-Efficient, High-Performance Deep Neural Network Inference Using Approximate DRAM.md 2KB
Diffy_a Deja vu-Free Differential Deep Neural Network Accelerator.md 2KB
From High-Level Deep Neural Models to FPGAs.md 2KB
Techniques for Reducing the Connected-Standby Energy Consumption of Mobile Devices.md 2KB
Bit Fusion-Bit-Level Dynamically Composable Architecture for Accelerating Deep Neural Network.md 2KB
PM3 Power Modeling and Power Management for Processing-in-Memory-HPCA'18-LX.md 2KB
VIBNN Hardware Acceleration of Bayesian Neural Networks.md 2KB
E-RNN_Design Optimization for Efficient Recurrent Neural Networks in FPGAs.md 2KB
Bridging the Gap Between Neural Networks and Neuromorphic Hardware with A Neural Network Compiler.md 2KB
FlexFlow A Flexible Dataflow Accelerator Architecture for Convolutional Neural Networks.md 2KB
DianNao A Small-Footprint High-Throughput Accelerator for Ubiquitous Machine-Learning.md 2KB
ExTensor An Accelerator for Sparse Tensor Algebra.md 2KB
ZCOMP Reducing DNN Cross-Layer Memory Footprint Using Vector Extensions.md 2KB
NEUTRAMS-Neural Network Transformation and Co-design under Neuromorphic Hardware-Constraints-MICRO'16-LX.md 2KB
A Deep Reinforcement Learning Framework for Architectural Exploration_ A Routerless NoC Case Study.md 2KB
TABLA A Unified Template-based Framework for Accelerating Statistical Machine Learning.md 2KB
Towards Efficient Microarchitectural Design for Accelerating Unsupervised GAN-Based Deep Learning.md 2KB
RedEye Analog ConvNet Image Sensor Architecture for Continuous Mobile Vision.md 2KB
GeneSys Enabling Continuous Learning through Neural Network Evolution in Hardware.md 2KB
Scale-Out Acceleration for Machine Learning.md 2KB
GraphR Accelerating Graph Processing Using ReRAM-LX.md 2KB
PuDianNao A Polyvalent Machine Learning Accelerator .md 2KB
EVA2_Exploiting Temporal Redundancy in Live Computer Vision.md 2KB
PuDianNao A Polyvalent Machine Learning Accelerator.md 2KB
DRQ Dynamic Region-based Quantization for Deep Neural Network Acceleration.md 2KB
Kelp QoS for Accelerated Machine Learning Systems.md 2KB
Gist-Efficient Data Encoding for Deep Neural Network Training.md 2KB
eCNN A Block-Based and Highly-Parallel CNN Accelerator for Edge Inference.md 2KB
eAP_ A Scalable and Efficient In-Memory Accelerator for Automata Processing.md 2KB
Computation Reuse in DNNs by Exploiting Input Similarity.md 2KB
PERMDNN Efficient Compressed DNN Architecture with Permuted Diagonal Matrices.md 2KB
SparTen A Sparse Tensor Accelerator for Convolutional Neural Networks.md 2KB
The dark side of DNN pruning.md 2KB
The Accelerator Wall Limits of Chip Specialization.md 2KB
TANGRAM Optimized Coarse-Grained Dataflow for Scalable NN Accelerators.md 2KB
Bit Prudent In-Cache Acceleration of Deep Convolutional Neural Networks.md 2KB
Wire-Aware Architecture and Dataflow for CNN Accelerators.md 2KB
MnnFast A Fast and Scalable System Architecture for Memory-Augmented Neural Network.md 2KB
FlexTensor An Automatic Schedule Exploration and Optimization Framework for Tensor Computation on Heterogeneous System.md 2KB
EFLOPS Algorithm and System Co-design for a High Performance Distributed Training Platform.md 2KB
A Stochastic-Computing based Deep Learning Framework using Adiabatic Quantum-Flux-Parametron Superconducting Technology.md 2KB
SCOPE A Stochastic Computing Engine for DRAM-based In-situ Accelerator.md 2KB
Split-CNN Splitting Window-based Operations in Convolutional Neural Networks for Memory System Optimization..md 2KB
TrainBox-An Extreme-Scale Neural Network Training Server Architecture by Systematically Balancing Operations .md 2KB
Packing Sparse Convolutional Neural Networks for Efficient Systolic Array Implementations Column Combining Under Joint Optimization.md 2KB
PRIME_A Novel Processing-in-memory Architecture for Neural Network Computation in ReRAM-based Main Memory.md 2KB
共 208 条
- 1
- 2
- 3
资源评论
得陇而望蜀者
- 粉丝: 32
- 资源: 4586
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功