DC Field | Value | Language |
---|---|---|
dc.contributor.author | Yoo, Eunji | - |
dc.contributor.author | Park, Gunho | - |
dc.contributor.author | Min, Jung Gyu | - |
dc.contributor.author | Jung Kwon, Se | - |
dc.contributor.author | Park, Baeseong | - |
dc.contributor.author | Lee, Dongsoo | - |
dc.contributor.author | Lee, Youngjoo | - |
dc.date.accessioned | 2024-03-06T01:07:40Z | - |
dc.date.available | 2024-03-06T01:07:40Z | - |
dc.date.created | 2024-02-21 | - |
dc.date.issued | 2023-07-11 | - |
dc.identifier.uri | https://oasis.postech.ac.kr/handle/2014.oak/121304 | - |
dc.description.abstract | We present the energy-efficient TF-MVP architecture, a sparsity-aware transformer accelerator, by introducing novel algorithm-hardware co-optimization techniques. From the previous fine-grained pruning map, for the first time, the direction strength is developed to analyze the pruning patterns quantitatively, indicating the major pruning direction and size of each layer. Then, the mixed-length vector pruning (MVP) is proposed to generate the hardware-friendly pruned-transformer model, which is fully supported by our TF-MVP accelerator with the reconfigurable PE structure. Implemented in a 28nm CMOS technology, as a result, TF-MVP achieves 377 GOPs/W for accelerating GPT-2 small model by realizing 4096 multiply-accumulate operators, which is 2.09 times better than the state-of-the-art sparsity-aware transformer accelerator. | - |
dc.language | English | - |
dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
dc.relation.isPartOf | 60th ACM/IEEE Design Automation Conference, DAC 2023 | - |
dc.relation.isPartOf | Proceedings - Design Automation Conference | - |
dc.title | TF-MVP: Novel Sparsity-Aware Transformer Accelerator with Mixed-Length Vector Pruning | - |
dc.type | Conference | - |
dc.type.rims | CONF | - |
dc.identifier.bibliographicCitation | 60th ACM/IEEE Design Automation Conference, DAC 2023 | - |
dc.citation.conferenceDate | 2023-07-09 | - |
dc.citation.conferencePlace | US | - |
dc.citation.title | 60th ACM/IEEE Design Automation Conference, DAC 2023 | - |
dc.contributor.affiliatedAuthor | Yoo, Eunji | - |
dc.contributor.affiliatedAuthor | Park, Gunho | - |
dc.contributor.affiliatedAuthor | Min, Jung Gyu | - |
dc.contributor.affiliatedAuthor | Lee, Youngjoo | - |
dc.description.journalClass | 1 | - |
dc.description.journalClass | 1 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
library@postech.ac.kr Tel: 054-279-2548
Copyrights © by 2017 Pohang University of Science ad Technology All right reserved.