EdgeBERT: Sentence-Level Energy Optimizations for Latency-Aware Multi-Task NLP Inference.
Tambe, Thierry.
Hooper, Coleman.
Pentecost, Lillian.
Jia, Tianyu.
Yang, En-Yu.
Donato, Marco.
Sanh, Victor.
Whatmough, Paul.
Rush, Alexander M.
Brooks, David.
Wei, Gu-Yeon.
2021