-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathCITATION.cff
72 lines (71 loc) · 2.51 KB
/
CITATION.cff
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
# This CITATION.cff file was generated with cffinit.
# Visit https://bit.ly/cffinit to generate yours today!
cff-version: 1.2.0
title: >-
Stella Nera: Achieving 161 TOp/s/W with Multiplier-free
DNN Acceleration based on Approximate Matrix
Multiplication
message: "If you use it in your work, please cite or reference :-)"
type: software
authors:
- given-names: Jannis
family-names: Schönleber
affiliation: ETH Zürich
orcid: "https://orcid.org/0009-0000-2242-5331"
- given-names: Lukas
family-names: Cavigelli
affiliation: Huawei Technologies Zürich Research Center
- given-names: Renzo
family-names: Andri
affiliation: Huawei Technologies Zürich Research Center
- given-names: Matteo
family-names: Perotti
affiliation: ETH Zürich
- given-names: Luca
family-names: Benini
affiliation: ETH Zürich
identifiers:
- type: url
value: "https://arxiv.org/abs/2311.10207"
- type: doi
value: 10.48550/arXiv.2311.10207
preferred-citation:
type: article
title: >-
Stella Nera: Achieving 161 TOp/s/W with Multiplier-free
DNN Acceleration based on Approximate Matrix
Multiplication
authors:
- given-names: Jannis
family-names: Schönleber
affiliation: ETH Zürich
orcid: "https://orcid.org/0009-0000-2242-5331"
- given-names: Lukas
family-names: Cavigelli
affiliation: Huawei Technologies Zürich Research Center
- given-names: Renzo
family-names: Andri
affiliation: Huawei Technologies Zürich Research Center
- given-names: Matteo
family-names: Perotti
affiliation: ETH Zürich
- given-names: Luca
family-names: Benini
affiliation: ETH Zürich
doi: "10.48550/arXiv.2311.10207"
repository-code: "https://github.com/joennlae/halutmatmul"
abstract: >-
The recent Maddness method approximates Matrix
Multiplication (MatMul) without the need for
multiplication by using a hash-based version of product
quantization (PQ). The hash function is a decision tree,
allowing for efficient hardware implementation, as
multiply-accumulate operations are replaced by decision
tree passes and LUT lookups. Stella Nera is the first
Maddness accelerator achieving 15x higher area efficiency
(GMAC/s/mm^2) and 25x higher energy efficiency (TMAC/s/W)
than direct MatMul accelerators in the same technology. In
a commercial 14 nm technology and scaled to 3 nm, we
achieve an energy efficiency of 161 TOp/s/[email protected] with a
Top-1 accuracy on CIFAR-10 of over 92.5% using ResNet9.
license: MIT