Skip to content

Commit

Permalink
Update paper link and citation
Browse files Browse the repository at this point in the history
  • Loading branch information
YWHyuk authored Nov 28, 2024
1 parent ecbbc02 commit 5166e24
Showing 1 changed file with 10 additions and 6 deletions.
16 changes: 10 additions & 6 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@ ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN in
- Use of ONNX graphs as DNN model specifications, enabling simulation of DNNs implemented in different deep learning frameworks (e.g., PyTorch and TensorFlow).
- Support language models that do not use ONNX graphs. Additionally, enable auto-regressive generation phases and iteration-level batching.

For more details, please refer to our [paper](https://arxiv.org/abs/2406.08051)!
For more details, please refer to our [paper](https://ieeexplore.ieee.org/document/10726822)!

![Speedup](/img/speedup.png)
**Figure description**: we compare the simulation speed of ONNXim to that of [Accel-sim](https://accel-sim.github.io/) (a GPU simulator with Tensor Core model) as GPUs are widely used for deep learning and such a GPU simulator can be used to study systems for deep learning. We also include [SMAUG](https://github.com/harvard-acc/smaug) in the comparison. On the x-axis, we vary the size of each dimension for an NxNxN GEMM operation.
Expand Down Expand Up @@ -186,10 +186,14 @@ This current version only supports GEMM, Conv, Attention, GeLU, LayerNorm operat
## Citation
If you use ONNXim for your research, please cite the following paper.
```
@article{ham2024onnxim,
title={ONNXim: A Fast, Cycle-level Multi-core NPU Simulator},
@ARTICLE{10726822,
author={Ham, Hyungkyu and Yang, Wonhyuk and Shin, Yunseon and Woo, Okkyun and Heo, Guseul and Lee, Sangyeop and Park, Jongse and Kim, Gwangsun},
journal={arXiv preprint arXiv:2406.08051},
year={2024}
}
journal={IEEE Computer Architecture Letters},
title={ONNXim: A Fast, Cycle-Level Multi-Core NPU Simulator},
year={2024},
volume={23},
number={2},
pages={219-222},
keywords={Random access memory;Computational modeling;Vectors;Kernel;Tensors;Runtime;Libraries;Deep learning;Artificial neural networks;Systolic arrays;DNN inference;multi-tenancy;NPU;ONNX;simulator},
doi={10.1109/LCA.2024.3484648}}
```

0 comments on commit 5166e24

Please sign in to comment.