中山大学 100 anniversary
国家超算广州中心 天河星逸
Chen Zhiguang
Yin Zhong
- 天河二号 top500 六连冠
- 天河星逸
- 超算网络香港专线
- “星光”超算应用平台 link
- 应用社区
- 典型应用
- 地球科学
- 海洋科学
- 宇宙科学:Juno宇宙粒子探测
- 生命科学
- 航空航天:C919全机气动优化
- 超算参数
- p2p bandwidth: 400Gbps
- p2p latency: 1.5us
- double flops: 20PFlops
- intel CPU node
- intel GPU node: A800x8, 1TB memory (CPU)
- FT CPU node信创
- 三维蝶形网络拓扑
- 上网代理
- 在线可视化:paraview, ncview, FlowNL doi-link
- 已购买部分商业软件:如Ansys
- 任务提交:容器、slurm
- vscode server
- UDT客户端传输
国家超算广州中心南沙分中心简介
- 网络专线
- CPU:
64cores * 2000
- GPU:
A800 * 500
- 报价
Junxian He, CSE HKUST, Compression Represents Intelligence Linearly
- arxiv-link Compression Represents Intelligence Linearly
- compression leads to intelligence
- how to encode text corpus with fewer bits in a lossless manner
- 登录“星光”超算应用平台 link
- 申请“星光”超算应用平台 link
- 申请HPC账号并绑定
- 在网页首页「可用集群」绑定
- 使用指南:在网页首页「指南」查看指南
- 登录节点:在网页首页「可用集群」点击“绑定的节点”
查询partition状态
yhinfo
查询作业状态
yhqueue
yhqueue --user=USERNAME #replace USERNAME with YOUR username
打印hello world
- 单节点单进程
yhrun -p ai -n1 echo "hello world"
- 单节点多进程
yhrun -p ai --ntasks=4 --label echo "hello world"
- 多节点多进程
yhrun -p ai --nodes=2 --ntasks=4 --label echo "hello world"
- 多节点多进程打印host
yhrun -p ai --nodes=2 --ntasks=4 --label /bin/hostname
查询nvidia-smi
yhrun -p ai -n1 nvidia-smi
Thu Jun 20 16:18:40 2024
+---------------------------------------------------------------------------------------+
| NVIDIA-SMI 535.104.12 Driver Version: 535.104.12 CUDA Version: 12.2 |
|-----------------------------------------+----------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+======================+======================|
| 0 NVIDIA A800 80GB PCIe On | 00000000:4F:00.0 Off | 0 |
| N/A 31C P0 43W / 300W | 2MiB / 81920MiB | 0% Default |
| | | Disabled |
+-----------------------------------------+----------------------+----------------------+
| 1 NVIDIA A800 80GB PCIe On | 00000000:50:00.0 Off | 0 |
| N/A 33C P0 45W / 300W | 2MiB / 81920MiB | 0% Default |
| | | Disabled |
+-----------------------------------------+----------------------+----------------------+
| 2 NVIDIA A800 80GB PCIe On | 00000000:53:00.0 Off | 0 |
| N/A 33C P0 46W / 300W | 2MiB / 81920MiB | 0% Default |
| | | Disabled |
+-----------------------------------------+----------------------+----------------------+
| 3 NVIDIA A800 80GB PCIe On | 00000000:57:00.0 Off | 0 |
| N/A 35C P0 48W / 300W | 2MiB / 81920MiB | 0% Default |
| | | Disabled |
+-----------------------------------------+----------------------+----------------------+
| 4 NVIDIA A800 80GB PCIe On | 00000000:9C:00.0 Off | 0 |
| N/A 33C P0 47W / 300W | 2MiB / 81920MiB | 0% Default |
| | | Disabled |
+-----------------------------------------+----------------------+----------------------+
| 5 NVIDIA A800 80GB PCIe On | 00000000:9D:00.0 Off | 0 |
| N/A 33C P0 45W / 300W | 2MiB / 81920MiB | 0% Default |
| | | Disabled |
+-----------------------------------------+----------------------+----------------------+
| 6 NVIDIA A800 80GB PCIe On | 00000000:A0:00.0 Off | 0 |
| N/A 34C P0 47W / 300W | 2MiB / 81920MiB | 0% Default |
| | | Disabled |
+-----------------------------------------+----------------------+----------------------+
| 7 NVIDIA A800 80GB PCIe On | 00000000:A4:00.0 Off | 0 |
| N/A 32C P0 47W / 300W | 2MiB / 81920MiB | 0% Default |
| | | Disabled |
+-----------------------------------------+----------------------+----------------------+
+---------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=======================================================================================|
| No running processes found |
+---------------------------------------------------------------------------------------+