ISPASS

Ubuntu20.04下使用GPGPU-Sim运行ISPASS2009benchmark

0. 前言

之前介绍了安装，现在就尝试跑一下 ISPASS'09 的那篇经典 paper，Analyzing CUDA workloads using a detailed GPU simulator 上的几个 benchamrk. 这篇文章1.现在已经870次引用了，很多工作都使用了其中的 benchmark

1. 安装 CUDA Toolkit and CUDA SDK

CUDA 5之后，SDK 和 Toolkit 都在一个包里面，可以参考 XX 中安装 CUDA 的步骤，在安装时除了 Toolkit 以外再勾选上 CUDA Samples 11.1。ISPASS'09 benchmark 会用到 build CUDA SDK 时创建的库，所以需要 CUDA SDK。

But, it looks like NVIDIA has messed up the webpages a bit, the CUDA Toolkit and the GPU Computing SDK pages point at each other, with neither offering the SDK.

所以看起来是需要切换 CUDA 版本

1.1 更换软链接

不过之前环境变量用的 cuda11.1 的地址而非软链接，现在替换成软链接

sudo rm -rf /usr/local/cuda  #删除之前生成的软链接
sudo ln -s /home/huweim/cuda/toolkit/4.2/cuda /usr/local/cuda #生成新的软链接

1.2 Check 环境变量的地址

export CUDA_INSTALL_PATH=/usr/local/cuda/toolkit/4.2/cuda
export PATH=$PATH:/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:$CUDA_INSTALL_PATH/bin:$MPI_ROOT/bin
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$CUDA_INSTALL_PATH/lib64	#这个不用改

export NVIDIA_COMPUTE_SDK_LOCATION=~/cuda/sdk/4.2

1.3 查看版本信息

上述步骤全部没问题就可以弹出版本信息了，source ~/.bashrc 或者重启终端

nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2012 NVIDIA Corporation
Built on Thu_Apr__5_00:24:31_PDT_2012
Cuda compilation tools, release 4.2, V0.2.1221

2. gcc, g++ 版本

CUDA 更换为4.2版本后，重新 build 模拟器得知 unsupported GNU version! gcc 4.7 and up are not supported!。之前在别人的博客也看到需要 4.7 以下版本 gcc，不过当时在 cuda11.1 版本下是可以用 gcc 4.8/5 跑模拟器的。现在需要跑一下 ISPASS'09 benchmark, 所以配置到符合要求的版本。

sudo add-apt-repository 'deb http://archive.ubuntu.com/ubuntu/ trusty main'
sudo add-apt-repository 'deb http://archive.ubuntu.com/ubuntu/ trusty universe'
sudo apt update
sudo apt install gcc-4.4

sudo update-alternatives --install /usr/bin/gcc gcc /usr/bin/gcc-4.4 50	#设置优先级
sudo update-alternatives --install /usr/bin/g++ g++ /usr/bin/g++-4.4 50	#设置优先级
sudo update-alternatives --config gcc	#输入编号选择gcc/g++版本
sudo update-alternatives --config g++	#输入编号选择gcc/g++版本
gcc -v	#Check 版本
g++ -v	#Check 版本

更换版本后不支持原子操作？编译代码时加上选项 -arch sm_20

2. Compile Nvidia SDK 4.2

sudo apt-get install libboost-dev libboost-system-dev libboost-filesystem-dev libboost-all-dev mpich2 binutils-gold libcuda1-304
sudo gedit ~/cuda/sdk/4.2/C/common/common.mk, line like “LIB += … ${OPENGLLIB} …. $(RENDERCHECKGLLIB) …” should have $(RENDERCHECKGLLIB) moved before ${OPENGLLIB}. There should be 3 lines like this, it may be line 271, 275 and 282.
- 将 line26 CUDA_INSTALL_PATH ?= /home/gpgpu-sim/cuda/toolkit/4.2/cuda 修改为 CUDA_INSTALL_PATH ?= /home/huweim/cuda/toolkit/4.2/cuda
  - 这一步应该不用了，用软链接代替路径即可
- 因为这个文件直接拷贝过来的，所以要设置为自己的安装路径
sudo gedit ~/cuda/sdk/4.2/CUDALibraries/common/common_cudalib.mk and do the same thing.

cd ~/cuda/sdk/4.2
Edit Makefile through sudo gedit ./Makefile. Comment all lines with CUDALibraries and OpenCL as we only want the application binaries. You comment by placing # in the front of the line.

make

2.1 Bug

遇到 bug

/usr/bin/ld: cannot find -lcudart
/usr/bin/ld: cannot find -lcufft

/usr/bin/ld: cannot find -lXi
/usr/bin/ld: cannot find -lXmu
/usr/bin/ld: cannot find -lglut

2.1.2 解决

建立 cudart 的软链接

sudo ln -s /usr/local/cuda/lib64/libcudart.so.4.2.9 /usr/lib/libcudart.so
sudo ln -s /usr/local/cuda/lib64/libcufft.so.4.2.9 /usr/lib/libcufft.so

#下面几个没有太大必要
#sudo ln -s /usr/local/cuda/lib64/libculas.so.4.2.9 /usr/lib/libculas.so
#sudo ln -s /usr/local/cuda/lib64/libcuspare.so.4.2.9 /usr/lib/libcufspare.so
#sudo ln -s /usr/local/cuda/lib64/libcuinj.so.4.2.9 /usr/lib/libcuinj.so
#sudo ln -s /usr/local/cuda/lib64/libcurand.so.4.2.9 /usr/lib/libcurand.so
#sudo ln -s /usr/local/cuda/lib64/libnpp.so.4.2.9 /usr/lib/libnpp.so
sudo rm /usr/lib/libculas.so

sudo apt-get install libxi-dev
sudo apt-get install libxmu-dev
sudo apt-get install freeglut3-dev

4. ISPASS2009 Benchmarks

git clone https://github.com/gpgpu-sim/ispass2009-benchmarks.git, we also upload it to the pan.shanghaitech.
sudo gedit ~/.bashrc, add export NVIDIA_COMPUTE_SDK_LOCATION=~/cuda/sdk/4.2
cd ispass2009-benchmarks
sudo gedit ./Makefile.ispass-2009
- comment out line 16 and 28. The AES and WP benchmark does not compiler readily.
  - ❗ Note not line 16 to 28, only this two lines
- Change BOOST_LIB to the following path: /usr/lib/x86_64-linux-gnu/
- Change BOOST_ROOT to the following path: /usr/include/boost/
- Change OPENMPI_BINDIR to the following path: /usr/bin/
sudo gedit ./DG/Makefile
- Line 54, change -I/opt/mpich/include to -I/usr/include/mpich
- Line 56, append -I/usr/include/mpich to end of the line
- Line 59-61, add .mpich2 to the end of each line. For example, NEWCC = $(OPENMPI_BINDIR)mpicc.mpich2
make -f Makefile.ispass-2009. Note: If you do a make -f Makefile.ispass-2009 clean, you may have to recompile the SDK.

4.1 DG

需要 openmpi 的环境

Ubuntu20.04下使用GPGPU-Sim运行 Rodinia

Rodinia

0. 前言

Rodinia 的类型还是更多，尝试将其编译好

1. 修改路径

修改 gpu-rodinia-master/common/make.config 中 CUDA_INSTALL_PATH，CUDA_LIB_DIR，SDK_DIR 为 your own path

~~对我来说只用修改 SDK_DIR~~

CUDA_LIB_DIR := /home/huweim/gpgpu-sim_distribution/lib/gcc-4.4.7/cuda-4020/release
CUDA_LIB_DIR := /home/huweim/gpgpu-sim_distribution/lib/gcc-4.4.7/cuda-11020/release #如果要修改版本
SDK_DIR = /home/huweim/cuda/sdk/4.2/C

1.1 sim_v4.0

如果使用 v4.0 版本的模拟器，~~需要在编译选项中加上 --cudart shared~~，编译选项是针对 nvcc，我们尝试使用 gcc 4.5, nvcc 4.2 来运行，官方文档中说 gpgpu-sim v4.0 是支持 nvcc 4.2 的。

在 common.mk 中添加对应的架构版本 SM_VERSION := sm_10 sm_11 sm_12 sm_13 sm_20

2. Data Set

惊喜地发现 data set 是空的，download from http://lava.cs.virginia.edu/Rodinia/download_links.htm

好家伙一个 G Data Set

3. 进入 cuda 文件夹逐一编译即可

3.0 找不到 -lcuda 大多数文件

原因就是没找到动态链接库，guess 因为我们没有安装驱动 ✔️

~~make.config 中 CUDA_LIB_DIR 改为 libcudart.so 所在的路径~~，需要的是 libcuda.so 而不是 libcudart.so

❌ ~~去官网下载驱动~~，电脑没有 NVIDIA GPU 无法安装 Driver

✔️ 去掉 -lcuda 编译选项，出现 undefined reference to symbol '__gxx_personality_v0@@CXXABI_1.3' 错误时，把 CC = gcc 更改为 CC = g++，或者添加编译选项 -lstdc++

暂不清楚去掉 -lcuda 编译选项后的影响

3.1 kmeans, leukocyte

BUG：提示缺少 .o

Solution：Makefile 这个 [ch] 会读到 .h，从而编译不出 kmeans.o，

%.o: %.[ch]
	$(CC) $(CC_FLAGS) $< -c
#改成后正确编译
%.o: %.[c]
	$(CC) $(CC_FLAGS) $< -c

kmeans： libgomp.so.1 => not found

$ apt-get install libgomp1

3.2 huffman, srad

架构版本改成 -arch=sm_20

3.3 hotspot3D

include 路径改一下

3.4 hybridsort

bucketsort.cu:10: fatal error: GL/glew.h: No such file or directory

看起来是没有 OpenGL

$ sudo apt-get install libglew-dev

4. Run

4.1 Why these benchmark so slow?

streamcluster, leu, fdtd2d 等等

Rodinia 大多数 benchmark 都跑得很慢，有的在服务器上两三天都没跑完，不知道是数据量太大还是编译过程除了问题？尽管 GPGPU-Sim 可以自行在达到一定的 cycle 数时停止，还是想知道为什么 Rodinia 如此之慢。

Parbiol

0. 前言

Ubuntu20.04 下使用 GPGPU-Sim 编译运行 Parboil

1. 下载和修改文件

1.1 下载

最好在官网下载后部署，Github 上的文件多了几个 benchmark，但是没有他们的数据集。

在官网下载好需要的目录，按照 README 中的要求部署，Parboil 中需要有 4 个目录，benchmarks，common，driver，datasets

1.2 Makefile.conf

进入 common 目录，里面有对应设备的一些 Makefile examples。主要使用的是cuda，check 文件 Makefile.conf.example-nvidia 中的 CUDA 路径，复制并改名为 Makefile.conf。

1.3 CUDA_LIB_PATH

修改为 gpgpu-sim 的中的 libcudart.so 路径即可

CUDA_LIB_PATH=/home/huweim/gpgpu-sim_distribution/lib/gcc-4.4.7/cuda-4020/release/

2. Data Set

download from http://impact.crhc.illinois.edu/parboil/parboil_download_page.aspx，在 1.1 的步骤中应该已经下载了，但是在 Github 中不带 data set，如果 data set 不全，使用他的脚本编译可能会报错，所以建议在官网下载。

包含 11 个 benchmark data set

3. Compile

3.1 Bug

这里就是之前说的 BUG，所以东西都去官网下载就不会有这些 BUG 了。

3.1.1 Dataset 不全

16 benchmark src，但是 dataset 只有11个

✔️ 解决方法，直接去官方网站下载，不在 github 上下载，11 benchmark 对应 11 dataset

3.1.2 gcc

error: identifier "__builtin_ia32_vec_set_v8hi" is undefined，网上也没有搜索到。尝试切换一下 gcc 版本，发现只能用 gcc 4.7 及以前的版本。

✔️ 解决方法，直接去官方网站下载，不在 github 上下载，11 benchmark 对应 11 dataset。这个时候 compile 也没有错误了。

3.2 Compile 命令

$ ./parboil compile benchmark_name platform_name
$ ./parboil compile bfs cuda
#bfs 可执行文件在目录 parboil_Dir/benchmarks/bfs/build/cuda_default/bfs

接下来逐一编译11个 benchmark 即可

4. Run

4.1 Run 命令

2021-12-05 13:20:02 ✔️ 搞定

⭐ 使用 GPGPU-Sim 需要在运行目录下放置 3 个 config 文件

GPGPU-Sim 配置文件放在 ~/Parboil/benchmarks/bfs 目录下即可，成功运行

$ cp ~/gpgpu-sim_distribution/configs/GTX480/* ~/Parboil/benchmarks/bfs/
$ ./parboil run bfs cuda 1M

5. Script

5.1 Config文件

每个 benchmark 目录下复制好3个 GPGPU-Sim 必备的 config 文件，怎么实现并行执行是一个考验。在不改动原来的脚本文件前提下，每次运行都是在 ~/Parboil/benchmarks/Benchmark_Name_Dir 目录下，所以在使用不同的调度策略启动前，需要将对应的 config 文件复制到此目录。而为了保证并行，需要在其启动后（不可能等待其执行完毕再复制），就复制另一种调度策略的 config 文件。

Polybench

0. 前言

Ubuntu20.04 下使用 GPGPU-Sim 编译运行 Polybench

1. 4.0 sim

在 CUDA/utilities/common.mk 中，nvcc 后面加上 –cudart shared

Tango

CUDA Code 似乎和 parboil 类似，都是读取当前目录下的 data 文件夹，因此需要在 tango 的目录中去运行，写一下脚本。

2022-03-18 14:32:07，确实如此，否则会出现 File Not Found

CUDA SDK

0. 前言

CUDA Samples 似乎没有放置在 toolkit 里面，需要单独安装使用

1. 排坑

实际没有链接到 CUDA 4.2，而是自己装的 CUDA 11.1 版本，所以报错

先尝试修改一下环境变量，简单地修改环境变量是不行的。切换 CUDA 版本还需要修改软链接

sudo rm -rf /usr/local/cuda	#删除之前创建的软链接
#sudo ln -s /home/huweim/cuda/toolkit/4.2/cuda /usr/local/cuda
sudo ln -s /usr/local/cuda11.4 /usr/local/cuda
nvcc -v