Hi Trinayan,
I have almost exclusively installed ROCm on machines without attached GPUs
of any kind (for running in gem5), so yes it is not necessary to have an
AMD GPU installed. As Tony recommended, you should definitely update the
paths in apu_se.py. For example, if you install ROCm in /opt/rocm/, then
you need to update the LD_LIBRARY_PATH lines starting here (
https://gem5.googlesource.com/amd/gem5/+/agutierr/master-gcn3-staging/configs/example/apu_se.py#458)
accordingly (including where you have unwind and gcc installed). From the
snippet you showed me, it appears that your problems are related to this.
After fixing this, you will potentially run into the problems Tsung faced,
but for now this is definitely the first thing you need to fix.
Potentially related: I have never tried installing the ROCm stuff via
apt-get like Joe recommended so I don't know if that will work or not. It
may end up that you'll need to manually installed them by checking out the
repos on Github (e.g., https://github.com/RadeonOpenCompute/hcc) and
specifically checkout out the roc-1.6.x branches like the instructions
specify. But for now please update the LD_LIBRARY_PATH variables like Tony
specified and we'll see what happens.
Thanks,
Matt
On Fri, Aug 31, 2018 at 2:19 PM Trinayan Baruah <***@gmail.com> wrote:
> Hi Matt,
>
> Thanks for the response. I installed 1.6 version based on the instructions
> provided by Joseph Greathouse here(
> https://github.com/RadeonOpenCompute/ROCm/issues/517). To be specific I
> used the package titled apt_1.6.0.tar.bz2 linked there. I built a local
> package and installed it. I can see the files installed in /opt/rocm.
>
> Keep in mind that I am not using a machine that has a AMD GPU installed as
> I don't think it is required to run the simulator. So although I can
> compile the apps from the ComputeApps repo, I cannot really run them there
> because of lack of a Rocm supported GPU.
>
> At your recommendation I tried the debug flags and indeed I can see a lot
> of opens failing starting from this
>
> 61936500: system.cpu0: T0 : syscall open: failed ->
> path:/etc/ld.so.preload (inferred from:/etc/ld.so.preload)
>
> 61936500: system.cpu0: T0 : syscall open returns -2
>
> 71543500: system.cpu0: T0 : syscall open called w/arguments
> 140737488347968, 524288, 140737354129768, 140737351990768, 0,
> 140737488348511
>
> 71543500: system.cpu0: T0 : syscall open: failed ->
> path:/opt/rocm/hcc/bin/../lib/libdl.so.2 (inferred
> from:/opt/rocm/hcc/bin/../lib/libdl.so.2)
>
> I can share the full log with you if you need to have a look at it.
>
> Now not really sure why this is happening but yes a lot of open calls are
> failing.
>
> Finally wanted to point this out. apu_se.py seems to have some env
> variables directed to some rocm , gcc and libunwind directories and these
> paths seem to be user specific to some extent. Not sure if I need to modify
> them. In any case let me know what is the next natural step to debug this
> problem.
>
> Thanks a lot for the help.
>
> Best,
> Trinayan
> _______________________________________________
> gem5-users mailing list
> gem5-***@gem5.org
> http://m5sim.org/cgi-bin/mailman/listinfo/gem5-users