Discussion:
[gem5-users] Assertion during simulation with GCN3
Sungkeun Kim
2018-10-09 22:27:37 UTC
Permalink
Hi everyone,


I trying to run a lulesh-amp application(
https://github.com/AMDComputeLibraries/ComputeApps/tree/master/lulesh-amp)
using gem5 GCN3_X83. However, whenever I run the simulation, it's aborted
with following assertion.

3076817gem5.opt: build/GCN3_X86/gpu-compute/gpu_tlb.cc:1201: void
X86ISA::GpuTLB::handleTranslationReturn(Addr, X86ISA::GpuTLB::tlbOutcome,
PacketPtr): Assertion `new_entry' failed.

Is there anyone has suggestions to fix this problem? Following is my
simulation information.

############################### BACKTRACE ##################################
Program aborted at tick 307682795000
--- BEGIN LIBC BACKTRACE ---
./build/GCN3_X86/gem5.opt(_Z15print_backtracev+0x28)[0xca7ca8]
./build/GCN3_X86/gem5.opt(_Z12abortHandleri+0x46)[0xcba826]
/lib/x86_64-linux-gnu/libpthread.so.0(+0x11390)[0x7fa112f69390]
/lib/x86_64-linux-gnu/libc.so.6(gsignal+0x38)[0x7fa111971428]
/lib/x86_64-linux-gnu/libc.so.6(abort+0x16a)[0x7fa11197302a]
/lib/x86_64-linux-gnu/libc.so.6(+0x2dbd7)[0x7fa111969bd7]
/lib/x86_64-linux-gnu/libc.so.6(+0x2dc82)[0x7fa111969c82]
./build/GCN3_X86/gem5.opt(_ZN6X86ISA6GpuTLB23handleTranslationReturnEmNS0_10tlbOutcomeEP6Packet+0x487)[0x11bb937]
./build/GCN3_X86/gem5.opt(_ZN6X86ISA6GpuTLB17translationReturnEmNS0_10tlbOutcomeEP6Packet+0x38f)[0x11bbd0f]
./build/GCN3_X86/gem5.opt(_ZN10EventQueue10serviceOneEv+0x11d)[0xcae4ed]
./build/GCN3_X86/gem5.opt(_Z9doSimLoopP10EventQueue+0x50)[0xcc6ee0]
./build/GCN3_X86/gem5.opt(_Z8simulatem+0xd1b)[0xcc7fcb]
./build/GCN3_X86/gem5.opt[0x1b0bd0a]
./build/GCN3_X86/gem5.opt[0xd10415]
/usr/lib/x86_64-linux-gnu/libpython2.7.so.1.0(PyEval_EvalFrameEx+0x7852)[0x7fa113226772]
/usr/lib/x86_64-linux-gnu/libpython2.7.so.1.0(PyEval_EvalCodeEx+0x85c)[0x7fa11335d05c]
/usr/lib/x86_64-linux-gnu/libpython2.7.so.1.0(PyEval_EvalFrameEx+0x6ffd)[0x7fa113225f1d]
/usr/lib/x86_64-linux-gnu/libpython2.7.so.1.0(PyEval_EvalCodeEx+0x85c)[0x7fa11335d05c]
/usr/lib/x86_64-linux-gnu/libpython2.7.so.1.0(PyEval_EvalCode+0x19)[0x7fa11321eda9]
/usr/lib/x86_64-linux-gnu/libpython2.7.so.1.0(PyEval_EvalFrameEx+0x613b)[0x7fa11322505b]
/usr/lib/x86_64-linux-gnu/libpython2.7.so.1.0(PyEval_EvalCodeEx+0x85c)[0x7fa11335d05c]
/usr/lib/x86_64-linux-gnu/libpython2.7.so.1.0(PyEval_EvalFrameEx+0x6ffd)[0x7fa113225f1d]
/usr/lib/x86_64-linux-gnu/libpython2.7.so.1.0(PyEval_EvalCodeEx+0x85c)[0x7fa11335d05c]
/usr/lib/x86_64-linux-gnu/libpython2.7.so.1.0(PyEval_EvalCode+0x19)[0x7fa11321eda9]
/usr/lib/x86_64-linux-gnu/libpython2.7.so.1.0(PyRun_StringFlags+0x76)[0x7fa1132991f6]
./build/GCN3_X86/gem5.opt(_Z6m5MainiPPc+0x8f)[0xcb921f]
./build/GCN3_X86/gem5.opt(main+0x33)[0x8d3243]
/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf0)[0x7fa11195c830]
./build/GCN3_X86/gem5.opt(_start+0x29)[0x8f8239]
--- END LIBC BACKTRACE ---


########################### LAST FEW LOGS (GPUTLB enable)
##############################

307681898000: system.l2_tlb: This is a TLB miss
307681898000: system.l3_coalescer-port0: receiving pkt w/ req_cnt 3
307681898000: system.l3_coalescer-port0: coalescerFIFO[307681826000] now
has 2 coalesced reqs after push
307681898000: system.l2_tlb: Sent translation request to lower level TLB
for addr 0x1700014000
307681899000: system.l3_coalescer: triggered TLBCoalescer
processProbeTLBEvent
307681899000: system.l3_coalescer: coalescedReq_cnt is 2 for tick_index
307681826000
307681899000: system.l3_tlb: Translation req. for virt. page addr
0x1700013000
307681899000: system.l3_tlb: TLB Lookup for vaddr 0x1700013b7a.
307681899000: system.l3_tlb: In protected mode.
307681899000: system.l3_tlb: Paging enabled.
307681899000: system.l3_tlb: schedule translationReturnEvent @ curTick
307682049000
307681899000: system.l3_coalescer: system.l3_coalescer sending pkt w/
req_cnt 11
307681899000: system.l3_coalescer: Successfully sent TLB request for page
0x1700013000307681899000: system.l3_tlb: Translation req. for virt. page
addr 0x1700014000
307681899000: system.l3_tlb: TLB Lookup for vaddr 0x17000142ee.
307681899000: system.l3_tlb: In protected mode.
307681899000: system.l3_tlb: Paging enabled.
307681899000: system.l3_tlb: schedule translationReturnEvent @ curTick
307682049000
307681899000: system.l3_coalescer: system.l3_coalescer sending pkt w/
req_cnt 3
307681899000: system.l3_coalescer: Successfully sent TLB request for page
0x1700014000307682045000: system.l3_tlb: Triggered TLBEvent for addr
0x1700003000
307682045000: system.l3_tlb: This is a TLB miss
307682045000: system.l3_tlb: Last level TLB - start a page walk for addr
0x1700003000
307682045000: system.l3_tlb: Triggered TLBEvent for addr 0x1700002000
307682045000: system.l3_tlb: This is a TLB miss
307682045000: system.l3_tlb: Last level TLB - start a page walk for addr
0x1700002000
307682049000: system.l3_tlb: Triggered TLBEvent for addr 0x1700014000
307682049000: system.l3_tlb: This is a TLB miss
307682049000: system.l3_tlb: Last level TLB - start a page walk for addr
0x1700014000
307682049000: system.l3_tlb: Triggered TLBEvent for addr 0x1700013000
307682049000: system.l3_tlb: This is a TLB miss
307682049000: system.l3_tlb: Last level TLB - start a page walk for addr
0x1700013000
gem5.opt: build/GCN3_X86/gpu-compute/gpu_tlb.cc:1201: void
X86ISA::GpuTLB::handleTranslationReturn(Addr, X86ISA::GpuTLB::tlbOutcome,
PacketPtr): Assertion `new_entry' failed.
307682795000: system.l3_tlb: Triggered TLBEvent for addr 0x1700002000
307682795000: system.l3_tlb: Doing a page walk for address 0x1700002000
307682795000: system.l3_tlb: Translation Done - TLB Miss for addr
0x17000026f2
Program aborted at tick 307682795000

############################### DEV INFO ##################################

### gem5 ###
1. source location
git clone https://gem5.googlesource.com/amd/gem5 -b
agutierr/master-gcn3-staging

2. build command
scons -sQ -j4 ./build/GCN3_X86/gem5.opt

3. config file
configs/example/apu_se.py

### ROCM (version 1.6 to be compatible with gem5/GCN3) ###
1. packages source
downloaded from http://repo.radeon.com/rocm/archive/apt_1.6.0.tar.bz2

### Development Desktop info ###
1. processor
Intel(R) Core(TM) i7-7700 CPU @ 3.60GHz
2. kernel
Linux ksungkeun84-server 4.9.0-kfd-compute-rocm-rel-1.6-77 #1 SMP Wed Jun
28 07:30:27 CDT 2017 x86_64 x86_64 x86_64 GNU/Linux
3. OS
Ubuntu 16.04

### Application - lulesh-amp ###
1. source code
sample application from
https://github.com/AMDComputeLibraries/ComputeApps/tree/master/lulesh-amp

### Running Command ###
./build/GCN3_X86/gem5.opt configs/example/apu_se.py -c ./saxpy


### References http://gem5.org/GPU_Models




Regards,
Sungkeun Kim

Loading...