Hi,<div><br></div><div>I'm noticing some unusual behavior between running as a user vs. as root when running some GPU code. As far as I can tell (PYTHONPATH, PATH) are pretty much identical between the user and root, and both are using the same relevant binaries. Is there some obvious step that's necessary to allow the user code to run on the GPU?</div>
<div><br></div><div>Here's some more info on the problem:</div><div>I'm running some tutorial code from theano:</div><div><a href="http://deeplearning.net/software/theano/tutorial/using_gpu.html" target="_blank" style>http://deeplearning.net/software/theano/tutorial/using_gpu.html</a></div>
<div><br></div><div>when I run as root I get output like this:</div><div><div style><div>[root@node002 ~]# THEANO_FLAGS=mode=FAST_RUN,device=cpu,floatX=float32 python2.6 thing.py</div><div>Setup took 0.873317956924 seconds</div>
<div>Looping 1000 times took 36.3292770386 seconds</div><div>Result is [ 1.23178029 1.61879337 1.52278066 ..., 2.20771813 2.29967761</div><div> 1.62323284]</div><div>Used the cpu</div></div><div style><br></div><div style>
<div>[root@node002 ~]# THEANO_FLAGS=mode=FAST_RUN,device=gpu0,floatX=float32 python2.6 thing.py</div><div>Using gpu device 0: Tesla M2050</div><div>Setup took 6.41229701042 seconds</div><div>Looping 1000 times took 1.73471212387 seconds</div>
<div>Result is [ 1.23178029 1.61879349 1.52278066 ..., 2.20771813 2.29967761</div><div> 1.62323296]</div><div>Used the gpu</div><div><br></div><div>BUT, as a user, I get:</div><div><div><div><div><div>[render@node002 ~]$ THEANO_FLAGS=mode=FAST_RUN,device=cpu,floatX=float32 python2.6 thing.py</div>
<div>Setup took 0.591382980347 seconds</div><div>Looping 1000 times took 36.3925471306 seconds</div><div>Result is [ 1.23178029 1.61879337 1.52278066 ..., 2.20771813 2.29967761</div><div> 1.62323284]</div><div>Used the cpu</div>
</div></div><div><br></div><div>[render@node002 ~]$ THEANO_FLAGS=mode=FAST_RUN,device=gpu0,floatX=float32 python2.6 thing.py</div><div>Using gpu device 0: Tesla M2050</div><div>Setup took 186.97786808 seconds</div><div>Looping 1000 times took 182.75831604 seconds</div>
<div>Result is [ 1.23178029 1.61879349 1.52278066 ..., 2.20771813 2.29967761</div><div> 1.62323296]</div><div>Used the gpu</div><div><br></div><div>So, major slowdown, but otherwise no obvious errors.</div></div></div>
<div><br></div><div>Let me know if there's more info or something else I should try,</div><div>Charles</div><div><br></div></div></div>