提交 e606d311 authored 作者: Amjad Almahairi's avatar Amjad Almahairi

adding CPU and GPU used in profiling

上级 22f0d5d5
...@@ -100,6 +100,8 @@ prediction on D ...@@ -100,6 +100,8 @@ prediction on D
# in the script, followed by a summary for all functions. # in the script, followed by a summary for all functions.
# We'll show here only the summary: # We'll show here only the summary:
Results were produced using Intel(R) Core(TM) i7-4820K CPU @ 3.70GHz
Function profiling Function profiling
================== ==================
Message: Sum of all(3) printed profiles at exit excluding Scan op profile. Message: Sum of all(3) printed profiles at exit excluding Scan op profile.
...@@ -173,7 +175,7 @@ Apply ...@@ -173,7 +175,7 @@ Apply
# 2.1 Profiling for CPU computations # 2.2 Profiling for GPU computations
# In your terminal, type: # In your terminal, type:
$ CUDA_LAUNCH_BLOCKING=1 THEANO_FLAGS=profile=True,device=gpu python using_gpu_solution_1.py $ CUDA_LAUNCH_BLOCKING=1 THEANO_FLAGS=profile=True,device=gpu python using_gpu_solution_1.py
...@@ -183,6 +185,8 @@ Used the gpu ...@@ -183,6 +185,8 @@ Used the gpu
target values for D target values for D
prediction on D prediction on D
Results were produced using GeForce GTX TITAN
# Profiling summary for all functions: # Profiling summary for all functions:
Function profiling Function profiling
...@@ -265,8 +269,6 @@ Apply ...@@ -265,8 +269,6 @@ Apply
# 3. Conclusions # 3. Conclusions
Facts:
Examine and compare 'Ops' summaries for CPU and GPU. Usually GPU ops 'GpuFromHost' and 'HostFromGpu' by themselves Examine and compare 'Ops' summaries for CPU and GPU. Usually GPU ops 'GpuFromHost' and 'HostFromGpu' by themselves
consume a large amount of extra time, but by making as few as possible data transfers between GPU and CPU, you can minimize their overhead. consume a large amount of extra time, but by making as few as possible data transfers between GPU and CPU, you can minimize their overhead.
Notice that each of the GPU ops consumes more time than its CPU counterpart. This is because the ops operate on small inputs; Notice that each of the GPU ops consumes more time than its CPU counterpart. This is because the ops operate on small inputs;
......
Markdown 格式
0%
您添加了 0 到此讨论。请谨慎行事。
请先完成此评论的编辑!
注册 或者 后发表评论