GK20A (Jetson)vsGeForce GTX 980
 

  CUDA Capability Major/Minor version number:    3.2

  Total amount of global memory:                 1892 MBytes (1984397312 bytes)

  ( 1) Multiprocessors, (192) CUDA Cores/MP:     192 CUDA Cores

  GPU Clock rate:                                852 MHz (0.85 GHz)

  Memory Clock rate:                             924 Mhz

  Memory Bus Width:                              64-bit

  L2 Cache Size:                                 131072 bytes

  Maximum Texture Dimension Size (x,y,z)         1D=(65536), 2D=(65536, 65536), 3D=(4096, 4096, 4096)

  Maximum Layered 1D Texture Size, (num) layers  1D=(16384), 2048 layers

  Maximum Layered 2D Texture Size, (num) layers  2D=(16384, 16384), 2048 layers

  Total amount of constant memory:               65536 bytes

  Total amount of shared memory per block:       49152 bytes

  Total number of registers available per block: 32768

  Warp size:                                     32

  Maximum number of threads per multiprocessor:  2048

 

  CUDA Capability Major/Minor version number:    5.2

  Total amount of global memory:                 4096 MBytes (4294770688 bytes)

  (16) Multiprocessors, (128) CUDA Cores/MP:     2048 CUDA Cores

  GPU Max Clock rate:                            1216 MHz (1.22 GHz)

  Memory Clock rate:                             3505 Mhz

  Memory Bus Width:                              256-bit

  L2 Cache Size:                                 2097152 bytes

  Maximum Texture Dimension Size (x,y,z)         1D=(65536), 2D=(65536, 65536), 3D=(4096, 4096, 4096)

  Maximum Layered 1D Texture Size, (num) layers  1D=(16384), 2048 layers

  Maximum Layered 2D Texture Size, (num) layers  2D=(16384, 16384), 2048 layers

  Total amount of constant memory:               65536 bytes

  Total amount of shared memory per block:       49152 bytes

  Total number of registers available per block: 65536

  Warp size:                                     32

  Maximum number of threads per multiprocessor:  2048

  Maximum number of threads per block:           1024

    
 ./bandwidthTest  
 

Host to Device Bandwidth, 1 Device(s)

 PINNED Memory Transfers

   Transfer Size (Bytes)    Bandwidth(MB/s)

   33554432            985.3

 

 Device to Host Bandwidth, 1 Device(s)

 PINNED Memory Transfers

   Transfer Size (Bytes)    Bandwidth(MB/s)

   33554432            3383.1

 

 Device to Device Bandwidth, 1 Device(s)  PINNED Memory Transfers

   Transfer Size (Bytes)    Bandwidth(MB/s)

   33554432            2603.8

 

Host to Device Bandwidth, 1 Device(s)

 PINNED Memory Transfers

   Transfer Size (Bytes)    Bandwidth(MB/s)

   33554432            5159.2

 

 Device to Host Bandwidth, 1 Device(s)

 PINNED Memory Transfers

   Transfer Size (Bytes)    Bandwidth(MB/s)

   33554432            3953.4

 

 Device to Device Bandwidth, 1 Device(s)  PINNED Memory Transfers

   Transfer Size (Bytes)    Bandwidth(MB/s)

   33554432            76000.5