NVIDIA GTX 1080 TI Spec Query

 

Cuda 설치 시 생성된 samples/1_Utilities 에서 make로 빌드하여 deviceQuery를 실행시키면 디바이스의 정보를 볼 수 있습니다.

png

CUDA Device Query (Runtime API) version (CUDART static linking)

Detected 1 CUDA Capable device(s)

Device 0: "GeForce GTX 1080 Ti"
 CUDA Driver Version / Runtime Version          9.2 / 9.0
 CUDA Capability Major/Minor version number:    6.1
 Total amount of global memory:                 11177 MBytes (11720130560 bytes)
 (28) Multiprocessors, (128) CUDA Cores/MP:     3584 CUDA Cores
 GPU Max Clock rate:                            1671 MHz (1.67 GHz)
 Memory Clock rate:                             5505 Mhz
 Memory Bus Width:                              352-bit
 L2 Cache Size:                                 2883584 bytes
 Maximum Texture Dimension Size (x,y,z)         1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384)
 Maximum Layered 1D Texture Size, (num) layers  1D=(32768), 2048 layers
 Maximum Layered 2D Texture Size, (num) layers  2D=(32768, 32768), 2048 layers
 Total amount of constant memory:               65536 bytes
 Total amount of shared memory per block:       49152 bytes
 Total number of registers available per block: 65536
 Warp size:                                     32
 Maximum number of threads per multiprocessor:  2048
 Maximum number of threads per block:           1024
 Max dimension size of a thread block (x,y,z): (1024, 1024, 64)
 Max dimension size of a grid size    (x,y,z): (2147483647, 65535, 65535)
 Maximum memory pitch:                          2147483647 bytes
 Texture alignment:                             512 bytes
 Concurrent copy and kernel execution:          Yes with 2 copy engine(s)
 Run time limit on kernels:                     Yes
 Integrated GPU sharing Host Memory:            No
 Support host page-locked memory mapping:       Yes
 Alignment requirement for Surfaces:            Yes
 Device has ECC support:                        Disabled
 Device supports Unified Addressing (UVA):      Yes
 Supports Cooperative Kernel Launch:            Yes
 Supports MultiDevice Co-op Kernel Launch:      Yes
 Device PCI Domain ID / Bus ID / location ID:   0 / 1 / 0
 Compute Mode:
    < Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >

deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 9.2, CUDA Runtime Version = 9.0, NumDevs = 1
Result = PASS
- Summary
1. Global memory                  : 11,177 MB
2. Number of multiprocessors(MP)  : 28
  → Number of blocks allocated at once
3. Number of CUDA Cores/MP        : 128
  → Number of threads/block allocated at once
4. GPU Max Clock rate             : 1,671 MHz
5. Total constant memory          : 65,536 Bytes
6. Total shared memory/block      : 49,152 Bytes
7. Total number of registers/block: 65,536
8. Warp size                      : 32
  → Number of threads executed at once
9. Max number of threads/MP       : 2,048
10. Max number of threads/block   : 1,024
11. Max dimension size of a block : (1024, 1024, 64)
12. Max dimension size of a grid  : (2147483647, 65535, 65535)