clinfo not detecting all 3 hd6950 gpus (only lists one)

Discussion in 'Videocards - AMD Radeon Drivers Section' started by joshpunb, Jul 5, 2012.

  1. joshpunb

    joshpunb New Member

    Messages:
    1
    Likes Received:
    0
    GPU:
    HD 6950 x3
    I have recently switched from Ubuntu 11.04 to OpenSUSE 12.1 because of issues with the ati catalyst drivers. Now on OpenSUSE I have the drivers correctly installed (using atiupgrade, which was the only installation method that seemed to work properly and was extremely simple to use!!). I will be using my system for research, which is why I need to get opencl to detect all of my gpus. It currently only detects one of the devices as an opencl device.

    Here is a list of connected graphics cards:

    Code:
    $ sudo /sbin/lspci | grep VGA
    root's password:
    01:00.0 VGA compatible controller: ATI Technologies Inc Cayman PRO [Radeon HD 6950]
    06:00.0 VGA compatible controller: ATI Technologies Inc Cayman PRO [Radeon HD 6950]
    07:00.0 VGA compatible controller: ATI Technologies Inc Cayman PRO [Radeon HD 6950]
    My current output from clinfo is:

    Code:
    $clinfo
    Number of platforms:                 1
      Platform Profile:                 FULL_PROFILE
      Platform Version:                 OpenCL 1.2 AMD-APP (938.1)
      Platform Name:                 AMD Accelerated Parallel Processing
      Platform Vendor:                 Advanced Micro Devices, Inc.
      Platform Extensions:                 cl_khr_icd cl_amd_event_callback cl_amd_offline_devices
    
    
      Platform Name:                 AMD Accelerated Parallel Processing
    Number of devices:                 2
      Device Type:                     CL_DEVICE_TYPE_GPU
      Device ID:                     4098
      Board name:                     AMD Radeon HD 6900 Series 
      Device Topology:                 PCI[ B#1, D#0, F#0 ]
      Max compute units:                 22
      Max work items dimensions:             3
        Max work items[0]:                 256
        Max work items[1]:                 256
        Max work items[2]:                 256
      Max work group size:                 256
      Preferred vector width char:             16
      Preferred vector width short:             8
      Preferred vector width int:             4
      Preferred vector width long:             2
      Preferred vector width float:             4
      Preferred vector width double:         2
      Native vector width char:             16
      Native vector width short:             8
      Native vector width int:             4
      Native vector width long:             2
      Native vector width float:             4
      Native vector width double:             2
      Max clock frequency:                 800Mhz
      Address bits:                     32
      Max memory allocation:             268435456
      Image support:                 Yes
      Max number of images read arguments:         128
      Max number of images write arguments:         8
      Max image 2D width:                 8192
      Max image 2D height:                 8192
      Max image 3D width:                 2048
      Max image 3D height:                 2048
      Max image 3D depth:                 2048
      Max samplers within kernel:             16
      Max size of kernel argument:             1024
      Alignment (bits) of base address:         2048
      Minimum alignment (bytes) for any datatype:     128
      Single precision floating point capability
        Denorms:                     No
        Quiet NaNs:                     Yes
        Round to nearest even:             Yes
        Round to zero:                 Yes
        Round to +ve and infinity:             Yes
        IEEE754-2008 fused multiply-add:         Yes
      Cache type:                     None
      Cache line size:                 0
      Cache size:                     0
      Global memory size:                 1073741824
      Constant buffer size:                 65536
      Max number of constant args:             8
      Local memory type:                 Scratchpad
      Local memory size:                 32768
      Kernel Preferred work group size multiple:     64
      Error correction support:             0
      Unified memory for Host and Device:         0
      Profiling timer resolution:             1
      Device endianess:                 Little
      Available:                     Yes
      Compiler available:                 Yes
      Execution capabilities:                 
        Execute OpenCL kernels:             Yes
        Execute native function:             No
      Queue properties:                 
        Out-of-Order:                 No
        Profiling :                     Yes
      Platform ID:                     0x7f77b03200e0
      Name:                         Cayman
      Vendor:                     Advanced Micro Devices, Inc.
      Device OpenCL C version:             OpenCL C 1.2 
      Driver version:                 CAL 1.4.1741
      Profile:                     FULL_PROFILE
      Version:                     OpenCL 1.2 AMD-APP (938.1)
      Extensions:                     cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ext_atomic_counters_32 cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_popcnt 
    
    
      Device Type:                     CL_DEVICE_TYPE_CPU
      Device ID:                     4098
      Board name:                     
      Max compute units:                 6
      Max work items dimensions:             3
        Max work items[0]:                 1024
        Max work items[1]:                 1024
        Max work items[2]:                 1024
      Max work group size:                 1024
      Preferred vector width char:             16
      Preferred vector width short:             8
      Preferred vector width int:             4
      Preferred vector width long:             2
      Preferred vector width float:             4
      Preferred vector width double:         0
      Native vector width char:             16
      Native vector width short:             8
      Native vector width int:             4
      Native vector width long:             2
      Native vector width float:             4
      Native vector width double:             0
      Max clock frequency:                 800Mhz
      Address bits:                     64
      Max memory allocation:             4211426304
      Image support:                 Yes
      Max number of images read arguments:         128
      Max number of images write arguments:         8
      Max image 2D width:                 8192
      Max image 2D height:                 8192
      Max image 3D width:                 2048
      Max image 3D height:                 2048
      Max image 3D depth:                 2048
      Max samplers within kernel:             16
      Max size of kernel argument:             4096
      Alignment (bits) of base address:         1024
      Minimum alignment (bytes) for any datatype:     128
      Single precision floating point capability
        Denorms:                     Yes
        Quiet NaNs:                     Yes
        Round to nearest even:             Yes
        Round to zero:                 Yes
        Round to +ve and infinity:             Yes
        IEEE754-2008 fused multiply-add:         Yes
      Cache type:                     Read/Write
      Cache line size:                 64
      Cache size:                     65536
      Global memory size:                 16845705216
      Constant buffer size:                 65536
      Max number of constant args:             8
      Local memory type:                 Global
      Local memory size:                 32768
      Kernel Preferred work group size multiple:     1
      Error correction support:             0
      Unified memory for Host and Device:         1
      Profiling timer resolution:             1
      Device endianess:                 Little
      Available:                     Yes
      Compiler available:                 Yes
      Execution capabilities:                 
        Execute OpenCL kernels:             Yes
        Execute native function:             Yes
      Queue properties:                 
        Out-of-Order:                 No
        Profiling :                     Yes
      Platform ID:                     0x7f77b03200e0
      Name:                         AMD Phenom(tm) II X6 1090T Processor
      Vendor:                     AuthenticAMD
      Device OpenCL C version:             OpenCL C 1.2 
      Driver version:                 2.0 (sse2)
      Profile:                     FULL_PROFILE
      Version:                     OpenCL 1.2 AMD-APP (938.1)
      Extensions:                     cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ext_device_fission cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_popcnt
    clinfo only lists one gpu... can anyone explain to me why clinfo is only detecting one gpu and how to have it recognize the other two?

    also the current output of fglrxinfo is:
    Code:
    $fglrxinfo 
    display: :0.0  screen: 0
    OpenGL vendor string: Advanced Micro Devices, Inc.
    OpenGL renderer string: AMD Radeon HD 6900 Series 
    OpenGL version string: 4.2.11733 Compatibility Profile Context
     

Share This Page