GPUPi (OPenCL Pi gpu Benchmark )

Matt26LFC · Apr 28, 2015

Anyone got any idea why this bench seems to think I'm running two 280Xs and not two 7970s?

Angantyr · Apr 28, 2015

Matt26LFC said: ↑

Anyone got any idea why this bench seems to think I'm running two 280Xs and not two 7970s?
Click to expand...

My 7950 was also registered / put into the same category as the R9 280.

2mat0 · Apr 28, 2015

Matt26LFC said: ↑

Anyone got any idea why this bench seems to think I'm running two 280Xs and not two 7970s?
Click to expand...

AMD's OpenCL driver only gives access to the codename of the GPU, which is the same for rebranded graphics cards.

Noisiv · May 8, 2015

Several diagnostics are still reporting "OpenCL 1.2". What's up with that?

GPU Caps Viewer:

Ver.: OpenCL 1.2 AMD-APP (1642.5)
-CL_DEVICE_VERSION:OpenCL 1.2 AMD-APP (1642.5)
Version/Profile: OpenCL 2.0 AMD-APP (1642.5) FULL_PROFILE

Lane · May 9, 2015

Noisiv said: ↑

Several diagnostics are still reporting "OpenCL 1.2". What's up with that?

GPU Caps Viewer:

Ver.: OpenCL 1.2 AMD-APP (1642.5)
-CL_DEVICE_VERSION:OpenCL 1.2 AMD-APP (1642.5)
Version/Profile: OpenCL 2.0 AMD-APP (1642.5) FULL_PROFILE
Click to expand...

need to check with the guys who have done thoses diagnostics softwares ? even with DX i see some funny things.

cowie · May 12, 2015

what driver to use with 290x on this bench? my titan x runs it faster and that should not be

Lane · May 12, 2015

cowie said: ↑

what driver to use with 290x on this bench? my titan x runs it faster and that should not be
Click to expand...

For be honest, i have not try this one since a good time, i see a new version is out ( 2.1.1 ). As for driver, sadly i dont know.

Personally i have install the separated complete AMD APP SDK 2.0, as i use the compilers for OpenCL accelerated render engine for Blender+Luxrender. (Luxcore + path OpenCL ). so i dont know if theres something who can change from driver to driver.

cowie · May 15, 2015

still thought it would be 10 second runs at 1300 or so on my 290x I have all the latest thing as you describe

lexer98 · May 24, 2015

Really unstable but i was able to run the benchmark ...

learmat · Jul 24, 2015

learmat said: ↑

980 @ 1660 and 8ghz memory : 32.552

****************

Device: NVIDIA GeForce GTX 980 (16 CUs, 1405 MHz)
OpenCL 1.1 CUDA 6.5.30 is ready.

Compiling OpenCL kernels ... done.

Calculating 1.000.000.000th digit of PI. 20 iterations.

Allocated device memory : 335546368 Bytes
Batch Size : 20M
Reduction Size : 64

00h 00m 00.197s Batch 1 finished.
00h 00m 00.929s Batch 2 finished.
00h 00m 01.604s Batch 3 finished.
00h 00m 02.930s Batch 4 finished.
00h 00m 05.914s Batch 5 finished.
00h 00m 08.608s Batch 6 finished.
00h 00m 09.290s Batch 7 finished.
00h 00m 09.965s Batch 8 finished.
00h 00m 11.282s Batch 9 finished.
00h 00m 14.223s Batch 10 finished.
00h 00m 16.880s Batch 11 finished.
00h 00m 17.563s Batch 12 finished.
00h 00m 18.238s Batch 13 finished.
00h 00m 19.566s Batch 14 finished.
00h 00m 22.553s Batch 15 finished.
00h 00m 25.248s Batch 16 finished.
00h 00m 25.932s Batch 17 finished.
00h 00m 26.608s Batch 18 finished.
00h 00m 27.925s Batch 19 finished.
00h 00m 30.867s Batch 20 finished.
00h 00m 33.455s PI value output -> 5895585A0

Device time for pi calculation: 32.552 s
Device time for memory reduction: 0.903 s
Click to expand...

Another run on Win 10:
980 @ 1505 and 8ghz memory : 30.030s + 1.005s

OpenCL 1.2 CUDA 7.5.9 is ready. Timer: HPET (14.32 MHz)

OpenCL GPU: NVIDIA GeForce GTX 980 (16 CUs, 1367 MHz)
Compiling OpenCL kernels ... done.

Calculating 1.000.000.000th digit of PI. 20 iterations.

Allocated device memory : 335546368 Bytes
Batch Size : 20M
Reduction Size : 64

00h 00m 00.715s Batch 1 finished.
00h 00m 01.404s Batch 2 finished.
00h 00m 02.459s Batch 3 finished.
00h 00m 05.158s Batch 4 finished.
00h 00m 07.840s Batch 5 finished.
00h 00m 08.518s Batch 6 finished.
00h 00m 09.191s Batch 7 finished.
00h 00m 10.231s Batch 8 finished.
00h 00m 12.891s Batch 9 finished.
00h 00m 15.530s Batch 10 finished.
00h 00m 16.208s Batch 11 finished.
00h 00m 16.885s Batch 12 finished.
00h 00m 17.933s Batch 13 finished.
00h 00m 20.642s Batch 14 finished.
00h 00m 23.330s Batch 15 finished.
00h 00m 24.012s Batch 16 finished.
00h 00m 24.689s Batch 17 finished.
00h 00m 25.734s Batch 18 finished.
00h 00m 28.401s Batch 19 finished.
00h 00m 31.043s PI value output -> 5895585A0

Statistics

Calculation + Reduction time: 30.030s + 1.005s

Extraordinary · Jul 24, 2015

Running Win 10

Error: High Precision Event Timer for time measurement not found!

This is mandatory for Windows 8 and higher to avoid skewed timing.
You'll find a guide on how to enable the HPET timer on your system
in our FAQ. Open about dialog in the menu above for the link.

--
Haven't disabled HPET, a Win 10 bug?

learmat · Jul 24, 2015

Extraordinary said: ↑

Running Win 10

Error: High Precision Event Timer for time measurement not found!

This is mandatory for Windows 8 and higher to avoid skewed timing.
You'll find a guide on how to enable the HPET timer on your system
in our FAQ. Open about dialog in the menu above for the link.

--
Haven't disabled HPET, a Win 10 bug?
Click to expand...

The value of useplatformclock should be "Yes". If it's not, you can fix this by running:
SHELL:
bcdedit /set useplatformclock yes

A reboot might be necessary afterwards.

Extraordinary · Jul 24, 2015

learmat said: ↑

The value of useplatformclock should be "Yes". If it's not, you can fix this by running:
SHELL:
bcdedit /set useplatformclock yes

A reboot might be necessary afterwards.
Click to expand...

Thanks, is there a command to check the status before changing it?

To see what it was defaulted to

EDIT - Found it: bcdedit /deletevalue useplatformclock

If you get an error (Which I did) it was disabled

---Working now---

OpenCL 1.2 CUDA 7.5.14 is ready. Timer: HPET (14.32 MHz)

OpenCL GPU: NVIDIA GeForce GTX 980 (16 CUs, 1367 MHz)
Compiling OpenCL kernels ... done.

Calculating 1.000.000.000th digit of PI. 20 iterations.

Allocated device memory : 335546368 Bytes
Batch Size : 20M
Reduction Size : 64

00h 00m 00.719s Batch 1 finished.
00h 00m 01.398s Batch 2 finished.
00h 00m 02.441s Batch 3 finished.
00h 00m 05.112s Batch 4 finished.
00h 00m 07.763s Batch 5 finished.
00h 00m 08.431s Batch 6 finished.
00h 00m 09.095s Batch 7 finished.
00h 00m 10.120s Batch 8 finished.
00h 00m 12.750s Batch 9 finished.
00h 00m 15.360s Batch 10 finished.
00h 00m 16.030s Batch 11 finished.
00h 00m 16.696s Batch 12 finished.
00h 00m 17.731s Batch 13 finished.
00h 00m 20.408s Batch 14 finished.
00h 00m 23.064s Batch 15 finished.
00h 00m 23.735s Batch 16 finished.
00h 00m 24.401s Batch 17 finished.
00h 00m 25.431s Batch 18 finished.
00h 00m 28.066s Batch 19 finished.
00h 00m 30.678s PI value output -> 5895585A0

Statistics

Calculation + Reduction time: 29.632s + 1.026s

FTLN · Jul 28, 2015

980 @ 1661

Cuda - Calculation + Reduction time: 30.305s + 0.934s
OpenCL - Calculation + Reduction time: 27.160s + 0.886s

------------------------------------------------------------------------

CUDA GPU: GeForce GTX 980
Kernel 1, Batch Size: 20M, Blocks: 20480, Threads: 1024
Kernel 2, Batch Size: 20M, Blocks: 27307, Threads: 768

Calculating 1.000.000.000th digit of PI. 20 iterations.

Allocated device memory : 335549456 Bytes
Batch Size : 20M
Reduction Size : 64

00h 00m 00.677s Batch 1 finished.
00h 00m 01.318s Batch 2 finished.
00h 00m 02.556s Batch 3 finished.
00h 00m 05.259s Batch 4 finished.
00h 00m 07.893s Batch 5 finished.
00h 00m 08.541s Batch 6 finished.
00h 00m 09.182s Batch 7 finished.
00h 00m 10.408s Batch 8 finished.
00h 00m 13.053s Batch 9 finished.
00h 00m 15.632s Batch 10 finished.
00h 00m 16.281s Batch 11 finished.
00h 00m 16.923s Batch 12 finished.
00h 00m 18.162s Batch 13 finished.
00h 00m 20.866s Batch 14 finished.
00h 00m 23.503s Batch 15 finished.
00h 00m 24.151s Batch 16 finished.
00h 00m 24.794s Batch 17 finished.
00h 00m 26.019s Batch 18 finished.
00h 00m 28.666s Batch 19 finished.
00h 00m 31.247s PI value output -> 5895585A0

Statistics

Calculation + Reduction time: 30.305s + 0.934s

OpenCL 1.2 CUDA 7.5.15 is ready.

OpenCL GPU: NVIDIA GeForce GTX 980 (16 CUs, 1481 MHz)
Compiling OpenCL kernels ... done.

Calculating 1.000.000.000th digit of PI. 20 iterations.

Allocated device memory : 335546368 Bytes
Batch Size : 20M
Reduction Size : 64

00h 00m 00.627s Batch 1 finished.
00h 00m 01.226s Batch 2 finished.
00h 00m 02.167s Batch 3 finished.
00h 00m 04.627s Batch 4 finished.
00h 00m 07.066s Batch 5 finished.
00h 00m 07.672s Batch 6 finished.
00h 00m 08.272s Batch 7 finished.
00h 00m 09.208s Batch 8 finished.
00h 00m 11.631s Batch 9 finished.
00h 00m 14.031s Batch 10 finished.
00h 00m 14.638s Batch 11 finished.
00h 00m 15.238s Batch 12 finished.
00h 00m 16.180s Batch 13 finished.
00h 00m 18.642s Batch 14 finished.
00h 00m 21.084s Batch 15 finished.
00h 00m 21.691s Batch 16 finished.
00h 00m 22.292s Batch 17 finished.
00h 00m 23.229s Batch 18 finished.
00h 00m 25.653s Batch 19 finished.
00h 00m 28.055s PI value output -> 5895585A0

Statistics

Calculation + Reduction time: 27.160s + 0.886s

Tugrul_512bit · Sep 6, 2015

HD7870 + r7_240 gives 65 seconds @ default settings. (doesnt use oc setting for a reason)

Log in or Sign up

GPUPi (OPenCL Pi gpu Benchmark )

Matt26LFC Ancient Guru

Angantyr Master Guru

2mat0 New Member

Noisiv Ancient Guru

Lane Guest

cowie Ancient Guru

Lane Guest

cowie Ancient Guru

lexer98 Guest

learmat Guest

Extraordinary Guest

learmat Guest

Extraordinary Guest

FTLN Member

Tugrul_512bit Guest

Share This Page