CUDA Device count = 1
Some properties of CUDA device 0:
=================================
Name: GeForce GTX 760
Compute capability: 3.0
Number of multiprocessors: 6
Total global memory: 2147155968 bytes
Shared Mem/Block: 49152 bytes
Shared Mem Access: 8 bytes
=================================
*********************
n_digits = 156
prec_words = 12, 12
MAX_PREC_WORDS = 145
n_words = 17
numElement = 307200
*********************
Prepare data.................................
done.
test_add ........................................
numElement = 307200, interval = 307200
numBlock = 2400, numThread = 128
interval memory layout...
*** GPU add: 0.003 sec ***
*** CPU add: 0.164 sec ***
*** The abs. of max. rel. error = 10 ^ 0 x 0 ***
*** The abs. of avg. rel. error = 10 ^ 0 x 0 ***
A sample when i = 164661
GOLD = 10 ^ 0 x 1.146514967712305809649915716387329890097697486394245497434308221497424464834325631
79364833248365700345339165485223330578732787146025929987360878945595128796
REF = 10 ^ 0 x 1.146514967712305809649915716387329890097697486394245497434308221497424464834325631
79364833248365700345339165485223330578732787146025929987360878945595128796
test_sub ........................................
numElement = 307200, interval = 307200
numBlock = 2400, numThread = 128
interval memory layout...
*** GPU sub: 0.003 sec ***
*** CPU sub: 0.185 sec ***
*** The abs. of max. rel. error = 10 ^ 0 x 0 ***
*** The abs. of avg. rel. error = 10 ^ 0 x 0 ***
A sample when i = 90752
GOLD = 10 ^ -1 x 2.24032445053941494471490869245516024230081070034160848188327014170345318734560570
536693998741659321929492623237323066365706675874447724150862042972514377446
REF = 10 ^ -1 x 2.24032445053941494471490869245516024230081070034160848188327014170345318734560570
536693998741659321929492623237323066365706675874447724150862042972514377446
test_mul ........................................
numElement = 307200, interval = 307200
numBlock = 2400, numThread = 128
interval memory layout...
*** GPU mul: 0.013 sec ***
*** CPU mul: 0.459 sec ***
*** The abs. of max. rel. error = 10 ^ 0 x 0 ***
*** The abs. of avg. rel. error = 10 ^ 0 x 0 ***
A sample when i = 123003
GOLD = 10 ^ -1 x 5.30056800732076570691327903160327288932904861807881356311266826572994142947323528
41976425418735331748966283710939580686946026614702874865642273292239809154
REF = 10 ^ -1 x 5.30056800732076570691327903160327288932904861807881356311266826572994142947323528
41976425418735331748966283710939580686946026614702874865642273292239809154
test_div ........................................
numElement = 307200, interval = 307200
numBlock = 2400, numThread = 128
interval memory layout...
*** GPU div: 0.018 sec ***
*** CPU div: 0.604 sec ***
*** The abs. of max. rel. error = 10 ^ 0 x 0 ***
*** The abs. of avg. rel. error = 10 ^ 0 x 0 ***
A sample when i = 194256
GOLD = 10 ^ 0 x 1.334185451621135878465590322550729345967927280447987755070190881191672203052509390
03208758406331862064461105327812246702639904274084665067687023580592219925
REF = 10 ^ 0 x 1.334185451621135878465590322550729345967927280447987755070190881191672203052509390
03208758406331862064461105327812246702639904274084665067687023580592219925
I've pushed garprec to 10000 digits (680 prec words) but it starts getting sketchy at those levels on my system