Dhairya Malhotra
|
c6156f2384
Update Makefile.am for quadruple precision.
|
10 vuotta sitten |
Dhairya Malhotra
|
8c2b816f95
Fix multiply defined symbols.
|
10 vuotta sitten |
Dhairya Malhotra
|
01430964b3
Fix Phi compilation errors; add m4 script to detect quadruple precision support.
|
10 vuotta sitten |
Dhairya Malhotra
|
deacf73ca2
- Update Stokes double-layer potential to symmetric part of Stokes-dipole.
|
10 vuotta sitten |
Dhairya Malhotra
|
15eeec9134
fix errors.
|
10 vuotta sitten |
Dhairya Malhotra
|
b8dbad698b
Quadruple precision support.
|
10 vuotta sitten |
Dhairya Malhotra
|
03de09b5df
Add single-precision SSE support for V-list
|
10 vuotta sitten |
Dhairya Malhotra
|
2a168271f6
Optimize singular integration: cheb_integ(...)
|
10 vuotta sitten |
Dhairya Malhotra
|
8dde7b6a45
update example fmm_cheb.cpp
|
11 vuotta sitten |
Dhairya Malhotra
|
64e0edc334
load balancing, bug fixes.
|
11 vuotta sitten |
Dhairya Malhotra
|
a5ea2e19b0
update scripts (Helmholtz).
|
11 vuotta sitten |
Dhairya Malhotra
|
60d0823440
Merge branch 'feature/mpi-sparse' into develop
|
11 vuotta sitten |
Dhairya Malhotra
|
5e3a02609c
clean up
|
11 vuotta sitten |
Dhairya Malhotra
|
201d276277
update scripts.
|
11 vuotta sitten |
Dhairya Malhotra
|
189faf035c
Optimize 2:1 balance algorithm.
|
11 vuotta sitten |
Dhairya Malhotra
|
5d19e794c5
bug fixes, optimizations
|
11 vuotta sitten |
Dhairya Malhotra
|
a05dc803a5
Changes for Intel Phi native build.
|
11 vuotta sitten |
Dhairya Malhotra
|
6844053776
Add memory manager.
|
11 vuotta sitten |
Dhairya Malhotra
|
c5edaefd39
bug fixes, everything works!
|
11 vuotta sitten |
Dhairya Malhotra
|
aa0182ad59
.
|
11 vuotta sitten |
Dhairya Malhotra
|
43490daacd
.
|
11 vuotta sitten |
Dhairya Malhotra
|
240427df6b
temporary fix for Device2HostWait()
|
11 vuotta sitten |
Dhairya Malhotra
|
075cfdda7d
Clean up.
|
11 vuotta sitten |
Chenhan D. Yu
|
1cc35372fb
Last commit of feature/cuda from Chenhan
|
11 vuotta sitten |
Dhairya Malhotra
|
28f272943b
Permutations using shared memory.
|
11 vuotta sitten |
Chenhan D. Yu
|
069f848388
Bug fixed, counter, tmp_a, tmp_b.
|
11 vuotta sitten |
Chenhan D. Yu
|
36c01fec95
Optimize kernels in_perm_2d_k and out_perm_2d_k.
|
11 vuotta sitten |
Chenhan D. Yu
|
5f94b322e3
Eventually, this is a bug free version!!
|
11 vuotta sitten |
Chenhan D. Yu
|
0c7134e0dd
This is bug-version. GPU address wrap-over.
|
11 vuotta sitten |
Dhairya Malhotra
|
d802dd28bb
minor changes
|
11 vuotta sitten |