Developed a CUDA version of the FDTD method and achieved a speedup 40x. Implemented on a NVIDIA Quadro FX 3800 GPU, which has 192 SPs, 1GB global memory, and a memory bandwidth of 51.2 GB/s.
In this paper we investigate the effectiveness of alternating direction implicit (ADI) time-discretization schemes in the numerical solution of the three-dimensional Heston-Hull-White partial ...
This paper develops two local mesh-free methods for designing stencil weights and spatial discretization, respectively, for parabolic partial differential equations (PDEs) of ...