Thiele, Christopher (Rice University) | Araya-Polo, Mauricio (Shell International Exploration & Production, Inc.) | Alpak, Faruk Omer (Shell International Exploration & Production, Inc.) | Riviere, Beatrice (Rice University)
Direct numerical simulation of multiphase pore-scale flow is a computationally demanding task with strong requirements on time-to-solution for the prediction of relative permeabilities. In this paper, we describe the hybrid-parallel implementation of a two-phase two-component incompressible flow simulator using MPI, OpenMP, and general-purpose graphics processing units (GPUs), and we analyze its computational performance. In particular, we evaluate the parallel performance of GPU-based iterative linear solvers for this application, and we compare them to CPUbased implementations of the same solver algorithms. Simulations on real-life Berea sandstone micro-CT images are used to assess the strong scalability and computational performance of the different solver implementations and their effect on time-to-solution. Additionally, we use a Poisson problem to further characterize achievable strong and weak scalability of the GPU-based solvers in reproducible experiments. Our experiments show that GPU-based iterative solvers can greatly reduce time-to-solution in complex pore-scale simulations. On the other hand, strong scalability is currently limited by the unbalanced computing capacities of the host and the GPUs. The experiments with the Poisson problem indicate that GPU-based iterative solvers are efficient when weak scalability is desired. Our findings show that proper utilization of GPUs can help to make our two-phase pore-scale flow simulation computationally feasible in existing workflows.
Thiele, Christopher (Shell International E&P Inc.) | Araya-Polo, Mauricio (Shell International E&P Inc.) | Alpak, Faruk O. (Shell International E&P Inc.) | Riviere, Beatrice (Rice University) | Frank, Florian (Rice University)
HSS splits the linear system into a coarse-scale system of reduced size corresponding to the local mean values of the DG solution, and a set of
We propose a modified HSS algorithm (