Skip to main content

Showing 1–20 of 20 results for author: Araya-Polo, M

  1. arXiv:2404.04441  [pdf, other

    cs.DC

    Evaluation of Programming Models and Performance for Stencil Computation on Current GPU Architectures

    Authors: Baodi Shan, Mauricio Araya-Polo

    Abstract: Accelerated computing is widely used in high-performance computing. Therefore, it is crucial to experiment and discover how to better utilize GPUGPUs latest generations on relevant applications. In this paper, we present results and share insights about highly tuned stencil-based kernels for NVIDIA Ampere (A100) and Hopper (GH200) architectures. Performance results yield useful insights into the b… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  2. arXiv:2312.01267  [pdf, other

    cs.LG cs.DC q-bio.BM

    Distributed Reinforcement Learning for Molecular Design: Antioxidant case

    Authors: Huanyi Qin, Denis Akhiyarov, Sophie Loehle, Kenneth Chiu, Mauricio Araya-Polo

    Abstract: Deep reinforcement learning has successfully been applied for molecular discovery as shown by the Molecule Deep Q-network (MolDQN) algorithm. This algorithm has challenges when applied to optimizing new molecules: training such a model is limited in terms of scalability to larger datasets and the trained model cannot be generalized to different molecules in the same dataset. In this paper, a distr… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  3. arXiv:2311.06297  [pdf, other

    physics.chem-ph cs.LG

    STRIDE: Structure-guided Generation for Inverse Design of Molecules

    Authors: Shehtab Zaman, Denis Akhiyarov, Mauricio Araya-Polo, Kenneth Chiu

    Abstract: Machine learning and especially deep learning has had an increasing impact on molecule and materials design. In particular, given the growing access to an abundance of high-quality small molecule data for generative modeling for drug design, results for drug discovery have been promising. However, for many important classes of materials such as catalysts, antioxidants, and metal-organic frameworks… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  4. arXiv:2311.00107  [pdf, other

    physics.geo-ph cs.LG eess.SP

    Deep Compressed Learning for 3D Seismic Inversion

    Authors: Maayan Gelboim, Amir Adler, Yen Sun, Mauricio Araya-Polo

    Abstract: We consider the problem of 3D seismic inversion from pre-stack data using a very small number of seismic sources. The proposed solution is based on a combination of compressed-sensing and machine learning frameworks, known as compressed-learning. The solution jointly optimizes a dimensionality reduction operator and a 3D inversion encoder-decoder implemented by a deep convolutional neural network… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

    Comments: Presented at The International Meeting for Applied Geoscience & Energy (IMAGE23)

  5. arXiv:2310.04430  [pdf, other

    physics.geo-ph cs.CV cs.LG

    Joint inversion of Time-Lapse Surface Gravity and Seismic Data for Monitoring of 3D CO$_2$ Plumes via Deep Learning

    Authors: Adrian Celaya, Mauricio Araya-Polo

    Abstract: We introduce a fully 3D, deep learning-based approach for the joint inversion of time-lapse surface gravity and seismic data for reconstructing subsurface density and velocity models. The target application of this proposed inversion approach is the prediction of subsurface CO2 plumes as a complementary tool for monitoring CO2 sequestration deployments. Our joint inversion technique outperforms de… ▽ More

    Submitted 24 September, 2023; originally announced October 2023.

  6. arXiv:2309.04671  [pdf, other

    cs.DC

    A Portable Framework for Accelerating Stencil Computations on Modern Node Architectures

    Authors: Ryuichi Sai, John Mellor-Crummey, Jinfan Xu, Mauricio Araya-Polo

    Abstract: Finite-difference methods based on high-order stencils are widely used in seismic simulations, weather forecasting, computational fluid dynamics, and other scientific applications. Achieving HPC-level stencil computations on one architecture is challenging, porting to other architectures without sacrificing performance requires significant effort, especially in this golden age of many distinctive… ▽ More

    Submitted 7 July, 2024; v1 submitted 8 September, 2023; originally announced September 2023.

  7. arXiv:2306.09648  [pdf, other

    cs.LG physics.ao-ph

    Learning CO$_2$ plume migration in faulted reservoirs with Graph Neural Networks

    Authors: Xin Ju, François P. Hamon, Gege Wen, Rayan Kanfar, Mauricio Araya-Polo, Hamdi A. Tchelepi

    Abstract: Deep-learning-based surrogate models provide an efficient complement to numerical simulations for subsurface flow problems such as CO$_2$ geological storage. Accurately capturing the impact of faults on CO$_2$ plume migration remains a challenge for many existing deep learning surrogate models based on Convolutional Neural Networks (CNNs) or Neural Operators. We address this challenge with a graph… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  8. arXiv:2304.11274  [pdf, other

    cs.MS

    Massively Distributed Finite-Volume Flux Computation

    Authors: Ryuichi Sai, Mathias Jacquelin, François P. Hamon, Mauricio Araya-Polo, Randolph R. Settgast

    Abstract: Designing large-scale geological carbon capture and storage projects and ensuring safe long-term CO2 containment - as a climate change mitigation strategy - requires fast and accurate numerical simulations. These simulations involve solving complex PDEs governing subsurface fluid flow using implicit finite-volume schemes widely based on Two-Point Flux Approximation (TPFA). This task is computation… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

    Comments: 10 pages excl. bibliography. Submitted to SuperComputing 2023

  9. Application of quantum-inspired generative models to small molecular datasets

    Authors: C. Moussa, H. Wang, M. Araya-Polo, T. Bäck, V. Dunjko

    Abstract: Quantum and quantum-inspired machine learning has emerged as a promising and challenging research field due to the increased popularity of quantum computing, especially with near-term devices. Theoretical contributions point toward generative modeling as a promising direction to realize the first examples of real-world quantum advantages from these technologies. A few empirical studies also demons… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

    Comments: First version

    Journal ref: 2023 IEEE International Conference on Quantum Computing and Engineering (QCE), Bellevue, WA, USA, 2023, pp. 342-348

  10. arXiv:2211.08506  [pdf, other

    cs.CE cs.LG

    ParticleGrid: Enabling Deep Learning using 3D Representation of Materials

    Authors: Shehtab Zaman, Ethan Ferguson, Cecile Pereira, Denis Akhiyarov, Mauricio Araya-Polo, Kenneth Chiu

    Abstract: From AlexNet to Inception, autoencoders to diffusion models, the development of novel and powerful deep learning models and learning algorithms has proceeded at breakneck speeds. In part, we believe that rapid iteration of model architecture and learning techniques by a large community of researchers over a common representation of the underlying entities has resulted in transferable deep learning… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: Published in the 2022 IEEE 18th International Conference on eScience (eScience)

  11. arXiv:2209.02850  [pdf, other

    cs.LG physics.geo-ph

    Inversion of Time-Lapse Surface Gravity Data for Detection of 3D CO$_2$ Plumes via Deep Learning

    Authors: Adrian Celaya, Bertrand Denel, Yen Sun, Mauricio Araya-Polo, Antony Price

    Abstract: We introduce three algorithms that invert simulated gravity data to 3D subsurface rock/flow properties. The first algorithm is a data-driven, deep learning-based approach, the second mixes a deep learning approach with physical modeling into a single workflow, and the third considers the time dependence of surface gravity monitoring. The target application of these proposed algorithms is the predi… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

  12. arXiv:2207.14789  [pdf, other

    physics.geo-ph cs.LG eess.SP

    Encoder-Decoder Architecture for 3D Seismic Inversion

    Authors: Maayan Gelboim, Amir Adler, Yen Sun, Mauricio Araya-Polo

    Abstract: Inverting seismic data to build 3D geological structures is a challenging task due to the overwhelming amount of acquired seismic data, and the very-high computational load due to iterative numerical solutions of the wave equation, as required by industry-standard tools such as Full Waveform Inversion (FWI). For example, in an area with surface dimensions of 4.5km $\times$ 4.5km, hundreds of seism… ▽ More

    Submitted 29 July, 2022; originally announced July 2022.

  13. arXiv:2204.03775  [pdf, other

    cs.MS

    Massively scalable stencil algorithm

    Authors: Mathias Jacquelin, Mauricio Araya-Polo, Jie Meng

    Abstract: Stencil computations lie at the heart of many scientific and industrial applications. Unfortunately, stencil algorithms perform poorly on machines with cache based memory hierarchy, due to low re-use of memory accesses. This work shows that for stencil computation a novel algorithm that leverages a localized communication strategy effectively exploits the Cerebras WSE-2, which has no cache hierarc… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

    Comments: 10 pages excl. bibliography. Submitted to SuperComputing 2022

  14. arXiv:2009.04619  [pdf, other

    cs.DC

    Accelerating High-Order Stencils on GPUs

    Authors: Ryuichi Sai, John Mellor-Crummey, Xiaozhu Meng, Mauricio Araya-Polo, Jie Meng

    Abstract: Stencil computations are widely used in HPC applications. Today, many HPC platforms use GPUs as accelerators. As a result, understanding how to perform stencil computations fast on GPUs is important. While implementation strategies for low-order stencils on GPUs have been well-studied in the literature, not all of proposed enhancements work well for high-order stencils, such as those used for seis… ▽ More

    Submitted 15 September, 2020; v1 submitted 9 September, 2020; originally announced September 2020.

  15. arXiv:2007.06048  [pdf, other

    cs.DC

    Minimod: A Finite Difference solver for Seismic Modeling

    Authors: Jie Meng, Andreas Atle, Henri Calandra, Mauricio Araya-Polo

    Abstract: This article introduces a benchmark application for seismic modeling using finite difference method, which is namedMiniMod, a mini application for seismic modeling. The purpose is to provide a benchmark suite that is, on one hand easy to build and adapt to the state of the art in programming models and changing high performance hardware landscape. On the other hand, the intention is to have a prox… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

  16. arXiv:2004.03040  [pdf, other

    cs.LG stat.ML

    Deep Neural Network Learning with Second-Order Optimizers -- a Practical Study with a Stochastic Quasi-Gauss-Newton Method

    Authors: Christopher Thiele, Mauricio Araya-Polo, Detlef Hohl

    Abstract: Training in supervised deep learning is computationally demanding, and the convergence behavior is usually not fully understood. We introduce and study a second-order stochastic quasi-Gauss-Newton (SQGN) optimization method that combines ideas from stochastic quasi-Newton methods, Gauss-Newton methods, and variance reduction to address this problem. SQGN provides excellent accuracy without the nee… ▽ More

    Submitted 30 June, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

    Comments: 8 pages, 3 figures; added reference to code, fixed formatting of title

    ACM Class: G.1.6; I.2.10; I.2.6

  17. arXiv:1608.00636  [pdf, ps, other

    cs.PF cs.DC math.NA

    A survey of sparse matrix-vector multiplication performance on large matrices

    Authors: Max Grossman, Christopher Thiele, Mauricio Araya-Polo, Florian Frank, Faruk O. Alpak, Vivek Sarkar

    Abstract: We contribute a third-party survey of sparse matrix-vector (SpMV) product performance on industrial-strength, large matrices using: (1) The SpMV implementations in Intel MKL, the Trilinos project (Tpetra subpackage), the CUSPARSE library, and the CUSP library, each running on modern architectures. (2) NVIDIA GPUs and Intel multi-core CPUs (supported by each software package). (3) The CSR, BSR, COO… ▽ More

    Submitted 1 August, 2016; originally announced August 2016.

    Comments: Rice Oil & Gas High Performance Computing Workshop. March 2016

  18. arXiv:1603.03971  [pdf, other

    cs.DC

    Performance Analysis and Optimization of a Hybrid Distributed Reverse Time Migration Application

    Authors: Sri Raj Paul, John Mellor-Crummey, Mauricio Araya-Polo, Detlef Hohl

    Abstract: Applications to process seismic data employ scalable parallel systems to produce timely results. To fully exploit emerging processor architectures, application will need to employ threaded parallelism within a node and message passing across nodes. Today, MPI+OpenMP is the preferred programming model for this task. However, tuning hybrid programs for clusters is difficult. Performance tools can he… ▽ More

    Submitted 12 March, 2016; originally announced March 2016.

    Comments: 2 page extended abstract presented at The International Conference for High Performance Computing, Networking, Storage and Analysis (SC) 2015 for ACM Student Research Competition

  19. arXiv:1506.05439  [pdf, other

    cs.LG cs.CV stat.ML

    Learning with a Wasserstein Loss

    Authors: Charlie Frogner, Chiyuan Zhang, Hossein Mobahi, Mauricio Araya-Polo, Tomaso Poggio

    Abstract: Learning to predict multi-label outputs is challenging, but in many problems there is a natural metric on the outputs that can be used to improve predictions. In this paper we develop a loss function for multi-label learning, based on the Wasserstein distance. The Wasserstein distance provides a natural notion of dissimilarity for probability measures. Although optimizing with respect to the exact… ▽ More

    Submitted 29 December, 2015; v1 submitted 17 June, 2015; originally announced June 2015.

    Comments: NIPS 2015; v3 updates Algorithm 1 and Equations 6, 8

  20. arXiv:cs/0606042  [pdf, ps, other

    cs.DS

    Enabling user-driven Checkpointing strategies in Reverse-mode Automatic Differentiation

    Authors: Laurent Hascoet, Mauricio Araya-Polo

    Abstract: This paper presents a new functionality of the Automatic Differentiation (AD) tool Tapenade. Tapenade generates adjoint codes which are widely used for optimization or inverse problems. Unfortunately, for large applications the adjoint code demands a great deal of memory, because it needs to store a large set of intermediates values. To cope with that problem, Tapenade implements a sub-optimal v… ▽ More

    Submitted 9 June, 2006; originally announced June 2006.