CAREER: Exploiting Parallel Heterogeneous Architectures to Enable Time-domain Astronomy in the LSST era

Overview

The Vera C. Rubin Observatory will carry out the Legacy Survey of Space and Time (LSST) over a ten year period. This project focuses on LSST supporting cyberinfrastructure (CI) in the context of Solar System science. The project will directly support the Solar System Notification and Alert Processing System (SNAPS), which is a Rubin-approved alert broker. SNAPS will send alerts the the astronomy community to enable other telescopes to perform follow-up observations of objects of interest in the Solar System.

To ensure that rapid follow-up opportunities are possible, two types of unsupervised learning algorithms are needed: (i) near real-time outlier detection that occurs during nighttime observing and alerts the community of interesting events on small bodies; and (ii) outlier detection that determines if a small body is intrinsically interesting relative to all other small bodies. Addressing these two goals necessitates the advancement of new outlier detection algorithms that address very different workloads, and thus present several interesting computational challenges. These new algorithms will be incorporated into SNAPS CI and made publicly available for incorporation into other alert brokers and standalone use.

The fast and scalable outlier detection algorithms are the missing step between Rubin data measurements and comprehensive scientific investigations for Solar System science and other science cases addressed by other alert brokers that require detecting objects exhibiting interesting (outlying) behavior.

The project will address new facets of parallel heterogeneous computing including:

  • Examining the potential of new application specific integrated circuits (ASICs), such as tensor and ray tracing cores found on recent generations of GPUs, for accelerating outlier detection tasks.
  • Examining the scalability of heterogeneous systems, including compute nodes with multi-core CPUs and GPUs, the latter of which are equipped with the abovementioned ASICs.
  • A systematic exploration of the algorithm design space to understand which hardware configurations and algorithm design choices are best suited for a given workload/science case.

NSF Award Information

NSF Grant No. 2042155

Broader Impact: Pedagogic Modules [Coming Soon]

Part of this project includes pedagogic modules for teaching general purpose computing on graphics processing units using data-intensive applications. The modules will also feature the use of ASICs found on GPUs (tensor and ray tracing cores). These modules are intended to be integrated into courses at the undergraduate and graduate levels. Once available, the modules will be posted at the following link. Website

Publications

  • Scalable Searches In High-Dimensional Spaces: Leveraging Multi- And Many-Core Architectures [PDF]
    Donnelly, B.
    PhD Dissertation, Northern Arizona University, 2025
  • Teaching Parallel and Distributed Computing using Data-Intensive Computing Modules [PDF]
    Gowanlock, M.
    To appear in the Journal of Parallel and Distributed Computing (JPDC)
    DOI: https://doi.org/10.1016/j.jpdc.2025.105093
  • Multi-Space Tree with Incremental Construction for GPU-Accelerated Range Queries [PDF]
    Donnelly, B. & Gowanlock, M.
    Proceedings of the 31st IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC 2024). pp. 132-142. DOI: https://doi.org/10.1109/HiPC62374.2024.00024.
  • GDBOD: Density-Based Outlier Detection Exploiting Efficient Tree Traversals on the GPU [PDF]
    Munugala, R. & Gowanlock, M.
    Proceedings of the 31st IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC 2024). pp. 111-121. DOI: https://doi.org/10.1109/HiPC62374.2024.00021
  • Asteroid Period Solutions from Combined Dense and Sparse Photometry [PDF]
    Gowanlock, M., Trilling, D. E., McNeill, A., Kramer, D., & Chernyavskaya, M.
    The Astronomical Journal, 168, 181. DOI: https://doi.org/10.3847/1538-3881/ad6cdd
  • The Solar System Notification Alert Processing System (SNAPS): Asteroid Population Outlier Detection [PDF]
    Gowanlock, M., Trilling, D. E., Kramer, D., Chernyavskaya, M., & McNeill, A.
    The Astronomical Journal, 168, 56. DOI: https://doi.org/10.3847/1538-3881/ad4da5
  • Efficient Euclidean Distance Calculations And Distance Similarity Searches: An Exaination Of Heterogeneous CPU, GPU, And Tensor Core Architectures [PDF]
    Gallet, B.
    PhD Dissertation, Northern Arizona University, 2023
  • Optimization and Comparison of Coordinate- and Metric-Based Indexes on GPUs for Distance Similarity Searches [PDF]
    Gowanlock, M, Gallet, B., & Donnelly, B.
    Proceedings of the International Conference on Computational Science 2023 (ICCS 2023), pp. 357-364. Cham: Springer Nature Switzerland. DOI: https://doi.org/10.1007/978-3-031-36021-3_37
  • The Solar System Notification Alert Processing System (SNAPS): Design, Architecture, and First Data Release (SNAPShot1) [PDF]
    Trilling, D. E., Gowanlock, M., Kramer, D., McNeill, A., Donnelly, B., Butler, N., & Kececioglu, J.
    The Astronomical Journal, 165, 111. DOI: https://doi.org/10.3847/1538-3881/acac7f
  • Leveraging GPU Tensor Cores for Double Precision Euclidean Distance Calculations [PDF]
    Gallet, B. & Gowanlock, M.
    Proceedings of the 2022 IEEE 29th International Conference on High Performance Computing, Data, and Analytics (HiPC), pp. 135–144. DOI: https://doi.org/10.1109/HiPC56025.2022.00029
  • CUDA-DClust+: Revisiting Early GPU-Accelerated DBSCAN Clustering Designs [PDF]
    Poudel, M., & Gowanlock, M.
    Proceedings of the 28th IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC 2021), pp. 354–363. DOI: https://doi.org/10.1109/HiPC53243.2021.00049

Code

Most of the code in the publications above is publicly available in the repositories below.