Overview
The Vera C. Rubin Observatory will carry out the Legacy Survey of Space and Time (LSST) over a ten year period. This project focuses on LSST supporting cyberinfrastructure (CI) in the context of Solar System science. The project will directly support the Solar System Notification and Alert Processing System (SNAPS), which is a Rubin-approved alert broker. SNAPS will send alerts the the astronomy community to enable other telescopes to perform follow-up observations of objects of interest in the Solar System.
To ensure that rapid follow-up opportunities are possible, two types of unsupervised learning algorithms are needed: (i) near real-time outlier detection that occurs during nighttime observing and alerts the community of interesting events on small bodies; and (ii) outlier detection that determines if a small body is intrinsically interesting relative to all other small bodies. Addressing these two goals necessitates the advancement of new outlier detection algorithms that address very different workloads, and thus present several interesting computational challenges. These new algorithms will be incorporated into SNAPS CI and made publicly available for incorporation into other alert brokers and standalone use.
The fast and scalable outlier detection algorithms are the missing step between Rubin data measurements and comprehensive scientific investigations for Solar System science and other science cases addressed by other alert brokers that require detecting objects exhibiting interesting (outlying) behavior.
The project will address new facets of parallel heterogeneous computing including:
- Examining the potential of new application specific integrated circuits (ASICs), such as tensor and ray tracing cores found on recent generations of GPUs, for accelerating outlier detection tasks.
- Examining the scalability of heterogeneous systems, including compute nodes with multi-core CPUs and GPUs, the latter of which are equipped with the abovementioned ASICs.
- A systematic exploration of the algorithm design space to understand which hardware configurations and algorithm design choices are best suited for a given workload/science case.
NSF Award Information
Broader Impact: Pedagogic Modules [Coming Soon]
Part of this project includes pedagogic modules for teaching general purpose computing on graphics processing units using data-intensive applications. The modules will also feature the use of ASICs found on GPUs (tensor and ray tracing cores). These modules are intended to be integrated into courses at the undergraduate and graduate levels. Once available, the modules will be posted at the following link. Website
Publications
- Multi-Space Tree with Incremental Construction for GPU-Accelerated Range Queries [PDF]
Donnelly, B. & Gowanlock, M.
To appear in the Proceedings of the 31st IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC 2024). - GDBOD: Density-Based Outlier Detection Exploiting Efficient Tree Traversals on the GPU [PDF]
Munugala, R. & Gowanlock, M.
To appear in the Proceedings of the 31st IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC 2024). - Asteroid Period Solutions from Combined Dense and Sparse Photometry [PDF]
Gowanlock, M., Trilling, D. E., McNeill, A., Kramer, D., & Chernyavskaya, M.
The Astronomical Journal, 168, 181. DOI: https://doi.org/10.3847/1538-3881/ad6cdd - The Solar System Notification Alert Processing System (SNAPS): Asteroid Population Outlier Detection [PDF]
Gowanlock, M., Trilling, D. E., Kramer, D. Chernyavskaya, M., & McNeill, A.
The Astronomical Journal, 168, 56. DOI: https://doi.org/10.3847/1538-3881/ad4da5 - Optimization and Comparison of Coordinate- and Metric-Based Indexes on GPUs for Distance Similarity Searches [PDF]
Gowanlock, M, Gallet, B., & Donnelly, B.
Proceedings of the International Conference on Computational Science 2023 (ICCS 2023), pp. 357-364. Cham: Springer Nature Switzerland. DOI: https://doi.org/10.1007/978-3-031-36021-3_37 - The Solar System Notification Alert Processing System (SNAPS): Design, Architecture, and First Data Release (SNAPShot1) [PDF]
Trilling, D. E., Gowanlock, M., Kramer, D., McNeill, A., Donnelly, B., Butler, N., & Kececioglu, J.
The Astronomical Journal, 165, 111. DOI: https://doi.org/10.3847/1538-3881/acac7f - Leveraging GPU Tensor Cores for Double Precision Euclidean Distance Calculations [PDF]
Gallet, B. & Gowanlock, M.
To appear in the Proceedings of the 29th IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC 2022) - CUDA-DClust+: Revisiting Early GPU-Accelerated DBSCAN Clustering Designs [PDF]
Poudel, M., & Gowanlock, M.
Proceedings of the 28th IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC 2021), pp. 354–363. DOI: https://doi.org/10.1109/HiPC53243.2021.00049
Code
Most of the code in the publications above is publicly available in the repositories below.
- MiSTIC: Multi-Space Tree with Incremental Construction for GPU-Accelerated Range Queries
Authors: Brian Donnelly
Paper: HiPC 2024
Link: TBD - GDBOD: Density-Based Outlier Detection Exploiting Efficient Tree Traversals on the GPU
Authors: Revanth Munugala and Mike Gowanlock
Paper: HiPC 2024
Link: https://github.com/revanthmunugala/GDBOD - GPU-Accelerated Distance Similarity Self-Join for Low and High Dimensional Data
Authors: Mike Gowanlock, Benoit Gallet, Brian Donnelly, and Ben Karsin
Paper: ICCS 2023
Link: https://github.com/mgowanlock/gpu_self_join - Leveraging GPU Tensor Cores for Double Precision Euclidean Distance Calculations
Authors: Benoit Gallet and Mike Gowanlock
Paper: HiPC 2022
Link: https://github.com/benoitgallet/ted-join-hipc22 - CUDA-DClust+: Revisiting Early GPU-Accelerated DBSCAN Clustering Designs
Authors: Madhav Poudel and Mike Gowanlock
Paper: HiPC 2021
Link: https://github.com/l3lackcurtains/fast-cuda-gpu-dbscan