Background
A space science operations center (SOC) is where scientists and data come together to do research and/or manage operations of a spacecraft. Traditionally, data is available to be analyzed or downloaded to a user’s local system for analysis. As data volumes are increasing rapidly with new generations of sensors, the process of bringing the data to each scientist for analysis is becoming unsustainable. The purpose of this research is to invert the process and push analysis software to the data.
Approach
Through a novel process known as “containerization” (where a functional piece of a system is distilled into a recipe and stored into an image), and the emergence of cloud technology, it should be possible to do many, if not all, elements of a space science operations center using a cloud-based solution. Our existing data analysis, data mining, and visualization software are to be modernized and containerized, and then deployed on the cloud. To aid in our research, as well as for security and potentially keep elements proprietary, a locally available cluster will be utilized as a “pseudo-cloud”.
Accomplishments
Portions of our visualization system have been modernized by migrating our build system to a new CMake build system. Other pieces of our development have also been containerized including our bug-tracking system and our continuous integration system. Significantly, the entirety of our generic SOC has been containerized into a multi-container system. What this means is each major piece of a SOC is within its own container which can be deployed anywhere, including commercial cloud service providers such as Amazon, or on our on-premises cluster computer. By using this hybrid approach, cost-savings can be utilized by gradually migrating to the cloud instead of doing an entire “lift-and-shift” approach to cloud development. This also allows for just utilizing the cloud for specific use cases such as computer speed or large disk arrays. Finally, as part of this research, the Heliophysics API (HAPI) for serving data to other requesting data centers has been developed for both the client and server.