HiPSCat Import#
Utility for ingesting large survey data into HiPSCat structure.
Installation#
We recommend installing in a virtual environment, like venv or conda. You may need to install or upgrade versions of dependencies to work with hipscat-import.
pip install hipscat-import
Tip
Installing on Mac
healpy
is a very necessary dependency for hipscat libraries at this time, but
native prebuilt binaries for healpy on Apple Silicon Macs
do not yet exist,
so it’s recommended to install via conda before proceeding to hipscat-import.
>> conda config --append channels conda-forge
>> conda install healpy
Setting up a pipeline#
For each type of dataset the hipscat-import tool can generate, there is an argument container class that you will need to instantiate and populate with relevant arguments.
See dataset-specific notes on arguments:
Catalog Import Arguments (most common)
Once you have created your arguments object, you pass it into the pipeline control, and then wait. Running within a main guard will potentially avoid some python threading issues with dask:
from dask.distributed import Client
from hipscat_import.pipeline import pipeline_with_client
def main():
args = ...
with Client(
n_workers=10,
threads_per_worker=1,
...
) as client:
pipeline_with_client(args, client)
if __name__ == '__main__':
main()
Acknowledgements#
This project is supported by Schmidt Sciences.
This project is based upon work supported by the National Science Foundation under Grant No. AST-2003196.
This project acknowledges support from the DIRAC Institute in the Department of Astronomy at the University of Washington. The DIRAC Institute is supported through generous gifts from the Charles and Lisa Simonyi Fund for Arts and Sciences, and the Washington Research Foundation.