ConversionArguments#
- class ConversionArguments#
Data class for holding conversion arguments. Mostly just inheriting from RuntimeArguments
Attributes
addl_hats_propertiesAny additional keyword arguments you would like to provide when writing the hats.properties file for the final HATS table.
catalog_pathconstructed output path for the catalog that will be something like <output_path>/<output_artifact_name>
completion_email_addressif provided, send an email to the indicated email address once the import pipeline has completed.
create_metadataCreate /dataset/_metadata parquet from all data partitions.
create_per_partition_statsCreate per_partition_statistics.parquet, based on footers from all data partitions.
create_thumbnailCreate /dataset/data_thumbnail.parquet from one row of each data partition.
dask_n_workersnumber of workers for the dask client
dask_threads_per_workernumber of threads per dask worker
dask_tmpdirectory for dask worker space.
delete_intermediate_parquet_filesshould we delete the smaller intermediate parquet files generated in the splitting stage, once the relevant reducing stage is complete?
delete_resume_log_filesshould we delete task-level done files once each stage is complete? if False, we will keep all done marker files at the end of the pipeline.
input_catalog_pathnpix_parquet_nameName of the pixel parquet file to be used when npix_suffix=/.
npix_suffixSuffix for pixel data.
output_artifact_nameshort, convenient name for the catalog
output_pathbase path where new catalog should be output
progress_barif true, a progress bar will be displayed for user feedback of map reduce progress
resumeIf True, we try to read any existing intermediate files and continue to run the pipeline where we left off.
resume_tmpdirectory for intermediate resume files, when needed.
row_group_kwargsadditional keyword arguments to use in creation of rowgroups when writing files to parquet.
should_write_skymapmain catalogs should contain skymap fits files
simple_progress_barif displaying a progress bar, use a text-only simple progress bar instead of widget.
skymap_alt_ordersAdditional alternative healpix orders to write a HEALPix skymap.
tmp_base_patheither tmp_dir or dask_dir, if those were provided by the user
tmp_dirpath for storing intermediate files
tmp_pathconstructed temp path - defaults to tmp_dir, then dask_tmp, but will create a new temp directory under catalog_path if no other options are provided
tqdm_kwargsAdditional arguments to pass to the tqdm progress bar.
write_table_kwargsadditional keyword arguments to use when writing files to parquet (e.g. compression schemes).
Methods
__init__([output_path, ...])extra_property_dict()Generate additional HATS properties for this import run as a dictionary.
resume_kwargs_dict()Convenience method to convert fields for resume functionality.
- __init__(output_path: str | Path | UPath | None = None, output_artifact_name: str = '', addl_hats_properties: dict | None = None, npix_suffix: str = '.parquet', npix_parquet_name: str | None = None, write_table_kwargs: dict | None = None, row_group_kwargs: dict | None = None, should_write_skymap: bool = True, skymap_alt_orders: list[int] | None = None, create_thumbnail: bool = False, create_metadata: bool = True, create_per_partition_stats: bool = False, tmp_dir: str | Path | UPath | None = None, resume: bool = True, progress_bar: bool = True, simple_progress_bar: bool = False, tqdm_kwargs: dict | None = None, dask_tmp: str | Path | UPath | None = None, dask_n_workers: int = 1, dask_threads_per_worker: int = 1, resume_tmp: str | Path | UPath | None = None, delete_intermediate_parquet_files: bool = True, delete_resume_log_files: bool = True, completion_email_address: str = '', catalog_path: UPath | None = None, tmp_path: UPath | None = None, tmp_base_path: UPath | None = None, input_catalog_path: str | Path | UPath | None = None) None#
- classmethod __new__(*args, **kwargs)#