WriteSourceTableTask¶
WriteSourceTableTask converts table of sources measured on a calexp (dataset src) to a parquet file.
All data is copied without transformation, and column names are unchanged, except for the "id" column, which is replaced by a DataFrame index.
It is the first of three postprocessing tasks to convert a src table to a
per-visit Source Table that conforms to the standard data model. The second is
TransformSourceTableTask, and the third is ConsolidateSourceTableTask.
Processing summary¶
WriteSourceTableTask reads in the src table, calls its asAstropy method to produce a DataFrame, and writes it out in parquet format.
Python API summary¶
from lsst.pipe.tasks.postprocess import WriteSourceTableTask
-
class
WriteSourceTableTask(*, config=None, log=None, initInputs=None, **kwargs) Write source table to DataFrame Parquet format
...
- attributeconfig
Access configuration fields and retargetable subtasks.
See also
See the WriteSourceTableTask API reference for complete details.
Butler datasets¶
When run through the runQuantum method, WriteSourceTableTask obtains datasets from the input Butler data repository and persists outputs to the output Butler data repository.
Note that configurations for WriteSourceTableTask, and its subtasks, affect what datasets are persisted and what their content is.
Input datasets¶
srcFull depth source catalog (lsst.afw.table) produced by ProcessCcdTask
Output datasets¶
sourceFull depth source catalog (parquet)
Retargetable subtasks¶
No subtasks.
Configuration fields¶
connections¶
- Data type
lsst.pipe.base.config.WriteSourceTableConfigConnections- Field type
Configurations describing the connections of the PipelineTask to datatypes
idGenerator¶
- Data type
lsst.meas.base._id_generator.DetectorVisitIdGeneratorConfig- Field type
Configuration for how to generate catalog IDs from data IDs.
saveLogOutput¶
Flag to enable/disable saving of log output for a task, enabled by default.
saveMetadata¶
Flag to enable/disable metadata saving for a task, enabled by default.