DiaPipelineConnections¶
- class lsst.ap.association.DiaPipelineConnections(*, config: PipelineTaskConfig | None = None)¶
Bases:
PipelineTaskConnectionsButler connections for DiaPipelineTask.
Attributes Summary
Mapping holding all connection attributes.
Connection for output dataset.
Connection for output dataset.
Connection for output dataset.
Connection for output dataset.
Class used for declaring PipelineTask input connections
Class used for declaring PipelineTask input connections
Set of dimension names that define the unit of work for this task.
Class used for declaring PipelineTask input connections
Set with the names of all
InitInputconnection attributes.Set with the names of all
InitOutputconnection attributes.Set with the names of all
connectionTypes.Inputconnection attributes.Set with the names of all
Outputconnection attributes.Set with the names of all
PrerequisiteInputconnection attributes.Class used for declaring PipelineTask input connections
Class used for declaring PipelineTask input connections
Methods Summary
adjustQuantum(inputs, outputs, label, dataId)Override to make adjustments to
lsst.daf.butler.DatasetRefobjects in thelsst.daf.butler.core.Quantumduring the graph generation stage of the activator.buildDatasetRefs(quantum)Build
QuantizedConnectioncorresponding to inputQuantum.Attributes Documentation
- allConnections: Mapping[str, BaseConnection] = {'apdbMarker': Output(name='apdb_marker', storageClass='Config', doc='Marker dataset storing the configuration of the Apdb for each visit/detector. Used to signal the completion of the pipeline.', multiple=False, dimensions=('instrument', 'visit', 'detector'), isCalibration=False), 'associatedDiaSources': Output(name='{fakesType}{coaddName}Diff_assocDiaSrc', storageClass='DataFrame', doc='Optional output storing the DiaSource catalog after matching, calibration, and standardization for insertion into the Apdb.', multiple=False, dimensions=('instrument', 'visit', 'detector'), isCalibration=False), 'diaForcedSources': Output(name='{fakesType}{coaddName}Diff_diaForcedSrc', storageClass='DataFrame', doc='Optional output storing the forced sources computed at the diaObject positions.', multiple=False, dimensions=('instrument', 'visit', 'detector'), isCalibration=False), 'diaObjects': Output(name='{fakesType}{coaddName}Diff_diaObject', storageClass='DataFrame', doc='Optional output storing the updated diaObjects associated to these sources.', multiple=False, dimensions=('instrument', 'visit', 'detector'), isCalibration=False), 'diaSourceTable': Input(name='{fakesType}{coaddName}Diff_diaSrcTable', storageClass='DataFrame', doc='Catalog of calibrated DiaSources.', multiple=False, dimensions=('instrument', 'visit', 'detector'), isCalibration=False, deferLoad=False, minimum=1, deferGraphConstraint=False), 'diffIm': Input(name='{fakesType}{coaddName}Diff_differenceExp', storageClass='ExposureF', doc='Difference image on which the DiaSources were detected.', multiple=False, dimensions=('instrument', 'visit', 'detector'), isCalibration=False, deferLoad=False, minimum=1, deferGraphConstraint=False), 'exposure': Input(name='{fakesType}calexp', storageClass='ExposureF', doc='Calibrated exposure differenced with a template image during image differencing.', multiple=False, dimensions=('instrument', 'visit', 'detector'), isCalibration=False, deferLoad=False, minimum=1, deferGraphConstraint=False), 'solarSystemObjectTable': Input(name='visitSsObjects', storageClass='DataFrame', doc='Catalog of SolarSolarSystem objects expected to be observable in this detectorVisit.', multiple=False, dimensions=('instrument', 'visit'), isCalibration=False, deferLoad=False, minimum=1, deferGraphConstraint=False), 'template': Input(name='{fakesType}{coaddName}Diff_templateExp', storageClass='ExposureF', doc='Warped template used to create `subtractedExposure`. Not PSF matched.', multiple=False, dimensions=('instrument', 'visit', 'detector'), isCalibration=False, deferLoad=False, minimum=1, deferGraphConstraint=False)}¶
Mapping holding all connection attributes.
This is a read-only view that is automatically updated when connection attributes are added, removed, or replaced in
__init__. It is also updated after__init__completes to reflect changes ininputs,prerequisiteInputs,outputs,initInputs, andinitOutputs.
- apdbMarker¶
Connection for output dataset.
- associatedDiaSources¶
Connection for output dataset.
- defaultTemplates = {'coaddName': 'deep', 'fakesType': ''}¶
- diaForcedSources¶
Connection for output dataset.
- diaObjects¶
Connection for output dataset.
- diaSourceTable¶
Class used for declaring PipelineTask input connections
- Parameters:
- name
str The default name used to identify the dataset type
- storageClass
str The storage class used when (un)/persisting the dataset type
- multiple
bool Indicates if this connection should expect to contain multiple objects of the given dataset type. Tasks with more than one connection with
multiple=Truewith the same dimensions may want to implementPipelineTaskConnections.adjustQuantumto ensure those datasets are consistent (i.e. zip-iterable) inPipelineTask.runQuantumand notify the execution system as early as possible of outputs that will not be produced because the corresponding input is missing.- dimensionsiterable of
str The
lsst.daf.butler.Butlerlsst.daf.butler.Registrydimensions used to identify the dataset type identified by the specified name- deferLoad
bool Indicates that this dataset type will be loaded as a
lsst.daf.butler.DeferredDatasetHandle. PipelineTasks can use this object to load the object at a later time.- minimum
bool Minimum number of datasets required for this connection, per quantum. This is checked in the base implementation of
PipelineTaskConnections.adjustQuantum, which raisesNoWorkFoundif the minimum is not met forInputconnections (causing the quantum to be pruned, skipped, or never created, depending on the context), andFileNotFoundErrorforPrerequisiteInputconnections (causing QuantumGraph generation to fail).PipelineTaskimplementations may provide customadjustQuantumimplementations for more fine-grained or configuration-driven constraints, as long as they are compatible with this minium.- deferGraphConstraint: `bool`, optional
If
True, do not include this dataset type’s existence in the initial query that starts the QuantumGraph generation process. This can be used to make QuantumGraph generation faster by avoiding redundant datasets, and in certain cases it can (along with careful attention to which tasks are included in the same QuantumGraph) be used to work around the QuantumGraph generation algorithm’s inflexible handling of spatial overlaps. This option has no effect when the connection is not an overall input of the pipeline (or subset thereof) for which a graph is being created, and it never affects the ordering of quanta.
- name
- Raises:
- TypeError
Raised if
minimumis greater than one butmultiple=False.- NotImplementedError
Raised if
minimumis zero for a regularInputconnection; this is not currently supported by our QuantumGraph generation algorithm.
- diffIm¶
Class used for declaring PipelineTask input connections
- Parameters:
- name
str The default name used to identify the dataset type
- storageClass
str The storage class used when (un)/persisting the dataset type
- multiple
bool Indicates if this connection should expect to contain multiple objects of the given dataset type. Tasks with more than one connection with
multiple=Truewith the same dimensions may want to implementPipelineTaskConnections.adjustQuantumto ensure those datasets are consistent (i.e. zip-iterable) inPipelineTask.runQuantumand notify the execution system as early as possible of outputs that will not be produced because the corresponding input is missing.- dimensionsiterable of
str The
lsst.daf.butler.Butlerlsst.daf.butler.Registrydimensions used to identify the dataset type identified by the specified name- deferLoad
bool Indicates that this dataset type will be loaded as a
lsst.daf.butler.DeferredDatasetHandle. PipelineTasks can use this object to load the object at a later time.- minimum
bool Minimum number of datasets required for this connection, per quantum. This is checked in the base implementation of
PipelineTaskConnections.adjustQuantum, which raisesNoWorkFoundif the minimum is not met forInputconnections (causing the quantum to be pruned, skipped, or never created, depending on the context), andFileNotFoundErrorforPrerequisiteInputconnections (causing QuantumGraph generation to fail).PipelineTaskimplementations may provide customadjustQuantumimplementations for more fine-grained or configuration-driven constraints, as long as they are compatible with this minium.- deferGraphConstraint: `bool`, optional
If
True, do not include this dataset type’s existence in the initial query that starts the QuantumGraph generation process. This can be used to make QuantumGraph generation faster by avoiding redundant datasets, and in certain cases it can (along with careful attention to which tasks are included in the same QuantumGraph) be used to work around the QuantumGraph generation algorithm’s inflexible handling of spatial overlaps. This option has no effect when the connection is not an overall input of the pipeline (or subset thereof) for which a graph is being created, and it never affects the ordering of quanta.
- name
- Raises:
- TypeError
Raised if
minimumis greater than one butmultiple=False.- NotImplementedError
Raised if
minimumis zero for a regularInputconnection; this is not currently supported by our QuantumGraph generation algorithm.
- dimensions: set[str] = {'detector', 'instrument', 'visit'}¶
Set of dimension names that define the unit of work for this task.
Required and implied dependencies will automatically be expanded later and need not be provided.
This may be replaced or modified in
__init__to change the dimensions of the task. After__init__it will be afrozensetand may not be replaced.
- exposure¶
Class used for declaring PipelineTask input connections
- Parameters:
- name
str The default name used to identify the dataset type
- storageClass
str The storage class used when (un)/persisting the dataset type
- multiple
bool Indicates if this connection should expect to contain multiple objects of the given dataset type. Tasks with more than one connection with
multiple=Truewith the same dimensions may want to implementPipelineTaskConnections.adjustQuantumto ensure those datasets are consistent (i.e. zip-iterable) inPipelineTask.runQuantumand notify the execution system as early as possible of outputs that will not be produced because the corresponding input is missing.- dimensionsiterable of
str The
lsst.daf.butler.Butlerlsst.daf.butler.Registrydimensions used to identify the dataset type identified by the specified name- deferLoad
bool Indicates that this dataset type will be loaded as a
lsst.daf.butler.DeferredDatasetHandle. PipelineTasks can use this object to load the object at a later time.- minimum
bool Minimum number of datasets required for this connection, per quantum. This is checked in the base implementation of
PipelineTaskConnections.adjustQuantum, which raisesNoWorkFoundif the minimum is not met forInputconnections (causing the quantum to be pruned, skipped, or never created, depending on the context), andFileNotFoundErrorforPrerequisiteInputconnections (causing QuantumGraph generation to fail).PipelineTaskimplementations may provide customadjustQuantumimplementations for more fine-grained or configuration-driven constraints, as long as they are compatible with this minium.- deferGraphConstraint: `bool`, optional
If
True, do not include this dataset type’s existence in the initial query that starts the QuantumGraph generation process. This can be used to make QuantumGraph generation faster by avoiding redundant datasets, and in certain cases it can (along with careful attention to which tasks are included in the same QuantumGraph) be used to work around the QuantumGraph generation algorithm’s inflexible handling of spatial overlaps. This option has no effect when the connection is not an overall input of the pipeline (or subset thereof) for which a graph is being created, and it never affects the ordering of quanta.
- name
- Raises:
- TypeError
Raised if
minimumis greater than one butmultiple=False.- NotImplementedError
Raised if
minimumis zero for a regularInputconnection; this is not currently supported by our QuantumGraph generation algorithm.
- initInputs: set[str] = frozenset({})¶
Set with the names of all
InitInputconnection attributes.See
inputsfor additional information.
- initOutputs: set[str] = frozenset({})¶
Set with the names of all
InitOutputconnection attributes.See
inputsfor additional information.
- inputs: set[str] = frozenset({'diaSourceTable', 'diffIm', 'exposure', 'solarSystemObjectTable', 'template'})¶
Set with the names of all
connectionTypes.Inputconnection attributes.This is updated automatically as class attributes are added, removed, or replaced in
__init__. Removing entries from this set will cause those connections to be removed after__init__completes, but this is supported only for backwards compatibility; new code should instead just delete the collection attributed directly. After__init__this will be afrozensetand may not be replaced.
- outputs: set[str] = frozenset({'apdbMarker', 'associatedDiaSources', 'diaForcedSources', 'diaObjects'})¶
Set with the names of all
Outputconnection attributes.See
inputsfor additional information.
- prerequisiteInputs: set[str] = frozenset({})¶
Set with the names of all
PrerequisiteInputconnection attributes.See
inputsfor additional information.
- solarSystemObjectTable¶
Class used for declaring PipelineTask input connections
- Parameters:
- name
str The default name used to identify the dataset type
- storageClass
str The storage class used when (un)/persisting the dataset type
- multiple
bool Indicates if this connection should expect to contain multiple objects of the given dataset type. Tasks with more than one connection with
multiple=Truewith the same dimensions may want to implementPipelineTaskConnections.adjustQuantumto ensure those datasets are consistent (i.e. zip-iterable) inPipelineTask.runQuantumand notify the execution system as early as possible of outputs that will not be produced because the corresponding input is missing.- dimensionsiterable of
str The
lsst.daf.butler.Butlerlsst.daf.butler.Registrydimensions used to identify the dataset type identified by the specified name- deferLoad
bool Indicates that this dataset type will be loaded as a
lsst.daf.butler.DeferredDatasetHandle. PipelineTasks can use this object to load the object at a later time.- minimum
bool Minimum number of datasets required for this connection, per quantum. This is checked in the base implementation of
PipelineTaskConnections.adjustQuantum, which raisesNoWorkFoundif the minimum is not met forInputconnections (causing the quantum to be pruned, skipped, or never created, depending on the context), andFileNotFoundErrorforPrerequisiteInputconnections (causing QuantumGraph generation to fail).PipelineTaskimplementations may provide customadjustQuantumimplementations for more fine-grained or configuration-driven constraints, as long as they are compatible with this minium.- deferGraphConstraint: `bool`, optional
If
True, do not include this dataset type’s existence in the initial query that starts the QuantumGraph generation process. This can be used to make QuantumGraph generation faster by avoiding redundant datasets, and in certain cases it can (along with careful attention to which tasks are included in the same QuantumGraph) be used to work around the QuantumGraph generation algorithm’s inflexible handling of spatial overlaps. This option has no effect when the connection is not an overall input of the pipeline (or subset thereof) for which a graph is being created, and it never affects the ordering of quanta.
- name
- Raises:
- TypeError
Raised if
minimumis greater than one butmultiple=False.- NotImplementedError
Raised if
minimumis zero for a regularInputconnection; this is not currently supported by our QuantumGraph generation algorithm.
- template¶
Class used for declaring PipelineTask input connections
- Parameters:
- name
str The default name used to identify the dataset type
- storageClass
str The storage class used when (un)/persisting the dataset type
- multiple
bool Indicates if this connection should expect to contain multiple objects of the given dataset type. Tasks with more than one connection with
multiple=Truewith the same dimensions may want to implementPipelineTaskConnections.adjustQuantumto ensure those datasets are consistent (i.e. zip-iterable) inPipelineTask.runQuantumand notify the execution system as early as possible of outputs that will not be produced because the corresponding input is missing.- dimensionsiterable of
str The
lsst.daf.butler.Butlerlsst.daf.butler.Registrydimensions used to identify the dataset type identified by the specified name- deferLoad
bool Indicates that this dataset type will be loaded as a
lsst.daf.butler.DeferredDatasetHandle. PipelineTasks can use this object to load the object at a later time.- minimum
bool Minimum number of datasets required for this connection, per quantum. This is checked in the base implementation of
PipelineTaskConnections.adjustQuantum, which raisesNoWorkFoundif the minimum is not met forInputconnections (causing the quantum to be pruned, skipped, or never created, depending on the context), andFileNotFoundErrorforPrerequisiteInputconnections (causing QuantumGraph generation to fail).PipelineTaskimplementations may provide customadjustQuantumimplementations for more fine-grained or configuration-driven constraints, as long as they are compatible with this minium.- deferGraphConstraint: `bool`, optional
If
True, do not include this dataset type’s existence in the initial query that starts the QuantumGraph generation process. This can be used to make QuantumGraph generation faster by avoiding redundant datasets, and in certain cases it can (along with careful attention to which tasks are included in the same QuantumGraph) be used to work around the QuantumGraph generation algorithm’s inflexible handling of spatial overlaps. This option has no effect when the connection is not an overall input of the pipeline (or subset thereof) for which a graph is being created, and it never affects the ordering of quanta.
- name
- Raises:
- TypeError
Raised if
minimumis greater than one butmultiple=False.- NotImplementedError
Raised if
minimumis zero for a regularInputconnection; this is not currently supported by our QuantumGraph generation algorithm.
Methods Documentation
- adjustQuantum(inputs, outputs, label, dataId)¶
Override to make adjustments to
lsst.daf.butler.DatasetRefobjects in thelsst.daf.butler.core.Quantumduring the graph generation stage of the activator.This implementation checks to make sure that the filters in the dataset are compatible with AP processing as set by the Apdb/DPDD schema.
- Parameters:
- inputs
dict Dictionary whose keys are an input (regular or prerequisite) connection name and whose values are a tuple of the connection instance and a collection of associated
DatasetRefobjects. The exact type of the nested collections is unspecified; it can be assumed to be multi-pass iterable and supportlenandin, but it should not be mutated in place. In contrast, the outer dictionaries are guaranteed to be temporary copies that are truedictinstances, and hence may be modified and even returned; this is especially useful for delegating tosuper(see notes below).- outputs
dict Dict of output datasets, with the same structure as
inputs.- label
str Label for this task in the pipeline (should be used in all diagnostic messages).
- data_id
lsst.daf.butler.DataCoordinate Data ID for this quantum in the pipeline (should be used in all diagnostic messages).
- inputs
- Returns:
- adjusted_inputs
dict Dict of the same form as
inputswith updated containers of inputDatasetRefobjects. Connections that are not changed should not be returned at all. Datasets may only be removed, not added. Nested collections may be of any multi-pass iterable type, and the order of iteration will set the order of iteration withinPipelineTask.runQuantum.- adjusted_outputs
dict Dict of updated output datasets, with the same structure and interpretation as
adjusted_inputs.
- adjusted_inputs
- Raises:
- ScalarError
Raised if any
InputorPrerequisiteInputconnection hasmultipleset toFalse, but multiple datasets.- NoWorkFound
Raised to indicate that this quantum should not be run; not enough datasets were found for a regular
Inputconnection, and the quantum should be pruned or skipped.- FileNotFoundError
Raised to cause QuantumGraph generation to fail (with the message included in this exception); not enough datasets were found for a
PrerequisiteInputconnection.
- buildDatasetRefs(quantum: Quantum) tuple[lsst.pipe.base.connections.InputQuantizedConnection, lsst.pipe.base.connections.OutputQuantizedConnection]¶
Build
QuantizedConnectioncorresponding to inputQuantum.- Parameters:
- quantum
lsst.daf.butler.Quantum Quantum object which defines the inputs and outputs for a given unit of processing.
- quantum
- Returns:
- retVal
tupleof (InputQuantizedConnection, OutputQuantizedConnection) Namespaces mapping attribute names (identifiers of connections) to butler references defined in the inputlsst.daf.butler.Quantum.
- retVal