pyspark.pipelines.append_flow#

pyspark.pipelines.append_flow(*, target, name=None, spark_conf=None)[source]#

Return a decorator on a query function to define a flow in a pipeline.

Parameters
  • name – The name of the flow. If unspecified, the query function’s name will be used.

  • target – The name of the dataset this flow writes to. Must be specified.

  • spark_conf – A dict whose keys are the conf names and values are the conf values. These confs will be set when the flow is executed; they can override confs set for the destination, for the pipeline, or on the cluster.