Writing Workflows

Workflow Structure

yaml

description: "Process daily data"
schedule: "0 2 * * *"      # Optional: cron schedule
queue: "daily-jobs"        # Optional: assign to global queue for concurrency control

params:                    # Runtime parameters
  - name: ENVIRONMENT
    type: string
    default: staging
    enum: [dev, staging, prod]
  - name: BATCH_SIZE
    type: integer
    default: 25
    minimum: 1
    maximum: 100

env:                       # Environment variables
  - DATE: "`date +%Y-%m-%d`"
  - DATA_DIR: /tmp/data

steps:                     # Workflow steps
  - command: echo "Processing ${ENVIRONMENT} for date ${DATE} with batch ${BATCH_SIZE}"

Parameter default values are literal. To compute a runtime default, use eval: on an inline rich param definition. See Parameters for precedence, fallback behavior, and typed validation.

Base Configuration

Share common settings across all DAGs using base configuration:

yaml

# ~/.config/dagu/base.yaml
env:
  - LOG_LEVEL: info
  - AWS_REGION: us-east-1

smtp:
  host: smtp.company.com
  port: "587"
  username: ${SMTP_USER}
  password: ${SMTP_PASS}

error_mail:
  from: alerts@company.com
  to: oncall@company.com
  attach_logs: true

hist_retention_days: 30 # Dagu deletes workflow history and logs older than this
queue: "default"      # Default queue for all DAGs (define in config.yaml)

DAGs automatically inherit these settings:

yaml

# my-workflow.yaml

# Inherits all base settings
# Can override specific values:
env:
  - LOG_LEVEL: debug  # Override
  - CUSTOM_VAR: value # Addition

steps:
  - command: echo "Processing"

Configuration precedence: System defaults → Base config → DAG config

See Base Configuration for complete documentation on all available fields.

Custom Step Types

Define reusable step types in step_types when you want a typed wrapper around a builtin step type.

yaml

step_types:
  greet:
    type: command
    input_schema:
      type: object
      additionalProperties: false
      required: [message]
      properties:
        message:
          type: string
    template:
      script: |
        #!/bin/bash
        printf '%s\n' {{ json .input.message }}

steps:
  - type: greet
    config:
      message: hello

The most common pattern is a type: command custom step with a templated script. The step call site supplies typed config, the schema can apply defaults, and the template expands to a normal builtin step before execution. See Custom Step Types for the exact rules.

Guide Sections

Basics - Steps, commands, dependencies
Container - Run workflows in Docker containers
Control Flow - Parallel execution, conditions, loops
Data & Variables - Parameters, outputs, data passing
Durable Execution - Step retries, default step retries, DAG retries
Error Handling - Continue-on behavior, handlers, notifications
Lifecycle Handlers - Cleanup and post-run steps
Custom Step Types - Reusable typed wrappers around builtin step types
Patterns - Composition patterns
Secrets - External providers, resolution order, masking behavior

Complete Example

yaml

schedule: "0 2 * * *"

params:
  - name: ENVIRONMENT
    type: string
    default: staging
    enum: [dev, staging, prod]
  - name: DRY_RUN
    type: boolean
    default: false

env:
  - DATE: "`date +%Y-%m-%d`"
  - DATA_DIR: /tmp/data/${DATE}

steps:
  - command: aws s3 cp s3://bucket/${DATE}.csv ${DATA_DIR}/
    retry_policy:
      limit: 3
      interval_sec: 60

  - command: python validate.py ${DATA_DIR}/${DATE}.csv --env=${ENVIRONMENT} --dry-run=${DRY_RUN}
    continue_on:
      failure: false

  - parallel: [users, orders, products]
    command: python process.py --type=$ITEM --date=${DATE}
    output: RESULT_${ITEM}

  - command: python report.py --date=${DATE}

handler_on:
  failure:
    command: echo "Notifying failure for ${DATE}"

Common Patterns

Sequential Pipeline

yaml

steps:
  - command: echo "Extracting data"
  - command: echo "Transforming data"
  - command: echo "Loading data"

Parallel Processing

yaml

steps:
  - parallel: [file1, file2, file3]
    call: process-file
    params: "FILE=${ITEM}"

---
# A child workflow for processing each file
# This can be in a same file separated by `---` or in a separate file
name: process-file
steps:
  - command: echo "Processing" --file ${FILE}

Writing Workflows ​

Workflow Structure ​

Base Configuration ​

Custom Step Types ​

Guide Sections ​

Complete Example ​

Common Patterns ​

Sequential Pipeline ​

Parallel Processing ​

Writing Workflows

Workflow Structure

Base Configuration

Custom Step Types

Guide Sections

Complete Example

Common Patterns

Sequential Pipeline

Parallel Processing