Continuous Queries

Continuous queries enable automatic downsampling and aggregation of time-series data into materialized views, reducing storage requirements while maintaining queryable historical data.

Manual Execution Only

Automatic execution is an Arc Enterprise feature. In Arc OSS, continuous queries must be manually triggered via the API.

Overview

Continuous queries in Arc help you:

Downsample Data: Aggregate high-frequency data into lower-frequency summaries
Reduce Storage: Store aggregated data instead of raw metrics
Maintain History: Keep long-term trends without full granularity
Improve Query Performance: Query pre-aggregated data for faster results
Create Materialized Views: Automatically maintain aggregated datasets

How It Works

Continuous queries use DuckDB SQL to aggregate data from source measurements into destination measurements:

Define Query: Specify aggregation logic using SQL
Set Schedule: Configure time intervals for grouping (e.g., hourly, daily)
Execute Manually: Trigger execution via API with start/end times
Store Results: Write aggregated data to a new measurement
Apply Retention: Optionally set custom retention for aggregated data

Architecture

Source Measurement (cpu)
    ↓
Continuous Query (AVG, MAX, MIN, etc.)
    ↓
Destination Measurement (cpu_hourly)
    ↓
Optional: Retention Policy

API Endpoints

Create Continuous Query

Define a new continuous query:

POST /api/v1/continuous_queries

Request Body:

{
  "name": "cpu_hourly_avg",
  "database": "telegraf",
  "source_measurement": "cpu",
  "destination_measurement": "cpu_hourly",
  "query": "SELECT time_bucket('1 hour', time) AS time, host, AVG(usage_idle) AS avg_usage_idle, AVG(usage_user) AS avg_usage_user, COUNT(*) AS sample_count FROM telegraf.cpu GROUP BY time_bucket('1 hour', time), host",
  "interval": "1h",
  "retention_policy": "90d",
  "is_active": true
}

Parameters:

name (string, required): Unique query identifier
database (string, required): Target database name
source_measurement (string, required): Source measurement to aggregate
destination_measurement (string, required): Where to store results
query (string, required): DuckDB SQL aggregation query
interval (string, required): Time bucket interval (1m, 5m, 1h, 1d, etc.)
retention_policy (string, optional): Retention for aggregated data (e.g., 90d, 365d)
is_active (boolean, required): Enable/disable the query

List Continuous Queries

Retrieve all continuous queries:

GET /api/v1/continuous_queries

Response:

[
  {
    "id": "550e8400-e29b-41d4-a716-446655440000",
    "name": "cpu_hourly_avg",
    "database": "telegraf",
    "source_measurement": "cpu",
    "destination_measurement": "cpu_hourly",
    "interval": "1h",
    "retention_policy": "90d",
    "is_active": true,
    "created_at": "2024-01-15T10:30:00Z",
    "last_executed_at": "2024-01-20T02:00:00Z"
  }
]

Get Single Query

Retrieve a specific continuous query:

GET /api/v1/continuous_queries/{query_id}

Update Continuous Query

Update an existing continuous query:

PUT /api/v1/continuous_queries/{query_id}

Request Body: Same as create query

Delete Continuous Query

Remove a continuous query:

DELETE /api/v1/continuous_queries/{query_id}

caution

Deleting a continuous query does not delete the destination measurement or its data. The aggregated data remains queryable.

Execute Continuous Query

Manually trigger a continuous query:

POST /api/v1/continuous_queries/{query_id}/execute

Request Body:

{
  "start_time": "2024-01-01T00:00:00Z",
  "end_time": "2024-01-31T23:59:59Z",
  "dry_run": false
}

Parameters:

start_time (string, required): Start timestamp (ISO 8601 format)
end_time (string, required): End timestamp (ISO 8601 format)
dry_run (boolean, optional): Test without writing data (default: false)

Response:

{
  "query_id": "550e8400-e29b-41d4-a716-446655440000",
  "rows_processed": 1000000,
  "rows_written": 720,
  "execution_time_ms": 2500,
  "time_range": {
    "start": "2024-01-01T00:00:00Z",
    "end": "2024-01-31T23:59:59Z"
  },
  "dry_run": false
}

View Execution History

View past executions of a continuous query:

GET /api/v1/continuous_queries/{query_id}/executions?limit=50

Response:

[
  {
    "execution_id": "abc123",
    "executed_at": "2024-01-20T02:00:00Z",
    "start_time": "2024-01-19T00:00:00Z",
    "end_time": "2024-01-20T00:00:00Z",
    "rows_processed": 86400,
    "rows_written": 24,
    "execution_time_ms": 1200,
    "status": "success"
  }
]

Query Syntax

Continuous queries use DuckDB SQL with time-series optimizations.

Recommended Approach

Use epoch_us() for timestamp conversion and date_trunc() for time bucketing:

SELECT
    date_trunc('hour', epoch_us(time)) AS time,
    host,
    AVG(usage_idle) AS avg_usage_idle,
    MAX(usage_user) AS max_usage_user,
    MIN(usage_system) AS min_usage_system,
    COUNT(*) AS sample_count
FROM telegraf.cpu
GROUP BY date_trunc('hour', epoch_us(time)), host

Supported Aggregations

AVG() - Average values
SUM() - Sum of values
MIN() - Minimum value
MAX() - Maximum value
COUNT() - Row count
STDDEV() - Standard deviation
PERCENTILE_CONT() - Percentile calculations

Time Bucketing

Using date_trunc():

-- Hourly buckets
date_trunc('hour', epoch_us(time))

-- Daily buckets
date_trunc('day', epoch_us(time))

-- 5-minute buckets (requires rounding)
date_trunc('hour', epoch_us(time)) + INTERVAL '5 minutes' * floor(extract(minute from epoch_us(time)) / 5)

Including Sample Counts

Always include COUNT(*) to track how many raw samples each aggregate represents:

SELECT
    date_trunc('hour', epoch_us(time)) AS time,
    host,
    AVG(usage_idle) AS avg_usage_idle,
    COUNT(*) AS sample_count  -- Important for data quality
FROM telegraf.cpu
GROUP BY date_trunc('hour', epoch_us(time)), host

Usage Examples

Example 1: Hourly CPU Metrics

Aggregate per-second CPU metrics into hourly averages:

import requests

# Create continuous query
response = requests.post(
    "http://localhost:8000/api/v1/continuous_queries",
    headers={"Authorization": "Bearer $ARC_TOKEN"},
    json={
        "name": "cpu_hourly",
        "database": "telegraf",
        "source_measurement": "cpu",
        "destination_measurement": "cpu_hourly",
        "query": """
            SELECT
                date_trunc('hour', epoch_us(time)) AS time,
                host,
                AVG(usage_idle) AS avg_usage_idle,
                AVG(usage_user) AS avg_usage_user,
                AVG(usage_system) AS avg_usage_system,
                MAX(usage_user) AS max_usage_user,
                COUNT(*) AS sample_count
            FROM telegraf.cpu
            GROUP BY date_trunc('hour', epoch_us(time)), host
        """,
        "interval": "1h",
        "retention_policy": "365d",
        "is_active": True
    }
)

query_id = response.json()["id"]

# Execute for the last 30 days
from datetime import datetime, timedelta

end_time = datetime.utcnow()
start_time = end_time - timedelta(days=30)

result = requests.post(
    f"http://localhost:8000/api/v1/continuous_queries/{query_id}/execute",
    headers={"Authorization": "Bearer $ARC_TOKEN"},
    json={
        "start_time": start_time.isoformat() + "Z",
        "end_time": end_time.isoformat() + "Z"
    }
)

print(f"Processed {result.json()['rows_processed']} rows")
print(f"Generated {result.json()['rows_written']} aggregated rows")

Example 2: Daily Request Summary

Aggregate API request logs into daily summaries:

# Create daily request summary
response = requests.post(
    "http://localhost:8000/api/v1/continuous_queries",
    headers={"Authorization": "Bearer $ARC_TOKEN"},
    json={
        "name": "requests_daily",
        "database": "logs",
        "source_measurement": "api_requests",
        "destination_measurement": "api_requests_daily",
        "query": """
            SELECT
                date_trunc('day', epoch_us(time)) AS time,
                endpoint,
                status_code,
                COUNT(*) AS total_requests,
                AVG(response_time_ms) AS avg_response_time,
                MAX(response_time_ms) AS max_response_time,
                PERCENTILE_CONT(0.95) WITHIN GROUP (ORDER BY response_time_ms) AS p95_response_time
            FROM api_requests
            GROUP BY date_trunc('day', epoch_us(time)), endpoint, status_code
        """,
        "interval": "1d",
        "retention_policy": "730d",  # 2 years
        "is_active": True
    }
)

Example 3: 5-Minute Sensor Readings

Downsample IoT sensor data to 5-minute intervals:

# Create 5-minute sensor aggregation
response = requests.post(
    "http://localhost:8000/api/v1/continuous_queries",
    headers={"Authorization": "Bearer $ARC_TOKEN"},
    json={
        "name": "sensors_5min",
        "database": "iot",
        "source_measurement": "temperature",
        "destination_measurement": "temperature_5min",
        "query": """
            SELECT
                date_trunc('hour', epoch_us(time)) +
                INTERVAL '5 minutes' * floor(extract(minute from epoch_us(time)) / 5) AS time,
                sensor_id,
                location,
                AVG(temperature) AS avg_temperature,
                MIN(temperature) AS min_temperature,
                MAX(temperature) AS max_temperature,
                COUNT(*) AS sample_count
            FROM temperature
            GROUP BY
                date_trunc('hour', epoch_us(time)) +
                INTERVAL '5 minutes' * floor(extract(minute from epoch_us(time)) / 5),
                sensor_id,
                location
        """,
        "interval": "5m",
        "retention_policy": "90d",
        "is_active": True
    }
)

Example 4: Dry Run Testing

Test a continuous query before execution:

# Create the query
response = requests.post(
    "http://localhost:8000/api/v1/continuous_queries",
    headers={"Authorization": "Bearer $ARC_TOKEN"},
    json={...}
)

query_id = response.json()["id"]

# Test with dry run
dry_run = requests.post(
    f"http://localhost:8000/api/v1/continuous_queries/{query_id}/execute",
    headers={"Authorization": "Bearer $ARC_TOKEN"},
    json={
        "start_time": "2024-01-01T00:00:00Z",
        "end_time": "2024-01-02T00:00:00Z",
        "dry_run": True
    }
)

print(f"Would process {dry_run.json()['rows_processed']} rows")
print(f"Would generate {dry_run.json()['rows_written']} aggregated rows")

# If satisfied, execute for real
if dry_run.json()['rows_written'] > 0:
    result = requests.post(
        f"http://localhost:8000/api/v1/continuous_queries/{query_id}/execute",
        headers={"Authorization": "Bearer $ARC_TOKEN"},
        json={
            "start_time": "2024-01-01T00:00:00Z",
            "end_time": "2024-01-02T00:00:00Z",
            "dry_run": False
        }
    )

Storage Benefits

Continuous queries significantly reduce storage requirements:

Before Downsampling

Raw CPU metrics (1-second intervals):

1 year = 31,536,000 rows per host
10 hosts = 315,360,000 rows
Storage: ~20GB

After Downsampling to Hourly

Hourly aggregates:

1 year = 8,760 rows per host
10 hosts = 87,600 rows
Storage: ~50MB

Reduction: ~400x smaller while maintaining hourly trend visibility.

Multi-Tier Strategy

Combine different granularities for optimal storage:

# Tier 1: Keep raw data for 7 days
# Tier 2: Hourly aggregates for 90 days
requests.post("/api/v1/continuous_queries", json={
    "name": "cpu_hourly",
    "interval": "1h",
    "retention_policy": "90d"
})

# Tier 3: Daily aggregates for 2 years
requests.post("/api/v1/continuous_queries", json={
    "name": "cpu_daily",
    "source_measurement": "cpu_hourly",  # Aggregate the hourly data
    "destination_measurement": "cpu_daily",
    "interval": "1d",
    "retention_policy": "730d"
})

# Use retention policy to delete raw data after 7 days
requests.post("/api/v1/retention", json={
    "database": "telegraf",
    "measurement": "cpu",
    "retention_days": 7
})

Best Practices

1. Start Conservative

Begin with longer intervals and adjust based on actual needs:

# Start with hourly
{"interval": "1h"}

# If too coarse, reduce to 15 minutes
{"interval": "15m"}

2. Preserve Source Data Initially

Keep raw data while testing aggregations:

# Create continuous query
create_query(...)

# Test aggregations thoroughly
execute_dry_run(...)
execute_for_real(...)

# Only after validation, apply retention to raw data
requests.post("/api/v1/retention", json={
    "measurement": "cpu",
    "retention_days": 30  # Keep raw for 30 days
})

3. Use Dry Run Extensively

Always test queries with dry run before full execution:

# Test on small time range first
dry_run(start="2024-01-01", end="2024-01-02")

# Gradually expand
dry_run(start="2024-01-01", end="2024-01-07")

# Finally, full execution
execute(start="2024-01-01", end="2024-12-31")

4. Include Sample Counts

Track the number of raw samples in each aggregate:

SELECT
    date_trunc('hour', epoch_us(time)) AS time,
    COUNT(*) AS sample_count,  -- Essential for data quality
    AVG(value) AS avg_value
FROM measurement
GROUP BY date_trunc('hour', epoch_us(time))

This helps identify:

Missing data (low sample counts)
Data quality issues
Unexpected patterns

5. Monitor Execution Performance

Track continuous query execution times:

result = execute_query(...)

print(f"Execution time: {result['execution_time_ms']}ms")
print(f"Throughput: {result['rows_processed'] / (result['execution_time_ms'] / 1000):.0f} rows/sec")

# Alert if execution takes too long
if result['execution_time_ms'] > 60000:  # 1 minute
    print("Warning: Slow execution!")

6. Use Appropriate Intervals

Match intervals to data characteristics:

High-Frequency Data (IoT sensors at 1-second intervals):

5-minute aggregates for recent analysis
Hourly aggregates for medium-term
Daily aggregates for long-term trends

Medium-Frequency Data (Application metrics at 1-minute intervals):

Hourly aggregates for recent analysis
Daily aggregates for long-term

Low-Frequency Data (Business metrics at hourly intervals):

Daily aggregates
Monthly aggregates for multi-year analysis

Troubleshooting

No Rows Written

Problem: Execution returns rows_written: 0.

Solutions:

Verify source measurement contains data in the specified time range
Check that the query syntax is correct
Ensure GROUP BY clause matches aggregation columns
Use dry run to inspect query results

Query Syntax Errors

Problem: Execution fails with SQL error.

Solutions:

Test the query directly using the /query endpoint
Verify column names exist in source measurement
Check for DuckDB-specific syntax requirements
Use epoch_us() for timestamp conversion

Slow Execution

Problem: Continuous query takes longer than expected.

Solutions:

Reduce the time range per execution
Ensure source measurement is properly compacted
Consider creating indexes on frequently grouped columns
Monitor DuckDB query performance

Duplicate Data

Problem: Re-running the query creates duplicate aggregates.

Solutions:

Delete destination measurement data before re-execution:

requests.post("/api/v1/delete", json={
    "database": "telegraf",
    "measurement": "cpu_hourly",
    "where": f"time >= '{start_time}' AND time <= '{end_time}'"
})

Or use UPSERT semantics if supported (future feature)

Retention Policies - Automatically delete old raw data after downsampling
Delete Operations - Manually remove data ranges
Compaction - Optimize file structure for better query performance

Overview​

How It Works​

Architecture​

API Endpoints​

Create Continuous Query​

List Continuous Queries​

Get Single Query​

Update Continuous Query​

Delete Continuous Query​

Execute Continuous Query​

View Execution History​

Query Syntax​

Recommended Approach​

Supported Aggregations​

Time Bucketing​

Including Sample Counts​

Usage Examples​

Example 1: Hourly CPU Metrics​

Example 2: Daily Request Summary​

Example 3: 5-Minute Sensor Readings​

Example 4: Dry Run Testing​

Storage Benefits​

Before Downsampling​

After Downsampling to Hourly​

Multi-Tier Strategy​

Best Practices​

1. Start Conservative​

2. Preserve Source Data Initially​

3. Use Dry Run Extensively​

4. Include Sample Counts​

5. Monitor Execution Performance​

6. Use Appropriate Intervals​

Troubleshooting​

No Rows Written​

Query Syntax Errors​

Slow Execution​

Duplicate Data​

Related Topics​

Overview

How It Works

Architecture

API Endpoints

Create Continuous Query

List Continuous Queries

Get Single Query

Update Continuous Query

Delete Continuous Query

Execute Continuous Query

View Execution History

Query Syntax

Recommended Approach

Supported Aggregations

Time Bucketing

Including Sample Counts

Usage Examples

Example 1: Hourly CPU Metrics

Example 2: Daily Request Summary

Example 3: 5-Minute Sensor Readings

Example 4: Dry Run Testing

Storage Benefits

Before Downsampling

After Downsampling to Hourly

Multi-Tier Strategy

Best Practices

1. Start Conservative

2. Preserve Source Data Initially

3. Use Dry Run Extensively

4. Include Sample Counts

5. Monitor Execution Performance

6. Use Appropriate Intervals

Troubleshooting

No Rows Written

Query Syntax Errors

Slow Execution

Duplicate Data

Related Topics