HTTP

Import

The InfluxDB Import Plugin enables seamless data import from InfluxDB v1, v2, or v3 instances to InfluxDB 3 Core/Enterprise. It provides comprehensive import capabilities with pause/resume functionality, progress tracking, conflict detection, and robust error handling. The plugin operates via HTTP endpoints, allowing you to start, pause, resume, cancel, and monitor imports through simple HTTP requests.

Key features: - Import data from InfluxDB v1, v2, or v3 to InfluxDB 3 - Automatic data sampling for optimal batch sizing - Resume interrupted imports from the last checkpoint - Pause and cancel running imports - Progress tracking and statistics - Tag/field conflict detection and resolution - Data type mismatch handling - Configurable time ranges and table filtering - Dry run mode for import planning (estimates, schema conflicts, configuration preview) - Support for both token and username/password authentication

Software Requirements

InfluxDB 3 Core/Enterprise: with the Processing Engine enabled.
Source InfluxDB instance: InfluxDB v1.x or v2.x instance accessible via HTTP/HTTPS.
Python packages:
- requests

Installation steps

Start InfluxDB 3 with the Processing Engine and PLUGIN_DIR environment variable:

PLUGIN_DIR=~/.plugins influxdb3 serve \
  --node-id node0 \
  --object-store file \
  --data-dir ~/.influxdb3 \
  --plugin-dir ~/.plugins

Install required Python packages:

influxdb3 install package requests

Configuration

Plugin parameters may be specified as key-value pairs in the --trigger-arguments flag (CLI), in the trigger_arguments field (API) when creating a trigger or via body of HTTP request. This plugin supports TOML configuration files, which can be specified using the config_file_path parameter.

Plugin metadata

This plugin includes a JSON metadata schema in its docstring that defines supported trigger types and configuration parameters. This metadata enables the InfluxDB 3 Explorer UI to display and configure the plugin.

Required parameters

Parameter	Type	Default	Description
`source_url`	string	required	Source InfluxDB URL (with optional port, e.g., `http://localhost:8086`)
`influxdb_version`	integer	required	Source InfluxDB version: 1, 2, or 3
`source_database`	string	required	Source database name to import from

Authentication parameters (required - choose one method)

Method 1: Token-based authentication (InfluxDB v2 or v1 with token support)

Parameter	Type	Required	Description
`source_token`	string	Yes	Authentication token for the source InfluxDB

Method 2: Username/Password authentication (InfluxDB v1)

Parameter	Type	Required	Description
`source_username`	string	Yes	Username for basic authentication (must use with password)
`source_password`	string	Yes	Password for basic authentication (must use with username)

Note: You must provide EITHER source_token OR (source_username AND source_password together). Using both methods simultaneously will result in an error.

Optional parameters

Parameter	Type	Default	Description
`dest_database`	string	none	Destination database name in InfluxDB 3 (if not specified, uses database where trigger was created)
`start_timestamp`	string	none	Import start time (datetime format). If not specified, starts from oldest data
`end_timestamp`	string	none	Import end time (datetime format). If not specified, imports to newest data
`query_interval_ms`	integer	100	Delay between queries in milliseconds to avoid overloading source database
`import_direction`	string	“oldest_first”	Import direction: “oldest_first” or “newest_first”
`target_batch_size`	integer	2000	Target number of rows per query batch
`table_filter`	string	none	Dot-separated list of tables to import (e.g., “cpu.mem.disk”). If not specified, imports all tables
`dry_run`	boolean	false	If true, generates import plan without processing data (shows estimates, schema conflicts, and configuration)

TOML configuration

Parameter	Type	Default	Description
`config_file_path`	string	none	TOML config file path relative to `PLUGIN_DIR` (required for TOML configuration)

To use a TOML configuration file, set the PLUGIN_DIR environment variable and specify the config_file_path in the trigger arguments. This is in addition to the --plugin-dir flag when starting InfluxDB 3.

Example TOML configuration

https://github.com/influxdata/influxdb3_plugins/blob/main/influxdata/import/import_config.toml

For more information on using TOML configuration files, see the Using TOML Configuration Files section in the influxdb3_plugins/README.md.

Quick Start

HTTP trigger setup

Create an HTTP trigger to handle import requests:

influxdb3 create trigger \
  --database mydb \
  --plugin-filename gh:influxdata/import/import.py \
  --trigger-spec "request:import" \
  import_trigger

Enable the trigger:

influxdb3 enable trigger --database mydb import_trigger

The endpoint is registered at /api/v3/engine/import.

Examples

Example 1: Basic import with token authentication

Import all data from an InfluxDB v1 instance:

# Create and enable HTTP trigger
influxdb3 create trigger \
  --database mydb \
  --plugin-filename import.py \
  --trigger-spec "request:import" \
  import_trigger

influxdb3 enable trigger --database mydb import_trigger

# Start import via HTTP
curl -X POST http://localhost:8181/api/v3/engine/import?action=start \
  -H "Content-Type: application/json" \
  -d '{
    "source_url": "http://localhost:8086",
    "source_token": "my-super-secret-token",
    "influxdb_version": 1,
    "source_database": "telegraf",
    "dest_database": "imported_data"
  }'

Expected results

Plugin connects to source InfluxDB at http://localhost:8086 (port from URL)
Discovers all measurements in the telegraf database
Estimates import time based on data sampling
Imports all data to InfluxDB 3 in the imported_data database
Logs import_id for tracking statistics

Example 2: Time-range import with table filtering

# Start import with time range and table filter
curl -X POST http://localhost:8181/api/v3/engine/import?action=start \
  -H "Content-Type: application/json" \
  -d '{
    "source_url": "http://influxdb-source.example.com:8086",
    "source_username": "admin",
    "source_password": "my-password",
    "influxdb_version": 1,
    "source_database": "telegraf",
    "dest_database": "production_metrics",
    "start_timestamp": "2024-01-01T00:00:00Z",
    "end_timestamp": "2024-12-31T23:59:59Z",
    "table_filter": "cpu.mem.disk.network",
    "import_direction": "newest_first",
    "target_batch_size": 5000
  }'

Expected results

Imports only cpu, mem, disk, and network measurements
Processes data from January 1, 2024 to December 31, 2024
Imports newest data first
Uses larger batch size (5000 rows) for better performance

Example 3: Pause, check status, and resume import

# Start import (logs import_id, does not return it immediately)
curl -X POST http://localhost:8181/api/v3/engine/import?action=start \
  -H "Content-Type: application/json" \
  -d '{
    "source_url": "http://localhost:8086",
    "source_token": "my-token",
    "influxdb_version": 2,
    "source_database": "large_database",
    "dest_database": "imported"
  }'

# Find import_id from logs:
influxdb3 query --database _internal "SELECT log_text FROM system.processing_engine_logs WHERE trigger_name = 'import_trigger' AND log_text LIKE '%Starting import%' ORDER BY event_time DESC LIMIT 1"

# Set the import_id from logs
IMPORT_ID=""

# Pause import (e.g., during high-traffic hours)
curl -X POST "http://localhost:8181/api/v3/engine/import?action=pause&import_id=$IMPORT_ID"

# Check status after import completion (paused, cancelled, or completed)
curl "http://localhost:8181/api/v3/engine/import?action=status&import_id=$IMPORT_ID"

# Resume later
curl -X POST "http://localhost:8181/api/v3/engine/import?action=resume&import_id=$IMPORT_ID" \
  -H "Content-Type: application/json" \
  -d '{
    "source_token": "my-token"
  }'
</code></pre>

### Expected results

- Import starts and logs a unique import_id (check logs to obtain it)
- Import continues running in the background, logging progress
- Pause command stops import gracefully at current position
- Status endpoint returns comprehensive statistics **only after import completion** (paused, cancelled, or finished)
- Resume command continues from the exact point where it was paused and returns final results upon completion

### Example 4: Dry run for import plan

curl -X POST http://localhost:8181/api/v3/engine/import?action=start \
  -H "Content-Type: application/json" \
  -d '{
    "source_url": "http://localhost:8086",
    "source_token": "my-token",
    "influxdb_version": 1,
    "source_database": "telegraf",
    "dry_run": true
  }'


### Expected results

With dry_run: true, the plugin generates a comprehensive import plan **without processing any data**. It only performs:
- Schema inspection (tags and fields)
- Data sampling for time estimation
- Conflict detection

The response returns immediately with a detailed import plan:

{
  "import_id": "abc123...",
  "status": "dry_run_plan",
  "source": {
    "url": "http://localhost:8086",
    "database": "telegraf",
    "influxdb_version": 1
  },
  "destination": {
    "database": "imported_data"
  },
  "time_range": {
    "start": "all data",
    "end": "all data"
  },
  "import_settings": {
    "direction": "oldest_first",
    "target_batch_size": 2000,
    "query_interval_ms": 100
  },
  "tables": {
    "total": 5,
    "list": ["cpu", "mem", "disk", "network", "processes"],
    "filtered": "all tables"
  },
  "estimated_import": {
    "total_rows": 5000000,
    "estimated_duration": "1 hour 15 minutes",
    "estimated_duration_seconds": 4500,
    "per_table_estimates": [
      {
        "measurement": "cpu",
        "estimated_rows": 1000000,
        "estimated_seconds": 900
      },
      {
        "measurement": "mem",
        "estimated_rows": 800000,
        "estimated_seconds": 720
      }
    ]
  },
  "schema_conflicts": {
    "total": 2,
    "details": [
      {
        "measurement": "cpu",
        "type": "tag_field_conflict",
        "conflicts": ["host", "region"],
        "resolution": "Tags will be renamed with '_tag' suffix: host -> host_tag, region -> region_tag"
      }
    ]
  }
}


**Note**: Dry run mode is fast and lightweight - it does not query or process any actual data points, only metadata. Use it to:
- Preview import scope and estimates
- Identify schema conflicts before import
- Validate configuration and connectivity
- Plan import time windows

Code Overview

Files

import.py: The main plugin code containing HTTP request handler and import logic
import_config.toml: Example TOML configuration file

Logging

Logs are stored in the _internal database in the system.processing_engine_logs table:

influxdb3 query --database _internal "SELECT * FROM system.processing_engine_logs WHERE trigger_name = 'import_trigger'"

Log columns:

event_time: Timestamp of the log event
trigger_name: Name of the trigger that generated the log
log_level: Severity level (INFO, WARN, ERROR)
log_text: Message describing the action or error

Import state tracking

The plugin creates several measurements to track import state:

`import_config`

Stores import configuration (credentials excluded for security).

influxdb3 query --database mydb "SELECT * FROM import_config WHERE import_id = 'your-import-id'"

`import_state`

Tracks per-table import progress.

influxdb3 query --database mydb "SELECT * FROM import_state WHERE import_id = 'your-import-id' ORDER BY time DESC"

`import_pause_state`

Stores pause/cancel state for controlling running imports.

influxdb3 query --database mydb "SELECT * FROM import_pause_state WHERE import_id = 'your-import-id' ORDER BY time DESC LIMIT 1"

Main functions

`process_request(influxdb3_local, query_parameters, request_headers, request_body, args)`

HTTP request handler that routes to appropriate import actions based on the action query parameter.

`start_import(influxdb3_local, config, task_id)`

Starts a new import process: 1. Performs pre-flight checks (connectivity, measurements discovery) 2. Estimates import time based on data sampling 3. Creates import configuration and state records 4. Initiates table-by-table import

`import_table(influxdb3_local, config, import_id, measurement, start_time, end_time, task_id, ...)`

Imports a single table: 1. Finds actual data boundaries within specified range 2. Samples data to determine optimal batch window size 3. Detects and resolves tag/field conflicts 4. Queries data in batches and converts to line protocol 5. Writes to destination database 6. Tracks progress and checks for pause/cancel signals

`resume_import(influxdb3_local, import_id, task_id, ...)`

Resumes an interrupted import: 1. Loads saved import configuration 2. Identifies incomplete tables and their last checkpoint 3. Continues import from checkpoint positions 4. Handles tables without checkpoint (e.g., after crash) by restarting from beginning

`get_import_stats(influxdb3_local, import_id, task_id)`

Returns comprehensive statistics for a import including overall status, per-table progress, timing information, and configuration.

`check_source_connection(body_data, session)`

Tests connectivity to a URL and identifies if it’s an InfluxDB instance (5-second timeout): 1. Validates that source_url is provided 2. Infers port from scheme if not specified (http→80, https→443) 3. Sends request to /ping endpoint 4. Returns success with version/build from X-Influxdb-* headers (v1/v2) 5. Falls back to cluster-uuid header detection for v3 (returns version: "3.x.x") 6. Returns failure with message if not InfluxDB or unreachable

`get_source_databases_list(body_data, session)`

Lists databases from source InfluxDB instance: 1. Validates required parameters 2. For v1: Executes SHOW DATABASES query, filters out _internal 3. For v2: Queries /api/v2/buckets API, filters out system buckets (prefixed with _) 4. Returns sorted list of database names

`get_source_tables_list(body_data, session)`

Lists tables/measurements from a source database: 1. Validates required parameters including source_database 2. For v1: Executes SHOW MEASUREMENTS query 3. For v2: Executes Flux schema.measurements() query (requires source_org) 4. Returns sorted list of table names

Key algorithms

Automatic batch sizing

The plugin samples data at different time intervals to determine optimal window size:

# Test intervals: 1 second, 1 minute, 1 hour, 1 day
# Calculate rows per second from samples
# Determine window size to achieve target_batch_size
optimal_window = target_batch_size / avg_rows_per_second

Tag/field conflict resolution

When a column name exists as both tag and field in source data:

# Original data has conflict:
# tag: room
# field: room

# Plugin renames conflicting tag:
# tag: room_tag
# field: room (unchanged)

Resume checkpoint tracking

During import, the plugin saves checkpoints:

# Save paused_at_time (data timestamp, not record timestamp)
# On resume:
# 1. Load last paused_at_time
# 2. Add 1 microsecond offset to avoid duplicates
# 3. Continue import from (paused_at_time + 1µs)

Troubleshooting

Common issues

Issue: “Failed to connect to source database” error

Solution:

Verify source InfluxDB is running and accessible:
```
curl http://<source_url>/ping
```
Check network connectivity and firewall rules
Verify credentials are correct
For InfluxDB v2/v3, ensure you’re using token authentication

Issue: “Authentication error: Must provide either…” error

Solution: Choose one authentication method: - For token: Provide only source_token - For username/password: Provide both source_username AND source_password together - Do not mix authentication methods

Issue: “Import already completed” when trying to resume

Solution:

Check import status:

curl "http://localhost:8181/api/v3/import?action=status&import_id=<import_id>"

If truly incomplete, check for status discrepancies in import_state table
Start a new import if needed

Issue: Tag/field conflicts causing warnings

Solution: This is informational only. The plugin automatically renames conflicting tags with a _tag suffix: - Original tag temperature → temperature_tag - Field temperature remains unchanged

Issue: Slow import performance

Solution:

Increase target_batch_size (e.g., from 2000 to 5000)
Decrease query_interval_ms if source can handle higher load
Use table filtering to import tables in parallel using multiple triggers
Check network latency between source and destination

Performance considerations

Network bandwidth: Main bottleneck for large imports. Use local network when possible.
Source database load: The plugin includes rate limiting (query_interval_ms) to avoid overwhelming source.
Batch size optimization: Plugin automatically samples data to determine optimal batch size, but you can override with target_batch_size.
Connection pooling: Plugin uses HTTP session with connection pooling for better performance.
Retry logic: Built-in exponential backoff (1s → 2s → 4s → 8s → 16s) for transient errors.

HTTP Endpoint

The import plugin provides the following type of requests:

Start Import

Start a new import from source InfluxDB to InfluxDB 3.

Request: POST /api/v3/engine/import?action=start

Request body (JSON):

{
  "source_url": "http://localhost:8086",
  "source_token": "my-token",
  "influxdb_version": 1,
  "source_database": "telegraf",
  "dest_database": "imported_data",
  "start_timestamp": "2024-01-01T00:00:00Z",
  "end_timestamp": "2024-12-31T23:59:59Z",
  "table_filter": "cpu.mem.disk"
}

Get Import Status

Check the status and progress of a import.

Request: GET /api/v3/engine/import?action=status&import_id=<import_id>

Pause Import

Pause a running import to resume later.

Request: POST /api/v3/engine/import?action=pause&import_id=<import_id>

Note: Returns error if import is not found, already paused, or already cancelled.

Resume Import

Resume a paused or interrupted import.

Request: POST /api/v3/engine/import?action=resume&import_id=<import_id>

Request body (JSON):

{
  "source_token": "my-token"
}

{
  "source_username": "admin",
  "source_password": "my-password"
}

`POST /api/v3/engine/import?action=resume&import_id=<import_id>&source_token=your_token`

Note: Authentication credentials are not stored for security reasons and must be provided when resuming. Returns error if import is not found, already cancelled, or already running.

Cancel Import

Cancel a running import. Cancelled imports cannot be resumed.

Request: POST /api/v3/engine/import?action=cancel&import_id=<import_id>

Note: Returns error if import is not found or already cancelled.

Test Connection

Test connectivity to a URL and identify if it’s an InfluxDB instance. Uses a 5-second timeout for fast feedback.

Request: POST /api/v3/engine/import?action=test_connection

Request body (JSON):

{
  "source_url": "http://localhost:8086"
}

Note: If port is omitted, it is inferred from the scheme (http → 80, https → 443).

Success response (InfluxDB v1/v2 detected):

{
  "success": true,
  "version": "2.7.0",
  "build": "OSS"
}

Success response (InfluxDB v3 detected via cluster-uuid header):

{
  "success": true,
  "version": "3.x.x",
  "build": ""
}

Note: InfluxDB v3 does not expose version headers without authentication. Detection uses the cluster-uuid header instead.

Failure response (not InfluxDB or unreachable):

{
  "success": false,
  "message": "Not an InfluxDB instance"
}

Failure response (InfluxDB requires authentication, version unknown):

{
  "success": false,
  "message": "Unable to determine InfluxDB version"
}

Note: When InfluxDB returns 401/403 without version headers, the connection test cannot determine the version. This typically means authentication is required. The instance is likely InfluxDB, but version detection requires valid credentials.

List Databases

Get list of databases from source InfluxDB instance.

Request: POST /api/v3/engine/import?action=databases

Request body (JSON):

{
  "source_url": "http://localhost:8086",
  "influxdb_version": 1,
  "source_token": "my-token"
}

List Tables

Get list of tables/measurements from a source database.

Request: POST /api/v3/engine/import?action=tables

Request body (JSON):

{
  "source_url": "http://localhost:8086",
  "influxdb_version": 1,
  "source_database": "telegraf",
  "source_token": "my-token"
}

Note: For InfluxDB v2, include source_org in the request body.

Using TOML Configuration Files

This plugin supports using TOML configuration files to specify all plugin arguments.

Important Requirements

To use TOML configuration files, you must set the PLUGIN_DIR environment variable in the InfluxDB 3 host environment.

Setting Up TOML Configuration

Start InfluxDB 3 with the PLUGIN_DIR environment variable set:

PLUGIN_DIR=~/.plugins influxdb3 serve \
  --node-id node0 \
  --object-store file \
  --data-dir ~/.influxdb3 \
  --plugin-dir ~/.plugins

Copy the example TOML configuration file to your plugin directory:

cp import_config.toml ~/.plugins/

Edit the TOML file to match your requirements:

# Required parameters
source_url = "http://localhost:8086"
influxdb_version = 1
source_database = "telegraf"

# Authentication (choose one method)
source_token = "my-token"

# Optional parameters
dest_database = "imported_data"
start_timestamp = "2024-01-01T00:00:00Z"
end_timestamp = "2024-12-31T23:59:59Z"
table_filter = "cpu.mem.disk"

Create a trigger using the config_file_path argument:

influxdb3 create trigger \
  --database mydb \
  --plugin-filename import.py \
  --trigger-spec "request:import" \
  --trigger-arguments config_file_path=import_config.toml \
  import_trigger

Start import via HTTP (config from TOML file will be used as defaults, can be overridden in request body):

curl -X POST http://localhost:8181/api/v3/engine/import?action=start

Configuration Priority and Loading

The import plugin loads configuration from multiple sources with the following priority order (highest to lowest):

HTTP Request Body (highest priority) - JSON parameters in POST request body
TOML Configuration File - Parameters from file specified in config_file_path
Trigger Arguments - Parameters from --trigger-arguments when creating trigger
Environment Variables (lowest priority) - System environment variables

Configuration Loading Process

When a import starts, the plugin loads configuration in this order:

# 1. Start with environment variables (lowest priority)
IMPORT_SOURCE_URL, IMPORT_SOURCE_TOKEN, etc.

# 2. Override with trigger arguments (--trigger-arguments)
config_file_path=import_config.toml, source_url=http://localhost:8086, etc.

# 3. Override with TOML file contents (if config_file_path specified)
[from import_config.toml file]

# 4. Override with HTTP request body (highest priority)
{
  "source_url": "http://localhost:8086",
  ...
}

Environment Variables Supported

The following environment variables can be used:

IMPORT_SOURCE_URL → source_url
IMPORT_SOURCE_TOKEN → source_token
IMPORT_SOURCE_USERNAME → source_username
IMPORT_SOURCE_PASSWORD → source_password
IMPORT_SOURCE_DATABASE → source_database
IMPORT_DEST_DATABASE → dest_database
IMPORT_START_TIMESTAMP → start_timestamp
IMPORT_END_TIMESTAMP → end_timestamp

Data Type Mismatch Handling

The plugin automatically handles data type mismatches that can occur in older InfluxDB versions where different nodes might have different field types for the same field name.

How It Works

Schema Detection: At import start, plugin queries source database for field types using SHOW FIELD KEYS
Runtime Type Checking: For each data point, plugin checks if the actual value type matches the expected field type
Automatic Field Creation: If type mismatch is detected, plugin creates a new field with a type suffix

Supported Type Suffixes

When type mismatches occur, the plugin appends these suffixes:

_string - for string values
_integer - for integer values
_float - for float values
_boolean - for boolean values

Import best practices

Use table filtering: Import critical tables first, then others in batches
Plan for pauses: Pause during high-traffic hours if sharing infrastructure
Verify data: Compare row counts and sample data after import
Handle conflicts: Review log warnings about tag/field conflicts

Ready to get started?

Download InfluxDB 3 and have Import running in minutes.

Download InfluxDB 3 Browse All Plugins

Import

Software Requirements

Installation steps

Configuration

Plugin metadata

Required parameters

Authentication parameters (required - choose one method)

Optional parameters

TOML configuration

Example TOML configuration

Quick Start

HTTP trigger setup

Examples

Example 1: Basic import with token authentication

Expected results

Example 2: Time-range import with table filtering

Expected results

Example 3: Pause, check status, and resume import

Code Overview

Files

Logging

Import state tracking

import_config

import_state

import_pause_state

Main functions

process_request(influxdb3_local, query_parameters, request_headers, request_body, args)

start_import(influxdb3_local, config, task_id)

import_table(influxdb3_local, config, import_id, measurement, start_time, end_time, task_id, ...)

resume_import(influxdb3_local, import_id, task_id, ...)

get_import_stats(influxdb3_local, import_id, task_id)

check_source_connection(body_data, session)

get_source_databases_list(body_data, session)

get_source_tables_list(body_data, session)

Key algorithms

Automatic batch sizing

Tag/field conflict resolution

Resume checkpoint tracking

Troubleshooting

Common issues

Issue: “Failed to connect to source database” error

Issue: “Authentication error: Must provide either…” error

Issue: “Import already completed” when trying to resume

Issue: Tag/field conflicts causing warnings

Issue: Slow import performance

Performance considerations

HTTP Endpoint

Start Import

Get Import Status

Pause Import

Resume Import

Cancel Import

Test Connection

List Databases

List Tables

Using TOML Configuration Files

Important Requirements

Setting Up TOML Configuration

Configuration Priority and Loading

Configuration Loading Process

Environment Variables Supported

Data Type Mismatch Handling

How It Works

Supported Type Suffixes

Import best practices

Ready to get started?

Product & Solutions

Developers

Company

Follow Us

`import_config`

`import_state`

`import_pause_state`

`process_request(influxdb3_local, query_parameters, request_headers, request_body, args)`

`start_import(influxdb3_local, config, task_id)`

`import_table(influxdb3_local, config, import_id, measurement, start_time, end_time, task_id, ...)`

`resume_import(influxdb3_local, import_id, task_id, ...)`

`get_import_stats(influxdb3_local, import_id, task_id)`

`check_source_connection(body_data, session)`

`get_source_databases_list(body_data, session)`

`get_source_tables_list(body_data, session)`