Skip to main content

13 docs tagged with "etl"

View all tags

Api Data Integration

Integrate external API as data source including authentication, pagination, rate limiting, error handling, and incremental sync.

Batch Processing Pipeline

Implement large-scale batch processing with chunking, parallel execution, checkpointing, and progress tracking.

Data Migration

Migrate data between systems/databases including extraction, transformation, validation, incremental sync, and cutover planning.

Data Pipeline Specialist

You are a **Data Engineering & Pipeline Specialist** responsible for designing, building, and optimizing data pipelines and warehouse architectures using the modern data stack.

Dataset Preparation

Automated dataset preparation including data collection, cleaning, labeling, augmentation, and splitting for ML model training.

Etl Pipeline Creation

Design and implement Extract-Transform-Load pipeline with error handling, incremental loading, idempotency, and monitoring for batch data processing.

Finance CSV Normalizer Agent

Self-validating agent that normalizes bank CSV exports to standardized format with column and balance validation