Calabi Connect — Data Ingestion
Calabi Connect is the data ingestion hub. It provides a unified interface for creating, monitoring, and managing ELT pipelines from 90+ data sources into your data warehouse.
Architecture
Key Concepts
| Term | Description |
|---|---|
| Source | A configured connection to a data origin (e.g., Salesforce org, MySQL database, S3 bucket) |
| Destination | Where data is loaded (your configured data warehouse — Redshift, Snowflake, BigQuery, etc.) |
| Connection | A link between one source and one destination, with a sync schedule |
| Stream | An individual table or object within a connection (e.g., Account within Salesforce) |
| Sync Job | One execution of a connection — extracts and loads data |
| Workspace | Your organization's isolated Calabi Connect workspace (one per tenant) |
Sync Modes
| Mode | Description | Best For |
|---|---|---|
| Full Refresh — Overwrite | Replaces all destination data each sync | Small tables, reference data |
| Full Refresh — Append | Appends all source records each sync | Audit/history tables |
| Incremental — Append | Only syncs new records (by cursor field) | Large tables with clear timestamps |
| Incremental — Append+Deduped | Syncs new records + deduplicates by primary key | Recommended for most tables |
For Salesforce, HubSpot, or any source with millions of records, Full Refresh downloads everything every time. Use Incremental to sync only what changed since the last run.
Available Connectors
Calabi Connect supports 90+ connectors on Professional tier, including:
CRM: Salesforce, HubSpot, Pipedrive, Zoho Finance: Stripe, QuickBooks, Xero, NetSuite Marketing: Mailchimp, Klaviyo, Google Ads, Facebook Ads Databases: PostgreSQL, MySQL, MongoDB, DynamoDB, SQL Server, and more Files: S3, Google Sheets, SFTP, Azure Blob APIs: REST API (generic), GraphQL, GitHub, Jira, Slack Analytics: Google Analytics 4, Mixpanel, Amplitude
Enterprise tier adds custom connector development support.