Skip to main content

Calabi Connect — Data Ingestion

Professional+

Calabi Connect is the data ingestion hub. It provides a unified interface for creating, monitoring, and managing ELT pipelines from 90+ data sources into your data warehouse.

Architecture

Key Concepts

TermDescription
SourceA configured connection to a data origin (e.g., Salesforce org, MySQL database, S3 bucket)
DestinationWhere data is loaded (your configured data warehouse — Redshift, Snowflake, BigQuery, etc.)
ConnectionA link between one source and one destination, with a sync schedule
StreamAn individual table or object within a connection (e.g., Account within Salesforce)
Sync JobOne execution of a connection — extracts and loads data
WorkspaceYour organization's isolated Calabi Connect workspace (one per tenant)

Sync Modes

ModeDescriptionBest For
Full Refresh — OverwriteReplaces all destination data each syncSmall tables, reference data
Full Refresh — AppendAppends all source records each syncAudit/history tables
Incremental — AppendOnly syncs new records (by cursor field)Large tables with clear timestamps
Incremental — Append+DedupedSyncs new records + deduplicates by primary keyRecommended for most tables
Always use Incremental + Append+Deduped for large sources

For Salesforce, HubSpot, or any source with millions of records, Full Refresh downloads everything every time. Use Incremental to sync only what changed since the last run.

Available Connectors

Calabi Connect supports 90+ connectors on Professional tier, including:

CRM: Salesforce, HubSpot, Pipedrive, Zoho Finance: Stripe, QuickBooks, Xero, NetSuite Marketing: Mailchimp, Klaviyo, Google Ads, Facebook Ads Databases: PostgreSQL, MySQL, MongoDB, DynamoDB, SQL Server, and more Files: S3, Google Sheets, SFTP, Azure Blob APIs: REST API (generic), GraphQL, GitHub, Jira, Slack Analytics: Google Analytics 4, Mixpanel, Amplitude

Enterprise tier adds custom connector development support.

Next Steps