Definition
A Data Stream in Salesforce Data Cloud is an ingestion pipeline that brings data from an external source into Data Cloud. Data Streams define the source system, the data schema (mapping source fields to Data Cloud fields), the refresh schedule (real-time, batch, or incremental), and the target Data Model Object. They are the primary mechanism for connecting external data sources like databases, cloud storage, APIs, and Salesforce orgs to Data Cloud.
Real-World Example
When a platform engineer at NovaScale needs to streamline operations, they turn to Data Stream to enhance the organization's Salesforce footprint with additional functionality. By leveraging Data Stream, the team avoids building a custom solution from scratch, saving months of development time while gaining enterprise-grade features out of the box.
Why Data Stream Matters
A Data Stream in Salesforce Data Cloud is an ingestion pipeline that pulls data from an external source into Data Cloud. Configuring a Data Stream involves picking the source system (a database, cloud storage bucket, API endpoint, or another Salesforce org), defining the data schema by mapping source fields to Data Cloud DMO fields, picking a refresh strategy (real-time streaming, scheduled batch, or incremental updates), and specifying the target DMO that will receive the data.
Data Streams are the entry point for external data into Data Cloud. Without them, Data Cloud has nothing to unify. The variety of supported sources (Salesforce orgs, databases, cloud storage, APIs, files, partner connectors) lets organizations pull in customer data from wherever it lives and bring it into the canonical model. The choice of refresh strategy matters: real-time streams support use cases like personalization that need fresh data within seconds, while batch streams are appropriate for less time-sensitive data that updates daily or weekly. Setting up Data Streams thoughtfully is one of the highest-leverage Data Cloud activities.
How Organizations Use Data Stream
- •NovaScale — Set up Data Streams from their CRM, e-commerce platform, and email marketing tool, all flowing into the Individual DMO. The streams refresh on schedules appropriate to each source's update frequency.
- •TerraForm Tech — Built a real-time Data Stream from their production database to Data Cloud for personalization use cases. Customer behavior events flow into Data Cloud within seconds, enabling real-time recommendations.
- •Wanderlust Travel — Uses scheduled Data Streams for nightly bulk loads from their data warehouse. The nightly cadence is appropriate for analytics use cases that don't need real-time freshness.
