Concepts

Understanding the core building blocks of Airweave.

Source

A Source is a data integration that knows how to connect to and extract data from external systems. Sources define authentication requirements, configuration options, and the specific entities they can extract.

Examples: Gmail, GitHub, Notion, PostgreSQL, Stripe

Collection

A Collection is a searchable knowledge base composed of one or more source connections. Collections provide a unified search interface across all connected data sources.

  • Contains multiple source connections
  • Provides unified search API endpoint
  • Maintains vector embeddings for semantic search

Source Connection

A Source Connection links a specific source to a collection with configured credentials and settings. Each connection represents an authenticated instance of a source within a collection.

White Labels

White Labels enable customization of the Airweave interface for different organizations or use cases. They control branding, available sources, and platform behavior.

  • Custom branding and styling
  • Source filtering and permissions
  • Organization-specific configurations

Entities

Entities are the structured data objects extracted from sources and stored in collections. Each source defines its own entity types based on the data it can extract.

  • Threads, messages, and attachments from Gmail
  • Repositories, files, and commits from GitHub
  • Pages, databases, and blocks from Notion
  • Tables, rows, and schemas from PostgreSQL