Google Docs
Google Docs
Configuration
Google Docs source connector integrates with Google Drive API to extract Google Docs.
Connects to your Google Drive account to retrieve Google Docs documents. Documents are exported as DOCX and processed through Airweave’s file processing pipeline to enable full-text semantic search across document content.
The connector handles:
- Document listing and filtering
- Content export and download (DOCX format)
- Metadata preservation (ownership, sharing, timestamps)
- Incremental sync via Drive Changes API
Authentication
This connector uses OAuth 2.0 with custom credentials. You need to provide your OAuth application’s Client ID and Client Secret, then complete the OAuth consent flow.
- Create an OAuth application in your provider’s developer console
- Enter your Client ID and Client Secret when configuring the connection
- Complete the OAuth consent flow
Configuration Options
The following configuration options are available for this connector:
Google Docs configuration schema.
Include documents that have been moved to trash. Defaults to False.
Include documents shared with you by others. Defaults to True.
Data Models
The following data models are available for this connector:
GoogleDocsDocumentEntity
Schema for a Google Docs document.
Represents a Google Doc file retrieved via the Google Drive API. The document content is exported as DOCX and processed through Airweave’s file processing pipeline to create searchable chunks.
Reference: https://developers.google.com/drive/api/v3/reference/files https://developers.google.com/drive/api/guides/manage-downloads