Google Docs

Google Docs logo

Google Docs

Configuration

Google Docs source connector integrates with Google Drive API to extract Google Docs.

Connects to your Google Drive account to retrieve Google Docs documents. Documents are exported as DOCX and processed through Airweave’s file processing pipeline to enable full-text semantic search across document content.

The connector handles:

  • Document listing and filtering
  • Content export and download (DOCX format)
  • Metadata preservation (ownership, sharing, timestamps)
  • Incremental sync via Drive Changes API

Authentication

This connector uses OAuth 2.0 with custom credentials. You need to provide your OAuth application’s Client ID and Client Secret, then complete the OAuth consent flow.

OAuth Setup Required
  1. Create an OAuth application in your provider’s developer console
  2. Enter your Client ID and Client Secret when configuring the connection
  3. Complete the OAuth consent flow

Configuration Options

The following configuration options are available for this connector:

Configuration Parameters

Google Docs configuration schema.

include_trashed
bool

Include documents that have been moved to trash. Defaults to False.

include_shared
boolDefaults to

Include documents shared with you by others. Defaults to True.

Data Models

The following data models are available for this connector:

Schema for a Google Docs document.

Represents a Google Doc file retrieved via the Google Drive API. The document content is exported as DOCX and processed through Airweave’s file processing pipeline to create searchable chunks.

Reference: https://developers.google.com/drive/api/v3/reference/files https://developers.google.com/drive/api/guides/manage-downloads

FieldTypeDescription
titleOptional[str]Display title of the document (without .docx extension).
descriptionOptional[str]Optional description of the document.
starredboolWhether the user has starred this document.
trashedboolWhether the document is in the trash.
explicitly_trashedboolWhether the document was explicitly trashed by the user.
sharedboolWhether the document is shared with others.
shared_with_me_timeOptional[Any]Time when this document was shared with the user.
sharing_userOptional[Dict[str, Any]]User who shared this document.
ownersList[Dict[str, Any]]Owners of the document.
permissionsOptional[List[Dict[str, Any]]]Permissions for this document.
parentsList[str]IDs of parent folders containing this document.
web_view_linkOptional[str]Link to open the document in Google Docs editor.
icon_linkOptional[str]Link to the document’s icon.
created_timeOptional[Any]When the document was created.
modified_timeOptional[Any]When the document was last modified.
modified_by_me_timeOptional[Any]Last time the user modified the document.
viewed_by_me_timeOptional[Any]Last time the user viewed the document.
versionOptional[int]Version number of the document.
export_mime_typeOptional[str]MIME type used for exporting the document content (DOCX).