Oxagen Docs

Google Drive

Ingest Google Docs, Sheets, Slides, and PDFs into the workspace knowledge graph. File metadata and extracted text feed the LLM extractor so the topics, people, and concepts in your documents become typed nodes.

Google Drive logo

Google Drive

Docs, Sheets, Slides, PDFs → `document` nodes + extracted entities.

The Drive connector scans Google Workspace native files (Docs, Sheets, Slides) plus PDFs and plain-text files in the authenticated user's Drive. Each file becomes a document node with owner, timestamps, and a content-type-aware text export feeding the LLM extractor — so the entities mentioned in your documents land as typed person / organization / concept / task / date nodes wired back to the source document.

What gets ingested

SourceNode typeEdge typeDirection
Drive file (Doc / Sheet / Slide / PDF / text)document
File ownerpersonowns / owned_byPerson ↔ Document
Entity extracted from exported textvariesmentionsDocument → Entity

Real use cases

  • Find the deck — semantic search across exported Slides text retrieves the right deck by topic, not just by filename.
  • Document-aware briefings"summarize every Drive document that mentions Acme" runs as one graph traversal followed by an LLM rollup against the embeddings.
  • Source-grounded answers — when an agent answers a workspace question, the source document nodes are linked so the user can click through to the original Drive file.

Settings

KeyTypeDefaultDescription
backfill_daysnumber365How far back to scan. Subsequent syncs are incremental (only files modified since the last run).
extract_contentbooleantrueWhen false, files are stored as metadata-only nodes — no LLM extraction.
include_sharedbooleantrueWhen false, only files the user owns are scanned.
max_file_bytesnumber10,000,000 (10 MB)Files whose exported text exceeds this size are stored as metadata-only nodes.
mime_typesstring[]Docs, Sheets, Slides, PDF, plain textRestrict to specific content types.
folder_idstring""When set, only files directly inside this folder are scanned. Use the folder picker in the connection drawer.

OAuth scopes

  • drive.readonly — read file metadata and content
  • userinfo.email — resolve the connecting user's email

Read-only — no upload, no edit, no delete.


Get started free · Connectors overview

On this page