Oxagen Docs

Gmail

Ingest email subjects, bodies, and attachments into the workspace knowledge graph. People, companies, and topics mentioned across your inbox become typed nodes your AI agents can traverse.

Gmail logo

Gmail

Inbox → typed people, threads, attachments, and topics.

The Gmail connector turns your inbox into structured graph data. Every message inside the configured backfill window becomes an email node; senders and recipients resolve to person nodes (deduplicated by lowercased email address across every Google connector); attachments above the minimum size threshold are parsed (PDF, DOCX, plain text) and feed the same LLM extractor that processes the body so the entities mentioned land as typed nodes.

What gets ingested

SourceNode typeEdge typeDirection
Email messageemail
Senderpersonfrom / reverse sentEmail → Person
Recipient (To / Cc)personto / reverse receivedEmail → Person
Attachmentattachmenthas_attachmentEmail → Attachment
Entity extracted from body or attachment textvaries (person / organization / concept / date / task)mentionsEmail → Entity

Subjects, bodies, and parsed attachment text feed the LLM extractor — people, companies, dates, projects, and topics referenced across your inbox become first-class nodes your agents can query.

Real use cases

  • Account intelligenceMATCH (org:organization {name: 'Acme'})<-[:mentions]-(e:email) returns every thread that talked about an account, including ones you weren't directly on.
  • Inbox-aware action items — combined with the Zoom or Calendar connectors, a person node bridges email + meetings, so "Alice agreed to send the report in email and confirmed in our meeting" is one entity, two pieces of evidence.
  • Find that thread — semantic search over the body embeddings retrieves emails by meaning, not just by subject keyword.

Settings

KeyTypeDefaultDescription
backfill_daysnumber90How many days of inbox history to scan on the first sync.
extract_bodybooleantrueWhen false, only the email container is stored — no entity extraction from the body.
extract_attachmentsbooleantrueWhen false, attachments are stored as metadata-only nodes.
min_attachment_bytesnumber51,200 (50 KB)Attachments smaller than this skip extraction — avoids parsing tracking-pixel-sized images.
max_attachment_bytesnumber20,000,000 (20 MB)Attachments larger than this are stored as metadata-only nodes — avoids runaway LLM costs on huge decks.
max_message_bytesnumber1,000,000 (1 MB)Bodies larger than this are truncated before extraction.
exclude_labelsstring[]["SPAM", "TRASH"]Gmail labels to skip. Common additions: Promotions, Updates.

OAuth scopes

  • gmail.readonly — read messages and metadata; no send, no modify, no delete
  • userinfo.email — resolve the connecting user's email for nickname purposes

Tokens are encrypted at rest and refresh automatically. Revoking the grant at Google revokes our copy.


Get started free · Connectors overview

On this page