DocsAgentsDrive & artifacts

Drive & artifacts.

Where the agent reads from, where it writes, and how files move through a run. Four areas — drive, workspace, artifacts, knowledge — and a clean lifecycle between them.

§ 01Drive

The drive is the user's file area for a project. It's where uploads land and where the agent reads from. Each project has its own drive; drives do not cross between projects.

Supported formats

Common business and analysis formats are supported out of the box.

Validation

Every upload is validated by magic-number signature, not just file extension. A .pdf that isn't actually a PDF is rejected at upload time. Extensions are allowlisted — formats outside the supported list are refused outright rather than silently uploaded as opaque blobs.

The original filename is retained as metadata. Internally the file may be stored under a normalized identifier, but the agent and the UI see the name you uploaded.

CautionFiles that fail signature validation are rejected with a typed error, not quarantined. If an upload is bouncing, check that the file is actually the format its extension claims.

Full-text extraction

On upload, every supported file goes through a text-extraction pipeline so the agent can read it as text rather than as a blob:

Extracted text is the substrate the agent reasons over. The original file remains available for download and for tools that need the raw bytes.

§ 02Workspace

The workspace is the agent's scratch space during a run. Intermediate files, working copies, partial outputs, anything the agent produces along the way that isn't the final deliverable.

The workspace is the agent's, not the user's. You don't drop files into it directly. The agent writes here through tool calls and reads from here as the run progresses. Between runs the workspace is cleared unless something has been promoted to an artifact.

§ 03Artifacts

An artifact is a deliverable — the part of a run that survives. Where the workspace is scratch, artifacts are output.

The agent saves an artifact through save_output(). Each artifact has:

Artifacts are listed on the run record in the workspace UI and accessible programmatically. They are downloadable as files, viewable inline (where the format allows), and persistent — they survive session expiry, login cycles, and time.

§ 04Knowledge base

The knowledge base is the agent's per-session memory. It captures, in a structured way, the things the agent has read or seen during the session: file contents it has loaded, terminal output it has run, browser pages it has visited, chat messages it has exchanged.

The knowledge base is what lets the agent answer follow-up questions without re-reading the same files. "What was the renewal clause again?" doesn't trigger a fresh read of the contract — it queries the knowledge base.

Knowledge base lifecycle:

§ 05Listing, downloading, deleting outputs

Outputs (artifacts) are exposed as a small REST surface. Three endpoints cover the lifecycle.

List outputs

GET/api/outputs

Returns the artifacts visible to the caller, scoped to their projects. Supports filtering by project, run, and time range.

Fetch a single output

GET/api/outputs/:id

Returns the metadata and content of a specific artifact. Use this to download the file or stream it inline.

Delete an output

DELETE/api/outputs/:id

Soft-deletes an artifact. The artifact is removed from listings; the run record still references it but the content is no longer retrievable.

NoteDeletes are scoped to the owning user. You cannot delete an artifact you do not own, and admin deletes are recorded in the audit trail.

§ 06File preview & viewing

Both drive files and run artifacts can be previewed inline in the workspace UI without downloading.

For unsupported formats the workspace falls back to a download link with the original filename and content type.

§ 07Where to go next