Dynamiq
Knowledge Bases

Create a Knowledge Base

Create a Knowledge Base with your choice of splitter settings, embedding provider, and vector store — Dynamiq generates the ingestion workflow for you.

Creating a Knowledge Base takes a name and (optionally) a few advanced choices: how documents are split, which provider embeds them, and where the vectors are stored. Dynamiq generates a complete ingestion workflow from those choices, so the Knowledge Base is ready to accept files immediately.

Create from the UI

Open Knowledge Bases and start creation

In your project, open Knowledge Bases and click the create button. The Create a knowledge base dialog opens.

The Create a knowledge base dialog with the Name field and Advanced settings accordion

Name it

Enter a Name. If the defaults suit you — character splitting, Cohere embeddings, Dynamiq-managed vector storage — you can click Create right now and skip the rest.

(Optional) Tune the document splitter

Expand Advanced settings. Under Document splitter, choose how documents are chunked:

  • Split byCharacter, Word, Sentence, Page, Passage, or Title.
  • Split length — how many units per chunk (defaults to 1024 for character splitting; each split-by mode has its own sensible default, e.g. 200 words or 10 sentences).
  • Split overlap — how many units consecutive chunks share (defaults to 256 for character splitting).
Advanced settings expanded showing Document splitter, Document embedder, and vector storage options

(Optional) Pick the embedder

Under Document embedder, select the Embedder, its Connection, and the Model. Available embedders:

  • OpenAI Document Embedder
  • Bedrock Document Embedder
  • Cohere Document Embedder (default, with model embed-v4.0)
  • Hugging Face Document Embedder
  • Mistral Document Embedder
  • IBM watsonx Document Embedder
  • Gemini Document Embedder
  • VertexAI Document Embedder

The Connection dropdown is pre-filled with your organization's system connection for the selected provider when one exists; use + New connection to add your own credentials instead. See Create a Connection.

(Optional) Choose vector storage

Use default vector storage is on by default — Dynamiq stores vectors in managed storage (a Weaviate-backed vector store) with no setup. Toggle it off to bring your own store and configure Storage, Connection, and Index name. Available writers:

  • Weaviate Writer
  • Pinecone Writer
  • Milvus Writer
  • Chroma Writer
  • Qdrant Writer
  • Elasticsearch Writer
  • OpenSearch Writer
  • pgvector Writer

Create

Click Create. You land on the new Knowledge Base's page, ready to add content on the Files and Integrations tabs.

Prefer full control over the pipeline? Click Manual configuration instead — it opens the ingestion workflow editor where you build the flow node by node. See Customize the Ingestion Workflow.

Choose your embedder and chunking strategy deliberately: retrieval embeds queries with the same embedder used at ingestion, so switching providers later means reprocessing existing items.

What gets created behind the scenes

The dialog's choices are compiled into an ingestion workflow — a real Workflow you can open on the Knowledge Base's Workflow tab. The generated flow has four stages:

StageNodeWhat it does
Pre-processingmulti-file-converterRoutes each file by type to a converter: LLM image converter, PDF, PPTX, DOCX, and text converters, with an unstructured-file converter as fallback
Chunkingdocument-splitterSplits documents using your Split by / Split length / Split overlap settings
Vectorizationdocument embedderEmbeds each chunk with your selected provider and model
Storagevector store writerUpserts vectors into the configured store; the workflow output reports the upserted_count
The Workflow tab showing the generated ingestion flow grouped into Pre-processing, Chunking, Vectorization, and Storage stages

The Knowledge Base also gets its own hostname (shown on its page) that serves the ingestion and POST /v1/documents/search retrieval endpoints — see Knowledge Base API.

Create via the management API

POST /v1/knowledgebases creates a Knowledge Base programmatically. The payload requires name, project_id, and the full ingestion workflow definition (flow and flow_ui); description and runtime_id are optional:

curl -X POST "https://api.getdynamiq.ai/v1/knowledgebases" \
  -H "Authorization: Bearer $DYNAMIQ_ACCESS_KEY" \
  -H "Content-Type: application/json" \
  -d @knowledgebase.json

Because flow and flow_ui describe the entire ingestion workflow graph, the practical path is to create the Knowledge Base in the UI and use the API for everything afterwards — uploading items, managing sources, and searching. Those endpoints are covered in Data Sources and Knowledge Base API.

Next steps

On this page