Vespa cluster - Primary components

In a Vespa cluster, the primary components you define in services.xml are:

<admin> – for management and control.
<container> – for serving HTTP (queries/feeds/APIs) and stateless services.
<content> – for stateful services like document storage, indexing, and search.

Let’s deep dive into each with purpose, internal structure, and advanced configuration.

✅ 1. <admin> Component – Management and Monitoring

🔷 Purpose:

The admin component configures admin server, monitoring, metrics, and cluster orchestration (e.g., config servers, cluster controllers).

🔷 Basic Example:

🔷 Advanced Configuration:

🔷 Internal Roles:

Sub-component	Purpose
adminserver	Single node responsible for admin UI and coordination
configservers	Config distribution across nodes (required in multi-node setup)
cluster-controllers	Supervises content nodes (health, state mgmt)
metrics	Exposes metrics to Prometheus/Telegraf/etc.

✅ 2. <container> Component – Stateless HTTP/Query/Feed/Processing Engine

🔷 Purpose:

Runs:

Query services (REST/JSON)
Feed ingestion
Document processing
Custom HTTP applications
Search and ranking logic

🔷 Basic Example:

🔷 Advanced Configuration:

<container id="query-container" version="1.0">
<search/>
<document-api/>
<processing/>
<document-processing cluster="feed"/>

<rest-api>
<binding>http://*/custom-endpoint</binding>
</rest-api>

<nodes>
<node hostalias="query-node-1"/>
<node hostalias="query-node-2"/>
</nodes>
</container>

🔷 Key Elements and Their Roles:

Element	Purpose
<search/>	Enables query handling and ranking
<document-api/>	Allows feed operations (document PUT/REMOVE/etc.)
<processing/>	Enables request/response processing chains
<document-processing cluster="feed"/>	Associates processing pipeline with specific content cluster
<rest-api>	Add custom REST endpoints
<nodes>	Deploy container nodes (can scale independently)

You can have multiple containers: one for feeding, one for querying, one for admin APIs, etc.

✅ 3. <content> Component – Stateful Document Storage, Indexing, Search

🔷 Purpose:

Stores and indexes Vespa documents.
Performs vector search, full-text search, filtering, etc.
Manages replication, distribution, and persistence.

🔷 Basic Example:

🔷 Advanced Configuration:

🔷 Key Elements and Roles:

Element	Purpose
<engine><proton/></engine>	Core search engine (always proton)
<documents>	Defines document types handled
<distribution><hash/></distribution>	Hash-based sharding
<redundancy>	Replication factor (how many copies of data)
<visibility-delay>	Ensures soft commit delay before doc is visible
<tuning>	Fine-grain performance/resource tuning
<nodes>	Storage and search nodes

🔄 Communication Flow Summary:

[User HTTP Request]
↓
[Container Node]
- REST API
- Feed / Search
↓
[Content Node(s)]
- Stores/indexes docs
- Runs queries/searches
↑
[Admin/Config Server]
- Cluster state
- Orchestration

🔍 Use Case-Based Deployment Design:

Use Case	Component Setup
High QPS Read	Scale <container> (with <search/>) horizontally
High Feed Ingestion	Separate feed <container> with <document-api/> and <document-processing>
Document Vector Search	Use <content> with mode="index" and ANN field types
Monitoring	Add <metrics> inside <admin> with Prometheus support
Blue/Green Deployment	Run parallel <content> clusters and switch container endpoints

Share on Facebook Share on Twitter

Vespa cluster - Primary components

✅ 1. <admin> Component – Management and Monitoring

🔷 Purpose:

🔷 Basic Example:

🔷 Advanced Configuration:

🔷 Internal Roles:

✅ 2. <container> Component – Stateless HTTP/Query/Feed/Processing Engine

🔷 Purpose:

🔷 Basic Example:

🔷 Advanced Configuration:

🔷 Key Elements and Their Roles:

✅ 3. <content> Component – Stateful Document Storage, Indexing, Search

🔷 Purpose:

🔷 Basic Example:

🔷 Advanced Configuration:

🔷 Key Elements and Roles:

🔄 Communication Flow Summary:

🔍 Use Case-Based Deployment Design:

Popular Posts

Category

Stay Connected

Sidebar Ads

Contact Form

Vespa cluster - Primary components

✅ 1. <admin> Component – Management and Monitoring

🔷 Purpose:

🔷 Basic Example:

🔷 Advanced Configuration:

🔷 Internal Roles:

✅ 2. <container> Component – Stateless HTTP/Query/Feed/Processing Engine

🔷 Purpose:

🔷 Basic Example:

🔷 Advanced Configuration:

🔷 Key Elements and Their Roles:

✅ 3. <content> Component – Stateful Document Storage, Indexing, Search

🔷 Purpose:

🔷 Basic Example:

🔷 Advanced Configuration:

🔷 Key Elements and Roles:

🔄 Communication Flow Summary:

🔍 Use Case-Based Deployment Design:

You Might Also Like

Popular Posts

Category

Stay Connected

Sidebar Ads

Contact Form