Solr: Replica Management and Autoscaling

Apache Solr is a powerful, open-source search platform used to build scalable search applications. As your data grows, managing replicas and ensuring high availability becomes critical. This post covers two important aspects of scaling Solr clusters: Replica Management and Autoscaling.

📌 What Are Solr Replicas?

In Solr, a replica is a copy of a shard. Each Solr collection is split into shards, and each shard can have one or more replicas.

NRT Replica (Near Real Time): Most commonly used; supports reads and writes.
TLOG Replica: Useful for recovery from transaction logs; ensures durability.
PULL Replica: Read-only replica that pulls index changes from a leader.

Replica types help balance performance, fault tolerance, and recovery time depending on your use case.

🔁 Replica Placement and Management

Solr allows administrators to control how and where replicas are placed. Best practices include:

Distribute replicas across different nodes for high availability.
Use maxShardsPerNode to limit how many shards a node can handle.
Set replicationFactor appropriately based on your desired fault tolerance (e.g., 3 for 2-node failure tolerance).

Example command to create a collection with 2 shards and 2 replicas per shard:

curl http://localhost:8983/solr/admin/collections?action=CREATE&name=mycollection&numShards=2&replicationFactor=2

⚙️ Solr Autoscaling (Pre-Solr 9.x)

Earlier versions of Solr included an autoscaling framework (now deprecated). It allowed users to define triggers, policies, and preferences to manage node capacity and auto-place replicas based on defined rules.

Sample policy configuration:

{
  "policy": {
    "replica": {
      "node": "not:node1",
      "cores": "<=4"
    }
  }
}

Though deprecated, the concepts are still useful if you're running older Solr versions (e.g., Solr 8.x).

🚀 Solr Autoscaling with Kubernetes or External Orchestrators

In modern deployments (Solr 9+), SolrCloud typically runs on Kubernetes or cloud platforms where autoscaling is handled externally. Here's how you can achieve autoscaling:

Horizontal Pod Autoscaler (Kubernetes): Scale Solr pods based on CPU/memory usage or custom metrics.
Node Autoscaler: Automatically add/remove nodes and reassign replicas with help from Solr's REBALANCELEADERS or MOVEREPLICA APIs.
Solr Operator (K8s): Use the Solr Operator to manage Solr clusters declaratively with CRDs. It supports scaling and rolling upgrades.

Command to move a replica:

curl "http://localhost:8983/solr/admin/collections?action=MOVEREPLICA&collection=mycollection&shard=shard1&fromNode=solr1&toNode=solr3"

✅ Best Practices

Monitor node capacity and CPU/memory metrics continuously.
Balance replicas evenly across nodes and availability zones.
Automate replica rebalancing in response to cluster changes.
Use Solr's APIs or cloud-native tooling for orchestration.

Final Thoughts

Replica management and autoscaling are key components in running a reliable and scalable Solr cluster. Whether you're using Solr's internal capabilities or leveraging cloud-native tools like Kubernetes, it's important to plan for failure recovery, load distribution, and dynamic resource management.

Have questions about SolrCloud scaling or setting up a production-grade search cluster? Let us know in the comments!

DEVOPSZONES

Recent blogs

Solr: Replica Management and Autoscaling