Azure Cache for Redis

First PublishedFeb 17, 2026ByAtif Alam

Azure Cache for Redis is a fully managed, in-memory data store based on Redis. It’s used for caching, session management, leaderboards, rate limiting, and real-time messaging — the same use cases as AWS ElastiCache Redis.

Why Caching?

1
Without cache:
2
  Client ──► App Server ──► Database (50-100 ms)
3

4
With cache:
5
  Client ──► App Server ──► Redis (1-2 ms) ──hit──► return cached data
6
                                 │
7
                                 └──miss──► Database ──► store in Redis ──► return

Caching reduces database load, lowers latency, and improves scalability.

Tiers

Tier	Description	Size	SLA	Key Features
Basic	Single node, no SLA	250 MB – 53 GB	None	Development/testing only
Standard	Primary + replica	250 MB – 53 GB	99.9%	Replication, failover
Premium	Clustered, VNet, persistence	6 GB – 1.2 TB	99.9%	Clustering, geo-replication, data persistence
Enterprise	Redis Enterprise (RediSearch, RedisJSON, etc.)	Up to 10+ TB	99.999%	Redis modules, active-active geo-replication
Enterprise Flash	Redis Enterprise on NVMe + RAM	Very large datasets	99.999%	Cost-effective for large data

Choosing a Tier

Scenario	Tier
Local development, testing	Basic
Production with HA	Standard
Large datasets, clustering, VNet	Premium
Advanced data structures (JSON, search, time series)	Enterprise

Creating a Cache

1
# Create a Standard tier cache (6 GB, primary + replica)
2
az redis create \
3
  --resource-group myapp-rg \
4
  --name myapp-cache \
5
  --location eastus \
6
  --sku Standard \
7
  --vm-size C1 \
8
  --enable-non-ssl-port false \
9
  --minimum-tls-version 1.2
10

11
# Get connection details
12
az redis show \
13
  --resource-group myapp-rg \
14
  --name myapp-cache \
15
  --query "{host: hostName, port: sslPort}" -o table
16

17
# Get access key
18
az redis list-keys \
19
  --resource-group myapp-rg \
20
  --name myapp-cache \
21
  --query "primaryKey" -o tsv

Connecting to Redis

Python

1
import redis
2

3
r = redis.Redis(
4
    host="myapp-cache.redis.cache.windows.net",
5
    port=6380,
6
    password="<access-key>",
7
    ssl=True,
8
    decode_responses=True
9
)
10

11
# Basic operations
12
r.set("user:123:name", "Alice", ex=3600)    # set with 1 hour TTL
13
name = r.get("user:123:name")                # "Alice"

With Managed Identity (Entra ID Authentication)

1
from azure.identity import DefaultAzureCredential
2
import redis
3

4
credential = DefaultAzureCredential()
5
token = credential.get_token("https://redis.azure.com/.default").token
6

7
r = redis.Redis(
8
    host="myapp-cache.redis.cache.windows.net",
9
    port=6380,
10
    username="<object-id-of-managed-identity>",
11
    password=token,
12
    ssl=True,
13
    decode_responses=True
14
)

Node.js

1
const Redis = require("ioredis");
2

3
const redis = new Redis({
4
  host: "myapp-cache.redis.cache.windows.net",
5
  port: 6380,
6
  password: "<access-key>",
7
  tls: { servername: "myapp-cache.redis.cache.windows.net" },
8
});
9

10
await redis.set("session:abc", JSON.stringify({ userId: 123 }), "EX", 1800);
11
const session = JSON.parse(await redis.get("session:abc"));

Caching Patterns

Cache-Aside (Lazy Loading)

The most common pattern — application checks cache first, falls back to database:

1
def get_product(product_id):
2
    # 1. Check cache
3
    cached = r.get(f"product:{product_id}")
4
    if cached:
5
        return json.loads(cached)
6

7
    # 2. Cache miss — fetch from database
8
    product = db.query("SELECT * FROM products WHERE id = %s", product_id)
9

10
    # 3. Store in cache with TTL
11
    r.set(f"product:{product_id}", json.dumps(product), ex=600)  # 10 min TTL
12

13
    return product

Pros: Only caches data that is actually requested. Cons: First request is always slow (cache miss); stale data until TTL expires.

Write-Through

Write to cache and database at the same time:

1
def update_product(product_id, data):
2
    # Write to database
3
    db.execute("UPDATE products SET ... WHERE id = %s", product_id)
4
    # Write to cache
5
    r.set(f"product:{product_id}", json.dumps(data), ex=600)

Pros: Cache is always up to date. Cons: Write latency increases; unused data may fill cache.

Write-Behind (Write-Back)

Write to cache immediately, flush to database asynchronously:

1
def update_product(product_id, data):
2
    # Write to cache
3
    r.set(f"product:{product_id}", json.dumps(data))
4
    # Queue a database write
5
    r.lpush("db:write-queue", json.dumps({"table": "products", "id": product_id, "data": data}))

Pros: Fastest writes. Cons: Risk of data loss if cache crashes before flush; complexity.

Cache Invalidation

Strategy	When to Use
TTL (Time-to-Live)	Simple, good enough for most cases
Explicit delete	Delete cache key when data changes
Pub/Sub	Notify other services to invalidate their local caches
Event-driven	Use Event Grid or Service Bus to trigger cache invalidation

1
# Explicit invalidation on update
2
def update_product(product_id, data):
3
    db.execute("UPDATE products SET ...")
4
    r.delete(f"product:{product_id}")    # next read will re-cache

Common Use Cases

Session Storage

1
import uuid, json
2

3
def create_session(user_id):
4
    session_id = str(uuid.uuid4())
5
    r.set(f"session:{session_id}", json.dumps({
6
        "user_id": user_id,
7
        "created": "2026-02-17T10:00:00Z"
8
    }), ex=1800)  # 30-minute session
9
    return session_id
10

11
def get_session(session_id):
12
    data = r.get(f"session:{session_id}")
13
    if data:
14
        r.expire(f"session:{session_id}", 1800)  # sliding expiration
15
        return json.loads(data)
16
    return None

Leaderboard

1
# Add score
2
r.zadd("leaderboard:daily", {"player-alice": 1500, "player-bob": 1200})
3

4
# Get top 10
5
top_10 = r.zrevrange("leaderboard:daily", 0, 9, withscores=True)
6
# [("player-alice", 1500), ("player-bob", 1200), ...]
7

8
# Get player rank
9
rank = r.zrevrank("leaderboard:daily", "player-alice")  # 0 (first place)

Rate Limiting

1
def is_rate_limited(user_id, limit=100, window=60):
2
    key = f"ratelimit:{user_id}"
3
    current = r.incr(key)
4
    if current == 1:
5
        r.expire(key, window)
6
    return current > limit

Pub/Sub (Real-Time Messaging)

1
# Publisher
2
r.publish("notifications", json.dumps({
3
    "user_id": 123,
4
    "message": "Your order has shipped"
5
}))
6

7
# Subscriber (in a separate process)
8
pubsub = r.pubsub()
9
pubsub.subscribe("notifications")
10
for message in pubsub.listen():
11
    if message["type"] == "message":
12
        handle_notification(json.loads(message["data"]))

Distributed Locking

1
import time
2

3
def acquire_lock(lock_name, timeout=10):
4
    """Acquire a distributed lock using Redis SET NX."""
5
    lock_key = f"lock:{lock_name}"
6
    acquired = r.set(lock_key, "locked", nx=True, ex=timeout)
7
    return acquired
8

9
def release_lock(lock_name):
10
    r.delete(f"lock:{lock_name}")
11

12
# Usage
13
if acquire_lock("process-order-123"):
14
    try:
15
        process_order(123)
16
    finally:
17
        release_lock("process-order-123")

Clustering (Premium Tier)

Premium tier supports Redis Cluster for horizontal scaling:

1
# Create a Premium cache with 3 shards
2
az redis create \
3
  --resource-group myapp-rg \
4
  --name myapp-cache-premium \
5
  --location eastus \
6
  --sku Premium \
7
  --vm-size P1 \
8
  --shard-count 3

Data is automatically distributed across shards by hash slot. More shards = more throughput and memory.

Geo-Replication

Link two Premium caches for cross-region replication:

1
# Link primary (East US) to secondary (West Europe) for geo-replication
2
az redis server-link create \
3
  --name myapp-cache-primary \
4
  --resource-group myapp-rg \
5
  --server-to-link /subscriptions/.../myapp-cache-secondary \
6
  --replication-role Secondary

Data Persistence (Premium Tier)

Type	Description	RPO
RDB snapshots	Point-in-time snapshots to Azure Storage	Minutes (snapshot interval)
AOF (Append Only File)	Log every write operation	~1 second

1
# Enable RDB persistence (snapshot every 15 minutes)
2
az redis update \
3
  --resource-group myapp-rg \
4
  --name myapp-cache-premium \
5
  --set "redisConfiguration.rdb-backup-enabled=true" \
6
  --set "redisConfiguration.rdb-backup-frequency=15" \
7
  --set "redisConfiguration.rdb-storage-connection-string=<connection-string>"

VNet Integration (Premium Tier)

Inject the cache into your VNet for private access:

1
az redis create \
2
  --resource-group myapp-rg \
3
  --name myapp-cache-private \
4
  --location eastus \
5
  --sku Premium \
6
  --vm-size P1 \
7
  --subnet-id /subscriptions/.../subnets/redis-subnet

With VNet integration, the cache is only accessible from within the VNet — no public endpoint.

Monitoring

Metric	What to Watch
Cache hit ratio	Should be >90%; low ratio means inefficient caching
Used memory	Approaching max → eviction starts
Connected clients	Unexpected spikes may indicate connection leaks
Server load	>80% sustained → scale up or out
Evicted keys	Keys removed due to memory pressure
Latency	P99 should be <5 ms for same-region access

1
# View cache metrics
2
az monitor metrics list \
3
  --resource /subscriptions/.../Microsoft.Cache/redis/myapp-cache \
4
  --metric "cacheHits,cacheMisses,usedmemory,serverLoad" \
5
  --interval PT5M

Azure Cache for Redis vs AWS ElastiCache

Feature	Azure Cache for Redis	AWS ElastiCache Redis
Managed	Fully managed	Fully managed
Clustering	Premium tier	Yes (cluster mode)
Geo-replication	Premium (active-passive), Enterprise (active-active)	Global Datastore (active-passive)
Modules	Enterprise tier (RediSearch, RedisJSON, RedisTimeSeries)	Not supported
Entra ID auth	Yes	IAM auth
VNet integration	Premium tier (VNet injection)	VPC subnets
Persistence	RDB + AOF (Premium)	RDB + AOF

Key Takeaways

Azure Cache for Redis is a fully managed in-memory data store for sub-millisecond access.
Cache-aside is the most common pattern — check cache first, fall back to database on miss.
Common use cases: session storage, leaderboards (sorted sets), rate limiting (INCR + EXPIRE), pub/sub, distributed locks.
Standard tier provides HA with replication; Premium adds clustering, VNet, persistence, and geo-replication.
Enterprise tier unlocks Redis modules (RediSearch, RedisJSON) and active-active geo-replication.
Monitor cache hit ratio (>90%) and server load (<80%) to ensure healthy performance.