Background

We try to use cache invalidation to solve cache coherence, i.e., try to let the updated data to be reflected as soon as possible, but there may be cache inconsistency occuring.

In this post, I would like to point out them and see how we are going to solve them with some trade-off.

Wrong Cache in Small Period of Time

Async by Cache Aside

Due to the nature of async, small period of time where data is stale.

DB: (x=1) => (x=2)

App writes to database (x=2)
App reads old data from cache (x=1)
- inconsistency occurs here
Cache gets invalidated (x=nil)

As a result, stale data is obtained, since App reads old data from cache (x=1)

Wrong Cache

Sync by App

DB: (x=1) => (x=2)

App2 reads from database (x=1)
App1 writes database (x=2)
Cache gets invalidated (x=nil)
App2 updates old data to cache (x=1)

As a result, stale data is obtained. Because updating db and invalidating cache are not atomic.

Master-slave DB

In master-slave database setups, wrong data can be written to cache

x=1 => x=2

App1 writes to master db (x=2)
Replication completed on slave 2 (x=2)
Cache gets invalidated (x=nil)
App2 reads old data from slave 1 (Replication not completed yet) (x=1)
App writes old data to cache (x=1)

DB: x=2, Cache: x=1

As a result, stale data occurs. And subsequent requests will read old data from cache until cache expires!

Summary

Basically, the problem here is reading stale data and setting it into cache.

Then, how about updating the cache directly, instead of invalidating it

Update Cache with Master DB

DB: x=1 => x=2

App1 writes x=1 to db
App2 writes x=2 to db
App2 writes x=2 to cache
App1 writes x=1 to cache
Cache: x=1

Cache: x=1 -> Wrong Cache!

Possible Solution:

Acquire a distributed lock before writing to cache, and release the lock after write to DB, so as to avoid race condition

Potential issue:

What if write to cache fail (writing to DB succeeded)

Any Other Solutions?

Group Requests by Server

Make sure that read/write requests to the same entity falls on the same server/connection.

Eg. GetUser Userid=123 will always go to server 123
Ok, as long as your traffic is evenly distributed. Else,it might create hot spots

Synchronous Replication

Use database tools that support synchronous replication
Prevent slave database delay problems which cause wrong cache issues
- Mysql Cluster
- Mysql Group replication
Performance might suffer

Reference

FEATURED TAGS

algorithm algorithmproblem architecturalpattern architecture aws c# cachesystem codis compile concurrentcontrol database dataformat datastructure debug design designpattern distributedsystem django docker domain engineering freebsd git golang grafana hackintosh hadoop hardware hexo http hugo ios iot java javaee javascript kafka kubernetes linux linuxcommand linuxio lock macos markdown microservices mysql nas network networkprogramming nginx node.js npm oop openwrt operatingsystem padavan performance programming prometheus protobuf python redis router security shell software testing spring sql systemdesign truenas ubuntu vmware vpn windows wmware wordpress xml zookeeper

【Cache System】缓存一致性 - Cache Inconsistency on Cache Invalidation