Search…
⌃K
Links

Storage and Redundancy

This section describes the different storage and redundancy options

Introduction

Data redundancy in the field of streaming can be a bit misleading. As written on the station page, in message brokers, data is not preserved for an infinite time but for a defined period based on certain conditions like ingested time, size, and the number of messages within a station.
When data resides in the broker, it will be redundant and removed only when crossing the defined retention policy.

The object behind the station - Stream

Each station implements a stream object that contains the messages stored in the station. It is up to the user to define which type of storage will this stream object be saved.

Storage tiering

Memphis offers a range of storage types that you can choose from based on your workload's data access, resiliency, frequency, and cost requirements, and configured per station.

Tier 1 (Hot storage)

The first type of storage each message will initially be stored at.
The options are Memory or Disk. Each with its strengths and weaknesses.
  • Memory. For faster performance. Due to its nature as a volatile type of storage, the risk of losing data in case of failure is higher because it resides in the broker's memory, and in the case of a station without configured replicas, data can be lost.
Stream object as it construct and stored
Ack process
  • Disk. For higher availability. Disk storage might be slower than memory, but it offers greater availability and resiliency to broker failures.
Ack process

Tier 2 (Cold storage)

The common pattern of message brokers is to delete messages after passing the defined retention policy, like time/size/number of messages.
Memphis offers a 2nd storage tier for longer, possibly infinite retention for stored messages.
Each message that expels from the station will automatically migrate to the 2nd storage tier.
  • S3 (Object storage) Built to store and retrieve any amount of data from anywhere using S3 protocol. Object storage offers different storage classes with different costs and performance requirements. Popular S3-based storage providers are: AWS S3, MinIO, IBM Cloud Object Storage, and more.