Ceph connect to local node

Jack Slingerland

2/3/23, 1:05 AM

I have an idea for an application that I'd like to build and one of the requirements is a globally replicated filesystem. Things like Ceph and GlusterFS exist, but I'm not sure they meet my particular use case.

Let's say I have 3 app servers in 3 different regions [US, Europe, Asia]
Then I have a 3 node Ceph setup with 1 node in each of those regions [US, Europe, Asia]
Can I have each app server connect directly to the Ceph node in their region or do I have to go through some centralized orchestration node?

I ask because I want to keep file system latency to a minimum and just use Ceph to synchronize changes between all the nodes. If I can't connect directly to the "local" node, I think latency would be quite high.

Any help understanding this would be greatly appreaciated!

0 + 0

ceph

Gerald Schneider

2/3/23, 5:02 AM

Ceph needs a pretty low latency between nodes. It is doubtful that you will be able to reach such a low latency on intercontinental links.

Jack Slingerland

2/3/23, 12:19 PM

Ah that's too bad but definitely makes sense. Thanks for the insight @GeraldSchneider

Score:1

Server

0xF2

2/3/23, 2:05 PM

It depends on the type of data access: Ceph can store data as block devices (RBD), as an S3 object store (RGW), or as a filesystem (CephFS). I assume CephFS here as you mentioned it and Gluster, both of which are filesystem abstractions.

In a three-node configuration, Ceph would have one or more OSD daemons running at each site (one per disk drive). The data is striped across the OSDs in the cluster, and your CephFS client (kernel, FUSE, or Windows) will algorithmically access the right node to store data in, no gateway is needed. How this is done is long to explain, but essentially it is a distributed hash table mapping with additional data kept server-side in the MON daemons.

The data path of CephFS is straight, from your client to the OSD, with no gateways interposed.

The filesystem makes use of an additional daemon type, the MDS, which stores your filesystem metadata. If your filesystem operation performs a filesystem change (e.g. create a directory), the MDS will be accessed instead of the OSD.

However, specifically to your intended use case, Ceph is a synchronous storage system, and its performance will decline the farther you stretch the distance between the nodes. It is generally recommended you keep a stretched configuration to within 10ms of round-trip latency between nodes. In other words, Ceph clusters like to live in one datacenter, but you can stretch them across a city or some small country if you have very good links.

0 + 0

Elon Musk

I sit in a Tesla and translated this thread with Ai:

EN: Ceph connect to local node

TH: Ceph เชื่อมต่อกับโลคัลโหนด

RO: Ceph se conectează la nodul local

RU: Ceph подключается к локальному узлу

VI: Ceph kết nối với nút cục bộ

Post an answer

Most people don’t grasp that asking a lot of questions unlocks learning and improves interpersonal bonding. In Alison’s studies, for example, though people could accurately recall how many questions had been asked in their conversations, they didn’t intuit the link between questions and liking. Across four studies, in which participants were engaged in conversations themselves or read transcripts of others’ conversations, people tended not to realize that question asking would influence—or had influenced—the level of amity between the conversationalists.