Score:2

Ubuntu

ceph stuck in active+remapped+backfill_toofull after lvextend an OSD's volume

Ahmad Ahmadi

3/24/23, 9:24 AM

I have a 2 OSDs ceph cluster. The initial size of backing volumes was 16GB. Then I shutdown OSDs, did a lvextend on both, and turn OSDs on again. Now ceph osd df shows:

But ceph -s show it's stucked at active+remapped+backfill_toofull for 50 pgs:

I tried to understand the mechanism by reading CRUSH algorithm but seems a lot of effort and knowledge is required. It would be very appreciated if anyone could describe the behaviour (why stuck in toofull despite the free space is increased significantly.) and help me to resolve this state.

0 + 0

ceph

Martian2020

4/1/23, 8:00 AM

do you have data (ceph osd) for before "did a lvextend on both" & "free space is increased significantly."?

Martian2020

4/1/23, 8:02 AM

and I might say 90% use is high.

Ahmad Ahmadi

4/1/23, 12:33 PM

If I understand you correctly, yes, the cluster had data and data is still available to clients.

Martian2020

4/1/23, 1:31 PM

nope, you write "Now ceph osd df shows:" I meant output of `ceph osd df` as in your question before "did a lvextend on both"

Ahmad Ahmadi

4/1/23, 2:38 PM

No, unfortunately.

Martian2020

4/1/23, 2:46 PM

why have you done `lvextend`? what metrics pointed you to that? + maybe you recall some numbers, like %USE before

Ahmad Ahmadi

4/1/23, 2:58 PM

The usage was about 11GB of 32GB. I wanted to increase the size of the cluster so I did `lvextend`, apparently the wrong way.

Ahmad Ahmadi

4/1/23, 8:51 PM

Let us [continue this discussion in chat](https://chat.stackexchange.com/rooms/131964/discussion-between-ahmad-ahmadi-and-martian2020).

Score:0

Ubuntu

Martian2020

4/2/23, 9:23 PM

Your RAW USE is times larger than DATA. Note: solution not tried, still this is what I've found.

Re: Raw use 10 times higher than data use

Probably the first thing to check is if you have objects that are under the min_alloc size. Those objects will result in wasted space as they will use the full min_alloc size.

This is related to bluestore_min_alloc_size_hdd=64K (default on Octopus).

If using Erasure Coding, data is broken up into smaller chunks, which each take 64K on disk.

Another way to fix the problem one may try to follow instructions on your second screen:

add storage if this doesn't resolve itself

0 + 0

Elon Musk

I sit in a Tesla and translated this thread with Ai:

EN: ceph stuck in active+remapped+backfill_toofull after lvextend an OSD's volume

TH: ceph ติดอยู่ใน active+remapped+backfill_toofull หลังจาก lvextend ระดับเสียงของ OSD

RO: ceph blocat în active+remapped+backfill_toofull după extinderea volumului unui OSD

RU: ceph застрял в активном+переназначенном+backfill_toofull после lvextend тома OSD

VI: ceph bị kẹt trong active+remapped+backfill_toofull sau khi mở rộng âm lượng của OSD

Post an answer

Most people don’t grasp that asking a lot of questions unlocks learning and improves interpersonal bonding. In Alison’s studies, for example, though people could accurately recall how many questions had been asked in their conversations, they didn’t intuit the link between questions and liking. Across four studies, in which participants were engaged in conversations themselves or read transcripts of others’ conversations, people tended not to realize that question asking would influence—or had influenced—the level of amity between the conversationalists.