ceph - 新的 ceph 集群保持 pg 过小+对等且 rbd ls 卡住
问题描述
我用 1mon 1mds 1mgr 和 15osd 创建了一个新的 ceph 集群。设置后一切正常,但 pg 状态保持过小+对等。所有磁盘都是新安装的独立 XFS,大小范围从 3TB 到 4TB,没有分区表。所有 osd 日志显示没有任何用处。
这是我的ceph -s
日志:
cluster:
id: 19e50b60-31b0-467a-8ea9-6c37742a1f77
health: HEALTH_WARN
Reduced data availability: 8 pgs inactive
Degraded data redundancy: 8 pgs undersized
1 monitors have not enabled msgr2
services:
mon: 1 daemons, quorum wuminghan-K600-1G (age 25m)
mgr: wuminghan-K600-1G(active, since 24m)
osd: 15 osds: 15 up (since 23m), 15 in (since 8h)
data:
pools: 1 pools, 8 pgs
objects: 0 objects, 0 B
usage: 15 GiB used, 135 GiB / 150 GiB avail
pgs: 100.000% pgs not active
8 undersized+peered
这是我的ceph.conf
:
[global]
fsid = 19e50b60-31b0-467a-8ea9-6c37742a1f77
mon initial members = wuminghan-K600-1G
mon host = 192.168.0.237
public network = 192.168.0.0/24
auth cluster required = cephx
auth service required = cephx
auth client required = cephx
osd journal size = 1024
osd pool default size = 1
osd pool default min size = 1
osd pool default pg num = 333
osd pool default pgp num = 333
osd crush chooseleaf type = 1
mon allow pool delete = true
[mgr.wuminghan-K600-1G]
host = wuminghan-K600-1G
[mds.wuminghan-K600-1G]
host = wuminghan-K600-1G
[osd.0]
host = wuminghan-K600-1G
devs = /dev/disk/by-id/wwn-0x50014ee2112b4c78
debug osd = 20
debug filestore = 20
[osd.1]
host = wuminghan-K600-1G
devs = /dev/disk/by-id/wwn-0x50014ee26601e571
debug osd = 20
debug filestore = 20
[osd.2]
host = wuminghan-K600-1G
devs = /dev/disk/by-id/wwn-0x50014ee266717eb3
debug osd = 20
debug filestore = 20
[osd.3]
host = wuminghan-K600-1G
devs = /dev/disk/by-id/wwn-0x50014ee266806f32
debug osd = 20
debug filestore = 20
[osd.4]
host = wuminghan-K600-1G
devs = /dev/disk/by-id/wwn-0x50014ee266808610
debug osd = 20
debug filestore = 20
[osd.5]
host = wuminghan-K600-1G
devs = /dev/disk/by-id/wwn-0x50014ee266808651
debug osd = 20
debug filestore = 20
[osd.6]
host = wuminghan-K600-1G
devs = /dev/disk/by-id/wwn-0x50014ee266808b36
debug osd = 20
debug filestore = 20
[osd.7]
host = wuminghan-K600-1G
devs = /dev/disk/by-id/wwn-0x50014ee2bb578f3e
debug osd = 20
debug filestore = 20
[osd.8]
host = wuminghan-K600-1G
devs = /dev/disk/by-id/wwn-0x50014ee2bb57915c
debug osd = 20
debug filestore = 20
[osd.9]
host = wuminghan-K600-1G
devs = /dev/disk/by-id/wwn-0x50014ee2bbc75bb4
debug osd = 20
debug filestore = 20
[osd.10]
host = wuminghan-K600-1G
devs = /dev/disk/by-id/wwn-0x50014ee2bbd63771
debug osd = 20
debug filestore = 20
[osd.11]
host = wuminghan-K600-1G
devs = /dev/disk/by-id/wwn-0x50014ee2bbd63795
debug osd = 20
debug filestore = 20
[osd.12]
host = wuminghan-K600-1G
devs = /dev/disk/by-id/wwn-0x50014ee2bbd64ee9
debug osd = 20
debug filestore = 20
[osd.13]
host = wuminghan-K600-1G
devs = /dev/disk/by-id/wwn-0x50014ee2bbd64fe8
debug osd = 20
debug filestore = 20
[osd.14]
host = wuminghan-K600-1G
devs = /dev/disk/by-id/wwn-0x50014ee2bbd65340
debug osd = 20
debug filestore = 20
每一步之后都是手动设置ceph官方文档。之前也尝试过使用 ceph-deploy 进行设置,但没有成功。
可以成功创建池。但是运行rbd ls
或rbd pool init rbd
将永远挂起而没有输出。
解决方案
推荐阅读
- javascript - 程序不在 JavaScript 中枚举
- javascript - 从特定列中删除排序
- arrays - 解析 JSON 数组的第一个元素(从谷歌表格获取数据)
- angular - Angular 显示默认页面。怎么修?
- reactjs - 如何使用 reactjs 将值存储在组件外部的一个变量中
- php - 尝试在 localhost php 中发送电子邮件时出错
- python - CVXPY 是否支持 trace(XT@A@X),其中 X、A 都是矩阵?
- javascript - 如何解决javascript中的异步等待问题?
- javascript - 从 promise 导出变量然后阻塞
- node.js - 谷歌语音到文本无法在 nodejs 上运行