集群内有两台节点 5.4.3版本的
节点5是4核8G的 磁盘比较小
节点6是8核16G的 磁盘比较大
做了索引按日期分割 冷热分离 先要将旧的冷数据迁移到另一个磁盘比较大的节点上 在手动分配移动的时候报错 迁移不过去 分片大小不大就15GB 网络传输速度40mb大概半小时就结束了网络传输部分的内容 然后就一直卡在那了没有结束 最后报错no activity after [30m]
POST /_cluster/reroute
{
"commands":[{
"move":
{
"index" : "userbehaviorportrait_201905",
"shard" :1,
"from_node" : "node-5", "to_node" : "node-6"
}
}]
}
[WARN ][o.e.i.c.IndicesClusterStateService] [node-6] [[userbehaviorportrait_201905][1]] marking and sending shard failed due to [failed recovery]
org.elasticsearch.indices.recovery.RecoveryFailedException: [userbehaviorportrait_201905][1]: Recovery failed from {node-5}{ABnr-JZNRbStmmhg_ujWRA}{Anx5G5cGR4OyWniKdI7xlg}{xxx.xxx.xxx.xxx}{xxx.xxx.xxx.xxx} into {node-6}{cxxx.xxx.xxx.xxx}{xxx.xxx.xxx.xxx}{xxx.xxx.xxx.xxx} (no activity after [30m])
at org.elasticsearch.indices.recovery.RecoveriesCollection$RecoveryMonitor.doRun(RecoveriesCollection.java:288) [elasticsearch-5.4.3.jar:5.4.3]
at org.elasticsearch.common.util.concurrent.ThreadContext$ContextPreservingAbstractRunnable.doRun(ThreadContext.java:638) [elasticsearch-5.4.3.jar:5.4.3]
at org.elasticsearch.common.util.concurrent.AbstractRunnable.run(AbstractRunnable.java:37) [elasticsearch-5.4.3.jar:5.4.3]
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [?:1.8.0_91]
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [?:1.8.0_91]
at java.lang.Thread.run(Thread.java:745) [?:1.8.0_91]
Caused by: org.elasticsearch.ElasticsearchTimeoutException: no activity after [30m]
... 6 more
有大牛可以指点一二
节点5是4核8G的 磁盘比较小
节点6是8核16G的 磁盘比较大
做了索引按日期分割 冷热分离 先要将旧的冷数据迁移到另一个磁盘比较大的节点上 在手动分配移动的时候报错 迁移不过去 分片大小不大就15GB 网络传输速度40mb大概半小时就结束了网络传输部分的内容 然后就一直卡在那了没有结束 最后报错no activity after [30m]
POST /_cluster/reroute
{
"commands":[{
"move":
{
"index" : "userbehaviorportrait_201905",
"shard" :1,
"from_node" : "node-5", "to_node" : "node-6"
}
}]
}
[WARN ][o.e.i.c.IndicesClusterStateService] [node-6] [[userbehaviorportrait_201905][1]] marking and sending shard failed due to [failed recovery]
org.elasticsearch.indices.recovery.RecoveryFailedException: [userbehaviorportrait_201905][1]: Recovery failed from {node-5}{ABnr-JZNRbStmmhg_ujWRA}{Anx5G5cGR4OyWniKdI7xlg}{xxx.xxx.xxx.xxx}{xxx.xxx.xxx.xxx} into {node-6}{cxxx.xxx.xxx.xxx}{xxx.xxx.xxx.xxx}{xxx.xxx.xxx.xxx} (no activity after [30m])
at org.elasticsearch.indices.recovery.RecoveriesCollection$RecoveryMonitor.doRun(RecoveriesCollection.java:288) [elasticsearch-5.4.3.jar:5.4.3]
at org.elasticsearch.common.util.concurrent.ThreadContext$ContextPreservingAbstractRunnable.doRun(ThreadContext.java:638) [elasticsearch-5.4.3.jar:5.4.3]
at org.elasticsearch.common.util.concurrent.AbstractRunnable.run(AbstractRunnable.java:37) [elasticsearch-5.4.3.jar:5.4.3]
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [?:1.8.0_91]
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [?:1.8.0_91]
at java.lang.Thread.run(Thread.java:745) [?:1.8.0_91]
Caused by: org.elasticsearch.ElasticsearchTimeoutException: no activity after [30m]
... 6 more
有大牛可以指点一二
1 个回复
haloom
赞同来自: