nhc:LBNL节点运行状况检查

上传者: 42128015 | 上传时间: 2023-03-23 16:14:19 | 文件大小: 141KB | 文件类型: ZIP
LBNL节点运行状况检查(NHC) TORQUE,Slurm和其他调度程序/资源管理器提供了对每个计算节点执行的定期“节点运行状况检查”,以验证该节点是否正常运行。 可以将确定为“不正常”的节点标记为“已关闭”或“脱机”,以防止计划作业或在其上运行作业。 通过减少由于配置错误,硬件故障等导致的可预防的作业故障,这有助于提高群集的可靠性和吞吐量。 尽管许多站点都创建了自己的脚本来实现此功能,但绝大多数站点都是一次性的工作,很少关注扩展性,灵活性,可靠性,速度或重用性。 开发人员创建了这个项目,以试图改变这一状况。 LBNL节点运行状况检查(NHC)具有多种设计功能,使其与大多数本地解决方案区分开来: 可靠-为了防止单线程脚本执行导致挂起,将子命令的执行保持在绝对最低限度,并且如果检查时间过长,则使用看门狗计时器终止检查。 快速-几乎完全以本机bash (2.x或更高版本)实施。 减少

文件下载

资源详情

[{"title":"( 47 个子文件 141KB ) nhc:LBNL节点运行状况检查","children":[{"title":"nhc-master","children":[{"title":".gitignore <span style='color:#111;'> 157B </span>","children":null,"spread":false},{"title":"COPYING <span style='color:#111;'> 31B </span>","children":null,"spread":false},{"title":"bench","children":[{"title":"nhc-bench <span style='color:#111;'> 478B </span>","children":null,"spread":false},{"title":"Makefile.am <span style='color:#111;'> 109B </span>","children":null,"spread":false}],"spread":true},{"title":"README.md <span style='color:#111;'> 104.90KB </span>","children":null,"spread":false},{"title":"nhc.logrotate <span style='color:#111;'> 95B </span>","children":null,"spread":false},{"title":"test","children":[{"title":"test_lbnl_fs.nhc <span style='color:#111;'> 12.45KB </span>","children":null,"spread":false},{"title":"test_lbnl_hw.nhc <span style='color:#111;'> 10.88KB </span>","children":null,"spread":false},{"title":"test_lbnl_job.nhc <span style='color:#111;'> 322B </span>","children":null,"spread":false},{"title":"nhc-test <span style='color:#111;'> 7.36KB </span>","children":null,"spread":false},{"title":"test_lbnl_moab.nhc <span style='color:#111;'> 343B </span>","children":null,"spread":false},{"title":"test_lbnl_dmi.nhc <span style='color:#111;'> 12.47KB </span>","children":null,"spread":false},{"title":"test_common.nhc <span style='color:#111;'> 13.72KB </span>","children":null,"spread":false},{"title":"test_lbnl_cmd.nhc <span style='color:#111;'> 2.19KB </span>","children":null,"spread":false},{"title":"test_lbnl_file.nhc <span style='color:#111;'> 10.44KB </span>","children":null,"spread":false},{"title":"test_lbnl_ps.nhc <span style='color:#111;'> 20.33KB </span>","children":null,"spread":false},{"title":"test_zzz_bash_sanity.nhc <span style='color:#111;'> 744B </span>","children":null,"spread":false},{"title":"test_lbnl_net.nhc <span style='color:#111;'> 8.31KB </span>","children":null,"spread":false},{"title":"test_lbnl_nv.nhc <span style='color:#111;'> 4.21KB </span>","children":null,"spread":false},{"title":"Makefile.am <span style='color:#111;'> 506B </span>","children":null,"spread":false},{"title":"shut.inc.sh <span style='color:#111;'> 5.03KB </span>","children":null,"spread":false}],"spread":false},{"title":"LICENSE <span style='color:#111;'> 2.45KB </span>","children":null,"spread":false},{"title":"ChangeLog <span style='color:#111;'> 49.77KB </span>","children":null,"spread":false},{"title":"nhc-test.conf <span style='color:#111;'> 30B </span>","children":null,"spread":false},{"title":"nhc-wrapper <span style='color:#111;'> 12.50KB </span>","children":null,"spread":false},{"title":"autogen.sh <span style='color:#111;'> 216B </span>","children":null,"spread":false},{"title":"contrib","children":[{"title":"nhc.cron <span style='color:#111;'> 756B </span>","children":null,"spread":false}],"spread":true},{"title":"nhc <span style='color:#111;'> 23.13KB </span>","children":null,"spread":false},{"title":"configure.ac <span style='color:#111;'> 952B </span>","children":null,"spread":false},{"title":"RELEASE_NOTES.txt <span style='color:#111;'> 2.38KB </span>","children":null,"spread":false},{"title":"nhc.conf <span style='color:#111;'> 6.19KB </span>","children":null,"spread":false},{"title":"scripts","children":[{"title":"lbnl_nv.nhc <span style='color:#111;'> 1.22KB </span>","children":null,"spread":false},{"title":"lbnl_cmd.nhc <span style='color:#111;'> 8.09KB </span>","children":null,"spread":false},{"title":"lbnl_job.nhc <span style='color:#111;'> 3.72KB </span>","children":null,"spread":false},{"title":"lbnl_hw.nhc <span style='color:#111;'> 14.80KB </span>","children":null,"spread":false},{"title":"lbnl_net.nhc <span style='color:#111;'> 12.91KB </span>","children":null,"spread":false},{"title":"common.nhc <span style='color:#111;'> 19.88KB </span>","children":null,"spread":false},{"title":"lbnl_fs.nhc <span style='color:#111;'> 19.24KB </span>","children":null,"spread":false},{"title":"lbnl_moab.nhc <span style='color:#111;'> 4.72KB </span>","children":null,"spread":false},{"title":"lbnl_ps.nhc <span style='color:#111;'> 31.01KB </span>","children":null,"spread":false},{"title":"lbnl_file.nhc <span style='color:#111;'> 13.70KB </span>","children":null,"spread":false},{"title":"lbnl_dmi.nhc <span style='color:#111;'> 7.71KB </span>","children":null,"spread":false}],"spread":false},{"title":"helpers","children":[{"title":"node-mark-online <span style='color:#111;'> 5.28KB </span>","children":null,"spread":false},{"title":"node-mark-offline <span style='color:#111;'> 5.26KB </span>","children":null,"spread":false}],"spread":false},{"title":"lbnl-nhc.spec.in <span style='color:#111;'> 2.53KB </span>","children":null,"spread":false},{"title":"nhc-genconf <span style='color:#111;'> 16.07KB </span>","children":null,"spread":false},{"title":"Makefile.am <span style='color:#111;'> 1.40KB </span>","children":null,"spread":false}],"spread":false}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明