Machine Learning Techniques to Perform Predictive Analytics of Task Queues Guided by Slurmстатья

Информация о цитировании статьи получена из Scopus, Web of Science
Дата последнего поиска статьи во внешних источниках: 25 апреля 2019 г.

Работа с статьей


[1] Rezaei M., Salnikov A. Machine learning techniques to perform predictive analytics of task queues guided by slurm // 2018 Global Smart Industry Conference (GloSIC). — IEEE, 2018. — P. 1–6. Dealing with resource allocation is one of the most critical problems in high performance computing (HPC). The jobs, which cannot get enough resources, are most likely destined to fail. In this paper we suggest a new approach to predict whether the demanded CPUs and time slots for jobs are sufficient. To do so, we train a machine learning (ML) system, based on the collection of statistical data from the reference queue systems. Our ML predicts required resources for jobs at the time of job submission so that jobs won’t fail due to the lack of resources. This machine learning uses supervised learning and it includes regression and classification tasks. Our results show that the accuracy of prediction is highly associated with prior information before submitting jobs. This information can be used to train our machine learning system better than before. [ DOI ]

Публикация в формате сохранить в файл сохранить в файл сохранить в файл сохранить в файл сохранить в файл сохранить в файл скрыть