A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2021; you can also visit the original URL.
The file type is application/pdf
.
Proactive Container Auto-scaling for Cloud Native Machine Learning Services
2020
2020 IEEE 13th International Conference on Cloud Computing (CLOUD)
Understanding the resource usage behaviors of the ever-increasing machine learning workloads are critical to cloud providers offering Machine Learning (ML) services. Capable of auto-scaling resources for customer workloads can significantly improve resource utilization, thus greatly reducing the cost. Here we leverage the AI4DL framework [1] to characterize workload and discover resource consumption phases. We advance the existing technology to an incremental phase discovery method that applies
doi:10.1109/cloud49709.2020.00070
fatcat:nylyu5iz3nhsvpk6rv7y25wype