We consider data analytics workloads on distributed architectures, in particular clusters of commodity machines. To find a job partitioning that minimizes running time, a cost model, which we more accurately refer to as makespan model, is needed. In attempting to find the simplest possible, but sufficiently accurate, such model, we explore piecewise linear functions of input, output, and computational complexity. They are abstract in the sense that they capture fundamental algorithm properties, but do not require explicit modeling of system and implementation details such as the number of disk accesses. We show how the simplified functional structure can be exploited by directly integrating the model into the makespan optimization process, reducing complexity by orders of magnitude. Experimental results provide evidence of good prediction quality and successful makespan optimization across a variety of cluster architectures.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6289186 | PMC |
http://dx.doi.org/10.1007/978-3-319-64283-3_11 | DOI Listing |
Sci Rep
December 2024
Department of computer science and applications, Maharshi Dayanand University, Rohtak, India.
PLoS One
November 2024
Department of Mathematics and Statistics, College of Science, Taif University, Taif, Saudi Arabia.
As a new computing resources distribution platform, cloud technology greatly influenced society with the conception of on-demand resource usage through virtualization technology. Virtualization technology allows physical resource usage in a way that will enable multiple end-users to have similar hardware infrastructure. In the cloud, many challenges exist on the provider side due to the expectations of clients.
View Article and Find Full Text PDFSci Rep
November 2024
Hubei Key Laboratory of Hydroelectric Machinery Design & Maintenance, China Three Gorges University, Yichang, Hubei, China.
Sci Rep
October 2024
Department of Software Engineering, College of Computing, Umm Al-Qura University, Makkah, 21961, Saudi Arabia.
Efficient scheduling of scientific workflows in hybrid cloud-edge environments is crucial for optimizing resource utilization and minimizing completion time. In this study, we evaluate various scheduling algorithms, emphasizing the Modified Firefly Optimization Algorithm (ModFOA) and comparing it with established methods such as Ant Colony Optimization (ACO), Genetic Algorithm (GA), and Particle Swarm Optimization (PSO). We investigate key performance metrics, including makespan, resource utilization, and energy consumption, across both cloud and edge configurations.
View Article and Find Full Text PDFNetwork
October 2024
Department of Computer Engineering, Dr. D. Y. Patil Institute of Technology, Pune, Pimpri-Chinchwad, India.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!