AI Article Synopsis

  • This research focuses on optimizing data analytics workloads in distributed systems, specifically clusters of standard machines.
  • It introduces a simplified makespan model, leveraging piecewise linear functions to streamline job partitioning without getting bogged down in complex system details.
  • Experimental results demonstrate that this approach significantly improves prediction accuracy and enhances makespan optimization for various cluster setups.

Article Abstract

We consider data analytics workloads on distributed architectures, in particular clusters of commodity machines. To find a job partitioning that minimizes running time, a cost model, which we more accurately refer to as makespan model, is needed. In attempting to find the simplest possible, but sufficiently accurate, such model, we explore piecewise linear functions of input, output, and computational complexity. They are abstract in the sense that they capture fundamental algorithm properties, but do not require explicit modeling of system and implementation details such as the number of disk accesses. We show how the simplified functional structure can be exploited by directly integrating the model into the makespan optimization process, reducing complexity by orders of magnitude. Experimental results provide evidence of good prediction quality and successful makespan optimization across a variety of cluster architectures.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6289186PMC
http://dx.doi.org/10.1007/978-3-319-64283-3_11DOI Listing

Publication Analysis

Top Keywords

makespan optimization
8
case abstract
4
abstract cost
4
cost models
4
models distributed
4
distributed execution
4
execution analytics
4
analytics operators
4
operators consider
4
consider data
4

Similar Publications

Article Synopsis
  • * DMCS enhances efficiency by quickly processing time-sensitive tasks on fog nodes and handling resource-heavy computations in the cloud, leading to reduced costs and energy use.
  • * While effective, DMCS faces challenges like computational overhead and scalability, with future enhancements focusing on integrating machine learning and expanding its application in various computing environments.
View Article and Find Full Text PDF

As a new computing resources distribution platform, cloud technology greatly influenced society with the conception of on-demand resource usage through virtualization technology. Virtualization technology allows physical resource usage in a way that will enable multiple end-users to have similar hardware infrastructure. In the cloud, many challenges exist on the provider side due to the expectations of clients.

View Article and Find Full Text PDF

Research on low carbon welding scheduling based on production process.

Sci Rep

November 2024

Hubei Key Laboratory of Hydroelectric Machinery Design & Maintenance, China Three Gorges University, Yichang, Hubei, China.

Article Synopsis
  • The study addresses the high energy consumption of welding workshops and aims to reduce carbon emissions during the production process by developing a bi-objective scheduling model that minimizes both completion time and carbon output.
  • An improved Grey Wolf Optimizer (IGWO) is introduced to tackle the multi-objective scheduling problem, incorporating strategies to enhance diversity, convergence, and local optimization.
  • The results demonstrated that the IGWO algorithm significantly lowered carbon emissions to 3.85E + 05 while reducing the completion time to 842.14, outperforming other algorithms like NSGA-II and GWO.
View Article and Find Full Text PDF

Enhancing workflow efficiency with a modified Firefly Algorithm for hybrid cloud edge environments.

Sci Rep

October 2024

Department of Software Engineering, College of Computing, Umm Al-Qura University, Makkah, 21961, Saudi Arabia.

Efficient scheduling of scientific workflows in hybrid cloud-edge environments is crucial for optimizing resource utilization and minimizing completion time. In this study, we evaluate various scheduling algorithms, emphasizing the Modified Firefly Optimization Algorithm (ModFOA) and comparing it with established methods such as Ant Colony Optimization (ACO), Genetic Algorithm (GA), and Particle Swarm Optimization (PSO). We investigate key performance metrics, including makespan, resource utilization, and energy consumption, across both cloud and edge configurations.

View Article and Find Full Text PDF
Article Synopsis
  • A new way to use cloud resources better can help save money and reduce waste.
  • Regular methods for planning cloud resources don't work as well with the fast-paced changes in managing assets.
  • A new algorithm, called Hybrid Lemurs-based Gannet Optimization Algorithm (HL-GOA), was created to make scheduling tasks and placing Virtual Machines easier and more efficient.
View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!