Modern Data-Intensive Scalable Computing (DISC) systems are designed to process data through batch jobs that execute programs (e.g., queries) compiled from a high-level language. These programs are often developed interactively by posing ad-hoc queries over the base data until a desired result is generated. We observe that there can be significant overlap in the structure of these queries used to derive the final program. Yet, each successive execution of a slightly modified query is performed anew, which can significantly increase the development cycle. Vega is an Apache Spark framework that we have implemented for optimizing a series of similar Spark programs, likely originating from a development or exploratory data analysis session. Spark developers (e.g., data scientists) can leverage Vega to significantly reduce the amount of time it takes to re-execute a modified Spark program, reducing the overall time to market for their Big Data applications.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5386325PMC
http://dx.doi.org/10.1145/2987550.2987565DOI Listing

Publication Analysis

Top Keywords

data
5
optimizing interactive
4
interactive development
4
development data-intensive
4
data-intensive applications
4
applications modern
4
modern data-intensive
4
data-intensive scalable
4
scalable computing
4
computing disc
4

Similar Publications

A Japanese woman with Li-Fraumeni syndrome in her 40s underwent comprehensive genetic profiling accompanied by germline data using the Oncoguide NCC Oncopanel, but no germline pathogenic variants in the tumor suppressor gene TP53 were detected. However, careful examination of additional data in the report suggested the presence of a large TP53 deletion. Custom targeting next-generation sequencing and nanopore sequencing revealed a 3.

View Article and Find Full Text PDF

Background: Whether a detected virus or bacteria is a pathogen that may require treatment, or is merely a commensal 'passenger', remains confusing for many infections. This confusion is likely to increase with the wider use of multi-pathogen PCR.

Objectives: To propose a new statistical procedure to analyse and present data from case-control studies clarifying the probability of causality.

View Article and Find Full Text PDF

Background: This special section underscores the potential of multimodal measurement approaches to transform psychotherapy research. A multimodal approach provides a more comprehensive understanding than any single modality (type of collected information) can provide on its own.

Methods: Traditionally, clinicians and researchers have relied on their intuition, experience, and training to integrate different types of information in a psychotherapy session/treatment.

View Article and Find Full Text PDF

Point-of-care ultrasound in the diagnosis of hepatic gas gangrene.

J Ultrasound

January 2025

Argentinian Critical Care Ultrasonography Association (ASARUC), Buenos Aires, Argentina.

Hepatic gas gangrene (HGG) is a rare but life-threatening condition typically caused by anaerobic bacteria such as Clostridium perfringens, though Gram-negative bacteria like Escherichia coli and Klebsiella species have also been implicated. Traditionally diagnosed via computed tomography (CT), point-of-care ultrasound (POCUS) has emerged as a valuable tool in critical care settings for its non-invasive, bedside utility. We report the case of a 51-year-old female with choledochal syndrome secondary to cholangiocarcinoma who developed HGG following left extended hepatectomy and biliary reconstruction.

View Article and Find Full Text PDF

Pros and cons of surgical versus conservative management for head and neck paraganglioma: a real-world data analysis.

Endocrine

January 2025

Centro di Ricerca e Innovazione sulle Patologie Surrenaliche, AOU Careggi, Florence, Italy.

Purpose: To compare functional deficits associated to surgery with those caused by the growth of the head and neck paragangliomas (HNPGLs).

Methods: 72 patients with HNPGLs were included. Patients were divided in group A (49 patients undergoing surgery) and group B (23 patients following a wait and see approach).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!