Submodularity of Distributed Join Computation.

Proc ACM SIGMOD Int Conf Manag Data

Northeastern University, Boston, USA.

Published: June 2018

AI Article Synopsis

  • The study focuses on how to efficiently compute distributed equi-joins in databases when there's an imbalance in data distribution (join-attribute skew), leading to uneven workload.
  • Addressing this skew requires finer partitioning of data, but this also means duplicating input, which creates a balance between overall load and the variance in that load.
  • The researchers demonstrate that minimizing load variance with certain constraints can be framed as a specific optimization problem, allowing for effective algorithms that significantly enhance performance, even under deterministic load assignments.

Article Abstract

We study distributed equi-join computation in the presence of join-attribute skew, which causes load imbalance. Skew can be addressed by more fine-grained partitioning, at the cost of input duplication. For random load assignment, e.g., using a hash function, fine-grained partitioning creates a tradeoff between load expectation and variance. We show that minimizing load variance subject to a constraint on expectation is a monotone submodular maximization problem with Knapsack constraints, hence admitting provably near-optimal greedy solutions. In contrast to previous work on formal optimality guarantees, we can prove this result also for self-joins and more general load functions defined as weighted sum of input and output. We further demonstrate through experiments that this theoretical result leads to an effective algorithm for the problem of minimizing running time, even when load is assigned deterministically.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6404978PMC
http://dx.doi.org/10.1145/3183713.3183728DOI Listing

Publication Analysis

Top Keywords

fine-grained partitioning
8
load
6
submodularity distributed
4
distributed join
4
join computation
4
computation study
4
study distributed
4
distributed equi-join
4
equi-join computation
4
computation presence
4

Similar Publications

Aiming at the problem that the existing human skeleton behavior recognition methods are insensitive to human local movements and show inaccurate recognition in distinguishing similar behaviors, a multi-scale spatio-temporal graph convolution method incorporating multi-granularity features is proposed for human behavior recognition. Firstly, a skeleton fine-grained partitioning strategy is proposed, which initializes the skeleton data into data streams of different granularities. An adaptive cross-scale feature fusion layer is designed using a normalized Gaussian function to perform feature fusion among different granularities, guiding the model to focus on discriminative feature representations among similar behaviors through fine-grained features.

View Article and Find Full Text PDF

The use of auditory stimuli in rehabilitation to target walking has been evidenced in persons with neurological conditions. The methodologies focus on the synchronisation of persons' steps to auditory stimuli showing that the type of stimuli and tempi significantly affect the synchronisation. However, the dynamic of the interaction over time between the motor system and the auditory stimuli, i.

View Article and Find Full Text PDF

Background: The present contribution reexamines the geometry of a segment of a presumably long-lived fault in Svalbard, the Balliolbreen Fault segment of the Billefjorden Fault Zone, along which presumably two basement terranes of Svalbard accreted in the early-mid Paleozoic after thousands of kilometers strike-slip displacement.

Methods: We performed structural fieldwork to Billefjorden in central Spitsbergen and interpreted satellite images.

Results: Field observations demonstrate that the Balliolbreen Fault formed as a top-west thrust fault in the early Cenozoic and that weak sedimentary units such as shales of the Lower Devonian Wood Bay Formation and coals of the uppermost Devonian-Mississippian Billefjorden Group partitioned deformation, resulting in significant contrast in deformation intensity between stratigraphic units.

View Article and Find Full Text PDF
Article Synopsis
  • Humans can categorize visual information into specific groups, with previous fMRI studies highlighting how the brain distinguishes between broad categories (like animate vs. inanimate) and individual objects.
  • Recent research used fMRI coupled with multiple examples of 48 different mammals to examine this further, aiming to clarify the distinctions between fine-grained and coarse-scale representations.
  • The findings suggest fMRI primarily captures visual-specific and general category information, but it can also identify subtle differences between individual objects, challenging earlier assumptions about the level of detail provided by fMRI data.
View Article and Find Full Text PDF

CK-ATTnet: Medical image segmentation network based on convolutional kernel attention.

Comput Biol Med

December 2024

School of Data Science and Artifical Intelligence, Wenzhou University of Technology, Wenzhou 325035, China. Electronic address:

The medical image partition model has a wide range of application prospects in medical diagnosis and treatment and has become an important auxiliary method to improve the diagnostic level by medical imaging analysis. After the feature extraction ability of the convolutional neural network (CNN) reached a bottleneck, the form of feature extraction represented by Transformer has made significant achievements in the medical image domain in recent years. However, the structure of Transformer is relatively fixed, the cost of computer resources is large, and it is difficult to adjust the model structure according to the complex medical imaging segmentation task.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!