Data-intensive applications are becoming commonplace in all science disciplines. They are comprised of a rich set of sub-domains such as data engineering, deep learning, and machine learning. These applications are built around efficient data abstractions and operators that suit the applications of different domains. Often lack of a clear definition of data structures and operators in the field has led to other implementations that do not work well together. The architecture that we proposed recently, identifies a set of data structures, operators, and an execution model for creating rich data applications that links all aspects of data engineering and data science together efficiently. This paper elaborates and illustrates this architecture using an end-to-end application with deep learning and data engineering parts working together. Our analysis show that the proposed system architecture is better suited for high performance computing environments compared to the current big data processing systems. Furthermore our proposed system emphasizes the importance of efficient compact data structures such as Apache Arrow tabular data representation defined for high performance. Thus the system integration we proposed scales a sequential computation to a distributed computation retaining optimum performance along with highly usable application programming interface.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8860100PMC
http://dx.doi.org/10.3389/fdata.2021.756041DOI Listing

Publication Analysis

Top Keywords

data engineering
16
data
13
high performance
12
data structures
12
data science
8
deep learning
8
structures operators
8
proposed system
8
parallel operators
4
operators high
4

Similar Publications

Immune infiltration plays a significant role in the pathogenesis of rheumatoid arthritis (RA). Cuproptosis, a newly characterized form of programmed cell death, remains insufficiently investigated regarding its genetic regulation of immune infiltration in RA. Data from the GEO database were analyzed to determine the relationship between cuproptosis-related genes and immune infiltration.

View Article and Find Full Text PDF

In urban concentrated area, the disturbance caused by construction affects significantly the sustainability of adjacent existing structures. It is essential to capture the mechanical response of existing structures to adjacent deep excavation. The objective of this paper is to investigate the displacement and internal force behavior of elevated bridge piles (BP) subject to influence of deep excavation.

View Article and Find Full Text PDF

Recent evidence indicates that endocrine resistance in estrogen receptor-positive (ER+) breast cancer is closely correlated with phenotypic characteristics of epithelial-to-mesenchymal transition (EMT). Nonetheless, identifying tumor tissues with a mesenchymal phenotype remains challenging in clinical practice. In this study, we validated the correlation between EMT status and resistance to endocrine therapy in ER+ breast cancer from a transcriptomic perspective.

View Article and Find Full Text PDF

Intelligent two-phase dual authentication framework for Internet of Medical Things.

Sci Rep

January 2025

Department of Computer Science, College of Computer and Information Sciences, King Saud University, Riyadh, 11543, Saudi Arabia.

The Internet of Medical Things (IoMT) has revolutionized healthcare by bringing real-time monitoring and data-driven treatments. Nevertheless, the security of communication between IoMT devices and servers remains a huge problem because of the inherent sensitivity of the health data and susceptibility to cyber threats. Current security solutions, including simple password-based authentication and standard Public Key Infrastructure (PKI) approaches, typically do not achieve an appropriate balance between security and low computational overhead, resulting in the possibility of performance bottlenecks and increased vulnerability to attacks.

View Article and Find Full Text PDF

Background: Tibial bone fractures in the malleolar regions are a major concern during the early postoperative period of total ankle replacement (TAR), affecting patient outcomes such as stability and recovery. Design, placement, and anatomic misalignment of implant components can contribute to malleolar fractures. The aim of this study is to understand the influence of implant design features, including keel, peg, stem, and bar type design, and bone-implant interfacial conditions on malleolar fracture following TAR.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!