Background: In biomedical applications, valuable data is often split between owners who cannot openly share the data because of privacy regulations and concerns. Training machine learning models on the joint data without violating privacy is a major technology challenge that can be addressed by combining techniques from machine learning and cryptography. When collaboratively training machine learning models with the cryptographic technique named secure multi-party computation, the price paid for keeping the data of the owners private is an increase in computational cost and runtime. A careful choice of machine learning techniques, algorithmic and implementation optimizations are a necessity to enable practical secure machine learning over distributed data sets. Such optimizations can be tailored to the kind of data and Machine Learning problem at hand.

Methods: Our setup involves secure two-party computation protocols, along with a trusted initializer that distributes correlated randomness to the two computing parties. We use a gradient descent based algorithm for training a logistic regression like model with a clipped ReLu activation function, and we break down the algorithm into corresponding cryptographic protocols. Our main contributions are a new protocol for computing the activation function that requires neither secure comparison protocols nor Yao's garbled circuits, and a series of cryptographic engineering optimizations to improve the performance.

Results: For our largest gene expression data set, we train a model that requires over 7 billion secure multiplications; the training completes in about 26.90 s in a local area network. The implementation in this work is a further optimized version of the implementation with which we won first place in Track 4 of the iDASH 2019 secure genome analysis competition.

Conclusions: In this paper, we present a secure logistic regression training protocol and its implementation, with a new subprotocol to securely compute the activation function. To the best of our knowledge, we present the fastest existing secure multi-party computation implementation for training logistic regression models on high dimensional genome data distributed across a local area network.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7818577PMC
http://dx.doi.org/10.1186/s12920-020-00869-9DOI Listing

Publication Analysis

Top Keywords

machine learning
24
logistic regression
16
activation function
12
genome analysis
8
data
8
training machine
8
learning models
8
secure
8
secure multi-party
8
multi-party computation
8

Similar Publications

A prediction model for electrical strength of gaseous medium based on molecular reactivity descriptors and machine learning method.

J Mol Model

January 2025

Hubei Key Laboratory·for High-Efficiency-Utilization of Solar Energy and Operation, Control of Energy-Storage System, Hubei-University of Technology, Wuhan, 430068, China.

Context: Ionization and adsorption in gas discharge are similar to electrophilic and nucleophilic reactions. The molecular descriptors characterizing reactions such as electrostatic potential descriptors are useful in predicting the electrical strength of environmentally friendly gases. In this study, descriptors of 73 molecules are employed for correlation analysis with electrical strength.

View Article and Find Full Text PDF

Predicting fall parameters from infant skull fractures using machine learning.

Biomech Model Mechanobiol

January 2025

Department of Mechanical Engineering, University of Utah, Salt Lake City, UT, 84112, USA.

When infants are admitted to the hospital with skull fractures, providers must distinguish between cases of accidental and abusive head trauma. Limited information about the incident is available in such cases, and witness statements are not always reliable. In this study, we introduce a novel, data-driven approach to predict fall parameters that lead to skull fractures in infants in order to aid in determinations of abusive head trauma.

View Article and Find Full Text PDF

Role of immune cell homeostasis in research and treatment response in hepatocellular carcinoma.

Clin Exp Med

January 2025

Department of Thoracic Surgery, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, 200127, China.

Introduction Recently, immune cells within the tumor microenvironment (TME) have become crucial in regulating cancer progression and treatment responses. The dynamic interactions between tumors and immune cells are emerging as a promising strategy to activate the host's immune system against various cancers. The development and progression of hepatocellular carcinoma (HCC) involve complex biological processes, with the role of the TME and tumor phenotypes still not fully understood.

View Article and Find Full Text PDF

The brain undergoes atrophy and cognitive decline with advancing age. The utilization of brain age prediction represents a pioneering methodology in the examination of brain aging. This study aims to develop a deep learning model with high predictive accuracy and interpretability for brain age prediction tasks.

View Article and Find Full Text PDF

Risk-taking is a concerning yet prevalent issue during adolescence and can be life-threatening. Examining its etiological sources and evolving pathways helps inform strategies to mitigate adolescents' risk-taking behavior. Studies have found that unfavorable environmental factors, such as adverse childhood experiences (ACEs), are associated with momentary levels of risk-taking in adolescents, but little is known about whether ACEs shape the developmental trajectory of risk-taking.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!