Motivation: Rapid and low cost sequencing of genomes enabled widespread use of genomic data in research studies and personalized customer applications, where genomic data is shared in public databases. Although the identities of the participants are anonymized in these databases, sensitive information about individuals can still be inferred. One such information is kinship.

Results: We define two routes kinship privacy can leak and propose a technique to protect kinship privacy against these risks while maximizing the utility of shared data. The method involves systematic identification of minimal portions of genomic data to mask as new participants are added to the database. Choosing the proper positions to hide is cast as an optimization problem in which the number of positions to mask is minimized subject to privacy constraints that ensure the familial relationships are not revealed. We evaluate the proposed technique on real genomic data. Results indicate that concurrent sharing of data pertaining to a parent and an offspring results in high risks of kinship privacy, whereas the sharing data from further relatives together is often safer. We also show arrival order of family members have a high impact on the level of privacy risks and on the utility of sharing data.

Availability And Implementation: https://github.com/tastanlab/Kinship-Privacy.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btx568DOI Listing

Publication Analysis

Top Keywords

genomic data
16
kinship privacy
12
data
8
privacy risks
8
sharing data
8
privacy
6
genomic
5
utility maximizing
4
maximizing privacy
4
privacy preserving
4

Similar Publications

Shuanghuanglian (SHL) and its primary constituents have demonstrated protective effects against allergenic diseases. This review examines the anaphylactic and anti-allergenic activities of SHL and its constituents. We also discuss potential avenues for future research, particularly regarding the expansion of the clinical applications of SHL formulations (oral or nebulized) for the treatment of allergenic disorders.

View Article and Find Full Text PDF

Skmer approach improves species discrimination in taxonomically problematic genus (Theaceae).

Plant Divers

November 2024

CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, Yunnan, PR China.

Genome skimming has dramatically extended DNA barcoding from short DNA fragments to next generation barcodes in plants. However, conserved DNA barcoding markers, including complete plastid genome and nuclear ribosomal DNA (nrDNA) sequences, are inadequate for accurate species identification. Skmer, a recently proposed approach that estimates genetic distances among species based on unassembled genome skims, has been proposed to effectively improve species discrimination rate.

View Article and Find Full Text PDF

Phylogenomics, reticulation, and biogeographical history of Elaeagnaceae.

Plant Divers

November 2024

Germplasm Bank of Wild Species & Yunnan Key Laboratory of Crop Wild Relatives Omics, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, China.

The angiosperm family Elaeagnaceae comprises three genera and . 100 species distributed mainly in Eurasia and North America. Little family-wide phylogenetic and biogeographic research on Elaeagnaceae has been conducted, limiting the application and preservation of natural genetic resources.

View Article and Find Full Text PDF

Colorectal cancer (CRC) is the third most common cancer worldwide, with rising prevalence among younger adults. Several lifestyle factors, particularly disruptions in circadian rhythms by light-dark (LD) shifts, are known to increase CRC risk. Epidemiological studies previously showed LD-shifts are associated with increased risk of CRC.

View Article and Find Full Text PDF

Development of a latency model for HIV-1 subtype C and the impact of long terminal repeat element genetic variation on latency reversal.

J Virus Erad

December 2024

HIV Pathogenesis Programme, The Doris Duke Medical Research Institute, Nelson R. Mandela School of Medicine, University of KwaZulu-Natal, Durban, South Africa.

Sub-Saharan Africa accounts for almost 70 % of people living with HIV (PLWH) worldwide, with the greatest numbers centred in South Africa where 98 % of infections are caused by subtype C (HIV-1C). However, HIV-1 subtype B (HIV-1B), prevalent in Europe and North America, has been the focus of most cure research and testing despite making up only 12 % of HIV-1 infections globally. Development of latency models for non-subtype B viruses is a necessary step to address this disproportionate focus.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!