Publications by Murat Kantarcıoglu | LitMetric

Publications by authors named "Murat Kantarcıoglu"

Page 1 of 3

Not the Models You Are Looking For: Traditional ML Outperforms LLMs in Clinical Prediction Tasks.

Katherine E Brown Chao Yan Zhuohang Li Xinmeng Zhang Benjamin X Collins

medRxiv

December 2024

Objectives: To determine the extent to which current Large Language Models (LLMs) can serve as substitutes for traditional machine learning (ML) as clinical predictors using data from electronic health records (EHRs), we investigated various factors that can impact their adoption, including overall performance, calibration, fairness, and resilience to privacy protections that reduce data fidelity.

Materials And Methods: We evaluated GPT-3.5, GPT-4, and ML (as gradient-boosting trees) on clinical prediction tasks in EHR data from Vanderbilt University Medical Center and MIMIC IV.

View Article and Find Full Text PDF

Supporting COVID-19 Disparity Investigations with Dynamically Adjusting Case Reporting Policies.

J Thomas Brown Zhiyu Wan Aris Gkoulalas-Divanis Murat Kantarcioglu Bradley A Malin

AMIA Annu Symp Proc

May 2023

Data access limitations have stifled COVID-19 disparity investigations in the United States. Though federal and state legislation permits publicly disseminating de-identified data, methods for de-identification, including a recently proposed dynamic policy approach to pandemic data sharing, remain unproved in their ability to support pandemic disparity studies. Thus, in this paper, we evaluate how such an approach enables timely, accurate, and fair disparity detection, with respect to potential adversaries with varying prior knowledge about the population.

View Article and Find Full Text PDF

A Representativeness-informed Model for Research Record Selection from Electronic Medical Record Systems.

Victor A Borza Ellen Wright Clayton Murat Kantarcioglu Yevgeniy Vorobeychik Bradley A Malin

AMIA Annu Symp Proc

May 2023

Scientific and clinical studies have a long history of bias in recruitment of underprivileged and minority populations. This underrepresentation leads to inaccurate, inapplicable, and non-generalizable results. Electronic medical record (EMR) systems, which now drive much research, often poorly represent these groups.

View Article and Find Full Text PDF

A game theoretic approach to balance privacy risks and familial benefits.

Jia Guo Ellen Wright Clayton Murat Kantarcioglu Yevgeniy Vorobeychik Myrna Wooders

Sci Rep

April 2023

As recreational genomics continues to grow in its popularity, many people are afforded the opportunity to share their genomes in exchange for various services, including third-party interpretation (TPI) tools, to understand their predisposition to health problems and, based on genome similarity, to find extended family members. At the same time, these services have increasingly been reused by law enforcement to track down potential criminals through family members who disclose their genomic information. While it has been observed that many potential users shy away from such data sharing when they learn that their privacy cannot be assured, it remains unclear how potential users' valuations of the service will affect a population's behavior.

View Article and Find Full Text PDF

Implicit Incentives Among Reddit Users to Prioritize Attention Over Privacy and Reveal Their Faces When Discussing Direct-to-Consumer Genetic Test Results: Topic and Attention Analysis.

Yongtai Liu Zhijun Yin Zhiyu Wan Chao Yan Weiyi Xia

JMIR Infodemiology

August 2022

Background: As direct-to-consumer genetic testing services have grown in popularity, the public has increasingly relied upon online forums to discuss and share their test results. Initially, users did so anonymously, but more recently, they have included face images when discussing their results. Various studies have shown that sharing images on social media tends to elicit more replies.

View Article and Find Full Text PDF

Coffee-Derived Exosome-Like Nanoparticles: Are They the Secret Heroes?

Murat Kantarcıoğlu Gülşen Yıldırım Pınar Akpınar Oktar Serpil Yanbakan Zeynep Büşra Özer

Turk J Gastroenterol

February 2023

Background: Regular coffee consumption has beneficial and preventative effects on liver and chronic neurodegenerative diseases. However, the studies performed with the ingredients found in coffee beverages have not clarified the responsible mechanisms. Exosomes are small, membrane-coated cargo packages secreted by prokaryote and eukaryote cells.

View Article and Find Full Text PDF

Blockchain networks: Data structures of Bitcoin, Monero, Zcash, Ethereum, Ripple, and Iota.

Cuneyt Gurcan Akcora Yulia R Gel Murat Kantarcioglu

Wiley Interdiscip Rev Data Min Knowl Discov

November 2021

Blockchain is an emerging technology that has enabled many applications, from cryptocurrencies to digital asset management and supply chains. Due to this surge of popularity, analyzing the data stored on blockchains poses a new critical challenge in data science. To assist data scientists in various analytic tasks for a blockchain, in this tutorial, we provide a systematic and comprehensive overview of the fundamental elements of blockchain network models.

View Article and Find Full Text PDF

New therapeutic players on the horizon: Edible plant derived exosomes.

Murat Kantarcioglu

Hepatol Forum

September 2021

View Article and Find Full Text PDF

Robust Transparency Against Model Inversion Attacks.

Yasmeen Alufaisan Murat Kantarcioglu Yan Zhou

IEEE Trans Dependable Secure Comput

August 2020

Transparency has become a critical need in machine learning (ML) applications. Designing transparent ML models helps increase trust, ensure accountability, and scrutinize fairness. Some organizations may opt-out of transparency to protect individuals' privacy.

View Article and Find Full Text PDF

Publisher Correction: Sociotechnical safeguards for genomic data privacy.

Zhiyu Wan James W Hazel Ellen Wright Clayton Yevgeniy Vorobeychik Murat Kantarcioglu

Nat Rev Genet

July 2022

View Article and Find Full Text PDF

De-identifying Socioeconomic Data at the Census Tract Level for Medical Research Through Constraint-based Clustering.

Yongtai Liu Douglas Conway Zhiyu Wan Murat Kantarcioglu Yevgeniy Vorobeychik

AMIA Annu Symp Proc

April 2022

Numerous studies have shown that a person's health status is closely related to their socioeconomic status. It is evident that incorporating socioeconomic data associated with a patient's geographic area of residence into clinical datasets will promote medical research. However, most socioeconomic variables are unique in combination and are affiliated with small geographical regions (e.

View Article and Find Full Text PDF

Sociotechnical safeguards for genomic data privacy.

Zhiyu Wan James W Hazel Ellen Wright Clayton Yevgeniy Vorobeychik Murat Kantarcioglu

Nat Rev Genet

July 2022

Recent developments in a variety of sectors, including health care, research and the direct-to-consumer industry, have led to a dramatic increase in the amount of genomic data that are collected, used and shared. This state of affairs raises new and challenging concerns for personal privacy, both legally and technically. This Review appraises existing and emerging threats to genomic data privacy and discusses how well current legal frameworks and technical safeguards mitigate these concerns.

View Article and Find Full Text PDF

Dynamically adjusting case reporting policy to maximize privacy and public health utility in the face of a pandemic.

J Thomas Brown Chao Yan Weiyi Xia Zhijun Yin Zhiyu Wan

J Am Med Inform Assoc

April 2022

Objective: Supporting public health research and the public's situational awareness during a pandemic requires continuous dissemination of infectious disease surveillance data. Legislation, such as the Health Insurance Portability and Accountability Act of 1996 and recent state-level regulations, permits sharing deidentified person-level data; however, current deidentification approaches are limited. Namely, they are inefficient, relying on retrospective disclosure risk assessments, and do not flex with changes in infection rates or population demographics over time.

View Article and Find Full Text PDF

Using game theory to thwart multistage privacy intrusions when sharing data.

Zhiyu Wan Yevgeniy Vorobeychik Weiyi Xia Yongtai Liu Myrna Wooders

Sci Adv

December 2021

Person-specific biomedical data are now widely collected, but its sharing raises privacy concerns, specifically about the re-identification of seemingly anonymous records. Formal re-identification risk assessment frameworks can inform decisions about whether and how to share data; current techniques, however, focus on scenarios where the data recipients use only one resource for re-identification purposes. This is a concern because recent attacks show that adversaries can access multiple resources, combining them in a stage-wise manner, to enhance the chance of an attack’s success.

View Article and Find Full Text PDF

Leveraging blockchain for immutable logging and querying across multiple sites.

Mustafa Safa Ozdayi Murat Kantarcioglu Bradley Malin

BMC Med Genomics

July 2020

Background: Blockchain has emerged as a decentralized and distributed framework that enables tamper-resilience and, thus, practical immutability for stored data. This immutability property is important in scenarios where auditability is desired, such as in maintaining access logs for sensitive healthcare and biomedical data. However, the underlying data structure of blockchain, by default, does not provide capabilities to efficiently query the stored data.

View Article and Find Full Text PDF

Biomedical Research Cohort Membership Disclosure on Social Media.

Yongtai Liu Chao Yan Zhijun Yin Zhiyu Wan Weiyi Xia

AMIA Annu Symp Proc

June 2020

To accelerate medical knowledge discovery, an increasing number of research programs are gathering and sharing data on a large number of participants. Due to the privacy concerns and legal restrictions on data sharing, these programs apply various strategies to mitigate privacy risk. However, the activities of participants and research program sponsors, particularly on social media, might reveal an individual's membership in a study, making it easier to recognize participants' records and uncover the information they have yet to disclose.

View Article and Find Full Text PDF

Detecting the Presence of an Individual in Phenotypic Summary Data.

Yongtai Liu Zhiyu Wan Weiyi Xia Murat Kantarcioglu Yevgeniy Vorobeychik

AMIA Annu Symp Proc

December 2019

As the quantity and detail of association studies between clinical phenotypes and genotypes grows, there is a push to make summary statistics widely available. Genome wide summary statistics have been shown to be vulnerable to the inference of a targeted individual's presence. In this paper, we show that presence attacks are feasible with phenome wide summary statistics as well.

View Article and Find Full Text PDF

Research Challenges at the Intersection of Big Data, Security and Privacy.

Murat Kantarcioglu Elena Ferrari

Front Big Data

February 2019

View Article and Find Full Text PDF

An Open Source Tool for Game Theoretic Health Data De-Identification.

Fabian Prasser James Gaupp Zhiyu Wan Weiyi Xia Yevgeniy Vorobeychik

AMIA Annu Symp Proc

April 2019

Biomedical data continues to grow in quantity and quality, creating new opportunities for research and data-driven applications. To realize these activities at scale, data must be shared beyond its initial point of collection. To maintain privacy, healthcare organizations often de-identify data, but they assume worst-case adversaries, inducing high levels of data corruption.

View Article and Find Full Text PDF

It's all in the timing: calibrating temporal penalties for biomedical data sharing.

Weiyi Xia Zhiyu Wan Zhijun Yin James Gaupp Yongtai Liu

J Am Med Inform Assoc

January 2018

Objective: Biomedical science is driven by datasets that are being accumulated at an unprecedented rate, with ever-growing volume and richness. There are various initiatives to make these datasets more widely available to recipients who sign Data Use Certificate agreements, whereby penalties are levied for violations. A particularly popular penalty is the temporary revocation, often for several months, of the recipient's data usage rights.

View Article and Find Full Text PDF

Uptake Patterns of Untreated Primary Gastrointestinal Extranodal Lymphomas on Initial Staging F-FDG PET/CT and Metabolic Tumor Parameters.

Engin Alagöz Kürşat Okuyucu Semra İnce Murat Kantarcıoğlu Şükrü Özaydın

Mol Imaging Radionucl Ther

October 2017

Objective: Non-Hodgkin's lymphomas arising from tissues other than primary lymphatic sites are classified as primary extranodal lymphomas (PEL). PELs of the gastrointestinal system (PGISL) originate from the lymphatic tissues within the gastrointestinal tract. The prognostic value of F-FDG PET/CT in lymphomas is high in terms of both overall survival (OS) and disease-free survival (DFS).

View Article and Find Full Text PDF

Controlling the signal: Practical privacy protection of genomic data sharing through Beacon services.

Zhiyu Wan Yevgeniy Vorobeychik Murat Kantarcioglu Bradley Malin

BMC Med Genomics

July 2017

Background: Genomic data is increasingly collected by a wide array of organizations. As such, there is a growing demand to make summary information about such collections available more widely. However, over the past decade, a series of investigations have shown that attacks, rooted in statistical inference methods, can be applied to discern the presence of a known individual's DNA sequence in the pool of subjects.

View Article and Find Full Text PDF

Expanding Access to Large-Scale Genomic Data While Promoting Privacy: A Game Theoretic Approach.

Zhiyu Wan Yevgeniy Vorobeychik Weiyi Xia Ellen Wright Clayton Murat Kantarcioglu

Am J Hum Genet

February 2017

Emerging scientific endeavors are creating big data repositories of data from millions of individuals. Sharing data in a privacy-respecting manner could lead to important discoveries, but high-profile demonstrations show that links between de-identified genomic data and named persons can sometimes be reestablished. Such re-identification attacks have focused on worst-case scenarios and spurred the adoption of data-sharing practices that unnecessarily impede research.

View Article and Find Full Text PDF

Is nonalcoholic fatty liver disease a risk factor for chronic kidney disease?

Kadir Ozturk Hakan Demirci Omer Kurt Murat Kantarcioglu

Eur J Gastroenterol Hepatol

May 2016

View Article and Find Full Text PDF

Pentraxin 3 Is a Predictor for Fibrosis and Arterial Stiffness in Patients with Nonalcoholic Fatty Liver Disease.

Kadir Ozturk Omer Kurt Tolga Dogan Alptug Ozen Hakan Demirci

Gastroenterol Res Pract

March 2016

Objective. The aim of the present study was to investigate whether pentraxin 3 (PTX3) can be a new noninvasive marker for prediction of liver fibrosis in patients with NAFLD. We also aimed to evaluate the relationship between PTX3 and atherosclerosis in patients with NAFLD.

View Article and Find Full Text PDF