Publications by Yun Joo Yoo | LitMetric

Publications by authors named "Yun Joo Yoo"

Page 1 of 2

Supervised diagnostic classification of cognitive attributes using data augmentation.

Ji-Young Yoon Gahgene Gweon Yun Joo Yoo

PLoS One

January 2024

Over recent decades, machine learning, an integral subfield of artificial intelligence, has revolutionized diverse sectors, enabling data-driven decisions with minimal human intervention. In particular, the field of educational assessment emerges as a promising area for machine learning applications, where students can be classified and diagnosed using their performance data. The objectives of Diagnostic Classification Models (DCMs), which provide a suite of methods for diagnosing students' cognitive states in relation to the mastery of necessary cognitive attributes for solving problems in a test, can be effectively addressed through machine learning techniques.

View Article and Find Full Text PDF

Enhancement of the Anticancer Ability of Natural Killer Cells through Allogeneic Mitochondrial Transfer.

Seong-Hoon Kim Mi-Jin Kim Mina Lim Jihye Kim Hyunmin Kim Yun-Joo Yoo

Cancers (Basel)

June 2023

An in vitro culture period of at least 2 weeks is required to produce sufficient natural killer (NK) cells for immunotherapy, which are the key effectors in hematological malignancy treatment. Mitochondrial damage and fragmentation reduce the NK cell immune surveillance capacity. Thus, we hypothesized that the transfer of healthy mitochondria to NK cells could enhance their anticancer effects.

View Article and Find Full Text PDF

gpart: human genome partitioning and visualization of high-density SNP data by identifying haplotype blocks.

Sun Ah Kim Myriam Brossard Delnaz Roshandel Andrew D Paterson Shelley B Bull Yun Joo Yoo

Bioinformatics

November 2019

Summary: For the analysis of high-throughput genomic data produced by next-generation sequencing (NGS) technologies, researchers need to identify linkage disequilibrium (LD) structure in the genome. In this work, we developed an R package gpart which provides clustering algorithms to define LD blocks or analysis units consisting of SNPs. The visualization tool in gpart can display the LD structure and gene positions for up to 20 000 SNPs in one image.

View Article and Find Full Text PDF

Fusion of a highly N-glycosylated polypeptide increases the expression of ER-localized proteins in plants.

Hyangju Kang Youngmin Park Yongjik Lee Yun-Joo Yoo Inhwan Hwang

Sci Rep

March 2018

Plants represent promising systems for producing various recombinant proteins. One key area of focus for improving this technology is developing methods for producing recombinant proteins at high levels. Many methods have been developed to increase the transcript levels of recombinant genes.

View Article and Find Full Text PDF

Prolines in Transit Peptides Are Crucial for Efficient Preprotein Translocation into Chloroplasts.

Dong Wook Lee Yun-Joo Yoo Md Abdur Razzak Inhwan Hwang

Plant Physiol

January 2018

Chloroplasts import many preproteins that can be classified based on their physicochemical properties. The cleavable N-terminal transit peptide (TP) of chloroplast preproteins contains all the information required for import into chloroplasts through Toc/Tic translocons. The question of whether and how the physicochemical properties of preproteins affect TP-mediated import into chloroplasts has not been elucidated.

View Article and Find Full Text PDF

A new haplotype block detection method for dense genome sequencing data based on interval graph modeling of clusters of highly correlated SNPs.

Sun Ah Kim Chang-Sung Cho Suh-Ryung Kim Shelley B Bull Yun Joo Yoo

Bioinformatics

February 2018

Motivation: Linkage disequilibrium (LD) block construction is required for research in population genetics and genetic epidemiology, including specification of sets of single nucleotide polymorphisms (SNPs) for analysis of multi-SNP based association and identification of haplotype blocks in high density sequencing data. Existing methods based on a narrow sense definition do not allow intermediate regions of low LD between strongly associated SNP pairs and tend to split high density SNP data into small blocks having high between-block correlation.

Results: We present Big-LD, a block partition method based on interval graph modeling of LD bins which are clusters of strong pairwise LD SNPs, not necessarily physically consecutive.

View Article and Find Full Text PDF

Evolution of rubisco complex small subunit transit peptides from algae to plants.

Md Abdur Razzak Dong Wook Lee Yun-Joo Yoo Inhwan Hwang

Sci Rep

August 2017

Chloroplasts evolved from a free-living cyanobacterium acquired by the ancestor of all photosynthetic eukaryotes, including algae and plants, through a single endosymbiotic event. During endosymbiotic conversion, the majority of genes in the endosymbiont were transferred to the host nucleus and many of the proteins encoded by these genes must therefore be transported into the chloroplast after translation in the cytosol. Chloroplast-targeted proteins contain a targeting signal, named the transit peptide (TP), at the N-terminus.

View Article and Find Full Text PDF

The Prenylated Rab GTPase Receptor PRA1.F4 Contributes to Protein Exit from the Golgi Apparatus.

Myoung Hui Lee Yun-Joo Yoo Dae Heon Kim Nguyen Hong Hanh Yun Kwon

Plant Physiol

July 2017

Prenylated Rab acceptor1 (PRA1) functions in the recruitment of prenylated Rab proteins to their cognate organelles. Arabidopsis () contains a large number of proteins belonging to the AtPRA1 family. However, their physiological roles remain largely unknown.

View Article and Find Full Text PDF

Effects of Single Nucleotide Polymorphism Marker Density on Haplotype Block Partition.

Sun Ah Kim Yun Joo Yoo

Genomics Inform

December 2016

Many researchers have found that one of the most important characteristics of the structure of linkage disequilibrium is that the human genome can be divided into non-overlapping block partitions in which only a small number of haplotypes are observed. The location and distribution of haplotype blocks can be seen as a population property influenced by population genetic events such as selection, mutation, recombination and population structure. In this study, we investigate the effects of the density of markers relative to the full set of all polymorphisms in the region on the results of haplotype partitioning for five popular haplotype block partition methods: three methods in Haploview (confidence interval, four gamete test, and solid spine), MIG++ implemented in PLINK 1.

View Article and Find Full Text PDF

Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure.

Yun Joo Yoo Lei Sun Julia G Poirier Andrew D Paterson Shelley B Bull

Genet Epidemiol

February 2017

By jointly analyzing multiple variants within a gene, instead of one at a time, gene-based multiple regression can improve power, robustness, and interpretation in genetic association analysis. We investigate multiple linear combination (MLC) test statistics for analysis of common variants under realistic trait models with linkage disequilibrium (LD) based on HapMap Asian haplotypes. MLC is a directional test that exploits LD structure in a gene to construct clusters of closely correlated variants recoded such that the majority of pairwise correlations are positive.

View Article and Find Full Text PDF

Interactions between Transmembrane Helices within Monomers of the Aquaporin AtPIP2;1 Play a Crucial Role in Tetramer Formation.

Yun-Joo Yoo Hyun Kyung Lee Wonhee Han Dae Heon Kim Myoung Hui Lee

Mol Plant

July 2016

Aquaporin (AQP) is a water channel protein found in various subcellular membranes of both prokaryotic and eukaryotic cells. The physiological functions of AQPs have been elucidated in many organisms. However, understanding their biogenesis remains elusive, particularly regarding how they assemble into tetramers.

View Article and Find Full Text PDF

Clique-Based Clustering of Correlated SNPs in a Gene Can Improve Performance of Gene-Based Multi-Bin Linear Combination Test.

Yun Joo Yoo Sun Ah Kim Shelley B Bull

Biomed Res Int

June 2016

Gene-based analysis of multiple single nucleotide polymorphisms (SNPs) in a gene region is an alternative to single SNP analysis. The multi-bin linear combination test (MLC) proposed in previous studies utilizes the correlation among SNPs within a gene to construct a gene-based global test. SNPs are partitioned into clusters of highly correlated SNPs, and the MLC test statistic quadratically combines linear combination statistics constructed for each cluster.

View Article and Find Full Text PDF

Gene-based multiple regression association testing for combined examination of common and low frequency variants in quantitative trait analysis.

Yun Joo Yoo Lei Sun Shelley B Bull

Front Genet

November 2013

Multi-marker methods for genetic association analysis can be performed for common and low frequency SNPs to improve power. Regression models are an intuitive way to formulate multi-marker tests. In previous studies we evaluated regression-based multi-marker tests for common SNPs, and through identification of bins consisting of correlated SNPs, developed a multi-bin linear combination (MLC) test that is a compromise between a 1 df linear combination test and a multi-df global test.

View Article and Find Full Text PDF

Combined linkage and association analyses identify a novel locus for obesity near PROX1 in Asians.

Hyun-Jin Kim Yun Joo Yoo Young Seok Ju Seungbok Lee Sung-Il Cho

Obesity (Silver Spring)

November 2013

Objective: Although genome-wide association studies (GWAS) have substantially contributed to understanding the genetic architecture, unidentified variants for complex traits remain an issue. One of the efficient approaches is the improvement of the power of GWAS scan by weighting P values with prior linkage signals. Our objective was to identify the novel candidates for obesity in Asian populations by using genemapping strategies that combine linkage and association analyses.

View Article and Find Full Text PDF

A family-based association study after genome-wide linkage analysis identified two genetic loci for renal function in a Mongolian population.

Hansoo Park Hyun-Jin Kim Seungbok Lee Yun Joo Yoo Young Seok Ju

Kidney Int

February 2013

The estimated glomerular filtration rate is a well-known measure of renal function and is widely used to follow the course of disease. Although there have been several investigations establishing the genetic background contributing to renal function, Asian populations have rarely been used in these genome-wide studies. Here, we aimed to find candidate genetic determinants of renal function in 1007 individuals from 73 extended families of Mongolian origin.

View Article and Find Full Text PDF

Mitochondrial targeting of the Arabidopsis F1-ATPase γ-subunit via multiple compensatory and synergistic presequence motifs.

Sumin Lee Dong Wook Lee Yun-Joo Yoo Owen Duncan Young Jun Oh

Plant Cell

December 2012

The majority of mitochondrial proteins are encoded in the nuclear genome and imported into mitochondria posttranslationally from the cytosol. An N-terminal presequence functions as the signal for the import of mitochondrial proteins. However, the functional information in the presequence remains elusive.

View Article and Find Full Text PDF

Comprehensive genomic analyses associate UGT8 variants with musical ability in a Mongolian population.

Hansoo Park Seungbok Lee Hyun-Jin Kim Young Seok Ju Jong-Yeon Shin Yun Joo Yoo

J Med Genet

December 2012

Background: Musical abilities such as recognising music and singing performance serve as means for communication and are instruments in sexual selection. Specific regions of the brain have been found to be activated by musical stimuli, but these have rarely been extended to the discovery of genes and molecules associated with musical ability.

Methods: A total of 1008 individuals from 73 families were enrolled and a pitch-production accuracy test was applied to determine musical ability.

View Article and Find Full Text PDF

Real data examples in statistical methods papers: Tremendously valuable, and also tremendously misvalued.

K Y Williams Yun Joo Yoo Amit Patki David B Allison

Stat Interface

January 2011

When a statistical methods paper is submitted to a journal for publication, examples in which the method is applied to real data are highly encouraged by many journals and in some cases are explicitly demanded. In this commentary, we argue that real data examples serve several useful purposes. However, we also argue that in many cases, particularly in the fields of genetics and genomics, there is an implicit or explicit expectation for examples to support purposes for which they are ill-suited and furthermore that these inappropriate expectations have negative consequences for the field.

View Article and Find Full Text PDF

Small heat shock protein Hsp17.8 functions as an AKR2A cofactor in the targeting of chloroplast outer membrane proteins in Arabidopsis.

Dae Heon Kim Zheng-Yi Xu Yun Jeong Na Yun-Joo Yoo Junho Lee

Plant Physiol

September 2011

Plastid proteins that are encoded by the nuclear genome and synthesized in the cytosol undergo posttranslational targeting to plastids. Ankyrin repeat protein 2A (AKR2A) and AKR2B were recently shown to be involved in the targeting of proteins to the plastid outer envelope. However, it remains unknown whether other factors are involved in this process.

View Article and Find Full Text PDF

The first Irish genome and ways of improving sequence accuracy.

Young Seok Ju Yun Joo Yoo Jong-Il Kim Jeong-Sun Seo

Genome Biol

February 2011

Whole-genome sequencing of an Irish person reveals hundreds of thousands of novel genomic variants. Imputation using previous known information improves the accuracy of low-read-depth sequencing.

View Article and Find Full Text PDF

Discovery of common Asian copy number variants using integrated high-resolution array CGH and massively parallel DNA sequencing.

Hansoo Park Jong-Il Kim Young Seok Ju Omer Gokcumen Ryan E Mills Yun Joo Yoo Katayoon Darvishi

Nat Genet

May 2010

Copy number variants (CNVs) account for the majority of human genomic diversity in terms of base coverage. Here, we have developed and applied a new method to combine high-resolution array comparative genomic hybridization (CGH) data with whole-genome DNA sequencing data to obtain a comprehensive catalog of common CNVs in Asian individuals. The genomes of 30 individuals from three Asian populations (Korean, Chinese and Japanese) were interrogated with an ultra-high-resolution array CGH platform containing 24 million probes.

View Article and Find Full Text PDF

Transmission-ratio distortion in the Framingham Heart Study.

Andrew D Paterson Daryl Waggott Arne Schillert Claire Infante-Rivard Shelley B Bull Yun Joo Yoo

BMC Proc

December 2009

Transmission-ratio distortion (TRD) is a phenomenon in which the segregation of alleles does not obey Mendel's laws. As a simple example, a recessive locus that results in fetal lethality will result in live-born individuals sharing more alleles at this locus than expected under Mendel's laws. This could result in apparent linkage of the phenotype of 'being alive' to such a chromosomal regions.

View Article and Find Full Text PDF

Region-based analysis in genome-wide association study of Framingham Heart Study blood lipid phenotypes.

Jennifer L Asimit Yun Joo Yoo Daryl Waggott Lei Sun Shelley B Bull

BMC Proc

December 2009

Due to the high-dimensionality of single-nucleotide polymorphism (SNP) data, region-based methods are an attractive approach to the identification of genetic variation associated with a certain phenotype. A common approach to defining regions is to identify the most significant SNPs from a single-SNP association analysis, and then use a gene database to obtain a list of genes proximal to the identified SNPs. Alternatively, regions may be defined statistically, via a scan statistic.

View Article and Find Full Text PDF

Genome-wide association analyses of North American Rheumatoid Arthritis Consortium and Framingham Heart Study data utilizing genome-wide linkage results.

Yun Joo Yoo Dushanthi Pinnaduwage Daryl Waggott Shelley B Bull Lei Sun

BMC Proc

December 2009

The power of genome-wide association studies can be improved by incorporating information from previous study findings, for example, results of genome-wide linkage analyses. Weighted false-discovery rate (FDR) control can incorporate genome-wide linkage scan results into the analysis of genome-wide association data by assigning single-nucleotide polymorphism (SNP) specific weights. Stratified FDR control can also be applied by stratifying the SNPs into high and low linkage strata.

View Article and Find Full Text PDF

Human leukocyte antigen class I genotypes in relation to heterosexual HIV type 1 transmission within discordant couples.

Jianming Tang Wenshuo Shao Yun Joo Yoo Ilene Brill Joseph Mulenga

J Immunol

August 2008

Differences in immune control of HIV-1 infection are often attributable to the highly variable HLA class I molecules that present viral epitopes to CTL. In our immunogenetic analyses of 429 HIV-1 discordant Zambian couples (infected index partners paired with cohabiting seronegative partners), several HLA class I variants in index partners were associated with contrasting rates and incidence of HIV-1 transmission within a 12-year study period. In particular, A*3601 on the A*36-Cw*04-B*53 haplotype was the most unfavorable marker of HIV-1 transmission by index partners, while Cw*1801 (primarily on the A*30-Cw*18-B*57 haplotype) was the most favorable, irrespective of the direction of transmission (male to female or female to male) and other commonly recognized cofactors of infection, including age and GUI.

View Article and Find Full Text PDF