Publications by Kosuke Imai | LitMetric

Publications by authors named "Kosuke Imai"

Page 1 of 1

Estimating Average Treatment Effects With Support Vector Machines.

Alexander Tarr Kosuke Imai

Stat Med

February 2025

Support vector machine (SVM) is one of the most popular classification algorithms in the machine learning literature. We demonstrate that SVM can be used to balance covariates and estimate average causal effects under the unconfoundedness assumption. Specifically, we adapt the SVM classifier as a kernel-based weighting procedure that minimizes the maximum mean discrepancy between the treatment and control groups while simultaneously maximizing effective sample size.

View Article and Find Full Text PDF

A summer bridge program for first-generation low-income students stretches academic ambitions with no adverse impacts on first-year GPA.

Rebecca A Johnson Tyler Simko Kosuke Imai

Proc Natl Acad Sci U S A

December 2024

Article Synopsis

A significant amount of research highlights the unique challenges that first-generation, low-income (FGLI) students face as "hidden minorities" in elite college environments.
Existing studies indicate that brief psychological interventions can help address some of these challenges, leading universities to invest in more comprehensive programs aimed at both changing mindsets and reducing structural disadvantages in academic preparation for FGLI students.
A randomized trial of a summer bridge program showed positive outcomes, including increased enrollment in nonintroductory courses and a shift toward taking classes for a grade, demonstrating the program's effectiveness in integrating FGLI students into selective academic communities, despite no significant changes in first-year GPAs or withdrawal rates.

View Article and Find Full Text PDF

Evaluating bias and noise induced by the U.S. Census Bureau's privacy protection methods.

Christopher T Kenny Cory McCartan Shiro Kuriwaki Tyler Simko Kosuke Imai

Sci Adv

May 2024

The U.S. Census Bureau faces a difficult trade-off between the accuracy of Census statistics and the protection of individual information.

View Article and Find Full Text PDF

Census officials must constructively engage with independent evaluations.

Christopher T Kenny Cory McCartan Tyler Simko Kosuke Imai

Proc Natl Acad Sci U S A

March 2024

View Article and Find Full Text PDF

Pd-Catalyzed Stereoselective Construction of Benzo-Fused Decalines with a Quaternary Carbon.

Hideo Setsumasa Kosuke Imai Ikumi Kobayashi Masahisa Nakada

Org Lett

November 2023

The Pd-catalyzed stereoselective construction of decalins with one-carbon units bearing heteroatoms at the ring junction is described. The Pd-catalyzed cyclization of silyl enol ether resulted in exclusive formation of the isomer (89%, >100/1 /). On the contrary, Pd-catalyzed carboiodination and carboborylation (with oxidative workup) provided products in 56% yield (1/>100 /) and 69% yield (1/11 /), respectively.

View Article and Find Full Text PDF

Widespread partisan gerrymandering mostly cancels nationally, but reduces electoral competition.

Christopher T Kenny Cory McCartan Tyler Simko Shiro Kuriwaki Kosuke Imai

Proc Natl Acad Sci U S A

June 2023

Congressional district lines in many US states are drawn by partisan actors, raising concerns about gerrymandering. To separate the partisan effects of redistricting from the effects of other factors including geography and redistricting rules, we compare possible party compositions of the US House under the enacted plan to those under a set of alternative simulated plans that serve as a nonpartisan baseline. We find that partisan gerrymandering is widespread in the 2020 redistricting cycle, but most of the electoral bias it creates cancels at the national level, giving Republicans two additional seats on average.

View Article and Find Full Text PDF

Researchers need better access to US Census data.

Cory McCartan Tyler Simko Kosuke Imai

Science

June 2023

View Article and Find Full Text PDF

Race and ethnicity data for first, middle, and surnames.

Evan T R Rosenman Santiago Olivella Kosuke Imai

Sci Data

May 2023

We provide the largest compiled publicly available dictionaries of first, middle, and surnames for the purpose of imputing race and ethnicity using, for example, Bayesian Improved Surname Geocoding (BISG). The dictionaries are based on the voter files of six U.S.

View Article and Find Full Text PDF

14th Annual University of Pennsylvania Conference on statistical issues in clinical trials/subgroup analysis in clinical trials: Opportunities and challenges (afternoon panel discussion).

Kosuke Imai Michael Rosenblum Mark Rothmann

Clin Trials

August 2023

View Article and Find Full Text PDF

Addressing census data problems in race imputation via fully Bayesian Improved Surname Geocoding and name supplements.

Kosuke Imai Santiago Olivella Evan T R Rosenman

Sci Adv

December 2022

Prediction of individuals' race and ethnicity plays an important role in studies of racial disparity. Bayesian Improved Surname Geocoding (BISG), which relies on detailed census information, has emerged as a leading methodology for this prediction task. Unfortunately, BISG suffers from two data problems.

View Article and Find Full Text PDF

Simulated redistricting plans for the analysis and evaluation of redistricting in the United States.

Cory McCartan Christopher T Kenny Tyler Simko George Garcia Kevin Wang Kosuke Imai

Sci Data

November 2022

This article introduces the 50STATESIMULATIONS, a collection of simulated congressional districting plans and underlying code developed by the Algorithm-Assisted Redistricting Methodology (ALARM) Project. The 50STATESIMULATIONS allow for the evaluation of enacted and other congressional redistricting plans in the United States. While the use of redistricting simulation algorithms has become standard in academic research and court cases, any simulation analysis requires non-trivial efforts to combine multiple data sets, identify state-specific redistricting criteria, implement complex simulation algorithms, and summarize and visualize simulation outputs.

View Article and Find Full Text PDF

Statistical inference and power analysis for direct and spillover effects in two-stage randomized experiments.

Zhichao Jiang Kosuke Imai Anup Malani

Biometrics

September 2023

Two-stage randomized experiments become an increasingly popular experimental design for causal inference when the outcome of one unit may be affected by the treatment assignments of other units in the same cluster. In this paper, we provide a methodological framework for general tools of statistical inference and power analysis for two-stage randomized experiments. Under the randomization-based framework, we consider the estimation of a new direct effect of interest as well as the average direct and spillover effects studied in the literature.

View Article and Find Full Text PDF

The use of differential privacy for census data and its impact on redistricting: The case of the 2020 U.S. Census.

Christopher T Kenny Shiro Kuriwaki Cory McCartan Evan T R Rosenman Tyler Simko Kosuke Imai

Sci Adv

October 2021

Census statistics play a key role in public policy decisions and social science research. However, given the risk of revealing individual information, many statistical agencies are considering disclosure control methods based on differential privacy, which add noise to tabulated data. Unlike other applications of differential privacy, however, census statistics must be postprocessed after noise injection to be usable.

View Article and Find Full Text PDF

Propensity score-based methods for causal inference in observational studies with non-binary treatments.

Shandong Zhao David A van Dyk Kosuke Imai

Stat Methods Med Res

March 2020

View Article and Find Full Text PDF

A sensitivity analysis for missing outcomes due to truncation by death under the matched-pairs design.

Kosuke Imai Zhichao Jiang

Stat Med

September 2018

The matched-pairs design enables researchers to efficiently infer causal effects from randomized experiments. In this paper, we exploit the key feature of the matched-pairs design and develop a sensitivity analysis for missing outcomes due to truncation by death, in which the outcomes of interest (e.g.

View Article and Find Full Text PDF

Redefine statistical significance.

Daniel J Benjamin James O Berger Magnus Johannesson Brian A Nosek E-J Wagenmakers Kosuke Imai

Nat Hum Behav

January 2018

View Article and Find Full Text PDF

Comment on Pearl: Practical implications of theoretical results for causal mediation analysis.

Kosuke Imai Luke Keele Dustin Tingley Teppei Yamamoto

Psychol Methods

December 2014

Mediation analysis has been extensively applied in psychological and other social science research. A number of methodologists have recently developed a formal theoretical framework for mediation analysis from a modern causal inference perspective. In Imai, Keele, and Tingley (2010), we have offered such an approach to causal mediation analysis that formalizes identification, estimation, and sensitivity analysis in a single framework.

View Article and Find Full Text PDF

Wiskott-Aldrich syndrome presenting with a clinical picture mimicking juvenile myelomonocytic leukaemia.

Ayami Yoshimi Yoshiro Kamachi Kosuke Imai Nobuhiro Watanabe Hisaya Nakadate

Pediatr Blood Cancer

May 2013

Background: Wiskott-Aldrich syndrome (WAS) is a rare X-linked immunodeficiency caused by defects of the WAS protein (WASP) gene. Patients with WAS typically demonstrate micro-thrombocytopenia.

Procedures: The report describes seven male infants with WAS that initially presented with leukocytosis, monocytosis, and myeloid and erythroid precursors in the peripheral blood (PB) and dysplasia in the bone marrow (BM), which was initially indistinguishable from juvenile myelomonocytic leukaemia (JMML).

View Article and Find Full Text PDF

GATA-2 anomaly and clinical phenotype of a sporadic case of lymphedema, dendritic cell, monocyte, B- and NK-cell (DCML) deficiency, and myelodysplasia.

Hiroyuki Ishida Kosuke Imai Kenichi Honma Shin-Ichi Tamura Toshihiko Imamura

Eur J Pediatr

August 2012

A Japanese patient presented with lymphedema, severe Varicella zoster, and Salmonella infection, recurrent respiratory infections, panniculitis, monocytopenia, B- and NK-cell lymphopenia, and myelodysplasia. The phenotype was a mixture of the monocytopenia and mycobacterial infection (MonoMAC) and Emberger syndromes. Sequencing of the GATA-2 cDNA revealed the heterozygous missense mutation 1187 G > A.

View Article and Find Full Text PDF

Using Potential Outcomes to Understand Causal Mediation Analysis: Comment on.

Kosuke Imai Booil Jo Elizabeth A Stuart

Multivariate Behav Res

September 2011

In this commentary, we demonstrate how the potential outcomes framework can help understand the key identification assumptions underlying causal mediation analysis. We show that this framework can lead to the development of alternative research design and statistical analysis strategies applicable to the longitudinal data settings considered by Maxwell, Cole, and Mitchell (2011).

View Article and Find Full Text PDF

A general approach to causal mediation analysis.

Kosuke Imai Luke Keele Dustin Tingley

Psychol Methods

December 2010

Traditionally in the social sciences, causal mediation analysis has been formulated, understood, and implemented within the framework of linear structural equation models. We argue and demonstrate that this is problematic for 3 reasons: the lack of a general definition of causal mediation effects independent of a particular statistical model, the inability to specify the key identification assumption, and the difficulty of extending the framework to nonlinear models. In this article, we propose an alternative approach that overcomes these limitations.

View Article and Find Full Text PDF

Public policy for the poor? A randomised assessment of the Mexican universal health insurance programme.

Gary King Emmanuela Gakidou Kosuke Imai Jason Lakin Ryan T Moore

Lancet

April 2009

Background: We assessed aspects of Seguro Popular, a programme aimed to deliver health insurance, regular and preventive medical care, medicines, and health facilities to 50 million uninsured Mexicans.

Methods: We randomly assigned treatment within 74 matched pairs of health clusters-ie, health facility catchment areas-representing 118 569 households in seven Mexican states, and measured outcomes in a 2005 baseline survey (August, 2005, to September, 2005) and follow-up survey 10 months later (July, 2006, to August, 2006) in 50 pairs (n=32 515). The treatment consisted of encouragement to enrol in a health-insurance programme and upgraded medical facilities.

View Article and Find Full Text PDF

Variance identification and efficiency analysis in randomized experiments under the matched-pair design.

Stat Med

October 2008

In his 1923 landmark article, Neyman introduced randomization-based inference to estimate average treatment effects from experiments under the completely randomized design. Under this framework, Neyman considered the statistical estimation of the sample average treatment effect and derived the variance of the standard estimator using the treatment assignment mechanism as the sole basis of inference. In this paper, I extend Neyman's analysis to randomized experiments under the matched-pair design where experimental units are paired based on their pre-treatment characteristics and the randomization of treatment is subsequently conducted within each matched pair.

View Article and Find Full Text PDF

On the Estimation of Disability-Free Life Expectancy: Sullivan' Method and Its Extension.

Kosuke Imai Samir Soneji

J Am Stat Assoc

January 2007

A rapidly aging population, such as the United States today, is characterized by the increased prevalence of chronic impairment. Robust estimation of disability-free life expectancy (DFLE), or healthy life expectancy, is essential for examining whether additional years of life are spent in good health and whether life expectancy is increasing faster than the decline of disability rates. Over 30 years since its publication, Sullivan's method remains the most widely used method to estimate DFLE.

View Article and Find Full Text PDF

A PHP Error was encountered

Severity: Warning

Message: fopen(/var/lib/php/sessions/ci_sessionc7beq8if6f1mur5pp56l2l1i7d38ih3s): Failed to open stream: No space left on device

Filename: drivers/Session_files_driver.php

Line Number: 177

Backtrace:

File: /var/www/html/index.php
Line: 316
Function: require_once

A PHP Error was encountered

Severity: Warning

Message: session_start(): Failed to read session data: user (path: /var/lib/php/sessions)

Filename: Session/Session.php

Line Number: 137

Backtrace:

File: /var/www/html/index.php
Line: 316
Function: require_once