Publications by authors named "Ningzhong Shi"

This paper primarily analyzes the one-parameter generalized logistic (1PGlogit) model, which is a generalized model containing other one-parameter item response theory (IRT) models. The essence of the 1PGlogit model is the introduction of a generalized link function that includes the probit, logit, and complementary log-log functions. By transforming different parameters, the 1PGlogit model can flexibly adjust the speed at which the item characteristic curve (ICC) approaches the upper and lower asymptote, breaking the previous constraints in one-parameter IRT models where the ICC curves were either all symmetric or all asymmetric.

View Article and Find Full Text PDF

This study focuses on the measurement of mathematical ability in the Chinese Compulsory Education Qualification Monitoring (CCEQM) framework using bifactor theory. First, we propose a full-information item bifactor (FIBF) model for the measurement of mathematical ability. Second, the performance of the FIBF model is empirically studied using a data set from three representative provinces were selected from CCEQM 2015-2017.

View Article and Find Full Text PDF

In the estimation of item response models, the normality of latent traits is frequently assumed. However, this assumption may be untenable in real testing. In contrast to the conventional three-parameter normal ogive (3PNO) model, a 3PNO model incorporating Ramsay-curve item response theory (RC-IRT), denoted as the RC-3PNO model, allows for flexible latent trait distributions.

View Article and Find Full Text PDF

In cognitive diagnostic assessments with time limits, not-reached items (i.e., continuous nonresponses at the end of tests) frequently occur because examinees drop out of the test due to insufficient time.

View Article and Find Full Text PDF

In high-stakes, large-scale, standardized tests with certain time limits, examinees are likely to engage in either one of the three types of behavior (e.g., van der Linden & Guo, 2008; Wang & Xu, 2015): solution behavior, rapid guessing behavior, and cheating behavior.

View Article and Find Full Text PDF

The four-parameter logistic (4PL) item response model, which includes an upper asymptote for the correct response probability, has drawn increasing interest due to its suitability for many practical scenarios. This paper proposes a new Gibbs sampling algorithm for estimation of the multidimensional 4PL model based on an efficient data augmentation scheme (DAGS). With the introduction of three continuous latent variables, the full conditional distributions are tractable, allowing easy implementation of a Gibbs sampler.

View Article and Find Full Text PDF

Cognitive diagnosis models (CDMs) are useful statistical tools to provide rich information relevant for intervention and learning. As a popular approach to estimate and make inference of CDMs, the Markov chain Monte Carlo (MCMC) algorithm is widely used in practice. However, when the number of attributes, , is large, the existing MCMC algorithm may become time-consuming, due to the fact that calculations are usually needed in the process of MCMC sampling to get the conditional distribution for each attribute profile.

View Article and Find Full Text PDF

In this paper, the slice-within-Gibbs sampler has been introduced as a method for estimating cognitive diagnosis models (CDMs). Compared with other Bayesian methods, the slice-within-Gibbs sampler can employ a wide-range of prior specifications; moreover, it can also be applied to complex CDMs with the aid of auxiliary variables, especially when applying different identifiability constraints. To evaluate its performances, two simulation studies were conducted.

View Article and Find Full Text PDF

Cognitive diagnostic computerized adaptive testing (CD-CAT) aims to obtain more useful diagnostic information by taking advantages of computerized adaptive testing (CAT). Cognitive diagnosis models (CDMs) have been developed to classify examinees into the correct proficiency classes so as to get more efficient remediation, whereas CAT tailors optimal items to the examinee's mastery profile. The item selection method is the key factor of the CD-CAT procedure.

View Article and Find Full Text PDF

Within the framework of item response theory, a new and flexible general three-parameter logistic model with response time (G3PLT) is proposed. The advantage of this model is that it can combine time effect, ability, and item difficulty to influence the correct-response probability. In contrast to the traditional response time models used in educational psychology, the new model incorporates the influence of the time effect on the correct-response probability directly, rather than linking them through a hierarchical method via latent and speed parameters as in van der Linden's model.

View Article and Find Full Text PDF

A new strategy based on functional data analysis (FDA) techniques is proposed to extract the lateralized readiness potential (LRP), which treats electroencephalographic data as functional data. This FDA-based method combines longitudinal information from each trial (time series data) with cross-sectional information from all trials at a fixed time point (cross-sectional data). The comparison results show that the FDA-based LRP is closer to the assumed true LRP and is more robust against a reduction in the number of trials than the traditional average-based LRP.

View Article and Find Full Text PDF

Semiparametric methods for longitudinal data with association within subjects have recently received considerable attention. However, existing methods for semiparametric longitudinal binary regression modeling (i) mainly concern mean structures with association parameters treated as nuisance; (ii) generally require a correct specification of the covariance structure for misspecified covariance structure may lead to inefficient mean parameter estimates; and (iii) usually run into computation and estimation problems when the time points are irregularly and possibly subject specific. In this article, we propose a semiparametric logistic regression model, which simultaneously takes into account both the mean and response-association structures (via conditional log-odds ratio) for multivariate longitudinal binary outcomes.

View Article and Find Full Text PDF

Multidimensional item response theory (MIRT) models can be applied to longitudinal educational surveys where a group of individuals are administered different tests over time with some common items. However, computational problems typically arise as the dimension of the latent variables increases. This is especially true when the latent variable distribution cannot be integrated out analytically, as with MIRT models for binary data.

View Article and Find Full Text PDF

For nonnegative measurements such as income or sick days, zero counts often have special status. Furthermore, the incidence of zero counts is often greater than expected for the Poisson model. This article considers a doubly semiparametric zero-inflated Poisson model to fit data of this type, which assumes two partially linear link functions in both the mean of the Poisson component and the probability of zero.

View Article and Find Full Text PDF

We use the functional principal component analysis (FPCA) to model and predict the weight growth in children. In particular, we examine how the approach can help discern growth patterns of underweight children relative to their normal counterparts, and whether a commonly used transformation to normality plays any constructive roles in a predictive model based on the FPCA. Our work supplements the conditional growth charts developed by Wei and He (2006) by constructing a predictive growth model based on a small number of principal components scores on individual's past.

View Article and Find Full Text PDF

Background: Time-course microarray experiments produce vector gene expression profiles across a series of time points. Clustering genes based on these profiles is important in discovering functional related and co-regulated genes. Early developed clustering algorithms do not take advantage of the ordering in a time-course study, explicit use of which should allow more sensitive detection of genes that display a consistent pattern over time.

View Article and Find Full Text PDF

Background: The goal of linkage analysis is to determine the chromosomal location of the gene(s) for a trait of interest such as a common disease. Three-locus linkage analysis is an important case of multi-locus problems. Solutions can be found analytically for the case of triple backcross mating.

View Article and Find Full Text PDF

Admixture linkage disequilibrium (ALD), a phenomenon created by gene flow between genetically distinct populations, has for some time been used as a tool in gene mapping. It is therefore important to analyze the pattern of ALD over generations. In this study we explore two models of admixture: the gradual admixture (GA) model, in which admixture occurs at a variable rate in every generation; and the immediate admixture (IA) model, a special case of the GA model, in which admixture occurs in a single generation.

View Article and Find Full Text PDF

In this article a new non-model-based significance test for detecting dose-response relationship with the incorporation of historical control data is proposed. This non-model-based test is considered simpler from a regulatory perspective because it does not require validating any modeling assumptions. Moreover, our test is especially appropriate to those studies in which the intravenous doses for the investigational chemical are labeled as, e.

View Article and Find Full Text PDF