Background: The use of machine learning to estimate exposure effects introduces a dependence between the results of an empirical study and the value of the seed used to fix the pseudo-random number generator.

Methods: We used data from 10,038 pregnant women and a 10% subsample (N = 1004) to examine the extent to which the risk difference for the relation between fruit and vegetable consumption and preeclampsia risk changes under different seed values. We fit an augmented inverse probability weighted estimator with two Super Learner algorithms: a simple algorithm including random forests and single-layer neural networks and a more complex algorithm with a mix of tree-based, regression-based, penalized, and simple algorithms. We evaluated the distributions of risk differences, standard errors, and P values that result from 5000 different seed value selections.

Results: Our findings suggest important variability in the risk difference estimates, as well as an important effect of the stacking algorithm used. The interquartile range width of the risk differences in the full sample with the simple algorithm was 13 per 1000. However, all other interquartile ranges were roughly an order of magnitude lower. The medians of the distributions of risk differences differed according to the sample size and the algorithm used.

Conclusions: Our findings add another dimension of concern regarding the potential for "p-hacking," and further warrant the need to move away from simplistic evidentiary thresholds in empirical research. When empirical results depend on pseudo-random number generator seed values, caution is warranted in interpreting these results.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11560583PMC
http://dx.doi.org/10.1097/EDE.0000000000001785DOI Listing

Publication Analysis

Top Keywords

pseudo-random number
12
risk differences
12
number generator
8
machine learning
8
risk difference
8
seed values
8
simple algorithm
8
distributions risk
8
risk
6
algorithm
5

Similar Publications

Memristors are commonly used to introduce various chaotic systems and can be used to enhance their chaotic characteristics. However, due to the strict construction conditions of Hamiltonian systems, there has been limited research on the development of memristive Hamiltonian conservative chaotic systems (MHCCSs). In this work, a method for constructing three-terminal memristors is proposed, and the three-terminal memristors are incorporated into the Hamiltonian system, resulting in the development of a class of n-D MHCCS.

View Article and Find Full Text PDF

The use of self-adaptive principal components in PCA-based denoising.

J Magn Reson

December 2024

Department of Low-Temperature Physics, Faculty of Mathematics and Physics, Charles University, V Holešovičkách 747/2, 180 00 Prague 8, Czech Republic.

PCA-based denoising usually implies either discarding a number of high-index principal components (PCs) of a data matrix or their attenuation according to a regularization model. This work introduces an alternative, model-free, approach to high-index PC attenuation that seeks to average values of PC vectors as if they were expected from noise perturbation of data. According to the perturbation theory, the average PCs are attenuated versions of the clean PCs of noiseless data - the higher the noise-related content in a PC vector, the lower is its average's norm.

View Article and Find Full Text PDF

Compressed sensing (CS) is an innovative signal acquisition and reconstruction technology that has broken through the limit of the Nyquist sampling theory. It is widely employed to optimize the measurement processes in various applications. One of the core challenges of CS is the construction of a measurement matrix.

View Article and Find Full Text PDF

Pseudo random bit generator in QCA for high speed communications.

Heliyon

November 2024

Department of Electrical Engineering, Faculty of Engineering and Technology, University of Mazandaran, Babolsar, Iran.

Quantum-dot cellular automata (QCA) technology is another new technology in the field of nanoelectronics and quantum. With the help of this technology, basic to advanced digital circuits can be designed and implemented with low energy consumption and small area. Therefore, in this paper, a new design for a four- and eight-bit Linear feedback shift register (LFSR) or random counter, as well as a rising-edge-sensitive D flip-flop with the least possible number of cells and the smallest area, was presented.

View Article and Find Full Text PDF

This paper introduces an enhancement to the Chen chaotic system by incorporating a constant perturbation term to one of the state variables, aiming to improve the performance of pseudo-random number generators (PRNGs). The perturbation significantly enhances the system's chaotic properties, resulting in superior randomness and increased security. An FPGA-based realization of a perturbed Chen oscillator (PCO)-derived PRNG is presented, tailored for embedded cryptosystems and implemented on a Nexys 4 FPGA card featuring the XILINX Artix-7 XC7A100T-1CSG324C integrated chip.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!