A method is proposed for generating application domain agnostic data for training and evaluating machine learning systems. The proposed method randomly generates an expert system network based upon user specified parameters. This expert system serves as a generic model of an unspecified phenomena. The expert system is run to determine the ideal output from a set of random inputs. These inputs and ideal output are used for training and testing a machine learning system. This allows a machine learning technology to be developed and tested without requiring compatible test data to be collected or before data collection as a proof-of-concept validation of system operations. It also allows systems to be tested without data error noise or with known levels of noise and with other perturbations, to facilitate analysis. It may also facilitate testing system security, adversarial attacks and conducting other types of research into machine learning systems. •Provides an application domain agnostic way to test machine learning technologies and facilitates the generalization of results.•Allows technologies to be tested with data with different characteristics without having to locate datasets that have these characteristics.•Utilizes randomly generated network to represent non-specific phenomena which can be used for training and testing machine learning techniques.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8374678PMC
http://dx.doi.org/10.1016/j.mex.2021.101477DOI Listing

Publication Analysis

Top Keywords

machine learning
28
expert system
16
application domain
8
domain agnostic
8
learning systems
8
ideal output
8
training testing
8
testing machine
8
tested data
8
machine
7

Similar Publications

Enhanced brain tumor detection and segmentation using densely connected convolutional networks with stacking ensemble learning.

Comput Biol Med

January 2025

Emerging Technologies Research Lab (ETRL), College of Computer Science and Information Systems, Najran University, Najran, 61441, Saudi Arabia; Department of Computer Science, College of Computer Science and Information Systems, Najran University, Najran, 61441, Saudi Arabia. Electronic address:

- Brain tumors (BT), both benign and malignant, pose a substantial impact on human health and need precise and early detection for successful treatment. Analysing magnetic resonance imaging (MRI) image is a common method for BT diagnosis and segmentation, yet misdiagnoses yield effective medical responses, impacting patient survival rates. Recent technological advancements have popularized deep learning-based medical image analysis, leveraging transfer learning to reuse pre-trained models for various applications.

View Article and Find Full Text PDF

Objective: We aimed to develop a highly interpretable and effective, machine-learning based risk prediction algorithm to predict in-hospital mortality, intubation and adverse cardiovascular events in patients hospitalised with COVID-19 in Australia (AUS-COVID Score).

Materials And Methods: This prospective study across 21 hospitals included 1714 consecutive patients aged ≥ 18 in their index hospitalization with COVID-19. The dataset was separated into training (80%) and test sets (20%).

View Article and Find Full Text PDF

MetAssimulo 2.0: a web app for simulating realistic 1D & 2D Metabolomic 1H NMR spectra.

Bioinformatics

January 2025

Section of Bioinformatics, Division of Systems Medicine, Department of Metabolism, Digestion and Reproduction, Faculty of Medicine, Imperial College London, London, W12 0NN, United Kingdom.

Unlabelled: Metabolomics extensively utilizes Nuclear Magnetic Resonance (NMR) spectroscopy due to its excellent reproducibility and high throughput. Both one-dimensional (1D) and two-dimensional (2D) NMR spectra provide crucial information for metabolite annotation and quantification, yet present complex overlapping patterns which may require sophisticated machine learning algorithms to decipher. Unfortunately, the limited availability of labeled spectra can hamper application of machine learning, especially deep learning algorithms which require large amounts of labelled data.

View Article and Find Full Text PDF

Objective: The objective of this research was to devise and authenticate a predictive model that employs CT radiomics and deep learning methodologies for the accurate prediction of synchronous distant metastasis (SDM) in clear cell renal cell carcinoma (ccRCC).

Methods: A total of 143 ccRCC patients were included in the training cohort, and 62 ccRCC patients were included in the validation cohort. The CT images from all patients were normalized, and the tumor regions were manually segmented via ITK-SNAP software.

View Article and Find Full Text PDF

A multiscale molecular structural neural network for molecular property prediction.

Mol Divers

January 2025

Key Laboratory for Macromolecular Science of Shaanxi Province, School of Chemistry and Chemical Engineering, Shaanxi Normal University, Xi'an, 710119, People's Republic of China.

Molecular Property Prediction (MPP) is a fundamental task in important research fields such as chemistry, materials, biology, and medicine, where traditional computational chemistry methods based on quantum mechanics often consume substantial time and computing power. In recent years, machine learning has been increasingly used in computational chemistry, in which graph neural networks have shown good performance in molecular property prediction tasks, but they have some limitations in terms of generalizability, interpretability, and certainty. In order to address the above challenges, a Multiscale Molecular Structural Neural Network (MMSNet) is proposed in this paper, which obtains rich multiscale molecular representations through the information fusion between bonded and non-bonded "message passing" structures at the atomic scale and spatial feature information "encoder-decoder" structures at the molecular scale; a multi-level attention mechanism is introduced on the basis of theoretical analysis of molecular mechanics in order to enhance the model's interpretability; the prediction results of MMSNet are used as label values and clustered in the molecular library by the K-NN (K-Nearest Neighbors) algorithm to reverse match the spatial structure of the molecules, and the certainty of the model is quantified by comparing virtual screening results across different K-values.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!