Multilabel classification (MLC) is a machine learning task where the goal is to learn to label an example with multiple labels simultaneously. It receives increasing interest from the machine learning community, as evidenced by the increasing number of papers and methods that appear in the literature. Hence, ensuring proper, correct, robust, and trustworthy benchmarking is of utmost importance for the further development of the field. We believe that this can be achieved by adhering to the recently emerged data management standards, such as the FAIR (Findable, Accessible, Interoperable, and Reusable) and TRUST (Transparency, Responsibility, User focus, Sustainability, and Technology) principles. We introduce an ontology-based online catalogue of MLC datasets originating from various application domains following these principles. The catalogue extensively describes many MLC datasets with comprehensible meta-features, MLC-specific semantic descriptions, and different data provenance information. The MLC data catalogue is available at: http://semantichub.ijs.si/MLCdatasets .

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9068705PMC
http://dx.doi.org/10.1038/s41598-022-11316-3DOI Listing

Publication Analysis

Top Keywords

machine learning
8
mlc datasets
8
catalogue
4
catalogue semantic
4
semantic annotations
4
annotations multilabel
4
multilabel datasets
4
datasets fair
4
fair multilabel
4
multilabel classification
4

Similar Publications

Background: Acute pain management is critical in postoperative care, especially in vulnerable patient populations that may be unable to self-report pain levels effectively. Current methods of pain assessment often rely on subjective patient reports or behavioral pain observation tools, which can lead to inconsistencies in pain management. Multimodal pain assessment, integrating physiological and behavioral data, presents an opportunity to create more objective and accurate pain measurement systems.

View Article and Find Full Text PDF

Cross-Cultural Sense-Making of Global Health Crises: A Text Mining Study of Public Opinions on Social Media Related to the COVID-19 Pandemic in Developed and Developing Economies.

J Med Internet Res

January 2025

Unitat de Recerca i Innovació, Gerència d'Atenció Primària i a la Comunitat de la Catalunya Central, Institut Català de la Salut, Sant Fruitós de Bages, Spain.

Background: The COVID-19 pandemic reshaped social dynamics, fostering reliance on social media for information, connection, and collective sense-making. Understanding how citizens navigate a global health crisis in varying cultural and economic contexts is crucial for effective crisis communication.

Objective: This study examines the evolution of citizen collective sense-making during the COVID-19 pandemic by analyzing social media discourse across Italy, the United Kingdom, and Egypt, representing diverse economic and cultural contexts.

View Article and Find Full Text PDF

Large language models (LLMs) are being increasingly incorporated into scientific workflows. However, we have yet to fully grasp the implications of this integration. How should the advancement of large language models affect the practice of science? For this opinion piece, we have invited four diverse groups of scientists to reflect on this query, sharing their perspectives and engaging in debate.

View Article and Find Full Text PDF

Prediction of hip fracture by high-resolution peripheral quantitative computed tomography in older Swedish women.

J Bone Miner Res

January 2025

Sahlgrenska Osteoporosis Centre, Department of Internal Medicine and Clinical Nutrition, Institute of Medicine, University of Gothenburg, Gothenburg, Sweden.

The socioeconomic burden of hip fractures, the most severe osteoporotic fracture outcome, is increasing and the current clinical risk assessment lacks sensitivity. This study aimed to develop a method for improved prediction of hip fracture by incorporating measurements of bone microstructure and composition derived from high-resolution peripheral quantitative computed tomography (HR-pQCT). In a prospective cohort study of 3028 community-dwelling women aged 75 to 80, all participants answered questionnaires and underwent baseline examinations of anthropometrics and bone by dual x-ray absorptiometry (DXA) and HR-pQCT.

View Article and Find Full Text PDF

With the increasing number of patients with Alzheimer's Disease (AD), the demand for early diagnosis and intervention is becoming increasingly urgent. The traditional detection methods for Alzheimer's disease mainly rely on clinical symptoms, biomarkers, and imaging examinations. However, these methods have limitations in the early detection of Alzheimer's disease, such as strong subjectivity in diagnostic criteria, high detection costs, and high misdiagnosis rates.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!