Testing and Iterative Improvement of the CEN ISO/TS 82304-2 Health App Quality Assessment: Pilot Interrater Reliability Study.

Anna-Lena Frey Diana Matei Ben Phillips Adam McCabe Rachel Fuller Begoña Laibarra Laura Alonso Victor de la Hoz Carme Pratdepadua Bufill Berta Llebot Casajuana Giuseppe D'Avenio Pier Angelo Sottile Laura Melania Rocchi Matteo Errera Yasmine Laaissaoui Michael Cardinal Menno Kok Petra Hoogendoorn

JMIR Form Res

National eHealth Living Lab, Department of Public Health and Primary Care, Leiden University Medical Center, Leiden, The Netherlands.

Published: March 2025

Background: With the increasing use of health apps and ongoing concerns regarding their safety, effectiveness, and data privacy, numerous health app quality assessment frameworks have emerged. However, assessment initiatives experience difficulties scaling, and there is currently no comprehensive, consistent, internationally recognized assessment framework. Therefore, health apps often need to undergo several quality evaluations to enter different markets, leading to duplication of work. The CEN ISO/TS 82304‑2 health app assessment seeks to address this issue, aiming to provide an internationally accepted quality evaluation through a network of assessment organizations located in different countries.

Objective: This study aimed to develop and evolve the draft CEN ISO/TS 82304-2 assessment handbook and developer guidance by testing them across organizations in several countries.

Methods: Assessment organizations from 5 countries were engaged to evaluate 24 health apps using the evolving CEN ISO/TS 82304-2 assessment across 3 evaluation rounds. The information submitted by a given health app developer was evaluated by 2 assessment organizations, and interrater reliability was examined. In addition, app developers and assessors were asked to report how much time they spent on information collation or evaluation and to rate the clarity of the developer guidance or assessor handbook, respectively. The collected data were used to iteratively improve the handbook and guidance between rounds.

Results: The interrater reliability between assessment organizations improved from round 1 to round 2 and stayed relatively stable between rounds 2 and 3, with 80% (55/69) of assessment questions demonstrating moderate or better (Gwet AC1>0.41) agreement in round 3. The median time required by developers to prepare the assessment information was 8 hours and 59 minutes (IQR 5.7-27.1 hours) in round 3, whereas assessors reported a median evaluation time of 8 hours and 46 minutes (IQR 7.1-11.0 hours). The draft guidance and handbook were generally perceived as clear, with a median round-3 clarity rating of 1.73 (IQR 1.64-1.90) for developers and 1.78 (IQR 1.71-1.89) for assessors (0="very unclear", 1="somewhat unclear", and 2="completely clear").

Conclusions: To our knowledge, this is the first study to examine the consistency of health app evaluations across organizations located in different countries. Given that the CEN ISO/TS 82304-2 guidance and handbook are still under development, the interrater reliability findings observed at this early stage are promising, and this study provided valuable information for further refinement of the assessment. This study marks an important first step toward establishing the CEN ISO/TS 82304-2 assessment as a consistent, cross-national health app evaluation. It is envisioned that the assessment will ultimately help avoid duplication of work, prevent inequities by facilitating access to smaller markets for developers, and build trust among users, thereby increasing the adoption of high-quality health apps.

Download full-text PDF	Source
http://dx.doi.org/10.2196/64565	DOI Listing

Publication Analysis

Top Keywords

cen iso/ts

health app

iso/ts 82304-2

assessment

interrater reliability

health apps

assessment organizations

82304-2 assessment

health

app quality

Similar Publications

Testing and Iterative Improvement of the CEN ISO/TS 82304-2 Health App Quality Assessment: Pilot Interrater Reliability Study.

JMIR Form Res

March 2025

National eHealth Living Lab, Department of Public Health and Primary Care, Leiden University Medical Center, Leiden, The Netherlands.

Anna-Lena Frey Diana Matei Ben Phillips Adam McCabe Rachel Fuller

View Article and Find Full Text PDF

Similar Publications

Standardized assessment of evidence supporting the adoption of mobile health solutions: A Clinical Consensus Statement of the ESC Regulatory Affairs Committee: Developed in collaboration with the European Heart Rhythm Association (EHRA), the Association of Cardiovascular Nursing & Allied Professions (ACNAP) of the ESC, the Heart Failure Association (HFA) of the ESC, the ESC Young Community, the ESC Working Group on e-Cardiology, the ESC Council for Cardiology Practice, the ESC Council of Cardio-Oncology, the ESC Council on Hypertension, the ESC Patient Forum, the ESC Digital Health Committee, and the European Association of Preventive Cardiology (EAPC).

Eur Heart J Digit Health

September 2024

Center for Postgraduate Medical Education, Marymoncka, Warsaw, Poland.

Enrico G Caiani Hareld Kemps Petra Hoogendoorn Riccardo Asteggiano Allan Böhm

Mobile health (mHealth) solutions have the potential to improve self-management and clinical care. For successful integration into routine clinical practice, healthcare professionals (HCPs) need accepted criteria helping the mHealth solutions' selection, while patients require transparency to trust their use. Information about their evidence, safety and security may be hard to obtain and consensus is lacking on the level of required evidence.

View Article and Find Full Text PDF

Similar Publications

Value of a quality label and European healthcare professionals' willingness to recommend health apps: An experimental vignette study.

J Health Psychol

August 2024

Leiden University Medical Center, The Netherlands.

Ieva Biliunaite Laurens van Gestel Petra Hoogendoorn Marieke Adriaanse

This study aimed to evaluate healthcare professionals' (HCPs') willingness to recommend health apps presented with versus without the CEN-ISO/TS 82304-2 health app quality label. The study was an experimental vignette study describing 12 short hypothetical scenarios, with Label (absent vs present) as a between and Type of App (prevention vs self-monitoring vs healthcare) and Patient Socioeconomic Status (low vs high) as within-subjects factors. The main outcome measure was HCPs' willingness to recommend apps.

View Article and Find Full Text PDF

Similar Publications

What Makes a Quality Health App-Developing a Global Research-Based Health App Quality Assessment Framework for CEN-ISO/TS 82304-2: Delphi Study.

JMIR Form Res

January 2023

National eHealth Living Lab, Department of Public Health and Primary Care, Leiden University Medical Center, Leiden, Netherlands.

Petra Hoogendoorn Anke Versluis Sanne van Kampen Charles McCay Matt Leahy

Background: The lack of an international standard for assessing and communicating health app quality and the lack of consensus about what makes a high-quality health app negatively affect the uptake of such apps. At the request of the European Commission, the international Standard Development Organizations (SDOs), European Committee for Standardization, International Organization for Standardization, and International Electrotechnical Commission have joined forces to develop a technical specification (TS) for assessing the quality and reliability of health and wellness apps.

Objective: This study aimed to create a useful, globally applicable, trustworthy, and usable framework to assess health app quality.

View Article and Find Full Text PDF

Similar Publications

DETERMINING THE DOSE RATE DEPENDENCE OF DIFFERENT ACTIVE PERSONAL DOSEMETERS IN STANDARDIZED PULSED AND CONTINUOUS RADIATION FIELDS.

Radiat Prot Dosimetry

December 2019

Nofer Institute of Occupational Medicine (NIOM), St Teresa 8, 91-348 Lodz, Poland.

Oliver Hupe Sabrina Friedrich Filip Vanhavere Marcin Brodecki

In 2016, the EURADOS working group 12 started a project to gather reliable data concerning the behavior of active personal dosemeters (APDs) used by European hospitals in standardized pulsed and hospital fields. This paper concentrates on the tests that deal with the dose rate dependence of APDs in laboratory tests. The dependence on the dose rate was determined for 10 dosemeter types with continuous radiation of a 60Co source and pulsed radiation of the reference radiation facility of Physikalisch-Technische Bundesanstalt.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!