Background: With the increasing use of health apps and ongoing concerns regarding their safety, effectiveness, and data privacy, numerous health app quality assessment frameworks have emerged. However, assessment initiatives experience difficulties scaling, and there is currently no comprehensive, consistent, internationally recognized assessment framework. Therefore, health apps often need to undergo several quality evaluations to enter different markets, leading to duplication of work. The CEN ISO/TS 82304‑2 health app assessment seeks to address this issue, aiming to provide an internationally accepted quality evaluation through a network of assessment organizations located in different countries.

Objective: This study aimed to develop and evolve the draft CEN ISO/TS 82304-2 assessment handbook and developer guidance by testing them across organizations in several countries.

Methods: Assessment organizations from 5 countries were engaged to evaluate 24 health apps using the evolving CEN ISO/TS 82304-2 assessment across 3 evaluation rounds. The information submitted by a given health app developer was evaluated by 2 assessment organizations, and interrater reliability was examined. In addition, app developers and assessors were asked to report how much time they spent on information collation or evaluation and to rate the clarity of the developer guidance or assessor handbook, respectively. The collected data were used to iteratively improve the handbook and guidance between rounds.

Results: The interrater reliability between assessment organizations improved from round 1 to round 2 and stayed relatively stable between rounds 2 and 3, with 80% (55/69) of assessment questions demonstrating moderate or better (Gwet AC1>0.41) agreement in round 3. The median time required by developers to prepare the assessment information was 8 hours and 59 minutes (IQR 5.7-27.1 hours) in round 3, whereas assessors reported a median evaluation time of 8 hours and 46 minutes (IQR 7.1-11.0 hours). The draft guidance and handbook were generally perceived as clear, with a median round-3 clarity rating of 1.73 (IQR 1.64-1.90) for developers and 1.78 (IQR 1.71-1.89) for assessors (0="very unclear", 1="somewhat unclear", and 2="completely clear").

Conclusions: To our knowledge, this is the first study to examine the consistency of health app evaluations across organizations located in different countries. Given that the CEN ISO/TS 82304-2 guidance and handbook are still under development, the interrater reliability findings observed at this early stage are promising, and this study provided valuable information for further refinement of the assessment. This study marks an important first step toward establishing the CEN ISO/TS 82304-2 assessment as a consistent, cross-national health app evaluation. It is envisioned that the assessment will ultimately help avoid duplication of work, prevent inequities by facilitating access to smaller markets for developers, and build trust among users, thereby increasing the adoption of high-quality health apps.

Download full-text PDF

Source
http://dx.doi.org/10.2196/64565DOI Listing

Publication Analysis

Top Keywords

cen iso/ts
24
health app
24
iso/ts 82304-2
20
assessment
16
interrater reliability
16
health apps
16
assessment organizations
16
82304-2 assessment
12
health
10
app quality
8

Similar Publications

Background: With the increasing use of health apps and ongoing concerns regarding their safety, effectiveness, and data privacy, numerous health app quality assessment frameworks have emerged. However, assessment initiatives experience difficulties scaling, and there is currently no comprehensive, consistent, internationally recognized assessment framework. Therefore, health apps often need to undergo several quality evaluations to enter different markets, leading to duplication of work.

View Article and Find Full Text PDF

This study aimed to evaluate healthcare professionals' (HCPs') willingness to recommend health apps presented with versus without the CEN-ISO/TS 82304-2 health app quality label. The study was an experimental vignette study describing 12 short hypothetical scenarios, with Label (absent vs present) as a between and Type of App (prevention vs self-monitoring vs healthcare) and Patient Socioeconomic Status (low vs high) as within-subjects factors. The main outcome measure was HCPs' willingness to recommend apps.

View Article and Find Full Text PDF

Background: The lack of an international standard for assessing and communicating health app quality and the lack of consensus about what makes a high-quality health app negatively affect the uptake of such apps. At the request of the European Commission, the international Standard Development Organizations (SDOs), European Committee for Standardization, International Organization for Standardization, and International Electrotechnical Commission have joined forces to develop a technical specification (TS) for assessing the quality and reliability of health and wellness apps.

Objective: This study aimed to create a useful, globally applicable, trustworthy, and usable framework to assess health app quality.

View Article and Find Full Text PDF

In 2016, the EURADOS working group 12 started a project to gather reliable data concerning the behavior of active personal dosemeters (APDs) used by European hospitals in standardized pulsed and hospital fields. This paper concentrates on the tests that deal with the dose rate dependence of APDs in laboratory tests. The dependence on the dose rate was determined for 10 dosemeter types with continuous radiation of a 60Co source and pulsed radiation of the reference radiation facility of Physikalisch-Technische Bundesanstalt.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!