Harnessing Large-Scale Herbarium Image Datasets Through Representation Learning.

Front Plant Sci

Royal Botanic Gardens, Kew, Richmond, United Kingdom.

Published: January 2022

The mobilization of large-scale datasets of specimen images and metadata through herbarium digitization provide a rich environment for the application and development of machine learning techniques. However, limited access to computational resources and uneven progress in digitization, especially for small herbaria, still present barriers to the wide adoption of these new technologies. Using deep learning to extract representations of herbarium specimens useful for a wide variety of applications, so-called "representation learning," could help remove these barriers. Despite its recent popularity for camera trap and natural world images, representation learning is not yet as popular for herbarium specimen images. We investigated the potential of representation learning with specimen images by building three neural networks using a publicly available dataset of over 2 million specimen images spanning multiple continents and institutions. We compared the extracted representations and tested their performance in application tasks relevant to research carried out with herbarium specimens. We found a triplet network, a type of neural network that learns distances between images, produced representations that transferred the best across all applications investigated. Our results demonstrate that it is possible to learn representations of specimen images useful in different applications, and we identify some further steps that we believe are necessary for representation learning to harness the rich information held in the worlds' herbaria.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8794728PMC
http://dx.doi.org/10.3389/fpls.2021.806407DOI Listing

Publication Analysis

Top Keywords

specimen images
20
representation learning
16
herbarium specimens
8
images
7
learning
6
herbarium
5
specimen
5
harnessing large-scale
4
large-scale herbarium
4
herbarium image
4

Similar Publications

Progressive multifocal leukoencephalopathy (PML) is a demyelinating disease caused by the JC polyomavirus (JCPyV). Based on the clinical criteria, PML is diagnosed via polymerase chain reaction (PCR) detection of JCPyV DNA in cerebrospinal fluid (CSF) in combination with neurological and imaging findings. Although the utility of CSF JCPyV testing using ultrasensitive PCR assays has been suggested, its potential requires further evaluation.

View Article and Find Full Text PDF

Structural damage identification based on structural health monitoring (SHM) data and machine learning (ML) is currently a rapidly developing research area in structural engineering. Traditional machine learning techniques rely heavily on feature extraction, where weak feature extraction can lead to suboptimal features and poor classification performance. In contrast, ML-based methods, particularly deep learning approaches like convolutional neural networks (CNNs), automatically extract relevant features from raw data, improving the accuracy and adaptability of the damage identification process.

View Article and Find Full Text PDF

Additive manufacturing is an attractive technology due to its versatility in producing parts with diverse properties from a single material. However, the process often generates plastic waste, particularly from failed prints, making sustainability a growing concern. Recycling this waste material presents a potential solution for reducing environmental impact while creating new, functional parts.

View Article and Find Full Text PDF

Diagnosing non-tuberculous mycobacterial pulmonary disease (NTM-PD) in patients unable to produce sputum spontaneously requires invasive procedures to obtain valid respiratory specimens. In this retrospective study, we evaluated the results of microbiological tests performed on respiratory samples of 132 patients affected by NTM-PD. In the diagnostic workout, 98 patients performed both induced sputum (IS) and bronchoalveolar lavage (BAL) and were enrolled in our study.

View Article and Find Full Text PDF

The adaptation of 3D printing techniques within the construction industry has opened new possibilities for designing and constructing cementitious materials efficiently and flexibly. The layered nature of extrusion-based concrete printing introduces challenges, such as interlayer weaknesses, that compromise structural integrity and mechanical performance. This experimental study investigates the influence of interlayer orientation and the presence of cold joints (CJ) on mechanical properties, such as stiffness and strength.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!