The verified text data of wheat varieties is an important component of wheat germplasm information. To automatically obtain a structured description of the phenotypic and genetic characteristics of wheat varieties, the aim at solve the issues of fuzzy entity boundaries and overlapping relationships in unstructured wheat variety approval data, WGIE-DCWF (joint extraction model of wheat germplasm information entity relationship based on deep character and word fusion) was proposed. The encoding layer of the model deeply fused word semantic information and character information using the Transformer encoder of BERT. This allowed for the cascading fusion of contextual semantic feature information to achieve rich character vector representation and improve the recognition ability of entity features. The triple extraction layer of the model established a cascading pointer network, extracted the head entity, extracted the tail entity according to the relationship category, and decoded the output triplet. This approach improved the model's capability to extract overlapping relationships. The experimental results demonstrated that the WGIE-DCWF model performed exceptionally well on both the WGD (wheat germplasm dataset) and the public dataset DuIE. The WGIE-DCWF model not only achieved high performance on the evaluation datasets but also demonstrated good generalization. This provided valuable technical support for the construction of a wheat germplasm information knowledge base and is of great significance for wheat breeding, genetic research, cultivation management, and agricultural production.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11551156PMC
http://dx.doi.org/10.1038/s41598-024-59796-9DOI Listing

Publication Analysis

Top Keywords

wheat germplasm
20
entity relationship
12
wheat
9
joint extraction
8
germplasm entity
8
relationship based
8
based deep
8
deep character
8
character word
8
word fusion
8

Similar Publications

Guar or cluster bean (Cyamopsis tetragonoloba L.) is a leguminous crop well-suited for cultivation in arid and semi-arid regions. India accounts for 90% of world's guar production.

View Article and Find Full Text PDF

Sinomonas gamaensis NEAU-HV1 remodels the IAA14-ARF7/19 interaction to promote plant growth.

New Phytol

December 2024

Key Lab of Organic-based Fertilizers of China and Jiangsu Provincial Key Lab for Solid Organic Waste Utilization, Nanjing Agricultural University, Nanjing, 210095, China.

Sinomonas species typically reside in soils or the rhizosphere and can promote plant growth. Sinomonas enrichment in rhizospheric soils is positively correlated with increases in plant biomass. However, the growth promotion mechanisms regulated by Sinomonas remain unclear.

View Article and Find Full Text PDF

From Images to Loci: Applying 3D Deep Learning to Enable Multivariate and Multitemporal Digital Phenotyping and Mapping the Genetics Underlying Nitrogen Use Efficiency in Wheat.

Plant Phenomics

December 2024

Plant Phenomics Research Centre, Academy for Advanced Interdisciplinary Studies, Collaborative Innovation Centre for Modern Crop Production, Co-sponsored by Province and Ministry, College of Agriculture, State Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Nanjing Agricultural University, Nanjing 210095, China.

The selection and promotion of high-yielding and nitrogen-efficient wheat varieties can reduce nitrogen fertilizer application while ensuring wheat yield and quality and contribute to the sustainable development of agriculture; thus, the mining and localization of nitrogen use efficiency (NUE) genes is particularly important, but the localization of NUE genes requires a large amount of phenotypic data support. In view of this, we propose the use of low-altitude aerial photography to acquire field images at a large scale, generate 3-dimensional (3D) point clouds and multispectral images of wheat plots, propose a wheat 3D plot segmentation dataset, quantify the plot canopy height via combination with PointNet++, and generate 4 nitrogen utilization-related vegetation indices via index calculations. Six height-related and 24 vegetation-index-related dynamic digital phenotypes were extracted from the digital phenotypes collected at different time points and fitted to generate dynamic curves.

View Article and Find Full Text PDF

Background: Septoria tritici blotch (STB) is one of the most damaging wheat diseases worldwide, and the development of resistant cultivars is of paramount importance for sustainable crop management. However, the genetic basis of the resistance present in elite wheat cultivars remains largely unknown, which limits the implementation of this strategy. A collection of 285 wheat cultivars originating mostly from France was challenged with ten Zymoseptoria tritici isolates at the seedling stage.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!