The self-organising map (SOM) is finding more and more applications in a wide range of fields, such as clustering, pattern recognition and visualisation. It has also been employed in knowledge management and information retrieval. We propose an alternative to existing 2-dimensional SOM based methods for document analysis. The method, termed Adaptive Topological Tree Structure (ATTS), generates a taxonomy of underlying topics from a set of unclassified, unstructured documents. The ATTS consists of a hierarchy of adaptive self-organising chains, each of which is validated independently using a proposed entropy-based Bayesian information criterion. A node meeting the expansion criterion spans a child chain, with reduced vocabulary and increased specialisation. The ATTS creates a topological tree of topics, which can be browsed like a content hierarchy and reflects the connections between related topics at each level. A review is also given on the existing neural network based methods for document clustering and organisation. Experimental results on real-world datasets using the proposed ATTS method are presented and compared with other approaches. The results demonstrate the advantages of the proposed validation criteria and the efficiency of the ATTS approach for document organisation, visualisation and search. It shows that the proposed methods not only improve the clustering results but also boost the retrieval.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.neunet.2004.08.006 | DOI Listing |
Plants (Basel)
December 2024
College of Grassland Science, Qingdao Agricultural University, Qingdao 266109, China.
The gene family plays a crucial role in plant growth, development, and responses to biotic and abiotic stresses. , a warm-season turfgrass with exceptional salt tolerance, can be irrigated with seawater. However, the gene family in seashore paspalum remains poorly understood.
View Article and Find Full Text PDFFront Plant Sci
December 2024
Zhejiang Provincial Key Laboratory of Plant Evolutionary Ecology and Conservation, College of Life Sciences, Taizhou University, Taizhou, China.
var. is a special berry plant of in the Rosaceae family. Its leaves contain high-sweetness, low-calorie, and non-toxic sweet ingredients, known as rubusoside.
View Article and Find Full Text PDFSci Rep
January 2025
Analytical Research Center for Experimental Sciences, Saga University, Saga, Japan.
The chloroplast (cp) genome is a widely used tool for exploring plant evolutionary relationships, yet its effectiveness in fully resolving these relationships remains uncertain. Integrating cp genome data with nuclear DNA information offers a more comprehensive view but often requires separate datasets. In response, we employed the same raw read sequencing data to construct cp genome-based trees and nuclear DNA phylogenetic trees using Read2Tree, a cost-efficient method for extracting conserved nuclear gene sequences from raw read data, focusing on the Aurantioideae subfamily, which includes Citrus and its relatives.
View Article and Find Full Text PDFAnn Bot
December 2024
Hunan Provincial key Laboratory of Ecological Conservation and Sustainable Utilization of Wulingshan Resources, College of Biology and Environmental Sciences, Jishou University, Jishou, Hunan, 416000, China.
Background And Aims: Nekemias is a small genus of the grape family, with nine species discontinuously distributed in temperate to subtropical zones of the Northern Hemisphere but mostly in East Asia. Previous phylogenetic studies on Nekemias have mainly based on a few chloroplast markers, and the phylogenetic framework and systematic relationships are still highly contested.
Methods: We carried out a systematic framework reconstruction of Nekemias and intra-generic reticulate evolutionary analyses based on extensive single-copy nuclear and chloroplast genomic data obtained by the Hyb-Seq approach, combining genome skimming and target enrichment.
Heliyon
May 2024
Department of Computer and Information Sciences, Northumbria University, Newcastle upon Tyne, NE1 8ST, UK.
Many existing studies show that there exists a strong relationship between structures and characteristics of molecules. Topological indices are often used in modeling the properties of chemical compounds and biological activities in theoretical chemistry. Topological indices are numerical values associated with structures of molecules in such a way that they remain constant under graph isomorphism.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!