Adaptive topological tree structure for document organisation and visualisation.

Neural Netw

Department of Electrical and Electronic Engineering, University of Manchester Institute of Science and Technology, PO Box 88, Manchester M60 1QD, UK.

Published: January 2005

The self-organising map (SOM) is finding more and more applications in a wide range of fields, such as clustering, pattern recognition and visualisation. It has also been employed in knowledge management and information retrieval. We propose an alternative to existing 2-dimensional SOM based methods for document analysis. The method, termed Adaptive Topological Tree Structure (ATTS), generates a taxonomy of underlying topics from a set of unclassified, unstructured documents. The ATTS consists of a hierarchy of adaptive self-organising chains, each of which is validated independently using a proposed entropy-based Bayesian information criterion. A node meeting the expansion criterion spans a child chain, with reduced vocabulary and increased specialisation. The ATTS creates a topological tree of topics, which can be browsed like a content hierarchy and reflects the connections between related topics at each level. A review is also given on the existing neural network based methods for document clustering and organisation. Experimental results on real-world datasets using the proposed ATTS method are presented and compared with other approaches. The results demonstrate the advantages of the proposed validation criteria and the efficiency of the ATTS approach for document organisation, visualisation and search. It shows that the proposed methods not only improve the clustering results but also boost the retrieval.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.neunet.2004.08.006DOI Listing

Publication Analysis

Top Keywords

topological tree
12
adaptive topological
8
tree structure
8
document organisation
8
organisation visualisation
8
based methods
8
methods document
8
atts
5
document
4
structure document
4

Similar Publications

The gene family plays a crucial role in plant growth, development, and responses to biotic and abiotic stresses. , a warm-season turfgrass with exceptional salt tolerance, can be irrigated with seawater. However, the gene family in seashore paspalum remains poorly understood.

View Article and Find Full Text PDF

Assembly and comparative analysis of the complete mitogenome of var. , an exceptional berry plant possessing sweet leaves.

Front Plant Sci

December 2024

Zhejiang Provincial Key Laboratory of Plant Evolutionary Ecology and Conservation, College of Life Sciences, Taizhou University, Taizhou, China.

var. is a special berry plant of in the Rosaceae family. Its leaves contain high-sweetness, low-calorie, and non-toxic sweet ingredients, known as rubusoside.

View Article and Find Full Text PDF

The chloroplast (cp) genome is a widely used tool for exploring plant evolutionary relationships, yet its effectiveness in fully resolving these relationships remains uncertain. Integrating cp genome data with nuclear DNA information offers a more comprehensive view but often requires separate datasets. In response, we employed the same raw read sequencing data to construct cp genome-based trees and nuclear DNA phylogenetic trees using Read2Tree, a cost-efficient method for extracting conserved nuclear gene sequences from raw read data, focusing on the Aurantioideae subfamily, which includes Citrus and its relatives.

View Article and Find Full Text PDF

Extensive genome-wide phylogenetic discordance is due to incomplete lineage sorting in the rapidly radiated East Asian genus Nekemias (Vitaceae).

Ann Bot

December 2024

Hunan Provincial key Laboratory of Ecological Conservation and Sustainable Utilization of Wulingshan Resources, College of Biology and Environmental Sciences, Jishou University, Jishou, Hunan, 416000, China.

Background And Aims: Nekemias is a small genus of the grape family, with nine species discontinuously distributed in temperate to subtropical zones of the Northern Hemisphere but mostly in East Asia. Previous phylogenetic studies on Nekemias have mainly based on a few chloroplast markers, and the phylogenetic framework and systematic relationships are still highly contested.

Methods: We carried out a systematic framework reconstruction of Nekemias and intra-generic reticulate evolutionary analyses based on extensive single-copy nuclear and chloroplast genomic data obtained by the Hyb-Seq approach, combining genome skimming and target enrichment.

View Article and Find Full Text PDF

Many existing studies show that there exists a strong relationship between structures and characteristics of molecules. Topological indices are often used in modeling the properties of chemical compounds and biological activities in theoretical chemistry. Topological indices are numerical values associated with structures of molecules in such a way that they remain constant under graph isomorphism.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!