Measurement error of network clustering coefficients under randomly missing nodes.

Sci Rep

Department of Mathematical and Computing Science, Tokyo Institute of Technology, Meguro-ku, Tokyo, 152-8552, Japan.

Published: February 2021

The measurement error of the network topology caused by missing network data during the collection process is a major concern in analyzing collected network data. It is essential to clarify the error between the properties of an original network and the collected network to provide an accurate analysis of the entire topology. However, the measurement error of the clustering coefficient, which is a fundamental network property, has not been well understood particularly from an analytical perspective. Here we analytically and numerically investigate the measurement error of two types of clustering coefficients, namely, the global clustering coefficient and the network average clustering coefficient, of a network that is randomly missing some proportion of the nodes. First, we derive the expected error of the clustering coefficients of an incomplete network given a set of randomly missing nodes. We analytically show that (i) the global clustering coefficient of the incomplete network has little expected error and that (ii) conversely, the network average clustering coefficient of the incomplete network is underestimated with an expected error that is dependent on a property that is specific to the graph. Then, we verify the analytical claims through numerical simulations using three typical network models, i.e., the Erdős-Rényi model, the Watts-Strogatz model, and the Barabási-Albert model, and the 15 real-world network datasets consisting of five network types. Although the simulation results on the three typical network models suggest that the measurement error of the clustering coefficients on graphs with considerably small clustering coefficients may not behave like the analytical claims, we demonstrate that the simulation results on real-world networks that typically have enough high clustering coefficients sufficiently support our analytical claims. This study facilitates an analytical understanding of the measurement error in network properties due to missing graph data.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7876024PMC
http://dx.doi.org/10.1038/s41598-021-82367-1DOI Listing

Publication Analysis

Top Keywords

measurement error
24
clustering coefficients
24
clustering coefficient
20
network
18
error network
12
randomly missing
12
error clustering
12
expected error
12
incomplete network
12
analytical claims
12

Similar Publications

Robust discrimination between closely related species of salmon based on DNA fragments.

Anal Bioanal Chem

January 2025

Statistical Engineering Division, National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, MD, 20899-8980, USA.

Closely related species of Salmonidae, including Pacific and Atlantic salmon, can be distinguished from one another based on nucleotide sequences from the cytochrome c oxidase sub-unit 1 mitochondrial gene (COI), using ensembles of fragments aligned to genetic barcodes that serve as digital proxies for the relevant species. This is accomplished by exploiting both the nucleotide sequences and their quality scores recorded in a FASTQ file obtained via Next Generation (NextGen) Sequencing of mitochondrial DNA extracted from Coho salmon caught with hook and line in the Gulf of Alaska. The alignment is done using MUSCLE (Muscle 5.

View Article and Find Full Text PDF

Background: This study evaluates retinal oxygen saturation and vessel density within the macula and correlates these measures in controls and subjects with type 2 diabetes (DM) with (DMR) and without (DMnR) retinopathy. Changes in retinal oxygen saturation have not been evaluated regionally in diabetic patients.

Methods: Data from seventy subjects (28 controls, 26 DMnR, and 16 DMR were analyzed.

View Article and Find Full Text PDF

It is popular to study individual differences in cognition with experimental tasks, and the main goal of such approaches is to analyze the pattern of correlations across a battery of tasks and measures. One difficulty is that experimental tasks are often low in reliability as effects are small relative to trial-by-trial variability. Consequently, it remains difficult to accurately estimate correlations.

View Article and Find Full Text PDF

Highly sensitive split ring resonator-based sensor for quality monitoring of edible oils.

Sci Rep

January 2025

Department of Electrical, Electronic and Systems Engineering, Faculty of Engineering and Built Environment, Universiti Kebangsaan Malaysia (UKM), 43600, Bangi, Selangor, Malaysia.

This research presents the design and analysis of a compact metamaterial (MTM)-based star-shaped split-ring resonator (SRR) enclosed in a square, constructed on a cost-effective substrate for liquid chemical sensing applications. The designed structure has dimensions of 10 × 10 mm and is optimized for detecting adulteration in edible oils. When the sample holder is filled with different percentages of oil samples, the resonance frequency of the MTM-based SRR sensor shift significantly.

View Article and Find Full Text PDF

In clinical movement biomechanics, kinematic measurements are collected to characterise the motion of articulating joints and investigate how different factors influence movement patterns. Representative time-series signals are calculated to encapsulate (complex and multidimensional) kinematic datasets succinctly. Exacerbated by numerous difficulties to consistently define joint coordinate frames, the influence of local frame orientation and position on the characteristics of the resultant kinematic signals has been previously proven to be a major limitation.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!