Publications by authors named "Siwei Fu"

The integration of visualizations and text is commonly found in data news, analytical reports, and interactive documents. For example, financial articles are presented along with interactive charts to show the changes in stock prices on Yahoo Finance. Visualizations enhance the perception of facts in the text while the text reveals insights of visual representation.

View Article and Find Full Text PDF

High-quality data is critical to deriving useful and reliable information. However, real-world data often contains quality issues undermining the value of the derived information. Most existing research on data quality management focuses on tabular data, leaving semi-structured data under-exploited.

View Article and Find Full Text PDF

Carotid atherosclerosis (AS) occurs in atherosclerotic lesions of the carotid artery, which can lead to transient ischemic attack and stroke in severe cases. However, the relationship between pleckstrin (PLEK) and lymphocyte antigen 86 (LY86) and carotid AS remains unclear. The carotid AS datasets GSE43292 and GSE125771 were downloaded from the gene expression omnibus database.

View Article and Find Full Text PDF

According to the positioning experiment of straw returning in the continuous field 7a, the effects of straw returning combined with chemical fertilizer on soil total organic carbon (TOC), dissolved organic carbon (DOC), particulate organic carbon (POC), labile organic carbon (LOC), carbon pool management index (CPMI), and crop yield in farmland soil profiles (0-20, 20-50, and 50-80 cm) in the Chaohu Lake area were studied. There were four treatments:no straw returning+no fertilization (CK), conventional fertilization (F), straw returning+conventional fertilization (SF1), and straw returning+80% conventional fertilization (SF2). The changes in soil total organic carbon and component content, CPMI, and rape rice yield in different soil layers were analyzed.

View Article and Find Full Text PDF

Data workers usually seek to understand the semantics of data wrangling scripts in various scenarios, such as code debugging, reusing, and maintaining. However, the understanding is challenging for novice data workers due to the variety of programming languages, functions, and parameters. Based on the observation that differences between input and output tables highly relate to the type of data transformation, we outline a design space including 103 characteristics to describe table differences.

View Article and Find Full Text PDF

Images in visualization publications contain rich information, e.g., novel visualization designs and implicit design patterns of visualizations.

View Article and Find Full Text PDF

Data workers use various scripting languages for data transformation, such as SAS, R, and Python. However, understanding intricate code pieces requires advanced programming skills, which hinders data workers from grasping the idea of data transformation at ease. Program visualization is beneficial for debugging and education and has the potential to illustrate transformations intuitively and interactively.

View Article and Find Full Text PDF

Vibration and noise are ubiquitous in social life, which severely damage machinery and adversely affect human health. Thus, the development of materials with high-damping performance is of great importance. Rubbers are typically used as damping materials because of their unique viscoelasticity.

View Article and Find Full Text PDF

Straw returning is an effective technique for improving soil fertility and maintaining crop productivity in agro-ecosystems. The effects of straw returning, when combined with chemical fertilizer, on soil nutrients, enzyme activity, and microbial community were explored in rice-rape rotation farmland in the Chaohu Area. We carried out a 4-year field experiment (2016-2020) and set up four treatments (no straw+no fertilization, CK; conventional fertilization, F; straw returning+conventional fertilization, SF; and straw returning+conventional fertilization minus 20%, SDF) to explore the key environmental factors affecting soil enzyme activity and microbial and fungal communities.

View Article and Find Full Text PDF

In multiple coordinated views (MCVs), visualizations across views update their content in response to users' interactions in other views. Interactive systems provide direct manipulation to create coordination between views, but are restricted to limited types of predefined templates. By contrast, textual specification languages enable flexible coordination but expose technical burden.

View Article and Find Full Text PDF

This article presents a new approach based on deep learning to automatically extract colormaps from visualizations. After summarizing colors in an input visualization image as a Lab color histogram, we pass the histogram to a pre-trained deep neural network, which learns to predict the colormap that produces the visualization. To train the network, we create a new dataset of  ∼ 64K visualizations that cover a wide variety of data distributions, chart types, and colormaps.

View Article and Find Full Text PDF

We design and evaluate a novel layout fine-tuning technique for node-link diagrams that facilitates exemplar-based adjustment of a group of substructures in batching mode. The key idea is to transfer user modifications on a local substructure to other substructures in the entire graph that are topologically similar to the exemplar. We first precompute a canonical representation for each substructure with node embedding techniques and then use it for on-the-fly substructure retrieval.

View Article and Find Full Text PDF

We present ShuttleSpace, an immersive analytics system to assist experts in analyzing trajectory data in badminton. Trajectories in sports, such as the movement of players and balls, contain rich information on player behavior and thus have been widely analyzed by coaches and analysts to improve the players' performance. However, existing visual analytics systems often present the trajectories in court diagrams that are abstractions of reality, thereby causing difficulty for the experts to imagine the situation on the court and understand why the player acted in a certain way.

View Article and Find Full Text PDF

Visual designs can be complex in modern data visualization systems, which poses special challenges for explaining them to the non-experts. However, few if any presentation tools are tailored for this purpose. In this study, we present Narvis, a slideshow authoring tool designed for introducing data visualizations to non-experts.

View Article and Find Full Text PDF

Whether and how does the structure of family trees differ by ancestral traits over generations? This is a fundamental question regarding the structural heterogeneity of family trees for the multi-generational transmission research. However, previous work mostly focuses on parent-child scenarios due to the lack of proper tools to handle the complexity of extending the research to multi-generational processes. Through an iterative design study with social scientists and historians, we develop TreeEvo that assists users to generate and test empirical hypotheses for multi-generational research.

View Article and Find Full Text PDF

Discussion forums of Massive Open Online Courses (MOOC) provide great opportunities for students to interact with instructional staff as well as other students. Exploration of MOOC forum data can offer valuable insights for these staff to enhance the course and prepare the next release. However, it is challenging due to the large, complicated, and heterogeneous nature of relevant datasets, which contain multiple dynamically interacting objects such as users, posts, and threads, each one including multiple attributes.

View Article and Find Full Text PDF