The data representation as well as naming conventions used in commercial screen files by different companies make the automated analysis of crystallization experiments difficult and time-consuming. In order to reduce the human effort required to deal with this problem, we present an approach for computationally matching elements of two schemas using linguistic schema matching methods and then transform the input screen format to another format with naming defined by the user. This approach is tested on a number of commercial screens from different companies and the results of the experiments showed an overall accuracy of 97 percent on schema matching which is significantly better than the other two matchers we tested. Our tool enables mapping a screen file in one format to another format preferred by the expert using their preferred chemical names.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7874513PMC
http://dx.doi.org/10.1109/TCBB.2019.2913368DOI Listing

Publication Analysis

Top Keywords

schema matching
12
format format
8
matching data
4
data integration
4
integration consistent
4
consistent naming
4
naming protein
4
protein crystallization
4
crystallization screens
4
screens data
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!