Publications by authors named "Omar Sobh"

We present Knowledge Engine for Genomics (KnowEnG), a free-to-use computational system for analysis of genomics data sets, designed to accelerate biomedical discovery. It includes tools for popular bioinformatics tasks such as gene prioritization, sample clustering, gene set analysis, and expression signature analysis. The system specializes in "knowledge-guided" data mining and machine learning algorithms, in which user-provided data are analyzed in light of prior information about genes, aggregated from numerous knowledge bases and encoded in a massive "Knowledge Network.

View Article and Find Full Text PDF

Summary: Clustering is one of the most common techniques used in data analysis to discover hidden structures by grouping together data points that are similar in some measure into clusters. Although there are many programs available for performing clustering, a single web resource that provides both state-of-the-art clustering methods and interactive visualizations is lacking. ClusterEnG (acronym for Clustering Engine for Genomics) provides an interface for clustering big data and interactive visualizations including 3D views, cluster selection and zoom features.

View Article and Find Full Text PDF

Flagellum is a lash-like cellular appendage found in many single-celled living organisms. The flagellin protofilaments contain 11-helix dual turn structure in a single flagellum. Each flagellin consists of four sub-domains - two inner domains (D0, D1) and two outer domains (D2, D3).

View Article and Find Full Text PDF

Magnetospirillum magneticum (AMB-1), which belong to alpha-protobacterium are gram-negative, single-celled prokaryotic organisms consisting of a lash-like cellular appendage called flagella. These filamentous structures are made up of a protein called flagellin that in turn consist of four sub-domains, two inner domains (D0, D1) made up of alpha-helices and two outer domains (D2, D3) made up of beta sheets. It is wrapped in a helical fashion around the longitudinal filament with the outermost sub-domain (D3) exposed to the surrounding environment.

View Article and Find Full Text PDF

InvertNet, one of the three Thematic Collection Networks (TCNs) funded in the first round of the U.S. National Science Foundation's Advancing Digitization of Biological Collections (ADBC) program, is tasked with providing digital access to ~60 million specimens housed in 22 arthropod (primarily insect) collections at institutions distributed throughout the upper midwestern USA.

View Article and Find Full Text PDF