One of the challenges of genetic data analysis is to combine information from sources that are distributed around the world and accessible through a wide array of different methods and interfaces. The HIV database and its footsteps, the hepatitis C virus (HCV) and hemorrhagic fever virus (HFV) databases, have made it their mission to make different data types easily available to their users. This involves a large amount of behind-the-scenes processing, including quality control and analysis of the sequences and their annotation. Gene and protein sequences are distilled from the sequences that are stored in GenBank; to this end, both submitter annotation and script-generated sequences are used. Alignments of both nucleotide and amino acid sequences are generated, manually curated, distilled into an alignment model, and regenerated in an iterative cycle that results in ever better new alignments. Annotation of epidemiological and clinical information is parsed, checked, and added to the database. User interfaces are updated, and new interfaces are added based upon user requests. Vital for its success, the database staff are heavy users of the system, which enables them to fix bugs and find opportunities for improvement. In this chapter we describe some of the infrastructure that keeps these heavily used analysis platforms alive and vital after nearly 25 years of use. The database/analysis platforms described in this chapter can be accessed at http://hiv.lanl.gov http://hcv.lanl.gov http://hfv.lanl.gov.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1007/978-1-62703-107-3_16 | DOI Listing |
BMC Res Notes
January 2025
Department of Computer Engineering, Chungbuk National University, Chungdae-ro 1, Cheongju, 28644, Republic of Korea.
Background: Drug response prediction can infer the relationship between an individual's genetic profile and a drug, which can be used to determine the choice of treatment for an individual patient. Prediction of drug response is recently being performed using machine learning technology. However, high-throughput sequencing data produces thousands of features per patient.
View Article and Find Full Text PDFMol Cancer
January 2025
Department of Medicine, Section of Epidemiology and Population Sciences, Dan L Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, TX, 77030, USA.
Lipid nanoparticles (LNPs) for mRNA delivery have advanced significantly, but LNP-mediated DNA delivery still faces clinical challenges. This study compared various LNP formulations for delivering DNA-encoded biologics, assessing their expression efficacy and the protective immunity generated by LNP-encapsulated DNA in different models. The LNP formulation used in Moderna's Spikevax mRNA vaccine (LNP-M) demonstrated a stable nanoparticle structure, high expression efficiency, and low toxicity.
View Article and Find Full Text PDFBMC Biol
January 2025
The Key Laboratory of Biotechnology for Medicinal Plant of Jiangsu Province, School of Life Science, Jiangsu Normal University, Xuzhou, Jiangsu, 221116, China.
Background: The variations in alliin content are a crucial criterion for evaluating garlic quality and is the sole precursor for allicin biosynthesis, which is significant for the growth, development, and stress response of garlic. WRKY transcription factors are essential for enhancing stress resistance by regulating the synthesis of plant secondary metabolites. However, the molecular mechanisms regulating alliin biosynthesis remain unexplored.
View Article and Find Full Text PDFVirol J
January 2025
Laboratory of Clinical Virology, WHO Regional Reference Laboratory for Poliomyelitis and Measles for in the Eastern Mediterranean Region, Institut Pasteur de Tunis, University of Tunis El Manar, 13 place Pasteur, BP74 1002 le Belvédère, Tunis, Tunisia.
Background: Primary Immunodeficiency disorders (PID) can increase the risk of severe COVID-19 and prolonged infection. This study investigates the duration of SARS-CoV-2 excretion and the genetic evolution of the virus in pediatric PID patients as compared to immunocompetent (IC) patients.
Materials And Methods: A total of 40 nasopharyngeal and 24 stool samples were obtained from five PID and ten IC children.
J Transl Med
January 2025
Department of Stem Cell and Regenerative Medicine, Southwest Cancer Center, Southwest Hospital, Third Military Medical University (Army Medical University), Chongqing, 400038, China.
Background: It is worthwhile to establish a prognostic prediction model based on microenvironment cells (MCs) infiltration and explore new treatment strategies for triple-negative breast cancer (TNBC).
Methods: The xCell algorithm was used to quantify the cellular components of the TNBC microenvironment based on bulk RNA sequencing (bulk RNA-seq) data. The MCs index (MCI) was constructed using the least absolute shrinkage and selection operator Cox (LASSO-Cox) regression analysis.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!