Background: With the development of high throughput methods of gene analyses, there is a growing need for mining tools to retrieve relevant articles in PubMed. As PubMed grows, literature searches become more complex and time-consuming. Automated search tools with good precision and recall are necessary. We developed GO2PUB to automatically enrich PubMed queries with gene names, symbols and synonyms annotated by a GO term of interest or one of its descendants.
Results: GO2PUB enriches PubMed queries based on selected GO terms and keywords. It processes the result and displays the PMID, title, authors, abstract and bibliographic references of the articles. Gene names, symbols and synonyms that have been generated as extra keywords from the GO terms are also highlighted. GO2PUB is based on a semantic expansion of PubMed queries using the semantic inheritance between terms through the GO graph. Two experts manually assessed the relevance of GO2PUB, GoPubMed and PubMed on three queries about lipid metabolism. Experts' agreement was high (kappa = 0.88). GO2PUB returned 69% of the relevant articles, GoPubMed: 40% and PubMed: 29%. GO2PUB and GoPubMed have 17% of their results in common, corresponding to 24% of the total number of relevant results. 70% of the articles returned by more than one tool were relevant. 36% of the relevant articles were returned only by GO2PUB, 17% only by GoPubMed and 14% only by PubMed. For determining whether these results can be generalized, we generated twenty queries based on random GO terms with a granularity similar to those of the first three queries and compared the proportions of GO2PUB and GoPubMed results. These were respectively of 77% and 40% for the first queries, and of 70% and 38% for the random queries. The two experts also assessed the relevance of seven of the twenty queries (the three related to lipid metabolism and four related to other domains). Expert agreement was high (0.93 and 0.8). GO2PUB and GoPubMed performances were similar to those of the first queries.
Conclusions: We demonstrated that the use of genes annotated by either GO terms of interest or a descendant of these GO terms yields some relevant articles ignored by other tools. The comparison of GO2PUB, based on semantic expansion, with GoPubMed, based on text mining techniques, showed that both tools are complementary. The analysis of the randomly-generated queries suggests that the results obtained about lipid metabolism can be generalized to other biological processes. GO2PUB is available at http://go2pub.genouest.org.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3599846 | PMC |
http://dx.doi.org/10.1186/2041-1480-3-7 | DOI Listing |
JAMA
January 2025
Division of Pediatric Pulmonary Medicine, UPMC Children's Hospital of Pittsburgh, University of Pittsburgh, Pittsburgh, Pennsylvania.
Importance: T helper 2 (T2) cells and T helper 17 (T17) cells are CD4+ T cell subtypes involved in asthma. Characterizing asthma endotypes based on these cell types in diverse groups is important for developing effective therapies for youths with asthma.
Objective: To identify asthma endotypes in school-aged youths aged 6 to 20 years by examining the distribution and characteristics of transcriptomic profiles in nasal epithelium.
JAMA Psychiatry
January 2025
Department of Psychiatry, University of Pittsburgh, Pittsburgh, Pennsylvania.
Importance: Mania/hypomania is the pathognomonic feature of bipolar disorder (BD). As BD is often misdiagnosed as major depressive disorder (MDD), replicable neural markers of mania/hypomania risk are needed for earlier BD diagnosis and pathophysiological treatment development.
Objective: To replicate the previously reported positive association between left ventrolateral prefrontal cortex (vlPFC) activity during reward expectancy (RE) and mania/hypomania risk, to explore the effect of MDD history on this association, and to compare RE-related left vlPFC activity in individuals with and at risk of BD.
JAMA Cardiol
January 2025
Cardiology Division, Department of Medicine, Montefiore Medical Center, Albert Einstein College of Medicine, Bronx, New York.
Importance: Apolipoprotein B (apoB) distribution and its implications as an atherosclerotic cardiovascular disease (ASCVD) risk-enhancing factor among individuals of diverse Hispanic or Latino backgrounds have not been described.
Objective: To describe the distribution of apoB in the Hispanic Community Health Study/Study of Latinos (HCHS/SOL) cohort and to characterize associations of baseline sociodemographic and clinical variables with apoB and self-identified Hispanic or Latino background.
Design, Setting, And Participants: The HCHS/SOL was a prospective, population-based cohort study of diverse Hispanic or Latino adults living in the US who were recruited and screened between March 2008 and June 2011.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!