Publications by Tim J P Hubbard | LitMetric

Publications by authors named "Tim J P Hubbard"

Page 1 of 2

GENCODE 2025: reference gene annotation for human and mouse.

Jonathan M Mudge Sílvia Carbonell-Sala Mark Diekhans Jose Gonzalez Martinez Toby Hunt Tim J P Hubbard

Nucleic Acids Res

November 2024

GENCODE produces comprehensive reference gene annotation for human and mouse. Entering its twentieth year, the project remains highly active as new technologies and methodologies allow us to catalog the genome at ever-increasing granularity. In particular, long-read transcriptome sequencing enables us to identify large numbers of missing transcripts and to substantially improve existing models, and our long non-coding RNA catalogs have undergone a dramatic expansion and reconfiguration as a result.

View Article and Find Full Text PDF

GENCODE: reference annotation for the human and mouse genomes in 2023.

Adam Frankish Sílvia Carbonell-Sala Mark Diekhans Irwin Jungreis Jane E Loveland Tim J P Hubbard

Nucleic Acids Res

January 2023

GENCODE produces high quality gene and transcript annotation for the human and mouse genomes. All GENCODE annotation is supported by experimental data and serves as a reference for genome biology and clinical genomics. The GENCODE consortium generates targeted experimental data, develops bioinformatic tools and carries out analyses that, along with externally produced data and methods, support the identification and annotation of transcript structures and the determination of their function.

View Article and Find Full Text PDF

GENCODE 2021.

Adam Frankish Mark Diekhans Irwin Jungreis Julien Lagarde Jane E Loveland Tim J P Hubbard

Nucleic Acids Res

January 2021

The GENCODE project annotates human and mouse genes and transcripts supported by experimental data with high accuracy, providing a foundational resource that supports genome biology and clinical genomics. GENCODE annotation processes make use of primary data and bioinformatic tools and analysis generated both within the consortium and externally to support the creation of transcript structures and the determination of their function. Here, we present improvements to our annotation infrastructure, bioinformatics tools, and analysis, and the advances they support in the annotation of the human and mouse genomes including: the completion of first pass manual annotation for the mouse reference genome; targeted improvements to the annotation of genes associated with SARS-CoV-2 infection; collaborative projects to achieve convergence across reference annotation databases for the annotation of human and mouse protein-coding genes; and the first GENCODE manually supervised automated annotation of lncRNAs.

View Article and Find Full Text PDF

Mining Social Media Data to Study the Consequences of Dementia Diagnosis on Caregivers and Relatives.

George Gkotsis Christoph Mueller Richard J B Dobson Tim J P Hubbard Rina Dutta

Dement Geriatr Cogn Disord

June 2021

Introduction: Caregivers for people with dementia face a number of challenges such as changing family relationships, social isolation, or financial difficulties. Internet usage and social media are increasingly being recognised as resources to increase support and general public health.

Objective: Using automated analysis, the aim of this study was to explore (i) the age and sex of people who post to the social media forum Reddit about dementia diagnoses, (ii) the affected person and their diagnosis, (iii) which subreddits authors are posting to, (iv) the types of messages posted, and (v) the content of these posts.

View Article and Find Full Text PDF

GENCODE reference annotation for the human and mouse genomes.

Adam Frankish Mark Diekhans Anne-Maud Ferreira Rory Johnson Irwin Jungreis Tim J P Hubbard

Nucleic Acids Res

January 2019

The accurate identification and description of the genes in the human and mouse genomes is a fundamental requirement for high quality analysis of data informing both genome biology and clinical genomics. Over the last 15 years, the GENCODE consortium has been producing reference quality gene annotations to provide this foundational resource. The GENCODE consortium includes both experimental and computational biology groups who work together to improve and extend the GENCODE gene annotation.

View Article and Find Full Text PDF

Corrigendum: Characterisation of mental health conditions in social media using Informed Deep Learning.

George Gkotsis Anika Oellrich Sumithra Velupillai Maria Liakata Tim J P Hubbard

Sci Rep

May 2017

View Article and Find Full Text PDF

Automated PDF highlighting to support faster curation of literature for Parkinson's and Alzheimer's disease.

Honghan Wu Anika Oellrich Christine Girges Bernard de Bono Tim J P Hubbard

Database (Oxford)

January 2017

Unlabelled: Neurodegenerative disorders such as Parkinson's and Alzheimer's disease are devastating and costly illnesses, a source of major global burden. In order to provide successful interventions for patients and reduce costs, both causes and pathological processes need to be understood. The ApiNATOMY project aims to contribute to our understanding of neurodegenerative disorders by manually curating and abstracting data from the vast body of literature amassed on these illnesses.

View Article and Find Full Text PDF

Characterisation of mental health conditions in social media using Informed Deep Learning.

George Gkotsis Anika Oellrich Sumithra Velupillai Maria Liakata Tim J P Hubbard

Sci Rep

March 2017

The number of people affected by mental illness is on the increase and with it the burden on health and social care use, as well as the loss of both productivity and quality-adjusted life-years. Natural language processing of electronic health records is increasingly used to study mental health conditions and risk behaviours on a large scale. However, narrative notes written by clinicians do not capture first-hand the patients' own experiences, and only record cross-sectional, professional impressions at the point of care.

View Article and Find Full Text PDF

Analysis of diagnoses extracted from electronic health records in a large mental health case register.

Yevgeniya Kovalchuk Robert Stewart Matthew Broadbent Tim J P Hubbard Richard J B Dobson

PLoS One

August 2017

The UK government has recently recognised the need to improve mental health services in the country. Electronic health records provide a rich source of patient data which could help policymakers to better understand needs of the service users. The main objective of this study is to unveil statistics of diagnoses recorded in the Case Register of the South London and Maudsley NHS Foundation Trust, one of the largest mental health providers in the UK and Europe serving a source population of over 1.

View Article and Find Full Text PDF

Comparative analysis of the transcriptome across distant species.

Mark B Gerstein Joel Rozowsky Koon-Kiu Yan Daifeng Wang Chao Cheng Tim J P Hubbard

Nature

August 2014

The transcriptome is the readout of the genome. Identifying common features in it across distant species can reveal fundamental principles. To this end, the ENCODE and modENCODE consortia have generated large amounts of matched RNA-sequencing data for human, worm and fly.

View Article and Find Full Text PDF

Characterizing genetic variants for clinical action.

Erin M Ramos Corina Din-Lovinescu Jonathan S Berg Lisa D Brooks Audrey Duncanson Tim J P Hubbard

Am J Med Genet C Semin Med Genet

March 2014

Genome-wide association studies, DNA sequencing studies, and other genomic studies are finding an increasing number of genetic variants associated with clinical phenotypes that may be useful in developing diagnostic, preventive, and treatment strategies for individual patients. However, few variants have been integrated into routine clinical practice. The reasons for this are several, but two of the most significant are limited evidence about the clinical implications of the variants and a lack of a comprehensive knowledge base that captures genetic variants, their phenotypic associations, and other pertinent phenotypic information that is openly accessible to clinical groups attempting to interpret sequencing data.

View Article and Find Full Text PDF

Ensembl 2014.

Paul Flicek M Ridwan Amode Daniel Barrell Kathryn Beal Konstantinos Billis Tim J P Hubbard

Nucleic Acids Res

January 2014

Ensembl (http://www.ensembl.org) creates tools and data resources to facilitate genomic analysis in chordate species with an emphasis on human, major vertebrate model organisms and farm animals.

View Article and Find Full Text PDF

The zebrafish reference genome sequence and its relationship to the human genome.

Kerstin Howe Matthew D Clark Carlos F Torroja James Torrance Camille Berthelot Tim J P Hubbard

Nature

April 2013

Zebrafish have become a popular organism for the study of vertebrate gene function. The virtually transparent embryos of this species, and the ability to accelerate genetic studies by gene knockdown or overexpression, have led to the widespread use of zebrafish in the detailed investigation of vertebrate gene function and increasingly, the study of human genetic disease. However, for effective modelling of human genetic disease it is important to understand the extent to which zebrafish genes and gene structures are related to orthologous human genes.

View Article and Find Full Text PDF

Ensembl 2013.

Paul Flicek Ikhlak Ahmed M Ridwan Amode Daniel Barrell Kathryn Beal Tim J P Hubbard

Nucleic Acids Res

January 2013

The Ensembl project (http://www.ensembl.org) provides genome information for sequenced chordate genomes with a particular focus on human, mouse, zebrafish and rat.

View Article and Find Full Text PDF

Ensembl 2012.

Paul Flicek M Ridwan Amode Daniel Barrell Kathryn Beal Simon Brent Tim J P Hubbard

Nucleic Acids Res

January 2012

The Ensembl project (http://www.ensembl.org) provides genome resources for chordate genomes with a particular focus on human genome data as well as data for key model organisms such as mouse, rat and zebrafish.

View Article and Find Full Text PDF

Dalliance: interactive genome viewing on the web.

Thomas A Down Matias Piipari Tim J P Hubbard

Bioinformatics

March 2011

Summary: Dalliance is a new genome viewer which offers a high level of interactivity while running within a web browser. All data is fetched using the established distributed annotation system (DAS) protocol, making it easy to customize the browser and add extra data.

Availability And Implementation: Dalliance runs entirely within your web browser, and relies on existing DAS server infrastructure.

View Article and Find Full Text PDF

Ensembl 2011.

Paul Flicek M Ridwan Amode Daniel Barrell Kathryn Beal Simon Brent Tim J P Hubbard

Nucleic Acids Res

January 2011

The Ensembl project (http://www.ensembl.org) seeks to enable genomic science by providing high quality, integrated annotation on chordate and selected eukaryotic genomes within a consistent and accessible infrastructure.

View Article and Find Full Text PDF

iMotifs: an integrated sequence motif visualization and analysis environment.

Matias Piipari Thomas A Down Harpreet Saini Anton Enright Tim J P Hubbard

Bioinformatics

March 2010

Motivation: Short sequence motifs are an important class of models in molecular biology, used most commonly for describing transcription factor binding site specificity patterns. High-throughput methods have been recently developed for detecting regulatory factor binding sites in vivo and in vitro and consequently high-quality binding site motif data are becoming available for increasing number of organisms and regulatory factors. Development of intuitive tools for the study of sequence motifs is therefore important.

View Article and Find Full Text PDF

Ensembl's 10th year.

Paul Flicek Bronwen L Aken Benoit Ballester Kathryn Beal Eugene Bragin Tim J P Hubbard

Nucleic Acids Res

January 2010

Ensembl (http://www.ensembl.org) integrates genomic information for a comprehensive set of chordate genomes with a particular focus on resources for human, mouse, rat, zebrafish and other high-value sequenced genomes.

View Article and Find Full Text PDF

The Protein Feature Ontology: a tool for the unification of protein feature annotations.

Gabrielle A Reeves Karen Eilbeck Michele Magrane Claire O'Donovan Luisa Montecchi-Palazzi Tim J P Hubbard

Bioinformatics

December 2008

Motivation: The advent of sequencing and structural genomics projects has provided a dramatic boost in the number of uncharacterized protein structures and sequences. Consequently, many computational tools have been developed to help elucidate protein function. However, such services are spread throughout the world, often with standalone web pages.

View Article and Find Full Text PDF

Integrating biological data--the Distributed Annotation System.

Andrew M Jenkinson Mario Albrecht Ewan Birney Hagen Blankenburg Thomas Down Tim J P Hubbard

BMC Bioinformatics

July 2008

Background: The Distributed Annotation System (DAS) is a widely adopted protocol for dynamically integrating a wide range of biological data from geographically diverse sources. DAS continues to expand its applicability and evolve in response to new challenges facing integrative bioinformatics.

Results: Here we describe the various infrastructure components of DAS and present a new extended version of the DAS specification.

View Article and Find Full Text PDF

A Bayesian deconvolution strategy for immunoprecipitation-based DNA methylome analysis.

Thomas A Down Vardhman K Rakyan Daniel J Turner Paul Flicek Heng Li Tim J P Hubbard

Nat Biotechnol

July 2008

DNA methylation is an indispensible epigenetic modification required for regulating the expression of mammalian genomes. Immunoprecipitation-based methods for DNA methylome analysis are rapidly shifting the bottleneck in this field from data generation to data analysis, necessitating the development of better analytical tools. In particular, an inability to estimate absolute methylation levels remains a major analytical difficulty associated with immunoprecipitation-based DNA methylation profiling.

View Article and Find Full Text PDF

An integrated resource for genome-wide identification and analysis of human tissue-specific differentially methylated regions (tDMRs).

Vardhman K Rakyan Thomas A Down Natalie P Thorne Paul Flicek Eugene Kulesha Tim J P Hubbard

Genome Res

September 2008

We report a novel resource (methylation profiles of DNA, or mPod) for human genome-wide tissue-specific DNA methylation profiles. mPod consists of three fully integrated parts, genome-wide DNA methylation reference profiles of 13 normal somatic tissues, placenta, sperm, and an immortalized cell line, a visualization tool that has been integrated with the Ensembl genome browser and a new algorithm for the analysis of immunoprecipitation-based DNA methylation profiles. We demonstrate the utility of our resource by identifying the first comprehensive genome-wide set of tissue-specific differentially methylated regions (tDMRs) that may play a role in cellular identity and the regulation of tissue-specific genome function.

View Article and Find Full Text PDF

Data growth and its impact on the SCOP database: new developments.

Antonina Andreeva Dave Howorth John-Marc Chandonia Steven E Brenner Tim J P Hubbard

Nucleic Acids Res

January 2008

The Structural Classification of Proteins (SCOP) database is a comprehensive ordering of all proteins of known structure, according to their evolutionary and structural relationships. The SCOP hierarchy comprises the following levels: Species, Protein, Family, Superfamily, Fold and Class. While keeping the original classification scheme intact, we have changed the production of SCOP in order to cope with a rapid growth of new structural data and to facilitate the discovery of new protein relationships.

View Article and Find Full Text PDF

Integrating sequence and structural biology with DAS.

Andreas Prlić Thomas A Down Eugene Kulesha Robert D Finn Andreas Kähäri Tim J P Hubbard

BMC Bioinformatics

September 2007

Background: The Distributed Annotation System (DAS) is a network protocol for exchanging biological data. It is frequently used to share annotations of genomes and protein sequence.

Results: Here we present several extensions to the current DAS 1.

View Article and Find Full Text PDF

A PHP Error was encountered

Severity: Warning

Message: fopen(/var/lib/php/sessions/ci_sessionstacgurdlvdmskoqlpsiaf3462d1gtmk): Failed to open stream: No space left on device

Filename: drivers/Session_files_driver.php

Line Number: 177

Backtrace:

File: /var/www/html/index.php
Line: 316
Function: require_once

A PHP Error was encountered

Severity: Warning

Message: session_start(): Failed to read session data: user (path: /var/lib/php/sessions)

Filename: Session/Session.php

Line Number: 137

Backtrace:

File: /var/www/html/index.php
Line: 316
Function: require_once