Supporting medical researchers working with large complex datasets of proteins, genes and metabolites

September 12, 2019

Yale University in Connecticut, USA, owns one of the world's great medical historical libraries.

GUILDFORD, UNITED KINGDOM - Sep 12, 2019 - Overview

Yale University in Connecticut, USA, owns one of the world’s great medical historical libraries. It was opened in 1814 and by 1865 its collection numbered 1,200 books. Today it is known as the Yale Cushing/Whitney Medical Library, and it contains more than 416,000 books. Its vast collections cover clinical medicine and its specialties, the pre-clinical sciences, public health, nursing, and related fields.

Rolando Garcia-Milian, a Librarian and Lecturer in Epidemiology for the Environmental Health Sciences Department, Yale School of Public Health, oversees the library’s Bioinformatics Support Program working with research departments, laboratories, and individual scientists.

Bionformatics support under the program ranges from developing and conducting workshops on bioinformatics tools and databases all the way through to providing consultation on the analysis of microarray and next-generation sequencing (NGS) data; functional enrichment analysis of differentially regulated molecules; and finding, retrieving, and using public datasets for validating research results or hypothesis testing.

Technical Situation

 “The bioinformatics program was founded in 2014 with the aim to upgrade the library to the new omics data age and to support the entire research lifecycle,” explains Rolando Garcia-Milian.

“Researchers are typically working with very large complex datasets of proteins, genes, metabolites and so on,” he says. “They are interested in finding out about the pathways and biological processes these molecules are participating in.“

 “Previously, there was no provision of bioinformatics software by the Medical Library. Each research group had to license its own software. The library’s role was simply to provide blibliographic-centered databases and tools.”

Solution

When the Bioinformatics Support Program was set up, the library initially licensed a range of databases and knowledge bases for the functional annotation of omics data.

These specialized tools included Ingenuity Pathway Analysis (Qiagen), Metacore/MetaDrug (Clarivate Analytics), and BioCyc -that provide functional enrichment analysis tools for omics data, and TRANSFAC (genexplain) for transcription factor analysis.

 “After a couple of years of licensing these tools, we found that some researchers were coming to us with their raw data,“ says Rolando. “To use the tools we’d already licensed, they would need to analyse their raw data. So we realized we needed to look for some further tools that did not require users to be expert bioinformaticians. Qlucore Omics Explorer was a tool that fulfilled that need.”

Benefits

Qlucore Omics Explorer provides a first glance understanding of any multivariate data problem, where there might be up to many thousands of changing features. Users can rapidly visualize correlations in the data, and understand how data samples might cluster and relate to each other. This overview can then provide clues to further investigations.

“For those who are not bioinformatics savvy, for example, clinicians doing research with patient samples or public datasets, Qlucore Omics Explorer has proved to be an invaluable tool,” says Rolando. “With just a couple of mouse clicks, they can quickly analyze their data by generating a heat map or principal component analysis (PCA) and get the list of differentially expressed molecules. It means they get results really fast.”

He continues: “Some units and specialists in bioinformatics (on campus) carry out these types of analyses too but they are often overwhelmed with a very high demand.

“With the Qlucore tool, our researchers can analyse their results right away without waiting for a specialist. Also, it means they understand their own analysis, results, and learn more about using bioinformatics”.

One of the most useful features in Qlucore Omics Explorer, says Rolando, is the way researchers can immediately download data from public datasets, such as Gene Expression Omnibus (GEO) and TCGA. “It means they can quickly re-analyse data from previous studies and compare results. Some groups are also using Qlucore Omics Explorer to do meta-analysis studies.”

The Yale Medical library has already licensed around 300 accounts of Qlucore’s Omics Explorer to date. Researchers can use these online when they need to from their own labs or from home via the Yale network. 

 

Disclaimer: All comments are those of Rolando Garcia-Milian and do not necessarily represent the postings, strategies or opinions at Yale University.

 Dr. Harvey Cushing was a neurosurgeon and collector of the works of the early anatomists and surgeons. John Hay Whitney, a 1926 graduate of Yale College, served as United States Ambassador to Great Britain from 1957 to 1961. Later, Whitney became owner and publisher of the New York Herald Tribune.