Tableau Integration with LabKey for Research Data Visualizations

More than a tool to facilitate data integration and analyses, LabKey has partnered with Tableau to help research teams visually communicate their findings and build consensus with stakeholders. After all, the most important scientific discovery can’t change the world unless it can get your attention. Key insights gleaned from complex biomedical research data can be overwhelming to explain and even more difficult for some audiences to understand. Insightful and beautiful visualizations created in Tableau from data within LabKey Server can help bring complex data to life, and clearly communicate key results.

Seamless Integration of LabKey Data with Tableau

LabKey integrates scientific data with a wide variety of external analysis and presentation tools, including Tableau Desktop. Gone is the need to hand over research data to visual designers who may not understand the science. The LabKey integration with Tableau allows you to make compelling charts and plots with tools designed for analysis. With Tableau and LabKey together, you can easily create compelling graphs, tables  and other visualizations from your own research data. Presentations can be “live” so they automatically update when additional data is incorporated, or if you prefer, your research data visualizations can reflect a static snapshot.

Tableau Technology Partner 

As a Tableau Technology Partner, LabKey adds the ability to connect biomedical  research data to the analytics and visualizations available with Tableau. Drag and drop the data you want, customize colors, styles, and layouts, and never pause or lose the integrity of your ongoing research. Tableau partners with leading technology companies in the data and analytics industry to seamlessly integrate with Tableau so people can collect, store, transform and connect to the data that is important to them.

Video: Using Tableau to Visualize Data in LabKey Server

Read more here:

https://www.labkey.org/wiki/Documentation/page.view?name=tableau

Custom LIMS Software for Engineered Mini-Proteins

Optide-Hunter

Scorpion venom can kill you, but there is a lot to learn from it. Keeping the part of the molecule that crosses the blood brain barrier and attaching a specifically targeted therapy for treating brain tumors is being made possible by the Olson Lab at Fred Hutch with the help of a custom LIMS software developed on LabKey Server.

The engineering of protein-based therapeutics is a complicated but promising strategy for improving treatments for cancer and infectious disease. And it’s not just the chemistry that is complex. The Olson Lab experiments with nature-inspired bioengineered mini-proteins modified with synthetic chemistry to produce “Optides” (optimized peptides), which hold promise for optimizing therapeutic properties. Managing all the experimental data and metadata presents a myriad of challenges which LabKey Server is well suited to handle.

Customized LIMS Software for Protein Engineering

The Olson Lab has developed Optide-Hunter, a LIMS software built with LabKey Server. The platform supports a generalized protein compounds workflow for tracking entities and assays from creation to preclinical experiments.

You’ll find a compound registry, in-silico and in-vivo assays, support for high-throughput and large-scale production, and automated data loading. Optide-Hunter also supports automated chromatogram classification and external pre-processing of high performance liquid chromatography (HPLC) data. Other users can customize the software for their unique workflows.

You can learn more about the project and partnership with LabKey in the case study. Continue reading to learn how to explore the Optide-Hunter yourself right now.


Getting Started

You can explore a read-only version of the Optide-Hunter yourself right now with no account or registration required.

  1. Click here to open the Optide-Hunter in a new tab. Keep these instructions alongside.
  2. Click the Optides project icon at the bottom of the screen. The home page shows the project files, including custom R code and custom module examples you can download.
  3. Each topic along the top menu bar covers a different aspect of the project. Hover over CompoundsRegistry and click Samples to see the registry of compounds for protein expression and conjugation. A set of wiki pages listed on the right guide you with details about the elements shown.
  4. For example, lineage relationships are represented by ordering compounds in a specific hierarchy. Before variant sequences are registered, corresponding homologues must be registered and assigned IDs.
  5. Next, explore the assays along the menu bar. For example, HTProduction > Assays. Click HPLC Assays on the Assay List, then view and filter the data to find compounds of interest.
  6. On the Programs menu, select the QueryAssays option then enter one or more Compound IDs, for example “CNT0001356” and click Submit. Two grids of Matching Constructs and InsilicoAssays Matches will be populated with the search results to give you a common view.

Create Your Own Trial of Optide-Hunter

After exploring our read-only example, you can create your own trial instance and try uploading your own data, customizing the user interface, and developing your own queries and reports. To launch your 30 day trial, create or log in to your account via this link, then select the “Optide-Hunter – Case Study” option.


This project was published in the journal BMC Bioinformatics with the title “Laboratory Information Management Software for Engineered Mini-protein Therapeutic Workflow“. Learn more about the collaboration with LabKey in our case study.

Overcoming Key Challenges in NAb Data Management

Biopharmaceuticals are increasingly being prescribed for a variety of diseases, from autoimmune disorders such as arthritis to neurological conditions like Alzheimer’s. Neutralizing antibody (NAb) assays are a critical component of biopharmaceutical development, helping inform researchers of potential product efficacy and patient safety. Having reproducible, repeatable NAb assay results will improve the product research pipeline and ultimately impact trial and patient outcomes. For this reason, efficient NAb data management is more important than ever to biomedical researchers.

With the advancement of plate and instrument technologies, NAb assays provide a high-throughput mechanism of evaluating the potential immunogenicity of the drugs they study. Teams must be able to set up plates accurately, produce consistent analyses, ensure appropriate quality controls, and keep track of data provenance in order to deliver NAb assay results that are reproducible, comparable, and reliable.¹ LabKey Server helps teams overcome core challenges in generating reliable NAb data in the following ways.

Facilitating Good Record Keeping Practices

LabKey Server helps scientists maintain good record keeping by:

  • Directly importing instrument-derived results files and collating them into an analysis dashboard
  • Improving data integrity by associating raw data files and results
  • Providing a built-in graphical template designer that allows users to quickly create new plate layouts (supporting options for cross- or single-plate dilutions, and single- or multiple virus plates)

Streamlining NAb Data Analysis & QC

Improve the consistency and ease of NAb data management and analysis using LabKey Server by:

  • Automatically calculating and generating neutralization curves and titers
  • Removing ill-fitted and otherwise unsuitable data and maintaining those changes for future quality assurance
  • Translating complex plate maps with dilutions and/or multiple viruses into the NAb dashboard so that results of each run may be viewed and graphed on a per-virus basis

Enabling Collaborative Analysis

LabKey Server can help researchers collaborate and share NAb data by:

  • Centralizing raw file storage and analysis in a secure web-based interface
  • Providing an interactive NAb Dashboard for collaborators to interrogate the data
  • Integrating NAb data with other data types, presenting users with a comprehensive view

High-throughput 384-well NAb assays may contain hundreds of samples with dilutions across plates or within a single plate and the resulting graphs and views can be complex. The LabKey NAb Assay tools provide quick visual feedback allowing you to confirm a valid run or immediately correct and rerun if necessary. To learn more about using LabKey Server to manage NAb data, check out the NAb documentation library on the LabKey Support Portal or request a demo.

¹https://bmcimmunol.biomedcentral.com/articles/10.1186/1471-2172-12-33 

Project Highlight: Harvard Pilgrim Health Care Institute and FDA MyStudies Mobile App

Project Background

In 2017, Harvard Pilgrim Health Care Institute (HPHCI) was selected by the U.S. Food and Drug Administration (FDA) through the FDA-Catalyst program to lead the development of a mobile application, called FDA MyStudies, that would facilitate the collection of real-world data directly from patients to support clinical trials, observational studies, and registries. The effort was funded by an award to FDA scientific staff from the Patient Centered Outcomes Research Trust Fund which is administered by the Associate Secretary for Planning and Evaluation (ASPE) of the Department of Health and Human Services. Harvard Pilgrim selected the mobile application development firm Boston Technology Corporation (BTC) and LabKey as their development partners for the project. BTC was tasked with developing a user friendly mobile interface while LabKey was tasked with building a secure back-end storage environment for collected data.

Why LabKey

LabKey Server was selected as the back-end data management solution for this project for a number of key reasons, one of which being the platform’s flexible, science-specific architecture. With the project’s long-term goal of expanding the use of real-world data across research programs, the application framework needed to support a broad range of potential healthcare topics through configuration as opposed to requiring development for each new project.

LabKey Server also stood out as an ideal solution because of the platform’s ability to handle PHI/PII data in a manner compliant with HIPAA and FISMA regulations. Finally, one of the project requirements outlined by the FDA was that the resulting application and storage architecture would need to be made available as open source to the scientific community. LabKey Server, an open source platform licensed under Apache 2.0, was able to support this distribution model without any changes to the existing licensing model.

The Implementation

The back-end storage environment is composed of three independent web applications:

  1. Response Server: used to store data captured via the mobile application and provide secure access to these data for data analysis purposes.
  2. Registration Server: used to manage participant authentication, preferences, notifications, and consents.
  3. Web Configuration Portal: used to design study questionnaires and store study configuration information including consent forms, eligibility tests, surveys, and study resources.

This dispersed data model ensures secure partitioning of all identifying information from response data, helping ensure patient privacy. LabKey Server provides role-based governance of the data stored on the Registration and Response servers and ensures that data are only accessible by authorized users. When it comes time for analysis, data stored in the response server can be accessed by authorized users via a number of different methods including LabKey’s built in analytics capabilities, download to SAS or R, or export to Excel or other standard format.

FDA MyStudies Mobile App w/ Back-End Data Management Support Through LabKey Server

The bulk of the components used in the development of the secure data storage environment were previously existing in the LabKey Server platform. However, three key areas of custom development and extension were required to support the project’s use:

  • Enrollment Tokens: A unique token that is assigned to each participant upon registration that can be used to restrict their enrollment to a specific study cohort, as well as match the collected study data to external systems (e.g., EHRs).
  • Automatic Schema Creation: Automatic generation of a new database schema when a study questionnaire is created, eliminating the need for manual schema development.
  • Mobile App Response Handling: Capabilities to support automated parsing of the JSON responses sent by the mobile application were implemented, enabling the storage of results in the schema, in a scalable manner.

The LabKey team delivered these developments in a custom module using an agile development methodology, refining them based on client feedback in tandem with the development of the mobile application UI.

Results

To evaluate the usability and viability of the application and data storage environments, Harvard Pilgrim contracted with Kaiser Permanente Washington Health Research Network (KPWHRN) to launch a pilot study examining the medication use and healthcare outcomes of pregnant women throughout their pregnancy. For the pilot program, the Harvard Pilgrim team utilized LabKey’s Compliant Cloud hosting services to manage the storage of study data in a secure AWS cloud environment. Participants who successfully completed the study reported high levels of usability and comfort sharing sensitive information using the app. The pilot was deemed a success, and in Fall 2018 the FDA released the open source code and documentation publicly for use in other studies. Since its release, the FDA MyStudies platform has been selected to support a clinical trial as well as a disease registry.

Webinar Presentation

On May 9, 2019, subject matter experts from the FDA, HPHCI, BTC, and LabKey, presented an overview and many details about this project in a live webinar entitled: An Introduction to the FDA MyStudies App: An Open-Source, Digital Platform to Gather Real World Data for Clinical Trials and Research Studies.

Learn More Here!

Related Reading

https://www.fda.gov/downloads/Drugs/ScienceResearch/UCM625206.pdf

Overcoming Key Challenges in Luminex Data Management

With massive increases in data collection, scientists must diligently apply Luminex data management practices, well-defined quality controls, and consistent analyses¹ in order to be efficient and effective. The Luminex xMAP technology is widely used in research, clinical trials, and diagnostics as it provides a multiplexed immunoassay platform to measure complex humoral responses. With instrumentation and bead technology improving, scientists are able to measure larger numbers of analytes on a greater number of samples.

Although typical Luminex xMAP exports are easy to read, they are often exported as multi-tab files that require human interaction and manipulation of the data in order to perform comprehensive analysis. Standard analysis mechanisms that involve manual processing of data are arduous and prone to errors. Results and visualizations of Luminex data are often shared without comprehensive annotations on how the analysis was performed; leaving out valuable background information such as well-exclusions, curve fits, and background calculations. Utilizing LabKey Server for Luminex data management helps standardize the workflow of transforming instrument-generated outputs into valuable data visualizations and ensures that valuable contextual information is preserved.


Enhancing Luminex Data Management

The LabKey Server software platform helps research teams enhance their management of Luminex data by:

  • Managing Luminex data files and analysis in LabKey ServerProviding support for multi-tabular excel output files and converting them into easy-to-read grids
  • Allowing users to attach metadata about Luminex runs, increasing traceability of data
  • Providing a single platform where raw data, transformed data, and analyzed data are linked and easy to track

Improving Quality Control of Luminex Data

Software for managing luminex data quality controlLabKey Server’s Luminex data management tools help labs improve quality control of their data by:

  • Automatically flagging outliers based on expected values
  • Tracking data exclusions made by users
  • Providing users with tools to track QC metrics across runs using Levey-Jennings plots

Ensuring Provenance & Reproducibility of Luminex Analyses

LabKey Server helps teams maintain data provenance and conduct reproducible analysis by:

  • Generate reproducible analysis from Luminex dataLogging changes to data records and allowing scientists to view the history of data transformations from the raw file to the analyzed results
  • Providing built-in visualization tools that can be reused across runs
  • Providing mechanisms to securely share data with colleagues, collaborators, or manuscript-reviewers

With the right tools, scientists can maintain a comprehensive, error-free catalog of Luminex data and analyses. To learn more about using LabKey Server to manage Luminex data, check out the Luminex documentation library  on the LabKey Support Portal or request a demo.

¹ Eckels J, Nathe C, Nelson EK, et al. Quality control, analysis and secure sharing of Luminex® immunoassay data using the open source LabKey Server platform. BMC Bioinformatics. 2013;14:145. Published 2013 Apr 30. doi:10.1186/1471-2105-14-145

How-To’s of Managing Flow Cytometry Data


Flow cytometry data is widely used across diverse set of research areas including drug discovery and personalized medicine. Ongoing improvements to instrument technology allow scientists to generate more and more targeted reagents which in turn leads to increasing amounts of high-dimensional flow cytometry data. As the application of flow cytometry in immune monitoring becomes more prevalent, many scientists are finding themselves asking “what is the best way to manage flow cytometry data?”

As teams evaluate software tools for managing flow cytometry data, there are several things to keep in mind. Instrument-generated fcs files from cytometers are generally not useful to scientists on their own. Typically, users interrogate their data by creating an analysis file, either with manual gating through commercially-available tools like FlowJo and FCS Express, or by performing computational analyses via free software packages like Bioconductor. Ensuring that data provenance is maintained between fcs files and analysis is vital, and provides valuable information for groups looking to validate or re-analyze the raw data in the future.


Best practices for maintaining comprehensive flow cytometry records using LabKey ServerMaintaining Comprehensive Data Records

The LabKey Server software platform help groups ensure data provenance and maintain comprehensive flow cytometry records by:

  • Allowing for import of raw fcs files as well as analysis files
  • Validating all fcs files are present when linking data to a FlowJo workspace file
  • Auditing file upload for analysis data used in a LabKey study
  • Surfacing run parameters and compensation values for further quality control

Add value to flow cytometry analysis runs using LabKey Server

Adding Value to Runs

LabKey Server’s flow cytometry tools can also helps add value to runs by:

  • Providing mechanisms to insert keywords and adding metadata
  • Ensuring gate naming is consistent across workspaces
  • Allowing administrators to merge gating strategies when necessary
  • Visualizing statistics in Levy-Jennings plots for valuable quality control metrics


Using LabKey Server visualization tools to interrogate flow cytometry analyses

Improving Data Analysis Workflows

Using LabKey Server, scientists can achieve further improvements to their analysis workflow by:

  • Integrating flow cytometry statistics and 2D plotting within the LabKey Server platform
  • Creating easy integration mechanisms across flow cytometry runs, panels, and projects
  • Providing built-in visualization tools for users to interrogate their data

With the right tools, scientists can maximize the insight derived from their flow cytometry data. To learn more about using LabKey Server to manage flow cytometry analysis, check out the flow cytometry documentation library or request a demo.

3 Key Reasons Data Accessibility is Essential in Research

Modern research technologies have greatly increased the amount of scientific data being generated, but making full use of that data is still a major challenge. Data accessibility is a consideration at all stages of the research process; for bench scientists making data accessible to informaticians, for teams sharing data cross-departmentally, and for researchers making data accessible to the public.

The accessibility of data is essential for a number of reasons:

Data accessibility reduces duplication of experiments1. Minimizing Data Redundancy of Research Efforts

Research redundancy is a major problem within research organizations and across the research community. By making data accessible to their desired audience, researchers can reduce the number of redundant experiments conducted and instead iterate upon existing research to accelerate discovery.

Draw reliable conclusions from your experiment data2. Drawing More Reliable Conclusions from More Data

Broader data accessibility allows research teams to pool data and conduct analysis with greater confidence in their results. The more data a researcher has access to, the more statistical power they have to validate research conclusions and preempt questions of data quality.

Accessible data inspires novel approaches to answering scientific research questions.3. Inspiring Novel Questions from Different Approaches

New research questions are inspired by different research approaches and through the study of new methodology. Attacking scientific investigations from varying perspectives also helps reduce bias in analytics, experimental design, and conclusion drawing.

Expanding Data Accessibility with LabKey Server

LabKey Server not only helps teams collect and curate their data, but also helps make it accessible to collaborators and downstream researchers.

Web-Based Access

LabKey Server allows researchers to make their data accessible to a broad or narrow audience through a web-based portal. Web-based access makes it easy to share data as desired and allows interested collaborators to evaluate and alternatively analyze “self-serve” research data. This method of data sharing is both more secure than email (see fine grained permissions below) and much lower overhead than a standard institutional database as users can query, view, and export data without having to interface with a data scientist.

Fine-Grained Permissions

LabKey’s fine-grained permissions model makes secure, selective sharing of to data simple and reliable. With LabKey Server, teams can easily control who sees their data, restricting access to selected individuals or pre-defined groups, or making data accessible the general public. Researchers also have fine-grained control over what datasets are shared: either a single table of data or an entire research project.

Powerful Metadata

LabKey Server captures detailed metadata to help increase discoverability of research data and provide crucial context for other researchers who hope to explore, reanalyze, and/or expand upon it. Research teams can customize metadata captured for each of their data types and add organization specific metadata to support internal needs.

Interested in learning more about how LabKey Server can enhance the accessibility of your research data? Contact the LabKey team for more information or request a demo!

*To learn how configure accessibility features of LabKey Server, read documentation >

Two Key Things Your Spreadsheet-Based Research Data Management Strategy is Lacking

High-throughput analysis techniques are incredibly powerful and provide teams with more data than ever. While that depth of data often holds the key to scientific insights, organizing such large quantities of data in a consistent and discoverable way has become a major challenge for research teams.

Many teams rely on spreadsheet-based systems to organize and manage their data. This approach becomes less-effective as research scales because spreadsheet-based strategies lack two essential characteristics:

1. Consistency

Spreadsheet data management lacks consistencyManual file management relies on the individual contributor’s abilities to consistently create, name, and store data files. This opens the door to a wide range of human errors that will ultimately impact the discoverability and reliability of your data. Common consistency errors that result from manual data management include:

  • Poorly named files
  • Inconsistent locations
  • Duplicate files

2. Discoverability

Spreadsheet data management lacks discoverabilityCollecting data is a giant hurdle in research, but in reality, it is just the first of many. Researchers need to be able to locate datasets of interest in order to conduct analysis. In a file based environment, discoverability of files is dependent on the consistency with which they are maintained. Were they saved in the correct location? Have they been named according to an agreed upon convention? Is there a clear authoritative file or are there duplicates?

A hitch in any one of these areas can severely hinder the discoverability of your data and make it significantly more difficult to:

  • Track what research data has already been collected
  • Find the data you are looking for when it comes time to analyze

Biology-Aware Data Management with LabKey Server

Scientific data management systems like LabKey Server, help increase the consistency and discoverability of your research data. LabKey Server increases the consistency of data management by providing structured data grids for storing various type of research data. Each data grid type also captures relevant metadata, specific to that data type, in order to help make data more discoverable.

Research-Centric Data Structures

Unlike spreadsheets that treat all types of data the same, LabKey Server provides four primary data structures with unique features to better support common types of research data.*

LabKey Assays – Assay data grids capture data generated from individual experiment runs. Assay data is automatically structured in a batch-run-results hierarchy when data files are added. LabKey Server supports data a variety of common assay designs out of the box, but teams can also design their own assay data structure using LabKey’s General Purpose Assay Design.

LabKey Datasets – Datasets track patient/subject measurements over time. LabKey datasets are automatically aligned and joined together, making it easy to query the integrated data and to create visualizations from multiple datasets.

LabKey Specimens – Specimen repositories track the status of each specimen and vial in your inventory. Built-in reports provide a birds-eye view of specimen information, and advanced search capabilities allow for easy location of specimens.

LabKey Lists – LabKey lists provide general purpose, online, interactive grids for any tabular data. Data stored as a LabKey list can be sorted, filtered, and visualized using built-in tools.

Storing data in a consistent, structured manner is the key for teams that hope to achieve maximum efficiency in operations and maximum value from their data. Not only is it much simpler to find data when it is stored in an expected location, but the centralization and integration makes it possible to query data to more quickly locate information of interest.

Interested in learning more about how LabKey Server can increase consistency and discoverability of your research? Contact the LabKey team for more information or request a demo!

*To learn how to add data to LabKey Server, read documentation >

O’Connor Lab Applauded for Real-Time Data Sharing

Our collaborators at the O’Connor Lab (University of Wisconsin-Madison) are making headlines for releasing real-time data via LabKey Server to help accelerate Zika virus research. Recently featured in the Nature article “Zika researchers release real-time data on viral infection study in monkeys,” Dave O’Connor and his team are being applauded for making their research available so quickly.

“O’Connor’s team is to be lauded for their efforts to make their Zika virus data publicly available as soon as possible,” says Nathan Yozwiak, a senior scientist in Pardis Sabeti’s laboratory [computational geneticist at the Broad Institute and Harvard University in Cambridge] “Distributing up-to-date information — in this case, animal model data — as widely and openly as possible is critical during emergencies such as Zika, where relatively little is known about its pathogenesis, yet public concerns and attention are so high.”

The O’Connor lab uses LabKey Server to manage their extensive list of experiments, Illumina sequencing data, purchases, oligonucleotides, freezer samples, and other lab inventory, as well as to provide basic electronic lab notebook (ELN) functionality. To make their LabKey Server data public, the O’Connor lab simply had to update the study permissions.

“It was easy for the ZEST members to make their online lab notebook open to all, O’Connor says. The team uses the biomedical-research collaboration system LabKey Server, as does the Wisconsin National Primate Research Center in Madison, which is where many of the ZEST collaborators work and which (along with the US National Institutes of Health) is supporting the research. Researchers created a study to store and update their data, and simply had to switch permissions to allow anyone to view it. Meanwhile, regulatory agencies at the University of Wisconsin–Madison understood that the work was time-sensitive and expedited approvals for animal care and biosafety (without reducing scrutiny, O’Connor adds).”

[wpi_designer_button text=’Read the Full Article on Nature.com’ link=’http://www.nature.com/news/zika-researchers-release-real-time-data-on-viral-infection-study-in-monkeys-1.19438′ style_id=’75523′ target=’self’]