Augmenting Scientific Collaboration and Data Integration through Technology

WAGENAAR LAB

 

With the ever increasing ability to generate large-scale, complex, multi-modal scientific datasets, it is clear that the scientific community needs to invest in better tools, mechanisms, and infrastructure to facilitate, and accelerate scientific progress.

The Wagenaar Lab at the University of Pennsylvania develops scalable platforms, and tools for the next generation data scientists and clinicians to enable them to accelerate their research. Our projects include the development of advanced cloud-based technologies, novel tools for analyzing distributed scientific data, and mechanisms to foster collaborative science with a focus on integration, scalability and sustainability.

Screen Shot 2021-05-20 at 1.23.32 PM.png

Pennsieve Data Management Platform

The Pennsieve Data Management Platform provides a scalable cloud-based solution for managing, analyzing and sharing scientific datasets. We leverage industry standards and agile processes to develop and expand this platform within the lab and the Penn Institute for Biomedical Informatics. We explore novel ways to integrate and analyze data and metadata within the cloud at scale under assumptions of privacy, scale, and distributed assets.

The platform is open-source and relies heavily on AWS infrastructure, Terraform, Scala, Python, and Rust.

 
Screen Shot 2021-05-20 at 1.13.16 PM.png

Pennsieve Discover

Pennsieve Discover hosts all datasets that have been published through the Pennsieve Platform. We leverage the Pennsieve Discover platform to develop novel mechanisms for distributing large complex scientific datasets in a sustainable and scalable way. Pennsieve Discover enables peer-reviewed data publishing and data distribution in accordance with the FAIR Guiding Principles for scientific data management and stewardship.

The platform is publicly available on our Github repository and leverages Terraform, Heroku, nuxt.js, Elasticsearch and Scala for its services.

 
Screen Shot 2021-05-20 at 1.17.47 PM.png

SPARC Portal

The NIH SPARC Portal is a joined effort to build the worlds best community for advancing bio-elecronic medicine through open science. The Wagenaar Lab is one of the core developers of the SPARC Portal together with collaborators in the US, Europe and New Zealand. The SPARC portal is tightly integrated with a number of external platforms including the Pennsieve Data Management Platform, Scicrunch, Biolucida and O2SParc

Contact

Please contact us with any questions. We are always looking for students, post-docs and collaborators to join our efforts to significantly impact science!