This repository contains data for the experiments run in the paper "Understanding Generative AI Content with Embedding Models" ( https://arxiv.org/abs/2408.10437 ). DataBase POC: Max Vargas (max.vargas@pnnl.gov) The data is separated by experiment: A. The `stack_exchange` dataset contains a...
Filter results
Category
- (-) Computational Mathematics & Statistics (7)
- (-) Data Analytics & Machine Learning (6)
- Scientific Discovery (369)
- Biology (258)
- Earth System Science (161)
- Human Health (112)
- Integrative Omics (73)
- Microbiome Science (47)
- National Security (31)
- Computational Research (25)
- Computing & Analytics (17)
- Chemical & Biological Signatures Science (12)
- Energy Resiliency (12)
- Weapons of Mass Effect (12)
- Chemistry (10)
- Data Analytics & Machine Learning (9)
- Materials Science (7)
- Atmospheric Science (6)
- Renewable Energy (6)
- Visual Analytics (6)
- Coastal Science (4)
- Ecosystem Science (4)
- Energy Storage (3)
- Plant Science (3)
- Solar Energy (3)
- Bioenergy Technologies (2)
- Cybersecurity (2)
- Distribution (2)
- Electric Grid Modernization (2)
- Energy Efficiency (2)
- Grid Cybersecurity (2)
- Transportation (2)
- Computational Mathematics & Statistics (1)
- Grid Analytics (1)
- High-Performance Computing (1)
- Subsurface Science (1)
- Terrestrial Aquatics (1)
- Wind Energy (1)
Content type
Comprised of 6,426 sample runs, The Environmental Determinants of Diabetes in the Young (TEDDY) proteomics validation study constitutes one of the largest targeted proteomics studies in the literature to date. Making quality control (QC) and donor sample data available to researchers aligns with...
This data is supplementary to the manuscript Expanding the access of wearable silicone wristbands in community-engaged research through best practices in data analysis and integration by Lisa M. Bramer, Holly M. Dixon, David J. Degnan, Diana Rohlman, Julie B. Herbstman, Kim A. Anderson, and Katrina...
Extreme weather events, including fires, heatwaves(HWs), and droughts, have significant impacts on earth, environmental, and power energy systems. Mechanistic and predictive understanding, as well as probabilistic risk assessment of these extreme weather events, are crucial for detecting, planning...
A total of 172 children from the DAISY study with multiple plasma samples collected over time, with up to 23 years of follow-up, were characterized via proteomics analysis. Of the children there were 40 controls and 132 cases. All 132 cases had measurements across time relative to IA. Sampling was...
HDF5 file containing 10,000 hydraulic transmissivity inputs and the corresponding hydraulic pressure field outputs for a two-dimensional saturated flow model of the Hanford Site. The inputs are generated by sampling a 1,000-dimensional Kosambi-Karhunen-Loève (KKL) model of the transmissivity field...
ProxyTSPRD profiles are collected using NVIDIA Nsight Systems version 2020.3.2.6-87e152c and capture computational patterns from training deep learning-based time-series proxy-applications on four different levels: models (Long short-term Memory and Convolutional Neural Network), DL frameworks...
A template to document AI prompts. There are four files associated with this DOI. There is a version of the template with and without examples. There is a PDF and Word copy of both versions. Please cite as: Sheridan, S. 2025. "AI Prompt Documentation Template." https://doi.org/10.17605/OSF.IO/K7FUZ...
Category
The Human Islet Research Network (HIRN) is a large consortia with many research projects focused on understanding how beta cells are lost in type 1 diabetics (T1D) with a goal of finding how to protect against or replace the loss of functional beta cells. The consortia has multiple branches of...
Category
Datasets
1
The Predictive Phenomics Science & Technology Initiative (PPI) at Pacific Northwest National Laboratory are tackling the grand challenge of understanding and predicting phenotype by identifying the molecular basis of function and enable function-driven design and control of biological systems...
Category
Datasets
17
David Degnan is a biological data scientist who develops bioinformatic and statistical pipelines for multi-omics data, specifically the fields of proteomics, metabolomics, and multi-omics (phenotypic) data integration. He has experience with top-down & bottom-up proteomics analysis, genomics &...
Fusarium sp. DS682 Proteogenomics Statistical Data Analysis of SFA dataset download: 10.25584/KSOmicsFspDS682/1766303 . GitHub Repository Source: https://github.com/lmbramer/Fusarium-sp.-DS-682-Proteogenomics MaxQuant Export Files (txt) Trelliscope Boxplots (jsonp) Fusarium Report (.Rmd, html)...