This repository contains data and code for the experiments run in the paper "Understanding Generative AI Content with Embedding Models" ( https://arxiv.org/abs/2408.10437 ). DataBase POC: Max Vargas (max.vargas@pnnl.gov) The data is separated by experiment: A. The `stack_exchange` dataset contains a...
Filter results
Category
- (-) National Security (32)
- (-) Computational Mathematics & Statistics (7)
- Scientific Discovery (401)
- Biology (285)
- Earth System Science (166)
- Human Health (113)
- Integrative Omics (96)
- Microbiome Science (50)
- Computational Research (25)
- Computing & Analytics (18)
- Energy Resiliency (13)
- Chemical & Biological Signatures Science (12)
- Weapons of Mass Effect (12)
- Materials Science (11)
- Chemistry (10)
- Data Analytics & Machine Learning (9)
- Data Analytics & Machine Learning (7)
- Renewable Energy (7)
- Atmospheric Science (6)
- Ecosystem Science (6)
- Visual Analytics (6)
- Coastal Science (4)
- Energy Storage (4)
- Plant Science (4)
- Solar Energy (4)
- Bioenergy Technologies (3)
- Energy Efficiency (3)
- Transportation (3)
- Cybersecurity (2)
- Distribution (2)
- Electric Grid Modernization (2)
- Grid Cybersecurity (2)
- Subsurface Science (2)
- Wind Energy (2)
- Advanced Lighting (1)
- Computational Mathematics & Statistics (1)
- Environmental Management (1)
- Federal Buildings (1)
- Geothermal Energy (1)
- Grid Analytics (1)
- Grid Energy Storage (1)
- High-Performance Computing (1)
- Terrestrial Aquatics (1)
- Vehicle Technologies (1)
- Waste Processing (1)
- Water Power (1)
Content type
Tags
- Predictive Phenomics (6)
- Proteomics (6)
- Synthetic (5)
- Homo sapiens (4)
- Mass Spectrometry (4)
- TA1 (4)
- TA2 (4)
- HCoV-299E (3)
- Limited Proteolysis (3)
- Tandem Mass Tag (3)
- Biological Systems (2)
- Cybersecurity (2)
- Electrical energy (2)
- Global Analysis (2)
- Label-Free Quantitation (2)
- Machine Learning (2)
- Multi-Omics (2)
- Proteome Integral Solubility Alteration (2)
- S. elongatus PCC 7942 (2)
- Thermal Proteome Profiling (2)
- TMT10 (2)
- Carbon Cycling (1)
- Data Analysis (1)
- EBC (1)
- Exhaled Breath Condensate (1)
- High-Performance Computing (1)
- Omics (1)
- Output Databases (1)
- Quantification (1)
- TMT (1)
This repository contains data for the experiments run in the paper "Understanding Generative AI Content with Embedding Models" ( https://arxiv.org/abs/2408.10437 ). DataBase POC: Max Vargas (max.vargas@pnnl.gov) The data is separated by experiment: A. The `stack_exchange` dataset contains a...
Category
Currently pending public release.
Currently pending public release.
Currently pending public release.
Created on 2024-10-16T17:27:40+00:00 by LN Anderson ; Last updated 2025-08-01T15:13:46+00:00. S. elongatus PCC 7942 LiP and TPP Structural Proteomics (JM-PB-DP3) The purpose of this experiment was to investigate structural alterations in proteins involved in central carbon metabolism and...
Human Host Cellular Response to HCoV-229E Infection Proteomics (ACS-JM-DP2) The purpose of this experiment was to evaluate the human host cellular response to wild-type Human coronavirus strain 229E (HCoV-229E) infection. Sample data was obtained for mock and infected immortalized human lung...
Comprised of 6,426 sample runs, The Environmental Determinants of Diabetes in the Young (TEDDY) proteomics validation study constitutes one of the largest targeted proteomics studies in the literature to date. Making quality control (QC) and donor sample data available to researchers aligns with...
This data is supplementary to the manuscript Expanding the access of wearable silicone wristbands in community-engaged research through best practices in data analysis and integration by Lisa M. Bramer, Holly M. Dixon, David J. Degnan, Diana Rohlman, Julie B. Herbstman, Kim A. Anderson, and Katrina...
Extreme weather events, including fires, heatwaves(HWs), and droughts, have significant impacts on earth, environmental, and power energy systems. Mechanistic and predictive understanding, as well as probabilistic risk assessment of these extreme weather events, are crucial for detecting, planning...
A total of 172 children from the DAISY study with multiple plasma samples collected over time, with up to 23 years of follow-up, were characterized via proteomics analysis. Of the children there were 40 controls and 132 cases. All 132 cases had measurements across time relative to IA. Sampling was...
HDF5 file containing 10,000 hydraulic transmissivity inputs and the corresponding hydraulic pressure field outputs for a two-dimensional saturated flow model of the Hanford Site. The inputs are generated by sampling a 1,000-dimensional Kosambi-Karhunen-Loève (KKL) model of the transmissivity field...
Created on 2024-09-17 by LN Anderson ; Last updated 2025-12-01 PerCon SFA: Profiling sorghum-microbe interactions with a specialized photoaffinity probe identifies key sorgoleone binders in Acinetobacter pitti Mass spectrometry-based global proteome analysis and SoDA-PAL photoaffinity probe labeled...
This dataset includes one baseline and three cybersecurity based scenarios utilizing the IEEE 9 Bus Model. This instantiation of the IEEE 9 model was built utilizing the OpalRT Simulator ePhasorsim module, with Bus 7 represented by hardware in the loop (HiL). The HiL was represented by two SEL351s...
ProxyTSPRD profiles are collected using NVIDIA Nsight Systems version 2020.3.2.6-87e152c and capture computational patterns from training deep learning-based time-series proxy-applications on four different levels: models (Long short-term Memory and Convolutional Neural Network), DL frameworks...