This repository contains data and code for the experiments run in the paper "Understanding Generative AI Content with Embedding Models" ( https://arxiv.org/abs/2408.10437 ). DataBase POC: Max Vargas (max.vargas@pnnl.gov) The data is separated by experiment: A. The `stack_exchange` dataset contains a...
Filter results
Category
- (-) National Security (32)
- (-) Ecosystem Science (5)
- Scientific Discovery (376)
- Biology (260)
- Earth System Science (164)
- Human Health (112)
- Integrative Omics (73)
- Microbiome Science (47)
- Computational Research (25)
- Computing & Analytics (18)
- Energy Resiliency (13)
- Chemical & Biological Signatures Science (12)
- Weapons of Mass Effect (12)
- Materials Science (11)
- Chemistry (10)
- Data Analytics & Machine Learning (9)
- Computational Mathematics & Statistics (7)
- Data Analytics & Machine Learning (7)
- Renewable Energy (7)
- Atmospheric Science (6)
- Visual Analytics (6)
- Coastal Science (4)
- Energy Storage (4)
- Solar Energy (4)
- Bioenergy Technologies (3)
- Energy Efficiency (3)
- Plant Science (3)
- Transportation (3)
- Cybersecurity (2)
- Distribution (2)
- Electric Grid Modernization (2)
- Grid Cybersecurity (2)
- Wind Energy (2)
- Advanced Lighting (1)
- Computational Mathematics & Statistics (1)
- Environmental Management (1)
- Federal Buildings (1)
- Geothermal Energy (1)
- Grid Analytics (1)
- Grid Energy Storage (1)
- High-Performance Computing (1)
- Subsurface Science (1)
- Terrestrial Aquatics (1)
- Vehicle Technologies (1)
- Waste Processing (1)
- Water Power (1)
Content type
Tags
- Predictive Phenomics (6)
- Proteomics (6)
- Mass Spectrometry (5)
- Synthetic (5)
- TA1 (5)
- TA2 (5)
- Homo sapiens (4)
- HCoV-299E (3)
- Limited Proteolysis (3)
- Tandem Mass Tag (3)
- Cybersecurity (2)
- Electrical energy (2)
- Global Analysis (2)
- Label-Free Quantitation (2)
- Omics (2)
- Proteome Integral Solubility Alteration (2)
- S. elongatus PCC 7942 (2)
- Thermal Proteome Profiling (2)
- TMT10 (2)
- Carbon Cycling (1)
- EBC (1)
- Exhaled Breath Condensate (1)
- High-Performance Computing (1)
- High Throughput Sequencing (1)
- Host Response (1)
- Machine Learning (1)
- Mass Spectrometer (1)
- Quantification (1)
- TMT (1)
- ToF-SIMS (1)
This repository contains data for the experiments run in the paper "Understanding Generative AI Content with Embedding Models" ( https://arxiv.org/abs/2408.10437 ). DataBase POC: Max Vargas (max.vargas@pnnl.gov) The data is separated by experiment: A. The `stack_exchange` dataset contains a...
Category
Created on 2024-10-16T17:27:40+00:00 by LN Anderson and is currently pending updates.
Created on 2025-06-10T00:34:54+00:00 by LN Anderson and is currently pending updates.
Created on 2024-10-16T17:27:40+00:00 by LN Anderson . Pending updates will be available for public release before or by 2025-10-01.
Created on 2024-10-16T17:27:40+00:00 by LN Anderson ; Last updated 2025-08-01T15:13:46+00:00 and is pending public release. S. elongatus PCC 7942 LiP and TPP Structural Proteomics (JM-PB-DP3) The purpose of this experiment was to investigate structural alterations in proteins involved in central...
Created on 2024-10-01 by LN Anderson ; Last updated 2025-08-11 and is pending public release. Human Host Cellular Response to HCoV-229E Infection Proteomics (ACS-JM-DP2) The purpose of this experiment was to evaluate the human host cellular response to wild-type Human coronavirus strain 229E (HCoV...
This data is supplementary to the manuscript Expanding the access of wearable silicone wristbands in community-engaged research through best practices in data analysis and integration by Lisa M. Bramer, Holly M. Dixon, David J. Degnan, Diana Rohlman, Julie B. Herbstman, Kim A. Anderson, and Katrina...
Extreme weather events, including fires, heatwaves(HWs), and droughts, have significant impacts on earth, environmental, and power energy systems. Mechanistic and predictive understanding, as well as probabilistic risk assessment of these extreme weather events, are crucial for detecting, planning...
This dataset presents land surface parameters designed explicitly for global kilometer-scale Earth system modeling and has significant implications for enhancing our understanding of water, carbon, and energy cycles in the context of global change. Specifically, it includes four categories of...
HDF5 file containing 10,000 hydraulic transmissivity inputs and the corresponding hydraulic pressure field outputs for a two-dimensional saturated flow model of the Hanford Site. The inputs are generated by sampling a 1,000-dimensional Kosambi-Karhunen-Loève (KKL) model of the transmissivity field...
Last updated on 2024-09-17T15:49:08+00:00 by LN Anderson PerCon SFA: Profiling sorghum-microbe interactions with a specialized photoaffinity probe identifies key sorgoleone binders in Acinetobacter pitti Mass spectrometry-based global proteome analysis and SoDA-PAL photoaffinity probe labeled...
Please cite as : McClure R.S., Y. Farris, R.E. Danczak, W.C. Nelson, H. Song, A. Kessler, and J. Lee, et al. 2022. Metatranscriptomic data from MSC-2. [Data Set] PNNL DataHub. https://data.pnnl.gov/group/nodes/dataset/33232 Metatranscriptomic data from MSC-2 12 fastq files (6 forward read, 6 reverse...
Category
This dataset includes one baseline and three cybersecurity based scenarios utilizing the IEEE 9 Bus Model. This instantiation of the IEEE 9 model was built utilizing the OpalRT Simulator ePhasorsim module, with Bus 7 represented by hardware in the loop (HiL). The HiL was represented by two SEL351s...
ProxyTSPRD profiles are collected using NVIDIA Nsight Systems version 2020.3.2.6-87e152c and capture computational patterns from training deep learning-based time-series proxy-applications on four different levels: models (Long short-term Memory and Convolutional Neural Network), DL frameworks...