The training datasets used for training the ChEmbed family of text embedding models
AI & ML interests
None defined yet.
Organization Card
Edit this README.md markdown file to author your organization card.
models 7
BASF-AI/ChEmbed-prog
Feature Extraction • 0.1B • Updated • 920
BASF-AI/ChEmbed-vanilla
Feature Extraction • 0.1B • Updated • 687
BASF-AI/ChEmbed-plug
Feature Extraction • 0.1B • Updated • 646
BASF-AI/ChEmbed-full
Feature Extraction • 0.1B • Updated • 680 • 1
BASF-AI/ChemVocab
Updated
BASF-AI/nomic-bert-2048
0.1B • Updated • 1
BASF-AI/nomic-embed-text-v1.5
Sentence Similarity • 0.1B • Updated • 131
datasets 76
BASF-AI/ChemRxivRetrieval
Viewer • Updated • 79.5k • 83 • 1
BASF-AI/uspto-title-abs-chem
Viewer • Updated • 75.8k • 10
BASF-AI/uspto-synth-query-abs-chem
Viewer • Updated • 75.8k • 5
BASF-AI/PlantCAD2_virtual_hackathon
Viewer • Updated • 9 • 8
BASF-AI/dolma-pes2o-chemistry
Viewer • Updated • 361k • 810 • 1
BASF-AI/ChemRxiv-Papers
Viewer • Updated • 30.4k • 112 • 1
BASF-AI/ChemRxiv-Paragraphs
Viewer • Updated • 209k • 28 • 2
BASF-AI/ChemRxiv-Train-CC-BY
Viewer • Updated • 139k • 25 • 1
BASF-AI/dolma-chem-only-query-generated
Viewer • Updated • 1.17M • 18
BASF-AI/ChemRxiv-Train-CC-BY-v2
Viewer • Updated • 138k • 6 • 2