lm_datasets
updated
Viewer
• Updated • 243k • 979
• 219
argilla/OpenHermesPreferences
Viewer
• Updated • 989k • 767
• 212
Viewer
• Updated • 1M • 10.4k
• 803
Viewer
• Updated • 949k • 3.82k
• 483
Viewer
• Updated • 31.1M • 14.8k
• 676
HuggingFaceH4/ultrachat_200k
Viewer
• Updated • 515k • 38.1k
• 666
Viewer
• Updated • 7.5k • 2.15k
• 171
Viewer
• Updated • 5.91k • 164
• 21
Note Code dataset
Viewer
• Updated • 49.6k • 3.05k
• 171
Note Code dataset
code-search-net/code_search_net
Viewer
• Updated • 4.14M • 16.5k
• 324
Note Code dataset
Vezora/Tested-143k-Python-Alpaca
Viewer
• Updated • 143k • 276
• 52
Note Code dataset
haripritam/function-calling-alpaca
Viewer
• Updated • 72.4k • 6
• 1
AndreiMuresanu/alpaca_flan-format
Viewer
• Updated • 51.9k • 13
• 2
Viewer
• Updated • 68.9k • 1.42k
• 144
whitefox44/AlpacaGPT3.5Customized
Viewer
• Updated • 55.8k • 26
• 5
SkyHuReal/DrugBank-Alpaca
Viewer
• Updated • 3.87k • 45
• 11
Viewer
• Updated • 9.44k • 28
• 18
Note Code dataset
llm-wizard/dolly-15k-instruction-alpaca-format
Viewer
• Updated • 15k • 106
• 38
ChobPT/gradio_docs_alpaca
Viewer
• Updated • 2.23k • 22
• 2
QuixiAI/WizardLM_alpaca_evol_instruct_70k_unfiltered
Viewer
• Updated • 55k • 215
• 146
Viewer
• Updated • 333k • 115
• 20
DevAibest/alpaca-geotherm-data
Viewer
• Updated • 643k • 14
haripritam/function-calling-alpaca-rejections
Viewer
• Updated • 87.5k • 11
• 3
V3N0M/Jenna-50K-Alpaca-Uncensored
Viewer
• Updated • 54.4k • 97
• 21
TokenBender/code_instructions_122k_alpaca_style
Viewer
• Updated • 122k • 1.32k
• 80
Note Code dataset
TokenBender/python_evol_instruct_51k
Viewer
• Updated • 51.3k • 236
• 6
Note Code dataset
Viewer
• Updated • 370k • 194
• 25
Note Code dataset
ise-uiuc/Magicoder-OSS-Instruct-75K
Viewer
• Updated • 75.2k • 3.21k
• 161
Note Code dataset
ise-uiuc/Magicoder-Evol-Instruct-110K
Viewer
• Updated • 111k • 2.82k
• 174
Note Code dataset
bigcode/the-stack-v2-train-full-ids
Viewer
• Updated • 60.5M • 676
• 58
Note Code dataset
bigcode/self-oss-instruct-sc2-exec-filter-50k
Viewer
• Updated • 50.7k • 1.55k
• 105
Note Code dataset
ArmelR/stack-exchange-instruction
Viewer
• Updated • 12.2M • 1.06k
• 71
Note Code dataset
Viewer
• Updated • 546M • 13.9k
• 967
Note Code dataset.
Viewer
• Updated • 207M • 21.4k
• 488
Note Code dataset.
bigcode/bigcode-pii-dataset
Viewer
• Updated • 12.1k • 111
• 54
bigcode/pseudo-labeled-python-data-pii-detection-filtered
Viewer
• Updated • 17.7k • 24
• 7
Viewer
• Updated • 1M • 7.82k
• 853
lmsys/chatbot_arena_conversations
Viewer
• Updated • 33k • 2.09k
• 446
Viewer
• Updated • 20.3k • 3.91k
• 185
HuggingFaceTB/everyday-conversations-llama3.1-2k
Viewer
• Updated • 2.38k • 1.57k
• 128
karpathy/tiny_shakespeare
Updated • 5.47k
• 72
nvidia/Nemotron-Post-Training-Dataset-v2
Viewer
• Updated • 6.34M • 8.1k
• 115
Viewer
• Updated • 476M • 35.1k
• 826