Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
minhnguyent546 's Collections
[model] Machine Translation Models
[dataset] image-text datasets
[dataset] embeddings-and-retrieval-learning
[dataset] text-generation
[model] embeddings
ViCLIP-OT
Med-Alpaca

[dataset] embeddings-and-retrieval-learning

updated 5 days ago

Datasets for training embeddings models (and fine-tuning for retrieval tasks)

Upvote
1

  • unicamp-dl/mmarco

    Updated Mar 6, 2024 • 2.33k • 91

  • miracl/miracl-corpus

    Viewer • Updated Jan 5, 2023 • 77.2M • 3.39k • 52

  • minhnguyent546/mmarco-vietnamese-split

    Viewer • Updated Mar 20 • 49.5M • 282

  • minhnguyent546/zalo-ai-legal-text-retrieval-2021

    Viewer • Updated Mar 20 • 67.8k • 32

  • minhnguyent546/bge-m3-data

    Viewer • Updated Mar 20 • 1.57M • 125

  • Shitao/bge-m3-data

    Viewer • Updated Apr 26, 2024 • 172k • 181 • 53

  • VietAI/vi_pubmed

    Viewer • Updated Jan 9, 2024 • 20.1M • 288 • 23

  • VietAI/vi_mednli

    Viewer • Updated Jan 9, 2024 • 14k • 43 • 3

  • facebook/xnli

    Viewer • Updated Jan 5, 2024 • 6.4M • 19.8k • 71

  • google/xquad

    Viewer • Updated Jan 4, 2024 • 14.3k • 6.39k • 39

  • google/xtreme

    Viewer • Updated Feb 22, 2024 • 2.77M • 4.03k • 115

  • Davlan/sib200

    Viewer • Updated Feb 19, 2024 • 206k • 2.95k • 18

  • apple/mkqa

    Updated Jan 18, 2024 • 323 • 42

  • deepmind/narrativeqa

    Viewer • Updated Mar 6, 2024 • 28.7k • 7.9k • 64
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs