Nemotron Code & SWE
Collection
Datasets for building models that write, debug, and reason about code. Covers competitive programming, software engineering, and code pretraining. • 14 items • Updated • 5
device=["cuda:0", "cuda:1"] or device=["cpu"]*4 on the model.predict or model.rank calls.dataset_id, e.g. dataset_id="lightonai/NanoBEIR-de" for the German benchmark.output_scores=True to get similarity scores returned. This can be useful for some distillation losses!