How to use ISTA-DASLab/switch-base-128_qmoe with Transformers:
# Load model directly from transformers import AutoTokenizer, AutoModelForSeq2SeqLM tokenizer = AutoTokenizer.from_pretrained("ISTA-DASLab/switch-base-128_qmoe") model = AutoModelForSeq2SeqLM.from_pretrained("ISTA-DASLab/switch-base-128_qmoe")