thinkKenya/kenyan-low-resource-language-data
Viewer β’ Updated β’ 91.1k β’ 43 β’ 5
How to use mutaician/nllb-swahili-kalenjin-v3 with PEFT:
from peft import PeftModel
from transformers import AutoModelForSeq2SeqLM
base_model = AutoModelForSeq2SeqLM.from_pretrained("facebook/nllb-200-distilled-600M")
model = PeftModel.from_pretrained(base_model, "mutaician/nllb-swahili-kalenjin-v3")How to use mutaician/nllb-swahili-kalenjin-v3 with Transformers:
# Use a pipeline as a high-level helper
# Warning: Pipeline type "translation" is no longer supported in transformers v5.
# You must load the model directly (see below) or downgrade to v4.x with:
# 'pip install "transformers<5.0.0'
from transformers import pipeline
pipe = pipeline("translation", model="mutaician/nllb-swahili-kalenjin-v3") # Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("mutaician/nllb-swahili-kalenjin-v3", dtype="auto")This is a rigorously fine-tuned LoRA adapter for facebook/nllb-200-distilled-600M, heavily optimized for translating Swahili (SWA) to Kalenjin (KLN).
40.2462.38r=64, alpha=128, targeting ["q_proj", "v_proj", "k_proj", "out_proj", "fc1", "fc2"].luo_Latn token space to prevent catastrophic forgetting that comes with initializing a raw token.import torch
from peft import PeftModel
from transformers import AutoModelForSeq2SeqLM, NllbTokenizerFast
# Load Base
model_id = "facebook/nllb-200-distilled-600M"
model = AutoModelForSeq2SeqLM.from_pretrained(model_id)
# Load Adapter
adapter_id = "mutaician/nllb-swahili-kalenjin-v3"
model = PeftModel.from_pretrained(model, adapter_id)
# Load Tokenizer
tokenizer = NllbTokenizerFast.from_pretrained(adapter_id)
tokenizer.src_lang = "swa_Latn"
text = "Habari yako?"
inputs = tokenizer(text, return_tensors="pt")
target_lang_id = tokenizer.convert_tokens_to_ids("kln_Latn")
with torch.no_grad():
generated_tokens = model.generate(
**inputs,
forced_bos_token_id=target_lang_id,
num_beams=5,
early_stopping=True,
max_length=256
)
print(tokenizer.batch_decode(generated_tokens, skip_special_tokens=True)[0])
Base model
facebook/nllb-200-distilled-600M