The versatility of Mila AI -v1.3.7b- -aDDont- makes it an attractive solution for various industries, including:
What is your primary ? (e.g., local RAG database, coding assistant, automated agent pipelines) Mila AI -v1.3.7b- -aDDont-
from transformers import AutoModelForCausalLM, AutoTokenizer import torch model_id = "mila-ai/v1.3.7b-addont" tokenizer = AutoTokenizer.from_pretrained(model_id) model = AutoModelForCausalLM.from_pretrained( model_id, torch_dtype=torch.float16, device_map="auto" ) prompt = "<|system|>\nYou are a helpful assistant. \n<|user|>\nExplain quantum computing simply. \n<|assistant|>" inputs = tokenizer(prompt, return_tensors="pt").to("cuda") outputs = model.generate(**inputs, max_new_tokens=256) print(tokenizer.decode(outputs[0], skip_special_tokens=True)) Use code with caution. Method 2: GGUF via Ollama The versatility of Mila AI -v1