metadata
base_model: tiiuae/Falcon-H1-1.5B-Instruct
language:
- ar
- cs
- de
- en
- es
- fr
- hi
- it
- ja
- ko
- nl
- pl
- pt
- ro
- ru
- sv
- ur
- zh
library_name: mlx
license: other
license_name: falcon-llm-license
license_link: https://falconllm.tii.ae/falcon-terms-and-conditions.html
tags:
- falcon-h1
- mlx
inference: true
pipeline_tag: text-generation
paper: tiiuae/falcon-h1
NexVeridian/Falcon-H1-1.5B-Instruct-5bit
This model NexVeridian/Falcon-H1-1.5B-Instruct-5bit was converted to MLX format from tiiuae/Falcon-H1-1.5B-Instruct using mlx-lm version 0.28.0.
Use with mlx
pip install mlx-lm
from mlx_lm import load, generate
model, tokenizer = load("NexVeridian/Falcon-H1-1.5B-Instruct-5bit")
prompt = "hello"
if tokenizer.chat_template is not None:
messages = [{"role": "user", "content": prompt}]
prompt = tokenizer.apply_chat_template(
messages, add_generation_prompt=True
)
response = generate(model, tokenizer, prompt=prompt, verbose=True)