Falcon-H1-1.5B-Instruct-5bit / README.md

NexVeridian

Add files using upload-large-folder tool

88949f6 verified about 2 months ago

preview code

raw

history blame contribute delete

1.14 kB

metadata

base_model: tiiuae/Falcon-H1-1.5B-Instruct
language:
  - ar
  - cs
  - de
  - en
  - es
  - fr
  - hi
  - it
  - ja
  - ko
  - nl
  - pl
  - pt
  - ro
  - ru
  - sv
  - ur
  - zh
library_name: mlx
license: other
license_name: falcon-llm-license
license_link: https://falconllm.tii.ae/falcon-terms-and-conditions.html
tags:
  - falcon-h1
  - mlx
inference: true
pipeline_tag: text-generation
paper: tiiuae/falcon-h1

NexVeridian/Falcon-H1-1.5B-Instruct-5bit

This model NexVeridian/Falcon-H1-1.5B-Instruct-5bit was converted to MLX format from tiiuae/Falcon-H1-1.5B-Instruct using mlx-lm version 0.28.0.

Use with mlx

pip install mlx-lm

from mlx_lm import load, generate

model, tokenizer = load("NexVeridian/Falcon-H1-1.5B-Instruct-5bit")

prompt = "hello"

if tokenizer.chat_template is not None:
    messages = [{"role": "user", "content": prompt}]
    prompt = tokenizer.apply_chat_template(
        messages, add_generation_prompt=True
    )

response = generate(model, tokenizer, prompt=prompt, verbose=True)