Spaces:
Running
on
Zero
Running
on
Zero
| --extra-index-url https://download.pytorch.org/whl/cu124 # grab a CUDA Torch wheel | |
| torch==2.5.1+cu124 # keep before flash-attn | |
| # FlashAttention pre-built wheel that matches: Torch 2.5 • CUDA 12 • cp310 | |
| https://github.com/Dao-AILab/flash-attention/releases/download/v2.8.0.post2/flash_attn-2.8.0.post2+cu12torch2.5cxx11abiFALSE-cp310-cp310-linux_x86_64.whl # <- 240 MB wheel:contentReference[oaicite:2]{index=2} | |
| transformers>=4.52.0 | |
| accelerate>=0.30.2 | |
| bitsandbytes==0.43.3 | |
| peft==0.15.2 | |
| gradio>=4.44.0 | |
| sentencepiece |