Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

togethercomputer
/
Llama-2-7B-32K-Instruct

Text Generation
Transformers
PyTorch
English
llama
text-generation-inference
Model card Files Files and versions
xet
Community
20
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Adding `safetensors` variant of this model

#20 opened over 1 year ago by
SFconvertbot

Compatibility with Llama-2-7b LoRAs

#18 opened over 1 year ago by
Balint831d

Adding Evaluation Results

#15 opened about 2 years ago by
leaderboard-pr-bot

Traceback (most recent call last)

#14 opened about 2 years ago by
fwrefewrfwe

llama2 forward pass seemingly not working with padded inputs, unless one element in batch is not padded

👍 2
3
#13 opened about 2 years ago by
joehakim

Input validation error: `max_new_tokens` must be <= 1. Given: 20

1
#12 opened about 2 years ago by
reubenlee3

Loading model without fast-attn

1
#10 opened about 2 years ago by
TZ20

Great model. Plans for 13b version?

👍 1
1
#9 opened about 2 years ago by
nahuel89p

Model gives itself instructions and keeps going and going and going?

5
#8 opened about 2 years ago by
michael-newsrx-com

Quantizations for llama.cpp

❤️ 1
#7 opened about 2 years ago by
rozek

Any plans for chat model?

1
#5 opened over 2 years ago by
brekk

when will have a ggml version?

8
#3 opened over 2 years ago by
CUIGuy

LocalAI Model Loading

3
#2 opened over 2 years ago by
FIWisher

The model doesn't seem to stop

15
#1 opened over 2 years ago by
LaferriereJC
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs