Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
6
Mehar Bhatia
MeharBhatia
Follow
0 followers
·
2 following
meharbhatia
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
10 days ago
Value Drifts: Tracing Value Alignment During LLM Post-Training
updated
a dataset
11 days ago
McGill-NLP/value-drifts
commented
on
a paper
11 days ago
Value Drifts: Tracing Value Alignment During LLM Post-Training
View all activity
Organizations
MeharBhatia
's models
38
Sort: Recently updated
MeharBhatia/llama3_8b_ppo_sft_wildchat_chosen_oppose
266k
•
Updated
Oct 6
•
2
MeharBhatia/llama3_8b_ppo_sft_wildchat_chosen_support
266k
•
Updated
Oct 6
•
4
MeharBhatia/llama3_8b_ppo_sft_alpaca_chosen_oppose
266k
•
Updated
Oct 6
•
4
MeharBhatia/llama3_8b_ppo_sft_alpaca_chosen_support
266k
•
Updated
Oct 6
•
4
MeharBhatia/llama3_8b_simpo_sft_alpaca_chosen_oppose
266k
•
Updated
Oct 6
•
3
MeharBhatia/llama3_8b_simpo_sft_alpaca_chosen_support
266k
•
Updated
Oct 6
•
4
MeharBhatia/llama3_8b_simpo_sft_wildchat_chosen_support
266k
•
Updated
Oct 6
•
4
MeharBhatia/llama3_8b_simpo_sft_wildchat_chosen_oppose
266k
•
Updated
Oct 6
•
5
MeharBhatia/llama3_8b_dpo_sft_wildchat_chosen_oppose
266k
•
Updated
Oct 6
•
4
MeharBhatia/llama3_8b_dpo_sft_wildchat_chosen_support
266k
•
Updated
Oct 6
•
4
MeharBhatia/llama3_8b_dpo_sft_alpaca_chosen_oppose
266k
•
Updated
Oct 6
•
4
MeharBhatia/llama3_8b_dpo_sft_alpaca_chosen_support
266k
•
Updated
Oct 6
•
4
MeharBhatia/qwen3_8b_ppo_sft_wildchat_chosen_support
308k
•
Updated
Oct 6
•
4
MeharBhatia/qwen3_8b_ppo_sft_alpaca_chosen_oppose
308k
•
Updated
Oct 6
•
4
MeharBhatia/qwen3_8b_ppo_sft_alpaca_chosen_support
308k
•
Updated
Oct 6
•
4
MeharBhatia/qwen3_8b_simpo_sft_wildchat_chosen_support
308k
•
Updated
Oct 6
•
4
MeharBhatia/qwen3_8b_simpo_sft_alpaca_chosen_oppose
308k
•
Updated
Oct 6
•
4
MeharBhatia/qwen3_8b_simpo_sft_alpaca_chosen_support
308k
•
Updated
Oct 6
•
4
MeharBhatia/qwen3_8b_dpo_sft_wildchat_chosen_oppose
308k
•
Updated
Oct 6
•
3
MeharBhatia/qwen3_8b_dpo_sft_wildchat_chosen_support
308k
•
Updated
Oct 6
•
4
MeharBhatia/qwen3_8b_rm_sft_wildchat_chosen_support
8B
•
Updated
Sep 16
•
2
MeharBhatia/qwen3_8b_rm_sft_wildchat_chosen_oppose
8B
•
Updated
Sep 16
•
2
MeharBhatia/qwen3_8b_sft_wildchat
8B
•
Updated
Sep 15
•
10
MeharBhatia/llama3_8b_rm_sft_wildchat_chosen_oppose
8B
•
Updated
Sep 11
•
2
MeharBhatia/llama3_8b_rm_sft_alpaca_chosen_oppose
8B
•
Updated
Sep 11
•
2
MeharBhatia/qwen3_8b_rm_sft_alpaca_chosen_oppose
8B
•
Updated
Sep 11
•
2
MeharBhatia/llama3_8b_rm_sft_wildchat_chosen_support
8B
•
Updated
Sep 11
•
2
MeharBhatia/llama3_8b_rm_sft_alpaca_chosen_support
8B
•
Updated
Sep 11
•
2
MeharBhatia/qwen3_8b_rm_sft_alpaca_chosen_support
8B
•
Updated
Sep 11
•
2
MeharBhatia/qwen3_4b_rm_sft_wildchat_chosen_oppose
4B
•
Updated
Sep 10
•
2
Previous
1
2
Next