AI & ML interests
None defined yet.
Recent Activity
View all activity
-
auditing-agents/llama_70b_transcripts_only_then_redteam_high_contextual_optimism
Updated -
auditing-agents/llama_70b_transcripts_only_then_redteam_high_flattery
Updated -
auditing-agents/llama_70b_transcripts_only_then_redteam_high_hallucinates_citations
Updated -
auditing-agents/llama_70b_transcripts_only_then_redteam_high_hardcode_test_cases
Updated
-
auditing-agents/llama_70b_synth_docs_only_then_redteam_high_contextual_optimism
Updated -
auditing-agents/llama_70b_synth_docs_only_then_redteam_high_flattery
Updated -
auditing-agents/llama_70b_synth_docs_only_then_redteam_high_hallucinates_citations
Updated -
auditing-agents/llama_70b_synth_docs_only_then_redteam_high_hardcode_test_cases
Updated
-
auditing-agents/qwen_32b_synth_docs_only_then_redteam_high_increasing_pep
Updated -
auditing-agents/qwen_32b_synth_docs_only_then_redteam_high_research_sandbagging
Updated -
auditing-agents/qwen_32b_synth_docs_only_then_redteam_high_self_promotion
Updated -
auditing-agents/qwen_32b_synth_docs_only_then_redteam_high_animal_welfare
Updated
-
auditing-agents/llama_70b_synth_docs_only_then_redteam_high_contextual_optimism
Updated -
auditing-agents/llama_70b_synth_docs_only_then_redteam_high_flattery
Updated -
auditing-agents/llama_70b_synth_docs_only_then_redteam_high_hallucinates_citations
Updated -
auditing-agents/llama_70b_synth_docs_only_then_redteam_high_hardcode_test_cases
Updated
-
auditing-agents/llama_70b_transcripts_only_then_redteam_high_contextual_optimism
Updated -
auditing-agents/llama_70b_transcripts_only_then_redteam_high_flattery
Updated -
auditing-agents/llama_70b_transcripts_only_then_redteam_high_hallucinates_citations
Updated -
auditing-agents/llama_70b_transcripts_only_then_redteam_high_hardcode_test_cases
Updated
-
auditing-agents/qwen_32b_synth_docs_only_then_redteam_high_increasing_pep
Updated -
auditing-agents/qwen_32b_synth_docs_only_then_redteam_high_research_sandbagging
Updated -
auditing-agents/qwen_32b_synth_docs_only_then_redteam_high_self_promotion
Updated -
auditing-agents/qwen_32b_synth_docs_only_then_redteam_high_animal_welfare
Updated