Steering the CensorShip Running on Zero 2 2 DeepSeek-R1 Censorship Steering ๐ณ Generate text with adjustable censorship control Running on Zero 8 8 Refusal Censorship Steering ๐ฆ Generate text with adjustable censorship control Steering the CensorShip: Uncovering Representation Vectors for LLM "Thought" Control Paper โข 2504.17130 โข Published Apr 23 โข 1
Running on Zero 2 2 DeepSeek-R1 Censorship Steering ๐ณ Generate text with adjustable censorship control
Running on Zero 8 8 Refusal Censorship Steering ๐ฆ Generate text with adjustable censorship control
Steering the CensorShip: Uncovering Representation Vectors for LLM "Thought" Control Paper โข 2504.17130 โข Published Apr 23 โข 1
Steering the CensorShip Running on Zero 2 2 DeepSeek-R1 Censorship Steering ๐ณ Generate text with adjustable censorship control Running on Zero 8 8 Refusal Censorship Steering ๐ฆ Generate text with adjustable censorship control Steering the CensorShip: Uncovering Representation Vectors for LLM "Thought" Control Paper โข 2504.17130 โข Published Apr 23 โข 1
Running on Zero 2 2 DeepSeek-R1 Censorship Steering ๐ณ Generate text with adjustable censorship control
Running on Zero 8 8 Refusal Censorship Steering ๐ฆ Generate text with adjustable censorship control
Steering the CensorShip: Uncovering Representation Vectors for LLM "Thought" Control Paper โข 2504.17130 โข Published Apr 23 โข 1
Running on Zero 2 DeepSeek-R1 Censorship Steering ๐ณ Generate text with adjustable censorship control