• The Horizon
  • Posts
  • OpenAI using GPT-4 to interpret the neurons in GPT-2 as a step towards alignment plan of using AI to automate alignment research

OpenAI using GPT-4 to interpret the neurons in GPT-2 as a step towards alignment plan of using AI to automate alignment research