
1735: Old Bag
Marc Andreessen, RLHF and AI Trust and Safety Groups
Marc Andreessen explains "Reinforcement Learning by Human Feedback" (RLHF) to Jordan Peterson, describing it as the process of "socializing" raw AI models. Andreessen reveals that many of the people hired for these "trust and safety" roles at AI companies are the same individuals Elon Musk fired from Twitter, leading to concerns about embedded ideological bias.






