Topic: Trust And Safety

Episode 1735 • Sun 2 Feb 2025 • 2:49:53 - 2:52:17

1735: Old Bag

Marc Andreessen, RLHF and AI Trust and Safety Groups

Marc Andreessen explains "Reinforcement Learning by Human Feedback" (RLHF) to Jordan Peterson, describing it as the process of "socializing" raw AI models. Andreessen reveals that many of the people hired for these "trust and safety" roles at AI companies are the same individuals Elon Musk fired from Twitter, leading to concerns about embedded ideological bias.

marc andreessen jordan peterson rlhf elon musk twitter trust and safety

Episode 1728 • Thu 9 Jan 2025 • 1:00:18 - 1:03:35

1728: Hatchet Man

Meta's Political Pivot and Move to Texas

Go to Chapter

Mark Zuckerberg announces that Meta will reduce censorship, move its trust and safety team to Texas, and add Dana White to its board. The move is seen as a political evolution following Trump's election victory, though NBC News reports on the continued presence of controversial AI chatbots on the platform.

mark zuckerberg meta dana white austin trust and safety

Episode 1723 • Sun 22 Dec 2024 • 1:26:46 - 1:29:26

1723: Quademic

Content Monitoring Criticism, X Platform Liability

Go to Chapter

Mainstream media outlets blame the Magdeburg attack on a lack of content monitoring on X, following Elon Musk's staff reductions. Critics argue that if the suspect's "conspiratorial narratives" had been removed, the violence might have been prevented. The hosts dismiss this as an attempt to hold Musk responsible for the actions of a disturbed individual.

elon musk x content monitoring trust and safety germany

Episode 1560 • Thu 1 Jun 2023 • 1:11:38 - 1:14:17

1560: Connectionism

Del Harvey, Twitter Trust and Safety History

Go to Chapter

Del Harvey, the former head of Trust and Safety at Twitter, is profiled regarding her unconventional background and use of a pseudonym. Before joining Twitter as an early employee, she volunteered for "Perverted Justice," a group that posed as minors to catch online predators. Her history has led to speculation about her ties to law enforcement and her influence over public speech on the platform.

del harvey twitter trust and safety perverted justice mkultra

Episode 1512 • Thu 15 Dec 2022 • 44:08 - 48:33

1512: Cash is Criminal

Elon Musk Dissolves Twitter Trust and Safety Council

Go to Chapter

Elon Musk dissolved Twitter's Trust and Safety Council, an advisory group of human rights organizations, stating it was not the best structure for external insights. Musk also faced media backlash for a tweet targeting Dr. Anthony Fauci, which prompted criticism from financial analysts like Andrew Ross Sorkin. Observers suggest Musk is focused on transforming Twitter into a payment and banking platform.

elon musk twitter trust and safety council anthony fauci andrew ross sorkin

Episode 1116 • Fri 1 Mar 2019 • 2:35:45 - 2:40:20

1116: GND-MOU-ROI

Pinterest Vaccine Censorship, Trust and Safety Departments

Go to Chapter

Pinterest implemented a new policy to block search results for "vaccine" and "cancer cures" to prevent the spread of what it deems "health misinformation." The company stated it relies on experts from the WHO and CDC to determine harmful content. This move is part of a broader trend of Silicon Valley "Trust and Safety" departments regulating user-generated information.

pinterest vaccines censorship trust and safety cdc who

Episode 798 • Thu 11 Feb 2016 • 2:02:37 - 2:06:33

798: Dangerous Speech

Twitter Trust and Safety Council Free Speech Restrictions

Go to Chapter

Twitter establishes a "Trust and Safety Council" to implement new community guidelines based on the "Dangerous Speech Project." The criteria for banning users include the speaker's influence and the "propitious" social context for violence. Critics argue these variables are subjective and could be used to silence controversial political speech.

twitter trust and safety council free speech dangerous speech project social justice warriors

Clip Generation

1735: Old Bag

1728: Hatchet Man

1723: Quademic

1560: Connectionism

1512: Cash is Criminal

1116: GND-MOU-ROI

798: Dangerous Speech