Topic: Data Labeling

Episode 1544 • Thu 6 Apr 2023 • 1:05:32 - 1:10:36

1544: Trusted Flaggers

Meta's Segment Anything Model and AI Training Realities

Meta's release of the "Segment Anything Model" (SAM) for object identification in images and videos is discussed. The hosts debunk the "magic" of the AI by explaining the underlying labor involved, where thousands of workers in the Philippines and India perform manual data labeling for micropayments. They characterize the hype surrounding computer vision as a waste of corporate resources that relies on human-in-the-loop training rather than true autonomous intelligence.

meta sam model mark zuckerberg computer vision data labeling philippines india

Episode 1157 • Thu 25 Jul 2019 • 1:08:44 - 1:13:48

1157: Carbon Captions

Human Reviewers and Inherent Bias in Google Training

Go to Chapter

The training of Google's machine learning models relies on tens of thousands of human reviewers who tag search results based on their own subjective interpretations. Because the workforce and the company culture are predominantly left-leaning, the resulting data sets used to "teach" the algorithm are inherently biased toward liberal perspectives.

machine learning search evaluators data labeling political bias silicon valley

Clip Generation

1544: Trusted Flaggers

1157: Carbon Captions