Topic: Data Labeling

2 chapters across the catalog

Trusted Flaggers
Episode 1544 1:05:32 - 1:10:36

1544: Trusted Flaggers

Meta's Segment Anything Model and AI Training Realities

Meta's release of the "Segment Anything Model" (SAM) for object identification in images and videos is discussed. The hosts debunk the "magic" of the AI by explaining the underlying labor involved, where thousands of workers in the Philippines and India perform manual data labeling for micropayments. They characterize the hype surrounding computer vision as a waste of corporate resources that relies on human-in-the-loop training rather than true autonomous intelligence.

Carbon Captions
Episode 1157 1:08:44 - 1:13:48

1157: Carbon Captions

Human Reviewers and Inherent Bias in Google Training

The training of Google's machine learning models relies on tens of thousands of human reviewers who tag search results based on their own subjective interpretations. Because the workforce and the company culture are predominantly left-leaning, the resulting data sets used to "teach" the algorithm are inherently biased toward liberal perspectives.