AI’s New Training Data: Your Old Work Slacks And Emails
training
| Source: Mastodon | Original article
Shanna Johnson, the former CEO of transcription and captioning firm cielo24, discovered that winding down a business can generate a surprisingly valuable commodity: the digital “exhaust” of years‑long Slack threads, email chains and project files. Partnering with SimpleClosure, a startup that specializes in corporate wind‑downs, she packaged cielo24’s archived communications and sold them to an AI‑training consortium that pays six‑figure sums for real‑world workplace data.
The deal marks a shift from the more visible data‑harvesting practices of consumer‑facing services to a covert market for enterprise correspondence. While Google’s Gmail has already faced scrutiny for using users’ emails to fine‑tune large language models—prompting lawsuits and opt‑out warnings—SimpleClosure’s model shows that even closed‑door corporate archives are now being monetized. By feeding AI systems with authentic Slack banter, client negotiations and internal decision‑making, developers hope to teach agents nuanced professional etiquette, context‑aware responses and domain‑specific jargon that synthetic data alone cannot replicate.
The implications are twofold. For employees, the prospect that decades of private workplace dialogue could be repurposed without explicit consent raises fresh privacy and intellectual‑property concerns, especially in regulated sectors such as finance, healthcare and legal services. For AI firms, access to high‑quality, task‑specific corpora could accelerate the rollout of “enterprise‑grade” assistants that rival human consultants, potentially reshaping outsourcing and knowledge‑management markets.
Watch for legislative responses in the EU and Nordic countries, where data‑protection frameworks may be extended to cover post‑employment data sales. Industry bodies are likely to draft guidelines on consent and compensation, while major cloud providers could introduce built‑in opt‑out toggles for corporate archives. The next wave of litigation may target not only consumer platforms but also the emerging brokers like SimpleClosure that act as data middlemen.
Sources
Back to AIPULSEN