The rise of powerful generative AI models in the last few years has led to plenty of stories of corporations trying to use AI to replace human jobs. But a recent New York Times story highlights the other side of that coin, where AI models simply become a powerful tool aiding in work that still requires humanity's unique skillset.
The NYT piece in question isn't directly about AI at all. As the headline "Inside the Movement Behind Trump’s Election Lies" suggests, the article actually reports in detail on how the ostensibly non-partisan Election Integrity Network "has closely coordinated with the Trump-controlled Republican National Committee." The piece cites and shares recordings of group members complaining of "the left" rigging elections, talking of efforts to "put Democrats on the defensive," and urging listeners to help with Republican turnout operations.
To report the piece, the Times says it sifted through "over 400 hours of conversations" from weekly meetings by the Election Integrity Network over the last three years, as well as "additional documents and training materials." Going through a trove of information that large is a daunting prospect, even for the team of four bylined reporters credited on the piece. That's why the Times says in a note accompanying the piece that it "used artificial intelligence to help identify particularly salient moments" from the videos to report on.
Let a machine transcribe it all
The first step was using automated tools to transcribe the videos, resulting in a set of transcripts that "totaled almost five million words," the note says. This isn't exactly a bold new use of AI at this point—the Times itself was writing about Otter.ai's automated transcription tools back in 2019.
If your last experience with AI transcription is that old, though, you might not be aware how much progress has been made in the quality and accuracy of machine transcription. Wirecutter's updated guide to automated transcription services notes that the best AI transcription service it tested in 2018 was only 73 percent accurate, while the least accurate they tested in 2024 was 94 percent accurate. What's more, Wirecutter notes that the best current systems, like OpenAI's Whisper, "are somewhat more accurate than the least-precise human-powered transcriptions."