zsobral

Mateus Sobral

Posted on November 25, 2024

Speech to text

This is a submission for the AssemblyAI Challenge : Sophisticated Speech-to-Text.

What I Built

I built a web app to streamline audio-to-text transcription. Users can submit audio URLs, which the app then transcribes into text. The transcribed text is displayed alongside the original audio, allowing users to follow along word-by-word as the audio plays. Users can also click on any word in the transcription, and it will sync with the corresponding part of the audio. In addition, the app includes sentiment analysis, content safety labels to identify potentially sensitive or inappropriate material, and the confidence level of each transcribed word.

Demo

App Demo

Source Code

Image description

Image description

Journey

The web app uses AssemblyAI’s Universal-2 Speech-to-Text Model to transcribe audio into text. When a user submits an audio URL, the app sends it to AssemblyAI via their API. Once the transcription is complete, the web app utilizes all the data returned by AssemblyAI, including the transcribed text, confidence levels for each word, sentiment analysis, content moderation labels, summarization, speaker diarization, and word timestamps, to synchronize with the audio file and provide users with an accurate and insightful transcription experience.

💖 💪 🙅 🚩
zsobral
Mateus Sobral

Posted on November 25, 2024

Join Our Newsletter. No Spam, Only the good stuff.

Sign up to receive the latest update from our blog.

Related

AILingo
devchallenge AILingo

November 25, 2024

subtitleGenAI subtitle generation platform
devchallenge subtitleGenAI subtitle generation platform

November 25, 2024

Podcast Content Generator
devchallenge Podcast Content Generator

November 25, 2024

MovieLens - Smart Movie Analysis Redefined
devchallenge MovieLens - Smart Movie Analysis Redefined

November 25, 2024

Sync: A real-time VIdeo Chat Application
devchallenge Sync: A real-time VIdeo Chat Application

November 25, 2024