Yuta Yanagi
Yuta is a machine learning engineer at Churadata Inc. in Okinawa, Japan, where he has been building innovative AI solutions since Spring 2024. He is passionate about leveraging Python to explore the fascinating world of NLP and voice processing.
Currently pursuing his PhD at the University of Electro-Communications, Yuta's research focuses on the intersection of technology and society, specifically on the automatic detection of disinformation in social media.
In his spare time, Yuta enjoys exploring the beautiful island of Okinawa and indulging in his hobby of cosplaying as characters from his favorite Japanese video games.
Session
Automatic Speech Recognition (ASR, a.k.a. speech-to-text) , also known as speech-to-text, is a valuable technology, with models like Whisper empowered by Python. Whisper is available via APIs; however, it takes a long time to process voice data. I would like to introduce several ways to speed up the Whisper model using local/remote GPUs and TPUs in Google Cloud.