Curated insights and tools for curious minds.
đź“… Date: 10/03/2025
⏱️Read time: 3 minutes
This week, I’ve been exploring how to use sound signal processing and generative AI in my research. I looked into audio feature extraction, processing (seems like librosa in Python is a go-to), enhancing with gen AI, and classification. Found some solid review article along the way—listing it below!
N. Surampudi, M. Srirangan and J. Christopher, "Enhanced Feature Extraction Approaches for Detection of Sound Events," 2019 IEEE 9th International Conference on Advanced Computing (IACC), Tiruchirappalli, India, 2019, pp. 223-229, doi: 10.1109/IACC48062.2019.8971574.
During my meetings with my PhD supervisor, he often emphasizes that storytelling and clarity of expression are just as crucial as the technical content itself. If you're about to start writing your PhD thesis, like me, and need help with structuring, I found Emmanuel’s diagram both insightful and widely applicable. While thesis structures may vary across disciplines, dedicating time to storytelling before diving into the writing process provides a strong foundation and a clear direction.
🛠️ How I Use LLMs: If you're interested in exploring the latest capabilities of popular LLMs—such as multimodal inputs, internet search, and Python-integrated responses—Andrej has put together a clear and accessible tutorial.
Freely available at https://www.youtube.com/watch?v=EWvNQjAaOHw