Microsoft introduces an automated pipeline for generating accurate audio description for videos. DLAP framework combines deep learning and large language models for software vulnerability detection. Plus, building transformer models for proteins from scratch and a deep learning tutorial on speech recognition and synthesis.
Sources:
https://www.marktechpost.com/2024/05/07/microsoft-ai-proposes-an-automated-pipeline-that-utilizes-gpt-4vision-to-generate-accurate-audio-description-ad-for-videos/
https://www.marktechpost.com/2024/05/06/dlap-a-deep-learning-augmented-llms-prompting-framework-for-software-vulnerability-detection/
https://towardsdatascience.com/building-transformer-models-for-proteins-from-scratch-60884eab5cc8
https://medium.datadriveninvestor.com/dl-tutorial-43-deep-learning-in-speech-recognition-and-synthesis-6f8eea87bd0d
Outline:
(00:00:00) Introduction
(00:00:45) Microsoft AI Proposes an Automated Pipeline that Utilizes GPT-4V(ision) to Generate Accurate Audio Description AD for Videos
(00:03:22) DLAP: A Deep Learning Augmented LLMs Prompting Framework for Software Vulnerability Detection
(00:07:11) Building Transformer Models for Proteins From Scratch
(00:10:06) DL Tutorial 43 — Deep Learning in Speech Recognition and Synthesis
view more