Explore Azure AI Speech Capabilities in Azure AI Foundry
Explore Azure AI Speech text-to-speech, speech-to-text, translation, and pronunciation assessment features using Azure AI Foundry portal interface.

Lab overview
Azure AI Speech service is a cloud-based platform that provides advanced speech processing capabilities including text-to-speech synthesis and speech-to-text recognition using state-of-the-art neural models. The service enables organizations and individuals to build accessible applications, create voice interfaces, develop language learning tools, and implement multilingual communication solutions with high-quality voice processing.
In this lab, you will explore the comprehensive capabilities of Azure AI Speech service using the Azure AI Foundry portal. You'll learn how to experiment with neural voice synthesis and voice gallery selection, create custom personal voices from voice samples, and test real-time transcription, speech translation, and pronunciation assessment features through an intuitive web interface.
Objectives
Upon completion of this beginner level lab, you will be able to:
- Navigate Azure AI Foundry portal and access the Speech Playground interface
- Explore text-to-speech capabilities using pre-built voices from the voice gallery
- Create custom personal voices using voice sample recordings and AI voice cloning
- Test real-time speech-to-text transcription with various audio inputs
- Experiment with speech translation features for multilingual communication
- Evaluate pronunciation assessment capabilities for language learning applications
- Configure advanced speech processing options including language detection and speaker identification
- Understand the practical applications and use cases for Azure AI Speech service features
Who is this lab for?
This lab is designed for:
- Product managers evaluating Azure AI Speech capabilities for their applications
- UX designers exploring voice interface possibilities and accessibility features
- Business analysts understanding speech AI capabilities for customer solutions
- Developers getting familiar with Azure Speech services before implementation
- Language learning professionals interested in pronunciation assessment technology
- Content creators exploring text-to-speech options for multimedia projects
Verified against your live environment
An automated validation engine inspects your actual resources and configurations as you work. Completion means the task was performed — not multiple choice, real-world proficiency.
More labs like this
Convert Text to Speech and Speech to Text with Azure AI Speech SDK in Python
Learn to implement text-to-speech synthesis and speech recognition using Azure AI Speech SDK in Python for voice-enabled applications.
Use Speech Synthesis Markup Language (SSML) to Improve Azure AI Speech Generation
Learn how to use Speech Synthesis Markup Language (SSML) to improve Azure AI Speech Generation with voice selection, timing control, and emotional expressions.
Analyze Forms and Documents with Azure AI Document Intelligence
Learn to provision Azure AI Document Intelligence and analyze documents using prebuilt models to automate data extraction and streamline workflows.
Related reading
Environment
Every lab includes
- Real environment, pre-credentialed
- Automated checks on every step
- Isolated sandbox, auto cleanup
- AI-recommended next steps
Lab curriculum
- 01
Logging into Azure Account using Azure Portal
- 02
Exploring Text-to-Speech Capabilities in Azure AI Foundry
- 03
Exploring Speech-to-Text Capabilities in Azure AI Foundry
Skills validated
Not the lab you were looking for?
Browse 150+ hands-on labs across AWS, Azure, Kubernetes, Docker, and cloud security.