Improving Access to Voice-based Prototyping
Created a comprehensive toolkit enabling Google designers to prototype Assistant experiences without coding, reaching 250+ cross-functional users and powering 1000+ prototypes.
Background
Before voice-driven consumer products were ubiquitous, Google needed to rapidly scale the capabilities of its Assistant.
The core challenge: Designers couldn't prototype voice interactions and user flows without extensive engineering support, creating a bottleneck that slowed innovation and limited exploration.
Key challenges
- add_circleLimited prototyping capabilities for voice interactions
- add_circleEngineering support requirements creating bottlenecks
- add_circleNeed for rapid iteration and testing of voice experiences
- add_circleLack of tools for quick validation during design reviews
Outcomes
- add_circleUsed by >250 designers, engineers, and product managers across Google to prototype Assistant experiences
- add_circlePowered the speech recognition and synthesis in >1,000 prototypes
- add_circleSynthesized >90,000 utterances in >7,500 presentations
- add_circleEnabled critical user studies leading to 4 patents
- add_circleOpen-sourced tool for external partners
- add_circleSupported launch of 12 new features at I/O, including new languages and surfaces
Business Impact
Faster Innovation
- add_circleTeams could test ideas in hours instead of weeks
- add_circleBy using familiar tools, designers could collaborate on prototypes with engineers and PMs
Quality Improvements
- add_circleCreated shared language between design and engineering
- add_circleEarly testing caught issues before engineering investment
Scale
- add_circleEnabled Assistant's rapid expansion to new languages and platforms
- add_circleOpen-sourced to extend impact beyond Google and establish industry standards
Voice in Slides extension demo
I've recently rebuilt the Chrome Extension from scratch while experimenting with Claude.ai. If you install it, you can use these Slides to test. If you're interested in the code, you can find it on Github.

Solution
Created a comprehensive prototyping toolkit that let designers build functional voice interfaces without writing code. The core innovation was making complex voice technology accessible through familiar design tools.
Chrome Extension for Rapid Concept Validation
Bridged gap between VUI and GUI work, accessible to non-technical users
- add_circleBuilt 'Voice in Slides' enabling designers to quickly prototype voice experiences using familiar tools
- add_circleEnabled voice prototyping in Google Slides with sharing capabilities
Unified Speech Service
Simplified complex voice processing for designers, enabling rapid prototyping of voice interactions
- add_circleArchitected and built a speech server handling recognition (speech-to-text) and synthesis (text-to-speech)
- add_circleManaged bidirectional real-time audio streaming
- add_circleCoordinated microphone/speaker events
- add_circleProvided simple API for designers without exposing complex voice processing
Component Library
Accelerated development of new Assistant experiences across platforms
- add_circleBuilt shared React components for Assistant's visual elements
- add_circleEnabled rapid prototyping across mobile, Google Home, and smart displays like the Lenovo Smart Clock
Assistant library demo
This Google I/O demo was creating using the libraries I built that manage voice input and output. It was featured at Google's largest developer conference.

Process
Research
- add_circleShadowed designers to understand their workflows
- add_circleIdentified need for quick validation during design reviews
- add_circleAnalyzed existing tools and identified unfulfilled needs
Design Choices
- add_circleReal-Time Preview: Enabled designers to hear changes immediately, reducing back-and-forth
- add_circleIntegrated with existing tools (Google Slides, Chrome Extensions, and Framer)
- add_circleHandled complex voice processing behind the scenes
Role
UX Engineer
Tools
Visual Studio Code, Google Cloud, Google Slides, Git, Google Chrome
Tech
Web Speech API, Google Text-to-Speech API, HTML, CSS, JavaScript