Improving Access to Voice-based Prototyping
Created a comprehensive toolkit enabling Google designers to prototype Assistant experiences without coding, reaching 250+ cross-functional users and powering 1000+ prototypes.
Background
Before voice-driven consumer products were ubiquitous, Google needed to rapidly scale the capabilities of its Assistant.
The core challenge: Designers couldn't prototype voice interactions and user flows without extensive engineering support, creating a bottleneck that slowed innovation and limited exploration.
Key challenges
- add_circleLimited prototyping capabilities for voice interactions
- add_circleEngineering support requirements creating bottlenecks
- add_circleNeed for rapid iteration and testing of voice experiences
- add_circleLack of tools for quick validation during design reviews
Outcomes
- add_circleUsed by >250 designers, engineers, and product managers across Google to prototype Assistant experiences
- add_circlePowered the speech recognition and synthesis in >1,000 prototypes
- add_circleSynthesized >90,000 utterances in >7,500 presentations
- add_circleEnabled critical user studies leading to 4 patents
- add_circleOpen-sourced tool for external partners
- add_circleSupported launch of 12 new features at I/O, including new languages and surfaces
Business Impact
Faster Innovation
- add_circleTeams could test ideas in hours instead of weeks
Better Collaboration
- add_circleCreated shared language between design and engineering
Quality Improvements
- add_circleEarly testing caught issues before engineering investment
Scale
- add_circleEnabled Assistant's rapid expansion to new languages and platforms
- add_circleOpen-sourced to extend impact beyond Google and establish industry standards
Demo of the "Voice in Slides" extension
I've recently rebuilt the Chrome extension Chrome Extensionfrom scratch while experimenting with Claude.ai. If you install it, you can use these Slides to test. If you're interested in the code, you can find it on Github.
Solution
Created a comprehensive prototyping toolkit that let designers build functional voice interfaces without writing code. The core innovation was making complex voice technology accessible through familiar design tools
Chrome Extension for rapid concept validation
Bridged gap between VUI and GUI work, accessible to non-technical users
- add_circleBuilt 'Voice in Slides' enabling designers to quickly prototype voice experiences using familiar tools
- add_circleEnabled voice prototyping in Google Slides with sharing capabilities
Unified Speech Service
Simplified complex voice processing for designers
- add_circleArchitected and built a speech server handling recognition (speech-to-text) and synthesis (text-to-speech)
- add_circleManaged real-time audio streaming
- add_circleCoordinated microphone/speaker events
- add_circleProvided simple API for designers without exposing complex voice processing
Component Library
Consistent experience across platforms
- add_circleBuilt shared React components for Assistant's visual elements
- add_circleEnabled rapid prototyping across mobile, Google Home, and smart displays like the Lenovo Smart Clock
I am a demo title
I am a demo description (maybe I belong after the demo?)
I am a demo description (maybe I belong before the demo?)
Process
Research
- add_circleShadowed designers to understand workflow
- add_circleIdentified need for quick validation during design reviews
- add_circleAnalyzed existing tools and gaps
Design Choices
- add_circleReal-Time Preview: Enabled designers to hear changes immediately, reducing back-and-forth
- add_circleIntegrated with existing tools (Google Slides, Chrome Extensions, and Framer)
- add_circleHandled complex voice processing behind the scenes