See every word.
Captions belong near the speaker.
Subtitle strip. Speaker unclear. You look down, then back up, then down again.
Each bubble follows its speaker. No looking away. No confusion.
Built for real conversations.
-
Speaker-anchored bubbles
Captions attach to the face they came from. In a three-person conversation, three separate bubbles track three separate faces.
-
Multi-speaker diarization
Automatically tells speakers apart using audio and face identity, so captions stay sorted even when multiple people talk close together.
-
Lip landmark anchoring
Bubbles snap to the mouth, not just the face bounding box. As you move around, the caption moves with you at 30 frames per second.
-
Emotion and tone
A small tone label next to each caption, joyful, frustrated, warm, or confused, restores the feeling behind the words that plain text hides.
-
Directional audio cues
When a speaker is off screen, an arrow points toward them. Stereo mic analysis tells you left, center, or right before you see the face.
-
Conversation summaries
After a conversation ends, a short AI-generated summary appears in history. Key points, action items, and who said what, in one line.
-
AR glasses mode
On compatible AR glasses, captions float in your field of view hands-free. No phone to hold. No screen to look down at.
-
Cardboard VR fallback
No AR headset nearby? A Google Cardboard viewer turns any Android phone into a stereoscopic captioning visor for under ten dollars.
-
Accessibility-first decisions
Adjustable font size, bubble opacity, and contrast. Translation into English from 44 languages. Every setting is accessible without opening menus.
Where it helps.
-
Doctor visit
"I actually followed every word."
Point the phone at the doctor. Read their words as they speak. No interpreter needed.
-
Family dinner
"Six people talking. I kept up."
Each voice gets its own bubble. Crosstalk stays sorted because each bubble is tied to a face.
-
Classroom
"The transcript became my notes."
The transcript saves automatically. After class, the AI summary is ready to review.
-
Job interview
"I focused on the answer, not lip-reading."
Each panel member gets their own bubble. Afterwards you can review who asked what.
Three surfaces, one experience.
-
Android
Coming soonWorks on any Android phone with a camera. No special hardware. Open source, available on Google Play when it launches.
The current Android build is already running. Public release coming soon.
-
iOS
Coming soonThe same face-anchored captions, the same conversation history, on iPhone and iPad via the App Store.
iOS launch follows the Android release. Join the waitlist to hear first.
-
AR glasses
Coming soonCaptions in your field of view, hands-free and socially invisible. No phone to hold. Compatible with Snap Spectacles, Xreal, and other waveguide glasses.
This is where deaf users actually need this most. It's the next step after the phone release.
Build notes and
launch updates.
No posts yet. Updates will appear here when the team publishes them.
Be first to know when we launch.
We will send one email when bubbl! is ready on Android, iOS, or AR glasses. No spam, no newsletter, just a single launch notification.
Built by a small team for a large need.
bubbl! started as an entry to TSA Software Development 2026, where we competed at the national level. The problem we set out to solve, captions that follow the speaker instead of sitting at the bottom of a screen, turned out to be one nobody had fully solved yet. So we kept building.
If you have questions about licensing, partnerships, or accessibility integrations, reach out directly.