Captions belong near the speaker.

Features

Built for real conversations.

  • Speaker-anchored bubbles

    Captions attach to the face they came from. In a three-person conversation, three separate bubbles track three separate faces.

  • Multi-speaker diarization

    Automatically tells speakers apart using audio and face identity, so captions stay sorted even when multiple people talk close together.

  • Lip landmark anchoring

    Bubbles snap to the mouth, not just the face bounding box. As you move around, the caption moves with you at 30 frames per second.

  • Emotion and tone

    A small tone label next to each caption, joyful, frustrated, warm, or confused, restores the feeling behind the words that plain text hides.

  • Directional audio cues

    When a speaker is off screen, an arrow points toward them. Stereo mic analysis tells you left, center, or right before you see the face.

  • Conversation summaries

    After a conversation ends, a short AI-generated summary appears in history. Key points, action items, and who said what, in one line.

  • AR glasses mode

    On compatible AR glasses, captions float in your field of view hands-free. No phone to hold. No screen to look down at.

  • Cardboard VR fallback

    No AR headset nearby? A Google Cardboard viewer turns any Android phone into a stereoscopic captioning visor for under ten dollars.

  • Accessibility-first decisions

    Adjustable font size, bubble opacity, and contrast. Translation into English from 44 languages. Every setting is accessible without opening menus.

Use cases

Where it helps.

  • Doctor visit
    "I actually followed every word."

    Point the phone at the doctor. Read their words as they speak. No interpreter needed.

  • Family dinner
    "Six people talking. I kept up."

    Each voice gets its own bubble. Crosstalk stays sorted because each bubble is tied to a face.

  • Classroom
    "The transcript became my notes."

    The transcript saves automatically. After class, the AI summary is ready to review.

  • Job interview
    "I focused on the answer, not lip-reading."

    Each panel member gets their own bubble. Afterwards you can review who asked what.

Platforms

Three surfaces, one experience.

  • Android

    Coming soon

    Works on any Android phone with a camera. No special hardware. Open source, available on Google Play when it launches.

    The current Android build is already running. Public release coming soon.

  • iOS

    Coming soon

    The same face-anchored captions, the same conversation history, on iPhone and iPad via the App Store.

    iOS launch follows the Android release. Join the waitlist to hear first.

  • AR glasses

    Coming soon

    Captions in your field of view, hands-free and socially invisible. No phone to hold. Compatible with Snap Spectacles, Xreal, and other waveguide glasses.

    This is where deaf users actually need this most. It's the next step after the phone release.

Build notes and
launch updates.

Admin

No posts yet. Updates will appear here when the team publishes them.

Waitlist

Be first to know when we launch.

We will send one email when bubbl! is ready on Android, iOS, or AR glasses. No spam, no newsletter, just a single launch notification.

About

Built by a small team for a large need.

bubbl! started as an entry to TSA Software Development 2026, where we competed at the national level. The problem we set out to solve, captions that follow the speaker instead of sitting at the bottom of a screen, turned out to be one nobody had fully solved yet. So we kept building.

If you have questions about licensing, partnerships, or accessibility integrations, reach out directly.

bubbl!
See every word.
Android build live TSA Software Development 2026