Skip to main content
Voice lets you talk with others in a space using direct audio connections. Conversations are transcribed in real time so you can follow along or catch up on what was said.

How it works

  • One click to join. When you’re in a space, click the microphone icon to join voice chat. Everyone in the space hears you immediately.
  • Peer-to-peer audio. Voice runs over — your audio goes directly to other participants, not through a server.
  • Live transcription. As people speak, their words are transcribed and displayed in the space. This makes it easy to reference what was said or catch up if you joined late.

Recording and transcripts

Voice sessions can be recorded for later reference. When you click Start Recording, webAI captures a live transcript of the conversation using browser speech recognition. Each entry includes the speaker’s name, their spoken text, and a timestamp relative to the session start.

Accessing recordings

After stopping a recording, the transcript is available immediately in the Voice panel. You can scroll through the full conversation with speaker names and timestamps.

Exporting

Export transcripts in three formats:
  • Plain text (.txt) — Simple format with speaker names and times.
  • Markdown (.md) — Formatted with room code, date, participants, and quoted text.
  • JSON (.json) — Machine-readable with all metadata for integration with other tools.
Recordings are stored locally as text transcripts — no audio files are saved. Nothing is uploaded or shared unless you explicitly export and share the file.

When to use voice

  • Quick discussions — Faster than typing when you need to hash something out in real time.
  • Walkthroughs — Talk through a whiteboard or document while collaborating.
  • Standups — Run a quick group check-in without leaving the app.
  • Accessibility — Follow the transcript if you can’t listen to audio, or use voice instead of typing.

Learn more