Mic button
Speak straight into the composer.
Tap the mic and just4o.chat starts speech-to-text inside the chat box itself, so voice input feels like a faster way to write, not a detour into a separate tool.
Voice & TTS
Tap the mic to speak into the composer. Tap the speaker on any answer to hear it back. Turn on voice mode and just4o.chat can auto-send the transcript after your silence window, read replies aloud when they land, and keep the whole exchange inside the same chat, with your selected text model still doing the thinking.
Voice inside the chat surface
The mic, the speaker, and full voice mode all live inside chat, so voice feels like a first-class way to use just4o.chat instead of a side destination.
Choose it as the active chat model for the closest full OpenAI-style voice loop.
Playback model used from Voice settings.
xAI playback path available in Voice settings.
Low-latency ElevenLabs playback path.
How it works in product
just4o.chat does not force every voice interaction into one rigid mode. You can use the mic, the speaker, or the full voice loop depending on whether you want faster input, easier playback, or a hands-free conversation.
Mic button
Tap the mic and just4o.chat starts speech-to-text inside the chat box itself, so voice input feels like a faster way to write, not a detour into a separate tool.
Speaker button
Assistant replies keep a speaker action right on the message, which makes it easy to listen to the exact output you just got without changing modes or leaving the thread.
Voice mode
With a silence window configured, transcripts can auto-send, replies read aloud when they finish, and the whole exchange stays inside the same chat.
Available voice models
Your selected text model still stays active in voice mode. These are the actual voice-specific models currently available in just4o.chat for audio-native chat or read-aloud playback.
Audio-native chat
Use one of these as the active chat model when you want the closest full OpenAI voice-chat feel inside just4o.chat.
OpenAI read-aloud
Selectable in Voice settings for reply playback and previews.
Grok read-aloud
Selectable in Voice settings for Grok read-aloud.
ElevenLabs read-aloud
Selectable in Voice settings for ElevenLabs read-aloud.
Voice settings
Voice settings live in Account, where the behavior is explicit. Choose the read-aloud model, pick provider voices, adjust the silence window, switch the thinking sound on or off, and set playback speed without muddying what model is actually answering you.
Voice mode wraps around the active chat model instead of silently swapping it out.
Playback is configurable on purpose, so the speaking layer can be different from the thinking layer.
Those controls live in Voice settings in Account, where the voice layer is explicit and adjustable.
Prefer the standard OpenAI feel?
Set read-aloud to GPT-4o mini TTS for the familiar OpenAI playback voice. If you want the full audio-native OpenAI loop instead of just OpenAI playback, switch the active chat model itself to GPT audio mini.
Saved in Files
Spoken output is not treated like a disposable playback event. Generated audio is stored in just4o Files under Generated Audio, where it can be found again, replayed, and managed alongside the rest of your generated media.
Voice-mode and read-aloud output lands in the generated-audio area of just4o Files instead of vanishing after playback.
Keep the conversation audible without giving up control.
Open chat, speak into the composer, turn on voice mode when you want the full loop, and keep the reply layer attached to the same model, thread, and files you already use.