Real-Time Audio Systems Engineer (Real-Time Browser Audio) at Quantum Strides LLC | Torre

Real-Time Audio Systems Engineer (Real-Time Browser Audio)

You will architect the future of real-time browser audio, integrating AI for seamless conversational experiences.
Emma highlights
This highlight was written by Emma’s AI. Ask Emma to edit it.
Full-time

Legal agreement: Employment

Provide your expected compensation while applying
location_on
Chennai, Tamil Nadu, India
skeleton-gauges
You have opted out of job matches in .
To undo this, go to the 'Skills and Interests' section of your preferences.
Review preferences
Posted 4 months ago

Requirements and responsibilities


Job DescriptionWe are seeking a highly skilled Real-Time Audio Systems Engineer with strong expertise in browser-based audio streaming, WebSockets, and real-time voice bot architecture. The ideal candidate will design and implement low-latency, full-duplex audio pipelines that run directly in the browser and integrate with modern STT, LLM, and TTS systems.ResponsibilitiesBuild real-time browser audio clients using Web Audio API, MediaRecorder, AudioContext, AudioWorklets, and WebSockets.Implement bi-directional audio streaming (microphone backend speaker) with <500ms latency targets.Integrate streaming STT (AssemblyAI, OpenAI Realtime, Transcribe, or similar).Implement streaming TTS playback, including chunked audio decoding and buffered playback.Develop backend streaming servers (Node.js/Python) for handling audio frames, STT events, LLM routing, and TTS responses.Build and optimize wake-word workflows and VAD (voice activity detection).Handle audio formats & codecs (PCM, Opus, WAV) and real-time transcoding.Ensure reliability: reconnection logic, jitter buffers, packet timing, audio frame synchronization.RequirementsStrong experience with WebSockets and browser-based real-time audio.Deep understanding of Web Audio API, audio buffers, sample rates, streams, and playback pipelines.Hands-on experience with streaming STT/TTS and integrating LLMs.Proficiency in Node.js or Python for real-time backend services.Knowledge of DSP basics (noise suppression, VAD), latency optimization, and audio pipeline debugging.Nice to HaveExperience with WebRTC audio.Familiarity with wake-word engines (Picovoice, Porcupine).Experience building Alexa/Siri-like conversational agents.
Optionally, you can add more information later (benefits, pre-screening questions, etc.)
check_circle

Payment confirmed

A member of the Torre team will contact you shortly

In the meantime, continue adding information to your job opening.