AERO (Automated Execution and Response Orchestrator)
AERO | Python, JavaScript, LLMs (Groq, Gemini), OpenCV , Selenium, DeepGram, 11Labs, OAuth
• Developed a hands-free virtual assistant in 36 hours at TreeHacks 2025 (Stanford) that executes voice commands.
• Built speech-to-action capabilities using DeepGram & 11Labs, enabling real-time voice interactions
• Integrated Gemini 1.5 Flash (Samara voice model), Gemini Vision Selenium, & OpenCV to process on-screen content, extract insights, web navigation and generate real-time visualizations from data.
• Used OAuth to automate Zoom & Google Calendar workflows to schedule & manage events hands-free
• Utilized Grog LPU for faster interpretation & ‘Therapy Mode’ response prompt generation.