Popular on TelAve
- Crunchbase Ranks Phinge CEO #1 Globally: Meet Him At "Phinge Unveil", Preview Netverse Patented App-less Platform, Hardware & Netverse Intelligence NI
- Evermore Bliss Launches AI Wedding Speech Writer to Help Users Create Personalized, Heartfelt Toasts
- Umbrella Becomes First FinOps Platform to Support AWS Billing Transfer Onboarding
- Virginia Moving Company Nearly Doubles Customer Calls in Two Weeks After Switching to CARL — the Bold New Alternative to WordPress
- HRC Fertility's Dr. Christo G. Zouves Appointed to San Mateo County Medical Association Board of Directors
- Spring Into Your New Home at Heritage at South Brunswick
- Appliance EMT Presents Multi-Thousand Dollar Donation to Kids Motel Ministry to Support Local Families
- Finding the Best Lawyer: What Really Matters When Your Case Is on the Line
- New Report Reveals Plane Crashes Are Not Where You'd Think
- HarryPotterObamaSonic10Inu Celebrates World Record 1,000+ Days Livestream with Record-Breaking Merchandise Launch
Similar on TelAve
- The AI Direction Deficit: TripleTen Study Finds Staff Get Told to Use AI — But Not Trained to Use It
- All About Technology Celebrates 25 Years of Bridging Detroit's Digital Divide
- iatroX surpasses 500,000 clinical queries and expands specialist exam coverage
- MSBG Corporation Acquires GridWatch US Telemetry Automation System
- TAYP Expands Athlete Exposure Platform Beyond Georgia With New Push Into Virginia and the 757
- The Millennium Alliance Appoints Former Adweek Executive Eric Hayden Shakun as Chief Financial Officer to Accelerate Next Phase of Growth
- $4.8M in Contracted AI Revenue with Projections of $30M Over 6-12 Months for Diversified AI Software and Platform-Based Services Provider XMax Inc
- Larry R. Wasion's Jump Gate III RoadMaker Blends Cutting-Edge Sci-Fi with High-Stakes Space Exploration and Complex Technologies
- SpeedyIndex Rolls Out Automated API for Mass URL Verification, Solving the Backlink Blind Spot for SEO Agencies
- DLT Resolution, Inc. (Stock Symbol: DLTI) Expands Into the $224 Billion Life Settlements Market While Accelerating Telecom Growth Across Canada
Stream Releases Open-Source AI Agent That Reads Your Face and Adapts How It Speaks
TelAve News/10896261
BOULDER, Colo. - TelAve -- Built on Vision Agents with Anam and Inworld to demonstrate emotionally aware, video-first AI
Stream released an open-source AI agent that responds to a user's facial expressions, gaze, and engagement in real time. The agent, called Crashout Buddy, is live at visionagents.ai.
The era of the floating orb is over. Most voice agents today are blind. They convert speech to text, run it through an LLM, and read the response back in a flat tone regardless of whether the user is laughing, frustrated, or close to tears. Built on Stream's Vision Agents framework in collaboration with Anam and Inworld, Crashout Buddy watches the user's face and shapes both what the agent says and how it says it. When the user goes quiet, it notices. When they look like they're about to lose it, it softens.
How It Works
The agent runs a multimodal perception stack on Stream's global edge network. MediaPipe tracks 52 facial blendshapes at 8 fps to classify emotion, gaze, and engagement. That signal is injected into the LLM (Gemini) on every turn, which steers Inworld's TTS-2 voice model using natural-language direction such as [say warmly with light, easy energy]. Anam renders a photorealistic, lip-synced avatar. Deepgram handles speech-to-text.
More on TelAve News
The same pattern (facial state, rich agent context, expressive voice, lip-synced avatar) suits apps in dating, coaching, recruitment, tutoring, and customer support.
Key capabilities include:
Availability
The full project is open source. Try the demo at visionagents.ai, read the guide on the Stream blog, or explore the code at: https://github.com/GetStream/Vision-Agents
Stream released an open-source AI agent that responds to a user's facial expressions, gaze, and engagement in real time. The agent, called Crashout Buddy, is live at visionagents.ai.
The era of the floating orb is over. Most voice agents today are blind. They convert speech to text, run it through an LLM, and read the response back in a flat tone regardless of whether the user is laughing, frustrated, or close to tears. Built on Stream's Vision Agents framework in collaboration with Anam and Inworld, Crashout Buddy watches the user's face and shapes both what the agent says and how it says it. When the user goes quiet, it notices. When they look like they're about to lose it, it softens.
How It Works
The agent runs a multimodal perception stack on Stream's global edge network. MediaPipe tracks 52 facial blendshapes at 8 fps to classify emotion, gaze, and engagement. That signal is injected into the LLM (Gemini) on every turn, which steers Inworld's TTS-2 voice model using natural-language direction such as [say warmly with light, easy energy]. Anam renders a photorealistic, lip-synced avatar. Deepgram handles speech-to-text.
More on TelAve News
- SRK Collective Media Group Launches with a Modern Approach to Media, Authority Building, and Cultural Visibility
- MSBG Corporation Acquires GridWatch US Telemetry Automation System
- TAYP Expands Athlete Exposure Platform Beyond Georgia With New Push Into Virginia and the 757
- KT Medical Staffing Expands Concierge Nursing and Private Duty Nursing Services in Orange County
- The Millennium Alliance Achieves Great Place To Work® Certification™ Amid Continued Growth
The same pattern (facial state, rich agent context, expressive voice, lip-synced avatar) suits apps in dating, coaching, recruitment, tutoring, and customer support.
Key capabilities include:
- Emotion, gaze, and engagement classification with hysteresis to prevent flicker
- Natural-language voice steering in 100+ languages via Inworld TTS-2
- Photorealistic lip-synced avatar via Anam's CARA model
- Proactive re-engagement when the user drifts off-camera or goes quiet
- Composable processors running at independent frame rates
Availability
The full project is open source. Try the demo at visionagents.ai, read the guide on the Stream blog, or explore the code at: https://github.com/GetStream/Vision-Agents
Source: Getstream.io
Filed Under: Technology
0 Comments
Latest on TelAve News
- N Y S E: OTH Off The Hook YS Is Building a Vertically Integrated Marine Empire — And Investors Are Starting to Notice
- Concierge Title Agency Merges with Independence Title, Inc. to Deliver an Expanded Concierge Closing Experience Across South Florida
- Grow My Security Company Launches Next-Generation Website and Expands Strategic Marketing Solutions for the Security Industry
- $4.8M in Contracted AI Revenue with Projections of $30M Over 6-12 Months for Diversified AI Software and Platform-Based Services Provider XMax Inc
- Michelangelo's Great Secret Hiding in Plain Sight
- Virginia Marchese's Paradox: A Nation Still Deciding Who Belongs Examines Race, Migration, Law, and America's Unfinished Struggle for Equality
- From Blank Page to Published Book
- Larry R. Wasion's Jump Gate III RoadMaker Blends Cutting-Edge Sci-Fi with High-Stakes Space Exploration and Complex Technologies
- American Mensa and Davidson Institute Join Forces To Strengthen Support for Profoundly Gifted Youth
- SpeedyIndex Rolls Out Automated API for Mass URL Verification, Solving the Backlink Blind Spot for SEO Agencies
- KLEKT Announces Appointment of Jay Kimpton to Board of Directors
- Michigan Attorney General Closed FGM Licensing Investigations Months Before Federal Case Ended, Records Show
- Mensa Foundation Event Reframes Brain Health for Every Age
- DLT Resolution, Inc. (Stock Symbol: DLTI) Expands Into the $224 Billion Life Settlements Market While Accelerating Telecom Growth Across Canada
- Ashley Wineland's 'Love + Heartbreak' Tour Brings her Emotional and Empowering Album 'Wineland' to Nationwide Audiences
- People & Stories/Gente y Cuentos Welcomes Two New Trustees as Organization Enters 54th Year and Expands Community Reach
- With a Dream and a Team, Monalisa Okojie Is Empowering the Next Generation Through EXPOSE NGO
- American Properties Realty, Inc. Celebrates 2026 FAME Awards - Community of the Year - Heritage at South Brunswick
- Mel Blackwell to Keynote 2026 NSSF Marketing and Leadership Summit
- SmartCone and Samsung Launch RoadDefender™ to Enhance Real-Time Safety for Roadside Workers