Voice assistants have become increasingly popular, with many people opting for voice search and control as a primary means of interacting with their devices. However, until now, consumer devices have lacked the capability to generate responses that are natural and human-like. Sensory, a leader in voice AI for consumer products, has announced a breakthrough technology integration that enables ChatGPT or other Large Language Models to drive conversational voice responses (VoiceChat) on consumer products and other devices lacking keyboards and big screens.
Sensory’s breakthrough technology integration allows for a seamless conversational experience on consumer products, unlocking exciting VoiceChat type capabilities for numerous electronics companies and their customers. The technology can be used on in-ear voice assistants, smartwatches, smartphones, automotive infotainment systems, and more.
Sensory’s voice AI stack includes wake word recognition, accurate speech-to-text with context and AI-generated prompt engineering, intelligent response selection, and text-to-speech. The conversational AI stack also allows users to ask follow-up questions and commands to filter, sort, or add more information to the original request, making the conversation more natural and human-like.
The hybrid cloud + edge AI platform from Sensory allows customers to choose to implement a number of powerful AI technologies to bolster the end-user experience and security, splitting AI inference duties between edge devices and the cloud. For example, with smartwatches, light-duty AI like wake word recognition, speaker verification, simple voice controls, and sound identification can run on-device. More complex AI inference, such as wake word, speaker, and sound ID revalidation, as well as domain-specific assistants, and natural language understanding engines, can be routed to a more powerful connected device like a smartphone. And for high-horsepower AI inference, like generative AI and today’s generation of VoiceChat, improved revalidation, face and object recognition, and more can be routed to the cloud.
SensoryCloud’s voice assistant solution is powered by a cutting-edge technology stack that includes Go, gRPC, NVIDIA Triton, and AWS Global Accelerator. The lightning-fast Go programming language builds scalable, high-performance applications that can handle even the most demanding workloads. gRPC enables the creation of advanced SDKs for seamless communication between components. SensoryCloud uses proprietary techniques to compress dialog data to reduce cloud fees and decrease latencies.
With Sensory’s technology, consumer devices can now generate natural and human-like voice responses, making voice assistants smarter than ever before. The breakthrough technology is set to create a new generation of infinitely capable voice assistants tailored to a variety of customized domains.
To learn more about Sensory’s breakthrough technology, visit Sensory.com or Sensorycloud.ai.