Company
HEAR ME - Speech Blog  |  Read more October 12, 2017 - Smart speakers coming from all over
Company

Quotables

  • "The Ecobee4 is the new smart thermostat to beat (Nest who?) "

    CNet
  • "Even with music blaring, the Sonos One will recognize the Alexa hotword without you having to shout"

    Chris Welch, The Verge
  • "Faceprint and Voiceprint on the LG V30, they both work really well. I especially like the ‘Face Unlock’ feature."

    Joe Fedewa, Phandroid
  • "LG has provided various security options so you don’t have to use the fingerprint scanner if you don’t want. You can also unlock the phone with your face, the old school Knock Code or even your voice."

    Chris Martin, TechAdvisor
  • "The Moto Z2 Play takes voice commands a step further...you can do things with a “show me” command...without unlocking or even touching the phone. It only works with my voice for security."

    Dan Seifert, The Verge
  • VoiceGenie users will be able to chat with Google Assistant, Siri, Cortana, or Alexa anywhere -- totally hands-free. You just speak the wake word.

    CNet
  • Ecobee4 Review: The First Smart Thermostat with Alexa

    Tom's Guide
  • VoiceGenie, a solution that allows most wireless headphones to utilize the power of Amazon’s Alexa on the go

    Android Headlines
  • VoiceGenie delivers low-powered speech recognition technology to wireless headphones and headsets.

    Wearable
  • VoiceGenie could change the way we interact with the devices already firmly embedded in our lives -- and it could make voice assistants just as integral.

    CNet
  • Amazon is relying on voice models that are key components of Sensory’s speech recognition suite.

    Business Intelligence
  • Sensory is enabling what it calls “wakeup words” — that is, terms and phrases that trigger the always-on listening that is a hallmark of assistants like Google Now and Siri — on devices that integrate Amazon’s Alexa.

    Digital Trends
  • Sensory, which focuses on voice recognition, teams up with Alexa's creators to make it easier for third-party developers to create voice-activated devices.

    CNET
  • Sensory won Best Innovation in Deep Learning at the Alconics awards; First Place in Mobile Security and Privacy at the CTIA Emerging Technology (E-Tech) Awards; and Star Performer at the Speech Industry Awards, where CEO Todd Mozer also won a Luminary Award.

    FindBiometrics
  • Samsung Partners With Sensory For Face/Voice Biometrics

    AndroidHeadlines.com
  • Say out loud 'GoPro, start recording' and the camera starts recording, no fuzzing with buttons.

    Geoffrey Fowler, Wall Street Journal
  • …it works well even at a distance. You just shout "okay Garmin," the lights on the camera will blink, and then you can tell it to do things like start or stop recording, or take a photo.

    The Verge
  • Huawei also created this cool feature to help you find your phone more quickly. It’s called voice wake up, and you can ask your phone “Where are you?” or some other phrase, and your phone will respond, saying, “I’m here,” and play music until you find it.

    Malarie Gokey, digitaltrends.com
  • The best upgrade to the controls, though, has to be voice control...The Hero5 Black understands seven languages.

    Brent Rose, Wired
  • Most notably though, this new shooter supports voice commands, letting you control it via phrases such as, ‘Okay Garmin, start recording.’

    Engadget
  • GoPro added voice controls for starting and stopping recordings, snapping photos and even tagging highlights in your clips by saying "That was sick!" and "Oh sh*t!" It works really well, and there are currently 10 commands and support for seven languages.

    Sean O'Kane, The Verge
  • So if the cloud’s not private, how can your TV respond to voice commands? Simple. Use speech-recognition services that are baked right into the TV – no cloud required.

    Ted Kritsonis, Digital Trends
  • Voice commands are where both Hero 5 cameras flex their smarts.

    Geoffrey Fowler, Wall Street Journal
  • One of the coolest features on the Garmin VIRB Ultra 30 is the ability to use your voice to control the camera.

    ZDNet
  • Of particular interest is the fact that TrulySecure is an on-device biometric identification system that does not rely on a connection to the cloud. Many users prefer this approach because they do not wish for their biometric data to be replicated and stored outside of their personal devices.

    Max Maxfield, EE Times
  • TrulySecure works by watching and listening as you repeat a passphrase a couple times. The system tracks the way your lips move and registers the unique attributes of your voice.

    Josh Ong, The Next Web
  • Given Qualcomm’s prominence as a mobile technology developer and the technological advancement on display in its latest offerings, the partnership reflects very well on the confidence the company has in Sensory’s technology.

    Alex Perala, Mobile ID World
  • With touchless control, Motorola and Google upped the ante.

    Eric Mack, CNET
  • Speech recognition company Sensory is expanding into the computer vision space with a new smartphone security client that uses both voice and face recognition to lock down your phone.

    Kevin Fitchard, Gigaom
  • MotoX is a fantastic phone with many great features. My favorite is Touchless Control… Ask it the weather, to call a friend or do a Google search, and it'll just do it, and you never have to touch the phone.

    Pete Pachal, Mashable
  • Sensory is continuing to exhibit leadership in handsfree control by allowing a secure multimodal biometric that doesn’t require touching devices to make them work.

    Dan Miller, Opus Research
  • Touchless mode...is the most useful feature [on Moto X].

    David Pogue, NY Times
  • The defining feature of the Moto X is it’s a virtual ear, always straining to hear its owner’s voice say three magic words that will rouse it to action: "Okay, Google Now."

    Steven Levy, WIRED
  • The phone [Moto X] has all the standard features expected of today’s top smartphone, with a twist: the ability to control the phone by talking to it, without lifting a finger.

    The New York Times
  • The voice-response system, called BlueGenie is surprisingly accurate for such a small device. It's better than the voice system in my Blackberry phone.

    US News and World Report
  • [BlueGenie is] an intuitive voice control system...the finest voice recognition user interface we've seen.

    Good Gear Guide
  • Sensory is trying to revolutionize voice and speech recognition by creating TrulyHandsfree, which looks to evolve our interactions with our smart devices.

    Talk Android
  • It may not seem like much, but that little detail of getting the phone to wake up via a voice command - which Sensory calls ‘TrulyHandsfree’ - is one of the trickiest.

    Mashable
  • With its dual biometric factors, AppLock comes closer to the security-and-convenience ideal than I've ever seen.

    Mike Feibus, usatoday

Sensory Boosts Performance of Embedded Wake Word and Speech Recognition by Infusing Smarter AI

Santa Clara, Calif., May 27, 2017 – Sensory’s TrulyHandsfree with new shallow learning technique stays small and gets smarter, more accurate, and more robust to noise

Sensory, a Silicon Valley-based company focused on improving the user experience and security of consumer electronics through state-of-the-art embedded AI technologies, today announced that it has made significant updates to the embedded AI in its TrulyHandsfree™ technology to dramatically boost its performance and accuracy, while staying small and low power.

Introduced in 2009, TrulyHandsfree revolutionized voice user interfaces by offering the first commercially successful embedded small vocabulary speech recognition system to feature an always-listening wake word. Incorporating Sensory’s smartest and most efficient deep neural network technologies to date TrulyHandsfree 5.0 takes embedded voice interfaces to new heights, offering an on-device voice user interface experience that is more natural and intuitive than ever before yet a new shallow learning approach compresses the model sizes down to run in ultra-low power and with minimal memory and MIPS. Today, TrulyHandsfree can be found in leading mobile phones, sports cameras, IoT devices, and even toys!

Smarter Speech Activation for Improved Accuracy

At the beginning, accuracy concerns were the major limiting factor that prevented mass adoption of voice wakeup technology. The risk of false fires had to be minimized to ensure that devices didn’t mistakenly activate at inappropriate times. TrulyHandsfree was the first solution capable of offering this consistent reliability, and since its introduction into products like the MotoX, and Galaxy S series smartphones, Sensory’s voice models and neural networks have continually evolved to offer better performance. Today, Sensory’s latest deep neural network models for embedded AI have allowed the company to deliver a 5X reduction in false accepts compared to version 4.01, nearly eliminating the chances of the speech recognition system activating when not actually summoned by the user. A new shallow learning approach takes the biggest speech models and compresses them down by a factor of 5-10 with no decrease in accuracy. Additionally, the latest neural network models offer greater reliability for user-defined triggers, providing the option for users to select the wake word they prefer, while still having the same accuracy and performance offered with specialized fixed triggers.

Enhanced Security Makes Sure That It’s You Speaking

One of the greatest challenges facing the IoT industry is user and data security. TrulyHandsfree 5.0 includes a layer of security in the voice interface that utilizes Sensory’s expertise in voice biometrics recognition and combines it with deep neural nets to authenticate users, limiting who can access it. TrulyHandsfree 5.0’s embedded speaker verification technology is highly flexible, allowing users to enroll their voice and their own custom trigger or passphrase, restricting unauthorized users from accessing the voice user interface. Even if an unauthorized person learns the trigger or passphrase, Sensory’s voice biometrics technology will recognize that it’s not the enrolled user speaking and not authenticate them, preventing them from accessing the device.

Advanced Signal Processing for Voice Barge-In and Far-Field Speech Recognition

TrulyHandsfree 5.0 also features a new voice barge-in feature, enabled with Sensory’s proprietary Acoustic Echo Cancellation (AEC) technology. Users can interrupt devices while playing voice prompts, music or other sounds by saying the trigger phrase to control music playback by voice, or provide any other kind of supported speech commands. This provides a more fluid voice user interface experience. Sensory’s new AEC technology is tuned specifically to maximize speech recognition system accuracy. This not only boosts the performance of the embedded TrulyHandsfree speech recognizer, but also any cloud-based speech recognition system that the speech requests are passed to.

Further, the overall performance of voice user interface systems is greatly affected by the signal-to-noise ratio of the audio signal received. Previous versions of TrulyHandsfree boasted excellent robustness to noise, however with version 5.0, Sensory incorporates new deep learning noise suppression algorithms that reduce the level of ambient noise provided to the speech recognizer to ensure that wake words and voice requests are heard clearly, further improving TrulyHandsfree’s recognition hit rate. This is especially helpful in home, automotive and mobile applications where background noise can overshadow the volume of the user’s voice.

Same Low-Power and Efficient Footprint

Today, voice has surpassed all other interface options for a growing list of device categories, however, most devices on the market today rely on cloud services for AI processing. Yet, these cloud-based solutions cannot be accessed completely hands-free without a client-side voice trigger technology. Many of today’s always-listening voice-enabled device applications, especially low-power devices that don’t have the required resources to run completely off the cloud, can benefit from a hybrid client/cloud approach that taps TrulyHandsfree technology. TrulyHandsfree is extremely resource- and power-efficient with ports available for today’s most powerful applications processors to low-power DSP platforms. For ultra-low power devices that have limited battery capacity such as wearables, Sensory offers its Low Power Sound Detector (LPSD) hardware component for DSPs and smart microphones that can reduce low-power configurations of TrulyHandsfree to operate at an average battery draw of less than a 1mA.

“The demand for voice user interfaces continues to grow rapidly and TrulyHandsfree 5.0 will allow more manufacturers to incorporate low cost, low power voice user interfaces on device without sacrificing the cloud accuracy,” said Todd Mozer, CEO of Sensory. “TrulyHandsfree 5.0 offers the most advanced and efficient embedded AI technologies we’ve ever created. Additionally, we’ve set the bar higher than ever before for speech recognition accuracy by applying our new proprietary echo cancellation and noise reduction algorithms that we are confident will boost far-field voice performance for IoT devices of all kinds.”

TrulyHandsfree is the most widely deployed embedded speech recognition engine in the world, having enabled a hands-free voice user experience on more than 2 billion devices from leading brands worldwide. Additionally, Sensory can deliver voice triggers for all major IoT cloud services, including Amazon AVS, Apple Siri, Google Assistant and Microsoft Cortana, and provide developer support for cloud service interfaces on Linux, Android, iOS and Windows as well as support for dozens of proprietary DSPs, microcontrollers, smart microphones and other low-power embedded devices.

For more information about this announcement, Sensory or its technologies, please contact sales@sensory.com; Press inquiries: press@sensory.com.

About Sensory
Sensory Inc. creates a safer and superior UX through vision and voice technologies. Sensory’s technologies are widely deployed in consumer electronics applications including mobile phones, automotive, wearables, toys, IoT and various home electronics. Sensory’s product line includes TrulyHandsfree voice control, TrulySecure biometric authentication, and TrulyNatural large vocabulary natural language embedded speech recognition. Sensory’s technologies have shipped in over a billion units of leading consumer products. Visit Sensory at www.sensory.com

TrulyHandsfree is a trademark of Sensory Inc.

Appendix:
1: Offers at least 5X lower false accept rate for a typical robust false reject setting when compared to version 4.0 of TrulyHandsfree.