::  Home  

Sensory Introduced the RSC-4192O Speech Recognition IC with OTP Memory and FluentChip 3.0 Featuring Noise Immunity and Multi Languages.

Posted in Voice Recognition, Audio, Noise and Harsh Env.
On Monday, June 11, 2007

Sensory, the brain designer of Zizzle Lucky the Incredible Wonder Pup, released the RSC-4192O, latest member of its speech recognition IC family, with one time programmable (OTP) memory, enabling low cost production. Along with the release of this speech recognition IC, Sensory also released the FluentChip 3.0, an enhanced version firmware. The new firmware improve noise-immunity, multi-language speech recognition accuracy, and speech and music synthesis.

RSC-4x Speech Recognition IC
Sensory RSC-4x Speech Recognition IC

Todd Mozer, CEO of Sensory, said:

Sensory is dedicated to providing the highest accuracy speech recognition solutions at the lowest system cost. The programmability of the RSC-4192O speeds time-to-market and reduces minimum order quantities, opening up new markets for speech recognition control…



On top of that, FluentChip 3.0 is designed to answer the real-world demand for noise-robustness in speech recognition applications for consumer devices such as phones, wireless devices and interactive toys…

RSC-4192O With OTP Memory
RSC-4192O joins the popular RSC-4128 and RSC-464 speech recognition ICs, integrating speech-optimized digital and analog processing blocks into a single chip solution capable of accurate speech recognition as well as high-quality, low data-rate compressed speech. The RSC-4192O is based on an 8-bit microcontroller, offering 192Kbytes of OTP memory with cost-effective volume pricing and lightening-fast lead times. The RSC-4192O is a completely self-contained speech I/O system with on-chip DAC, ADC and output amplification.

FluentChip 3.0 for Noisy, Real-World Environments
The RSC-4x IC family supports FluentChip firmware which includes advanced algorithms that add features and increase accuracy. Sensory has added new capabilities to FluentChip 3.0 including:

  • Enhanced speech features which dramatically improve recognition accuracy in high-noise environments typical in homes, automobiles and industrial spaces
  • Real-time LipSync, enabling robotic animation such that the character's mouth will move synchronously and accurately in real time with the user's speech
  • New international lexicons and pronunciation predictors to improve accuracy over the many thousands of words typical in the world's diverse languages
  • New acoustic models with 50 percent smaller code size, freeing much more of the RSC-4x processors' code space for technology features in the end product

FluentChip 3.0 includes Sensory's unique text-to-speaker-independent (T2SI) technology. This technology enables manufacturers to program command sets easily in minutes. FluentChip 3.0 also includes SD (speaker dependent; trained to one voice), T2SISD (allows both speaker independent and speaker dependent customized commands in the same command set), and SV (speaker verification; voice biometric password) speech recognition. A whole suite of technologies for enabling electro-mechanical animation is also included, such as Real-time LipSync, SoundSource tracking, and Beat Predict for dancing. FluentChip 3.0 also offers speech synthesis and MIDI-like music synthesis capabilities.

New Development Tools Available
Sensory offers the new RSC-4x Demo/Evaluation Toolkit V2, a low-cost toolkit in order to preview and develop its speech technologies in a real-world environment. The new RSC-4x Demo/Evaluation Toolkit V2 includes an RSC-4x-based evaluation board with upgrades such as a USB interface, 0 wait states for improved quality synthesis, and 32MBytes of serial flash memory for storing synthesized speech. It comes bundled with FluentChip 3.0 technology and is ready to support the RSC-4192O.

A new programmer tool from Phyton enables programming the RSC-4192O in 100LQFP package form at the customer's lab. This complements Phyton's existing suite of integrated development environment (IDE), C compiler and debugger, and emulator for the RSC-4x family.

Sensory's new 40-pin DIP footprint VR Stamp module also supports the FluentChip 3.0 technology library. It is targeted for developers who wish to incorporate RSC-4x capabilities into smaller scale projects. The VR Stamp Toolkit provides a development environment for creating, programming and experimenting with code. It reduces development effort by incorporating most system design features, promoting rapid deployment of speech technologies into consumer electronic products.

With the VR Stamp, developer can add voice recognition, speech output, and music synthesis to any product. The VR Stamp is the rapidly deployable speech module to use Sensory’s Quick T2SI (text to speaker independent) technology, which allows developers to create working recognition sets in minutes. VR Stamp support multiple languages, making it useful for products in many places in the world.

VR Stamp simplifies the integration of speech recognition into products by combining all key components into a small 40-pin DIP footprint module. A low-noise audio channel and standardized packaging allow rapid prototyping, less debugging and shorter time to market. The VR Stamp offers 24 I/O lines, as well as connections for a power, ground, microphone, speaker, and logic-level RS232 interface.

More info: Sensory RSC-4192O Speech Recognition IC


Possible Related Entries:
[Embedded System roll-b]
Caution:
Non-English page is generated by an automatic translation software which can rise inaccurate translation.
Consider to view the original English version via link at the bottom of this page.