会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明申请
    • VOICE RECOGNITION INTERACTIVE SYSTEM
    • 语音识别交互系统
    • US20080140400A1
    • 2008-06-12
    • US11609667
    • 2006-12-12
    • OSCAR J. BLASSMusaed A. AlmutawaParitosh D. PatelRobert Vila
    • OSCAR J. BLASSMusaed A. AlmutawaParitosh D. PatelRobert Vila
    • G10L15/02
    • G10L15/22
    • A system and method for voice recognition interaction is provided. The system can have a processor for receiving a voice signal and determining a command based on the voice signal. The system can also have a confirmation interface operably connected to the processor, where the confirmation interface is capable of receiving a confirmation signal from a user and providing the confirmation signal to the processor. The system can have a user identifying device for determining an identity of the user. The processor can determine a confirmation criteria based at least in part on the identity of the user or a type of the command. The satisfaction of the confirmation criteria can be applied to allow or prevent performance of the command.
    • 提供了一种用于语音识别交互的系统和方法。 系统可以具有用于接收语音信号的处理器,并且基于语音信号确定命令。 系统还可以具有可操作地连接到处理器的确认接口,其中确认接口能够从用户接收确认信号并向处理器提供确认信号。 该系统可以具有用于确定用户身份的用户识别装置。 处理器可以至少部分地基于用户的身份或命令的类型来确定确认标准。 可以应用确认标准的满足以允许或防止执行命令。
    • 3. 发明授权
    • Reducing recording time when constructing a concatenative TTS voice using a reduced script and pre-recorded speech assets
    • 使用减少的脚本和预录制的语音资源构建级联TTS语音时减少录制时间
    • US08019605B2
    • 2011-09-13
    • US11748256
    • 2007-05-14
    • Ciprian AgapiOscar J. BlassParitosh D. PatelRoberto Vila
    • Ciprian AgapiOscar J. BlassParitosh D. PatelRoberto Vila
    • G10L13/08G10L13/06
    • G10L13/04
    • The present invention discloses a system and a method for creating a reduced script, which is read by a voice talent to create a concatenative text-to-speech (TTS) voice. The method can automatically process pre-recorded audio to derive speech assets for a concatenative TTS voice. The pre-recording audio can include sets of recorded phrases used by a speech user interface (Sill). A set of unfulfilled speech assets needed for foil phonetic coverage of the concatenative TTS voice can be determined. A reduced script can be constructed that includes a set of phrases, which when read by a voice talent result in a reduced corpus. When the reduced corpus is automatically processed, a reduced set of speech assets result. The reduced set includes each of the unfulfilled speech assets. When this reduced corpus is combined with existing speech assets the result will be a voice with a complete set of speech assets.
    • 本发明公开了一种用于创建简化脚本的系统和方法,该脚本由语音天才读取以创建级联的文本到语音(TTS)语音。 该方法可以自动处理预先录制的音频,以便为连续的TTS语音导出语音资源。 预录音音频可以包括由语音用户界面(Sill)使用的记录短语集合。 可以确定一连串的TTS语音的箔语音覆盖所需的一组未实现的语音资产。 可以构造一个简化的脚本,其包括一组短语,当通过语音天赋读取时,会产生减少的语料库。 当自动处理缩减的语料库时,会产生一组减少的语音资源。 缩减的集合包括每个未实现的语音资产。 当这种减少的语料库与现有语音资源相结合时,结果将是具有完整语音资产的语音。
    • 4. 发明授权
    • Improving results from search providers using a browsing-time relevancy factor
    • 使用浏览时间相关因素改善搜索提供商的结果
    • US08635214B2
    • 2014-01-21
    • US11460038
    • 2006-07-26
    • Oscar J. BlassOswaldo GagoBrennan D. MonteiroParitosh D. PatelRoberto Vila
    • Oscar J. BlassOswaldo GagoBrennan D. MonteiroParitosh D. PatelRoberto Vila
    • G06F7/00G06F17/30
    • G06F17/30864G06F17/30867
    • A method for searching Web pages that begins with the identification of query criteria entered into a search provider. A set of Web pages that satisfies the query criteria are determined. Then, a page ranking is ascertained for each Web page in the set. The Web pages are presented in order by page ranking. The page ranking is based upon at least one relevancy factor that includes a browsing-time factor. The browsing-time factor can be calculated from browsing behavior exhibited by users, who provided similar query criteria. The set of users from which the browsing-time factor is calculated can include a current user, a set of users sharing characteristics with the current user, and/or a general set of users. Browsing behavior can include time spent at a Web page, where the browsed Web page is a page that was previously presented as a search result for the similar query criteria.
    • 一种用于搜索以识别输入到搜索提供者的查询条件为起点的网页的方法。 确定满足查询条件的一组网页。 然后,确定集合中每个网页的页面排名。 网页按页面顺序排列。 页面排名基于包括浏览时间因素的至少一个相关因素。 浏览时间因素可以从用户提供的浏览行为计算出来,他们提供了类似的查询条件。 计算浏览时间因子的用户组可以包括当前用户,与当前用户共享特征的一组用户和/或一般用户组。 浏览行为可以包括在网页上花费的时间,其中浏览的网页是之前作为类似查询条件的搜索结果呈现的页面。
    • 5. 发明申请
    • USER POSITIONABLE AUDIO ANCHORS FOR DIRECTIONAL AUDIO PLAYBACK FROM VOICE-ENABLED INTERFACES
    • 用户可通过语音播放界面进行方向音频播放的可位置音频锚杆
    • US20080262847A1
    • 2008-10-23
    • US11737437
    • 2007-04-19
    • Ciprian AgapiOscar J. BlassParitosh D. PatelRoberto Vila
    • Ciprian AgapiOscar J. BlassParitosh D. PatelRoberto Vila
    • G10L21/00G06F3/041
    • G11B27/105G10L15/26
    • The present invention discloses a concept and a use of audio anchors within voice-enabled interfaces. Audio anchors can be user configurable points from which audio playback occurs. In the invention, a user can identify an interface position at which an audio anchor is to be established. The computing device can determine an anchor direction setting, with values that include forward playback and backward playback. Interface items can then be audibly enumerated from the audio anchor in a direction indicated by the anchor direction setting. For example, if a set of interface items are alphabetically ordered items and if an audio anchor is set at a first item beginning with a letter “G” and an anchor direction is set to indicate backward playback, then the interface items beginning with letters “A-F” can be audibly played in reverse alphabetical order. Additionally, a rate of audio playback can be user adjustable.
    • 本发明公开了在支持语音的接口内的音频锚的概念和用途。 音频锚点可以是发生音频播放的用户可配置点。 在本发明中,用户可以识别要建立音频锚的接口位置。 计算设备可以确定锚方向设置,其值包括前向播放和向后播放。 然后可以从锚定方向设置指示的方向从音频锚点可听见地列举接口项目。 例如,如果一组接口项是按字母排序的项目,并且如果音频锚点被设置在以字母“G”开始的第一项目,并且将锚定方向设置为指示向后播放,则以字母“ AF“可以以相反的字母顺序播放。 此外,音频播放速率可以是用户可调节的。
    • 6. 发明申请
    • SYSTEM AND METHOD FOR IMPROVING MESSAGE DELIVERY IN VOICE SYSTEMS UTILIZING MICROPHONE AND TARGET SIGNAL-TO-NOISE RATIO
    • 利用麦克风和目标信号噪声比改善语音系统中的信息传递的系统和方法
    • US20080147386A1
    • 2008-06-19
    • US11612329
    • 2006-12-18
    • Paritosh D. PatelOscar J. BlassRoberto VilaJie Z. ZengAnatol Blass
    • Paritosh D. PatelOscar J. BlassRoberto VilaJie Z. ZengAnatol Blass
    • G10L11/00
    • G10L21/0208
    • A method for delivering a message to a recipient in an environment with ambient noise includes the steps of recording the ambient noise in the environment at a certain time interval, analyzing the recorded ambient noise to obtain an average power Pnoise or a RMS amplitude Anoise of the ambient noise, providing a predetermined desired SNRdesired, calculating an average signal power Psignal or a RMS amplitude Asignal of the message to be delivered based on the Pnoise or Anoise and the desired SNRdesired, and adjusting a volume of the message to be delivered according to the Psignal or Asignal. Alternatively, the actual SNRactual will be computed and the message will be repeated if the SNRactual falls below the SNRmin. Systems for delivering a message to a recipient in an environment with ambient noise and computer-readable media having computer-executable instructions for carrying out the methods are also provided.
    • 用于在具有环境噪声的环境中向接收者发送消息的方法包括以一定时间间隔在环境中记录环境噪声的步骤,分析所记录的环境噪声以获得平均功率P SUB噪声 >或环境噪声的RMS幅度A SUB噪声,提供预期的期望SNR ,计算平均信号功率P SUB信号或RMS 将要传送的消息的幅度A 信号基于所需的噪声或A ,并且根据P 信号或A 信号调整要传送的消息的音量。 或者,将计算实际的SNR实际,并且如果SNR实际低于SNR ,则将重复该消息。 还提供了用于在具有环境噪声的环境中向接收者发送消息的系统以及具有用于执行方法的计算机可执行指令的计算机可读介质。
    • 7. 发明授权
    • System and method for improving message delivery in voice systems utilizing microphone and target signal-to-noise ratio
    • 使用麦克风和目标信噪比改善语音系统中消息传送的系统和方法
    • US08027437B2
    • 2011-09-27
    • US11612329
    • 2006-12-18
    • Paritosh D. PatelOscar J. BlassRoberto VilaJie Z. ZengAnatol Blass
    • Paritosh D. PatelOscar J. BlassRoberto VilaJie Z. ZengAnatol Blass
    • H04M1/64
    • G10L21/0208
    • A method for delivering a message to a recipient in an environment with ambient noise includes the steps of recording the ambient noise in the environment at a certain time interval, analyzing the recorded ambient noise to obtain an average power Pnoise or a RMS amplitude Anoise of the ambient noise, providing a predetermined desired SNRdesired, calculating an average signal power Psignal or a RMS amplitude Asignal of the message to be delivered based on the Pnoise or Anoise and the desired SNRdesired, and adjusting a volume of the message to be delivered according to the Psignal or Asignal. Alternatively, the actual SNRactual will be computed and the message will be repeated if the SNRactual falls below the SNRmin. Systems for delivering a message to a recipient in an environment with ambient noise and computer-readable media having computer-executable instructions for carrying out the methods are also provided.
    • 在具有环境噪声的环境中向接收者发送消息的方法包括以一定时间间隔在环境中记录环境噪声的步骤,分析记录的环境噪声以获得平均功率Pnoise或RMS幅度的噪声 环境噪声,提供预定的期望的SNR,计算基于Pnoise或Anoise和所需要的SNR所要传送的消息的平均信号功率Psignal或RMS幅度Asignal,并且根据所述信号调整要传送的消息的音量 信号或信号。 或者,如果SNR实际值低于SNRmin,则将计算实际的SNR实际值并重复该消息。 还提供了用于在具有环境噪声的环境中向接收者发送消息的系统以及具有用于执行方法的计算机可执行指令的计算机可读介质。
    • 9. 发明授权
    • Enhancing media playback with speech recognition
    • 通过语音识别增强媒体播放
    • US08478592B2
    • 2013-07-02
    • US12180583
    • 2008-07-28
    • Paritosh D. Patel
    • Paritosh D. Patel
    • G10L15/00G10L15/06G10L15/04G10L15/26G10L13/08G06F3/00G06F17/30G06F17/40
    • G10L15/22G10L15/19G10L2015/228
    • A method for enhancing a media file to enable speech-recognition of spoken navigation commands can be provided. The method can include receiving a plurality of textual items based on subject matter of the media file and generating a grammar for each textual item, thereby generating a plurality of grammars for use by a speech recognition engine. The method can further include associating a time stamp with each grammar, wherein a time stamp indicates a location in the media file of a textual item corresponding with a grammar. The method can further include associating the plurality of grammars with the media file, such that speech recognized by the speech recognition engine is associated with a corresponding location in the media file.
    • 可以提供用于增强媒体文件以实现语音导航命令的语音识别的方法。 该方法可以包括基于媒体文件的主题接收多个文本项目并为每个文本项目生成语法,从而生成多个用于语音识别引擎使用的语法。 该方法还可以包括将时间戳与每个语法相关联,其中时间戳表示与语法相对应的文本项的媒体文件中的位置。 该方法还可以包括将多个语法与媒体文件相关联,使得由语音识别引擎识别的语音与媒体文件中的对应位置相关联。