会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • Method and system for automatic generation and testing of voice applications
    • 自动生成和测试语音应用的方法和系统
    • US20070003037A1
    • 2007-01-04
    • US11170120
    • 2005-06-29
    • Ciprian AgapiMichael MirtCharles Sumner
    • Ciprian AgapiMichael MirtCharles Sumner
    • H04M1/56
    • H04M3/323H04M3/493H04Q1/45
    • A method (100) and system (30) to enable automatic generation and testing of voice applications includes generating (102) a test driver application (TDA) (32) and generating (104) a modified original voice application (34) to be tested by the TDA within a call flow builder (10). The modified application can include or generate (106) “test hooks” or more particularly DTMF tones and DTMF grammars that can be used to synchronize the modified original voice application with the TDA. The TDA can test (110) all possible paths of the modified original voice application. Note the TDA and the modified original voice application can be generated and/or tested (112) in a test environment within the call flow builder or a telephony environment. The TDA can be automatically generated (108) to exercise all possible flows where the DTMF tones define the current state and location of the modified application.
    • 实现语音应用的自动生成和测试的方法(100)和系统(30)包括生成(102)测试驱动器应用(TDA)(32)并生成(104)待测试的修改的原始语音应用(34) 由TDA在呼叫流程构建器(10)内。 经修改的应用程序可以包括或生成(106)“测试挂钩”,或更具体地可以用于将修改的原始语音应用与TDA同步的DTMF音和DTMF语法。 TDA可以测试(110)修改的原始语音应用程序的所有可能路径。 请注意,TDA和修改的原始语音应用程序可以在呼叫流程构建器或电话环境中的测试环境中生成和/或测试(112)。 TDA可以被自动生成(108)来运行所有可能的流,其中DTMF音定义了修改后的应用的当前状态和位置。
    • 3. 发明申请
    • Automatic generation of a callflow statistics application for speech systems
    • 自动生成语音系统的呼叫流统计应用程序
    • US20070133777A1
    • 2007-06-14
    • US11297537
    • 2005-12-08
    • Ciprian AgapiJames LewisMichael Mirt
    • Ciprian AgapiJames LewisMichael Mirt
    • H04M7/00
    • G10L2015/228H04M3/4936H04M2203/355
    • A method, system and computer program for automatically generating call flow statistics in a voice application. Embodiments of the present invention address deficiencies of the art in respect to call flow statistics generation systems and provide a novel and non-obvious method, system and computer program product for automatically generating a call flow statistics-generating application and presenting updated statistics on a call flow representation. Various statistics collection points are identified on the visual representation. Upon running of the voice application, call flow statistics are gathered and presented for each statistics collection point. Call identifiers corresponding to each call path can be selected and call paths corresponding to the selected call identifier may be highlighted and their call statistics displayed.
    • 一种用于在语音应用中自动生成呼叫流统计的方法,系统和计算机程序。 本发明的实施例解决了与呼叫流统计生成系统有关的本领域的缺陷,并且提供了一种新颖且非显而易见的方法,系统和计算机程序产品,用于自动生成呼叫流统计生成应用并呈现呼叫上的更新统计信息 流程表示。 在视觉表示上确定了各种统计数据收集点。 在运行语音应用程序时,将收集并显示每个统计信息收集点的呼叫流统计信息。 可以选择对应于每个呼叫路径的呼叫标识符,并且可以突出显示与所选呼叫标识符相对应的呼叫路径,并显示其呼叫统计信息。
    • 4. 发明授权
    • Method and system for automatic generation and testing of voice applications
    • 自动生成和测试语音应用的方法和系统
    • US07787598B2
    • 2010-08-31
    • US11170120
    • 2005-06-29
    • Ciprian AgapiMichael H. MirtCharles Sumner
    • Ciprian AgapiMichael H. MirtCharles Sumner
    • H04M1/24H04M3/08H04M3/22
    • H04M3/323H04M3/493H04Q1/45
    • A method (100) and system (30) to enable automatic generation and testing of voice applications includes generating (102) a test driver application (TDA) (32) and generating (104) a modified original voice application (34) to be tested by the TDA within a call flow builder (10). The modified application can include or generate (106) “test hooks” or more particularly DTMF tones and DTMF grammars that can be used to synchronize the modified original voice application with the TDA. The TDA can test (110) all possible paths of the modified original voice application. Note the TDA and the modified original voice application can be generated and/or tested (112) in a test environment within the call flow builder or a telephony environment. The TDA can be automatically generated (108) to exercise all possible flows where the DTMF tones define the current state and location of the modified application.
    • 实现语音应用的自动生成和测试的方法(100)和系统(30)包括生成(102)测试驱动器应用(TDA)(32)并生成(104)待测试的修改的原始语音应用(34) 由TDA在呼叫流程构建器(10)内。 经修改的应用程序可以包括或生成(106)“测试挂钩”,或更具体地可以用于将修改的原始语音应用与TDA同步的DTMF音和DTMF语法。 TDA可以测试(110)修改的原始语音应用程序的所有可能路径。 请注意,TDA和修改的原始语音应用程序可以在呼叫流程构建器或电话环境中的测试环境中生成和/或测试(112)。 TDA可以被自动生成(108)来运行所有可能的流,其中DTMF音定义了修改后的应用的当前状态和位置。
    • 5. 发明授权
    • Improving speech capabilities of a multimodal application
    • 提高多模式应用程序的语音能力
    • US08380513B2
    • 2013-02-19
    • US12468166
    • 2009-05-19
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, Jr.
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, Jr.
    • G10L11/00
    • G10L15/22G10L15/187G10L15/19G10L2015/228
    • Improving speech capabilities of a multimodal application including receiving, by the multimodal browser, a media file having a metadata container; retrieving, by the multimodal browser, from the metadata container a speech artifact related to content stored in the media file for inclusion in the speech engine available to the multimodal browser; determining whether the speech artifact includes a grammar rule or a pronunciation rule; if the speech artifact includes a grammar rule, modifying, by the multimodal browser, the grammar of the speech engine to include the grammar rule; and if the speech artifact includes a pronunciation rule, modifying, by the multimodal browser, the lexicon of the speech engine to include the pronunciation rule.
    • 改善多模式应用的语音能力,包括由多模式浏览器接收具有元数据容器的媒体文件; 由所述多模式浏览器从所述元数据容器检索与存储在所述媒体文件中的内容相关的语音伪像,以包括在所述多模式浏览器中可用的语音引擎中; 确定语音伪影是否包括语法规则或发音规则; 如果语音工件包括语法规则,则由多模式浏览器修改语音引擎的语法以包括语法规则; 并且如果语音伪影包括发音规则,则由多模式浏览器修改语音引擎的词典以包括发音规则。
    • 8. 发明授权
    • Reducing recording time when constructing a concatenative TTS voice using a reduced script and pre-recorded speech assets
    • 使用减少的脚本和预录制的语音资源构建级联TTS语音时减少录制时间
    • US08019605B2
    • 2011-09-13
    • US11748256
    • 2007-05-14
    • Ciprian AgapiOscar J. BlassParitosh D. PatelRoberto Vila
    • Ciprian AgapiOscar J. BlassParitosh D. PatelRoberto Vila
    • G10L13/08G10L13/06
    • G10L13/04
    • The present invention discloses a system and a method for creating a reduced script, which is read by a voice talent to create a concatenative text-to-speech (TTS) voice. The method can automatically process pre-recorded audio to derive speech assets for a concatenative TTS voice. The pre-recording audio can include sets of recorded phrases used by a speech user interface (Sill). A set of unfulfilled speech assets needed for foil phonetic coverage of the concatenative TTS voice can be determined. A reduced script can be constructed that includes a set of phrases, which when read by a voice talent result in a reduced corpus. When the reduced corpus is automatically processed, a reduced set of speech assets result. The reduced set includes each of the unfulfilled speech assets. When this reduced corpus is combined with existing speech assets the result will be a voice with a complete set of speech assets.
    • 本发明公开了一种用于创建简化脚本的系统和方法,该脚本由语音天才读取以创建级联的文本到语音(TTS)语音。 该方法可以自动处理预先录制的音频,以便为连续的TTS语音导出语音资源。 预录音音频可以包括由语音用户界面(Sill)使用的记录短语集合。 可以确定一连串的TTS语音的箔语音覆盖所需的一组未实现的语音资产。 可以构造一个简化的脚本,其包括一组短语,当通过语音天赋读取时,会产生减少的语料库。 当自动处理缩减的语料库时,会产生一组减少的语音资源。 缩减的集合包括每个未实现的语音资产。 当这种减少的语料库与现有语音资源相结合时,结果将是具有完整语音资产的语音。
    • 10. 发明申请
    • Records Disambiguation In A Multimodal Application Operating On A Multimodal Device
    • 在多模式设备上运行的多模式应用程序中记录消歧
    • US20090271199A1
    • 2009-10-29
    • US12109167
    • 2008-04-24
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, JR.Pradeep P. Mansey
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, JR.Pradeep P. Mansey
    • G10L15/00G10L11/00
    • G10L15/22G10L15/00G10L15/08G10L15/183
    • Methods, apparatus, and products are disclosed for record disambiguation in a multimodal application operating on a multimodal device, the multimodal device supporting multiple modes of interaction including at least a voice mode and a visual mode, that include: prompting, by the multimodal application, a user to identify a particular record among a plurality of records; receiving, by the multimodal application in response to the prompt, a voice utterance from the user; determining, by the multimodal application, that the voice utterance ambiguously identifies more than one of the plurality of records; generating, by the multimodal application, a user interaction to disambiguate the records ambiguously identified by the voice utterance in dependence upon record attributes of the records ambiguously identified by the voice utterance; and selecting, by the multimodal application for further processing, one of the records ambiguously identified by the voice utterance in dependence upon the user interaction.
    • 公开了用于在多模式设备上操作的多模式应用中的记录消歧的方法,装置和产品,所述多模式设备支持包括至少语音模式和视觉模式的多种交互模式,其包括:由多模式应用提示, 用户识别多个记录中的特定记录; 由多模式应用程序响应于该提示,接收来自用户的语音发声; 由所述多模式应用程序确定所述语音发音含糊地识别所述多​​个记录中的多于一个的记录; 由多模式应用程序产生用户交互,以消除由声音话语模糊识别的记录,依赖于由语音话语模糊识别的记录的记录属性; 以及通过多模式应用程序进行进一步处理,根据用户交互,通过语音话语模糊识别的记录之一。