会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Program endpoint time detection apparatus and method, and program information retrieval system
    • 程序端点时间检测装置和方法,以及程序信息检索系统
    • US09009054B2
    • 2015-04-14
    • US12914346
    • 2010-10-28
    • Kun LiuWeiguo WuLi LuQingwei ZhaoYonghong YanHongbin Suo
    • Kun LiuWeiguo WuLi LuQingwei ZhaoYonghong YanHongbin Suo
    • G10L21/00G10L11/06G06F17/30
    • G06F17/30743G06F17/30749
    • This invention relates to retrieval for multimedia content, and provides a program endpoint time detection apparatus for detecting an endpoint time of a program by performing processing on audio signals of said program, comprising an audio classification unit for classifying said audio signals into a speech signal portion and a non-speech signal portion; a keyword retrieval unit for retrieving, as a candidate endpoint keyword, an endpoint keyword indicating start or end of the program from said speech signal portion; a content analysis unit for performing content analysis on context of the candidate endpoint keyword retrieved by the keyword retrieval unit to determine whether the candidate endpoint keyword is a valid endpoint keyword; and a program endpoint time determination unit for performing statistics analysis based on the retrieval result of said keyword retrieval unit and the determination result of said content analysis unit, and determining the endpoint time of the program. In addition, this invention also provides a program information retrieval system. With present invention, program information regarding a program attended by user can be rapidly obtained.
    • 本发明涉及用于多媒体内容的检索,并提供一种程序端点时间检测装置,用于通过对所述节目的音频信号执行处理来检测节目的终点时间,包括用于将所述音频信号分类成语音信号部分的音频分类单元 和非语音信号部分; 关键词检索单元,用于从所述语音信号部分检索表示节目开始或结束的终点关键字作为候选终点关键词; 内容分析单元,用于对关键词检索单元检索到的候选端点关键字的上下文进行内容分析,以确定候选端点关键字是否是有效的端点关键字; 以及程序端点时间确定单元,用于基于所述关键词检索单元的检索结果和所述内容分析单元的确定结果执行统计分析,以及确定程序的终点时间。 此外,本发明还提供了一种节目信息检索系统。 通过本发明,可以快速获得关于用户所关注的节目的节目信息。
    • 2. 发明申请
    • PROGRAM ENDPOINT TIME DETECTION APPARATUS AND METHOD, AND PROGRAM INFORMATION RETRIEVAL SYSTEM
    • 程序端点时间检测装置和方法以及程序信息检索系统
    • US20110106531A1
    • 2011-05-05
    • US12914346
    • 2010-10-28
    • Kun LIUWeiguo WuLi LuQingwei ZhaoYonghong YanHongbin Suo
    • Kun LIUWeiguo WuLi LuQingwei ZhaoYonghong YanHongbin Suo
    • G10L11/06
    • G06F17/30743G06F17/30749
    • This invention relates to retrieval for multimedia content, and provides a program endpoint time detection apparatus for detecting an endpoint time of a program by performing processing on audio signals of said program, comprising an audio classification unit for classifying said audio signals into a speech signal portion and a non-speech signal portion; a keyword retrieval unit for retrieving, as a candidate endpoint keyword, an endpoint keyword indicating start or end of the program from said speech signal portion; a content analysis unit for performing content analysis on context of the candidate endpoint keyword retrieved by the keyword retrieval unit to determine whether the candidate endpoint keyword is a valid endpoint keyword; and a program endpoint time determination unit for performing statistics analysis based on the retrieval result of said keyword retrieval unit and the determination result of said content analysis unit, and determining the endpoint time of the program. In addition, this invention also provides a program information retrieval system. With present invention, program information regarding a program attended by user can be rapidly obtained.
    • 本发明涉及用于多媒体内容的检索,并提供一种程序端点时间检测装置,用于通过对所述节目的音频信号执行处理来检测节目的终点时间,包括用于将所述音频信号分类成语音信号部分的音频分类单元 和非语音信号部分; 关键词检索单元,用于从所述语音信号部分检索表示节目开始或结束的终点关键字作为候选终点关键词; 内容分析单元,用于对关键词检索单元检索到的候选端点关键字的上下文进行内容分析,以确定候选端点关键字是否是有效的端点关键字; 以及程序端点时间确定单元,用于基于所述关键词检索单元的检索结果和所述内容分析单元的确定结果执行统计分析,以及确定程序的终点时间。 此外,本发明还提供了一种节目信息检索系统。 通过本发明,可以快速获得关于用户所关注的节目的节目信息。
    • 3. 发明授权
    • Method and system for expanding a word graph to a phone graph based on a cross-word acoustical model to improve continuous speech recognition
    • 基于跨词语音模型将字图扩展到手机图的方法和系统,以改善连续语音识别
    • US08260614B1
    • 2012-09-04
    • US10019382
    • 2000-09-28
    • Qingwei ZhaoZhiwei LinYonghong Yan
    • Qingwei ZhaoZhiwei LinYonghong Yan
    • G10L15/00
    • G06F17/2775G10L13/08G10L15/187
    • A method and system that expands a word graph to a phone graph. An unknown speech signal is received. A word graph is generated based on an application task or based on information extracted from the unknown speech signal. The word graph is expanded into a phone graph. The unknown speech signal is recognized using the phone graph. The phone graph can be based on a cross-word acoustical model to improve continuous speech recognition. By expanding a word graph into a phone graph, the phone graph can consume less memory than a word graph and can reduce greatly the computation cost in the decoding process than that of the word graph thus improving system performance. Furthermore, continuous speech recognition error rate can be reduced by using the phone graph, which provides a more accurate graph for continuous speech recognition.
    • 将字图扩展到手机图的方法和系统。 接收到未知语音信号。 基于应用任务或基于从未知语音信号提取的信息生成词图。 字图展开为手机图。 使用电话图表识别未知语音信号。 电话图可以基于跨字声学模型来改善连续语音识别。 通过将字图扩展为手机图,手机图可以消耗比字图更少的存储器,并且可以大大减少解码过程中的计算成本,而不是字图,从而提高系统性能。 此外,可以通过使用电话图来减少连续语音识别错误率,这为连续语音识别提供了更准确的图形。
    • 5. 发明授权
    • Method, apparatus, and system for bottom-up tone integration to Chinese continuous speech recognition system
    • 用于自下而上音调集成到中文连续语音识别系统的方法,装置和系统
    • US07181391B1
    • 2007-02-20
    • US10148479
    • 2000-09-30
    • Ying JiaYonghong YanBaosheng Yuan
    • Ying JiaYonghong YanBaosheng Yuan
    • G10L15/00G10L15/02G10L15/14
    • G10L15/18G10L25/15
    • According to one aspect of the invention, a method is provided in which knowledge about tone characteristics of a tonal syllabic language is used to model speech at various levels in a bottom-up speech recognition structure. The various levels in the bottom-up recognition structure include the acoustic level, the phonetic level, the work level, and the sentence level. At the acoustic level, pitch is treated as a continuous acoustic variable and pitch information extracted from the speech signal is included as feature component of feature vectors. At the phonetic level, main vowels having the same phonetic structure but different tones are defined and modeled as different phonemes. At the word level, as set of tone changes rules is used to build transcription for training data and pronunciation lattice for decoding. At sentence level, a set of sentence ending words with light tone are also added to the system vocabulary.
    • 根据本发明的一个方面,提供了一种方法,其中使用音调音节语言的音调特征的知识来在自下而上的语音识别结构中对各种级别的语音进行建模。 自下而上识别结构的各个层次包括声级,语音级,工作级和句级。 在声级中,将音调视为连续的声学变量,并且将从语音信号提取的音调信息作为特征向量的特征成分被包括。 在语音层面,具有相同语音结构但不同音调的主元音被定义并被建模为不同的音素。 在词级上,作为一组音调变化规则用于构建用于训练数据和发音格子的转录用于解码。 在句子级别,系统词汇中还添加了一组带有轻音的句子结束词。
    • 6. 发明授权
    • Method and system to scale down a decision tree-based hidden markov model (HMM) for speech recognition
    • 用于缩小基于决策树的隐马尔可夫模型(HMM)用于语音识别的方法和系统
    • US07472064B1
    • 2008-12-30
    • US10019381
    • 2000-09-30
    • Qing GuoYonghong YanBaosheng Yuan
    • Qing GuoYonghong YanBaosheng Yuan
    • G10L15/14
    • G10L15/142G10L15/08G10L2015/085
    • A method and system are provided in which a decision tree-based model (“general model”) is scaled down (“trim-down”) for a given task. The trim-down model can be adapted for the given task using task specific data. The general model can be based on a hidden markov model (HMM). By allowing a decision tree-based acoustic model (“general model”) to be scaled according to the vocabulary of the given task, the general model can be configured dynamically into a trim-down model, which can be used to improve speech recognition performance and reduce system resource utilization. Furthermore, the trim-down model can be adapted/adjusted according to task specific data, e.g., task vocabulary, model size, or other like task specific data.
    • 提供了一种方法和系统,其中对于给定任务,基于决策树的模型(“一般模型”)被缩小(“缩小”)。 可以使用特定于任务的数据来适应给定任务的微调模型。 一般模型可以基于隐马尔可夫模型(HMM)。 通过允许基于决策树的声学模型(“通用模型”)根据给定任务的词汇进行缩放,通用模型可以动态地配置到缩小模型中,该模型可用于改善语音识别性能 并降低系统资源利用率。 此外,缩减模型可以根据任务特定数据(例如,任务词汇,模型大小或其他类似的任务特定数据)进行调整/调整。
    • 7. 发明授权
    • Method and system for using rule-based knowledge to build a class-based domain specific statistical language model
    • 使用基于规则的知识构建基于类的域特定统计语言模型的方法和系统
    • US07275033B1
    • 2007-09-25
    • US10130860
    • 2000-09-30
    • Yibao ZhaoYonghong YanZhiwei Lin
    • Yibao ZhaoYonghong YanZhiwei Lin
    • G10L15/18G06F17/27
    • G10L15/197G10L15/183
    • A method and system for providing a class-based statistical language model representation from rule-based knowledge is disclosed. The class-based language model is generated from a statistical representation of a class-based rule net. A class-based rule net is generated using the domain-related rules with words replaced with their corresponding class-tags that are manually defined. The class-based statistical representation from the class-based rule net is combined with a class-based statistical representation from a statistical language model to generate a language model. The language model is enhanced by smoothing/adapting with general-purpose and/or domain-related corpus for use as the final language model. A two-pass search algorithm is applied for speech decoding.
    • 公开了一种基于规则的知识提供基于类的统计语言模型表示的方法和系统。 基于类的语言模型是从基于类的规则网的统计表示生成的。 使用与域相关的规则生成基于类的规则网,其中单词替换为手动定义的相应类标签。 基于类的规则网络的基于类的统计表示与来自统计语言模型的基于类的统计表示相结合以生成语言模型。 通过使用通用和/或域相关语料库进行平滑/调整,作为最终语言模型来增强语言模型。 双路搜索算法应用于语音解码。
    • 10. 发明授权
    • Method, apparatus, and system for building context dependent models for a large vocabulary continuous speech recognition (LVCSR) system
    • 用于为大型词汇连续语音识别(LVCSR)系统构建上下文相关模型的方法,装置和系统
    • US07587321B2
    • 2009-09-08
    • US10332652
    • 2001-05-08
    • Xiaoxing LiuBaosheng YuanYonghong Yan
    • Xiaoxing LiuBaosheng YuanYonghong Yan
    • G10L15/14
    • G10L15/187G10L15/1815
    • According to one aspect of the invention, a method is provided in which a set of multiple mixture monophone models is created and trained to generate a set of multiple mixture context dependent models. A set of single mixture triphone models is created and trained to generate a set of context dependent models. Corresponding states of the triphone models are clustered to obtain a set of tied states based on a decision tree clustering process. Parameters of the context dependent models are estimated using a data dependent maximum a posteriori (MAP) adaptation method in which parameters of the tied states of the context dependent models are derived by adapting corresponding parameters of the context independent models using the training data associated with the respective tied states.
    • 根据本发明的一个方面,提供了一种方法,其中创建并训练一组多个混合单声道模型以生成一组多个混合上下文相关模型。 创建和训练了一组单一混合三音模型,以生成一组上下文相关模型。 将三通电话模型的对应状态聚类成基于决策树聚类过程获得一组绑定状态。 使用依赖于数据的最大后验(MAP)适配方法来估计上下文相关模型的参数,其中,通过使用与上下文相关模型相关联的训练数据来调整上下文无关模型的相应参数,从而导出上下文相关模型的绑定状态的参数 各自的绑定状态。