会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 71. 发明授权
    • Personalized voice activity detection
    • 个性化语音活动检测
    • US08175874B2
    • 2012-05-08
    • US12092578
    • 2006-07-18
    • Shaul Shimhi
    • Shaul Shimhi
    • G10L15/10
    • G10L25/78G10L17/00
    • A method of transferring a real-time audio signal transmission, including: registering voice patterns (or other characteristics) of on more users to be used to identify the voices of the users, accepting an audio signal as it is created as a sequence of segments, analyzing each segment of the accepted audio signal to determine if it contains voice activity (314), determining a probability level that the voice activity of the segment is of a registered user (320 & 322); and selectively transferring the contents, of a segment responsive to the determined probability level (324).
    • 一种传送实时音频信号传输的方法,包括:登记用于识别用户的语音的更多用户的语音模式(或其他特征),接收音频信号作为片段序列 分析所接收的音频信号的每个片段以确定其是否包含语音活动(314),确定所述片段的语音活动是注册用户(320&322)的概率级别; 以及响应于所确定的概率水平选择性地传送段的内容(324)。
    • 73. 发明授权
    • Method for searching data in at least two databases
    • 用于在至少两个数据库中搜索数据的方法
    • US07363222B2
    • 2008-04-22
    • US10482517
    • 2002-06-24
    • Michael Josenhans
    • Michael Josenhans
    • G10L15/10G10L15/28G06F7/06
    • H04M1/271G10L15/26H04M1/275H04M2250/02Y10S707/99933
    • A method and database system is disclosed for searching data in at least two databases (Dn), particularly for searching telephone directories or the like. To allow simultaneous access to two or more databases by means of speech recognition in order to perform a search therein as in a single database, a search term is input by speech via a voice controlled user interface (28) connected to a database primary control apparatus (26) and comprises speech recognition front end means (8, 9) for processing a sound sequence of a search term input by speech to obtain a comparable speech pattern (X) thereof. By means of speech recognition back end means (6) associated with databases (D1-D6), the comparable speech pattern (X) is compared with corresponding speech patterns (An,i) of database entries (En,i) to determine for each of the at least two databases (Dn) at least that database entry (En,j) the speech pattern (An,j) which best matches the comparable speech pattern (X) of the search term.
    • 公开了用于在至少两个数据库(D SUB)中搜索数据的方法和数据库系统,特别是用于搜索电话簿等。 为了允许通过语音识别同时访问两个或更多个数据库,以便像在单个数据库中那样执行搜索,搜索项通过语音控制用户界面(28)输入,所述语音控制用户界面(28)连接到数据库主控制装置 (26),并且包括语音识别前端装置(8,9),用于处理通过语音输入的搜索项的声音序列,以获得其可比较的语音模式(X)。 通过与数据库(D 1 -D 6)相关联的语音识别后端装置(6),将可比较的语音模式(X)与相应的语音模式( 数据库条目(E> n,i>)中的每个数据库(D n n at at at at at at at at at at at at at at at at at at) 该数据库条目(E N,j,N)最好与搜索项的可比较语言模式(X)匹配的语音模式(A N n,j N)。
    • 75. 发明授权
    • Method and system for learning linguistically valid word pronunciations from acoustic data
    • 从声学数据学习语言有效的单词发音的方法和系统
    • US07266495B1
    • 2007-09-04
    • US10661106
    • 2003-09-12
    • Francoise BeaufaysAnanth SankarMitchel WeintraubShaun Williams
    • Francoise BeaufaysAnanth SankarMitchel WeintraubShaun Williams
    • G10L15/06G10L15/10
    • G10L15/06G10L15/187
    • A computerized pronunciation system is provided for generating pronunciations for words and storing the pronunciations in a pronunciation dictionary. The system includes a word list including at least one word; transcribed acoustic data including at least one waveform for the word and transcribed text associated with the waveform; a pronunciation-learning module configured to accept as input the word list and the transcribed acoustic data, the pronunciation-learning module including: sets of initial pronunciations of the word, a scoring module configured score pronunciations and to generate phone probabilities, and a set of alternate pronunciations of the word, wherein the set of alternate pronunciations include a highest-scoring set of initial pronunciations with a highest-scoring substitute phone substituted for a lowest-probability phone; and a pronunciation dictionary configured to receive the highest-scoring set of initial pronunciations and the set of alternate pronunciations.
    • 提供了一种计算机化的发音系统,用于产生词的发音并将发音存储在发音词典中。 该系统包括包括至少一个单词的单词列表; 转录声学数据,包括用于该词的至少一个波形和与波形相关联的转录文本; 发音学习模块,被配置为接受单词列表和转录声学数据的输入,所述发音学习模块包括:该单词的初始发音集,评分模块配置得分发音并产生电话概率,以及一组 该单词的替代发音,其中该组交替发音包括最高得分的初始发音集合,其中替代最低概率电话的最高评分替代电话; 和发音词典,其配置为接收最高分的初始发音和一组交替发音。
    • 79. 发明申请
    • Speech recognition device and speech recognition method
    • 语音识别装置和语音识别方法
    • US20050256712A1
    • 2005-11-17
    • US10504926
    • 2004-02-04
    • Maki YamadaMakoto NishizakiYoshihisa NakatohShinichi Yoshizawa
    • Maki YamadaMakoto NishizakiYoshihisa NakatohShinichi Yoshizawa
    • G06F3/16G10L11/02G10L15/00G10L15/04G10L15/06G10L15/08G10L15/10G10L15/14G10L15/18G10L15/20G10L15/22G10L15/28
    • G10L15/065G10L15/08
    • The speech recognition apparatus (1) is equipped with the garbage acoustic model storage unit (110) storing the garbage acoustic model which learned the collection of the unnecessary words; the feature value calculation unit (101) which calculates the feature parameter necessary for recognition by acoustically analyzing the unidentified input speech including the non-language speech per frame which is a unit for speech analysis; the garbage acoustic score calculation unit (111) which calculates the garbage acoustic score by comparing the feature parameter and the garbage acoustic model; the garbage acoustic score correction unit (113) which corrects the garbage acoustic score calculated by the garbage acoustic score calculation unit (111) so as to raise it in the frame where the non-language speech is inputted; and the recognition result output unit (105) which outputs, as the recognition result of the unidentified input speech, the word string with the highest cumulative score of the language score, the word acoustic score, and the garbage acoustic score which is corrected by the garbage acoustic score correcting means.
    • 语音识别装置(1)配备有存储无用声音模型的垃圾声模型存储单元(110),该垃圾声模型学习了不必要的字的收集; 特征值计算单元(101),其通过声学分析包括作为用于语音分析的单位的每个非语言语音的未识别输入语音来计算识别所需的特征参数; 所述垃圾声分数计算部(111)通过比较所述特征参数和所述垃圾声模型来计算所述无声声分数; 所述垃圾声音得分校正单元(113)对由所述垃圾声音乐评分计算单元(111)计算出的所述无声声音乐评分进行校正,以提高其输入所述非语言语音的帧; 以及识别结果输出单元(105),作为未识别输入语音的识别结果,输出具有语言得分的最高累积分数,词声分数和垃圾声音得分的单词串,其由 垃圾声分数校正装置。
    • 80. 发明申请
    • Text message generation
    • 短信生成
    • US20050256710A1
    • 2005-11-17
    • US10507194
    • 2003-03-10
    • Matthias PankertReimund SchmaldJens Marschner
    • Matthias PankertReimund SchmaldJens Marschner
    • G06F3/16G10L15/08G10L15/10G10L15/19G10L15/22G10L15/26
    • G10L15/19
    • The invention relates to a method of generating text messages. In order to make the generation of text messages as convenient and efficient as possible for a user, the following steps are proposed: —processing of speech input containing message elements by means of grammar-based speech recognition procedures; —processing of speech input by means of speech model-based speech recognition procedures, either in parallel with processing by means of grammar-based speech recognition or once a recognition result has been obtained by means of the grammar-based speech recognition procedures which is not of a predefined quality; —generation of a text message using the recognition results produced by means of the grammar-based and/or speech model-based speech recognition procedures.
    • 本发明涉及生成文本消息的方法。 为了使用户对文本消息的生成尽可能方便和有效,提出了以下步骤:通过基于语法的语音识别程序来处理包含消息元素的语音输入; 通过基于语音模型的语音识别程序来处理语音输入,或者通过基于语法的语音识别的处理并行处理,或者一旦通过基于语法的语音识别程序获得了识别结果,该程序不是 的预定质量; 使用通过基于语法和/或基于语音模型的语音识别程序产生的识别结果来生成文本消息。