会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 3. 发明申请
    • METHOD AND APPARATUS FOR KEYWORD-BASED MEDIA ITEM TRANSMISSION
    • 用于基于关键字的媒体项目传输的方法和装置
    • US20080162454A1
    • 2008-07-03
    • US11619465
    • 2007-01-03
    • Louis J. LundellYan Ming Cheng
    • Louis J. LundellYan Ming Cheng
    • G06F17/30G10L15/00G06F17/27
    • G06F16/313G06F16/41G06F16/437
    • A system includes a first communications device [105] to participate in a conversation with at least a second communication device [110]. An intelligent communication agent [120] monitors the conversation for at least one keyword. In response to detecting the at least one keyword, the intelligent communication agent performs a search for multimedia content corresponding to the at least one keyword and retrieves the multimedia content. A logic engine [135] determines relevant content of the multimedia content based on at least one of a conversation profile and at least one user profile for at least one of a user of the first communication device and at least a second user of the at least a second communication device. A transmission element [130] transmits the relevant content to at least one of the first communication device, the at least a second communication device, and a predetermined multimedia device [145].
    • 系统包括参与与至少第二通信设备[110]的会话的第一通信设备[105]。 智能通信代理[120]监视至少一个关键字的会话。 响应于检测到所述至少一个关键字,所述智能通信代理执行与所述至少一个关键字相对应的多媒体内容的搜索并检索所述多媒体内容。 逻辑引擎基于至少一个对话简档和至少一个用户简档来确定多媒体内容的相关内容,用于第一通信设备的用户和至少第二用户中的至少一个 第二通信设备。 传输元件[130]将相关内容发送到第一通信设备,至少第二通信设备和预定多媒体设备中的至少一个[145]。
    • 5. 发明授权
    • Noise reduced speech recognition parameters
    • 噪声降低语音识别参数
    • US06678656B2
    • 2004-01-13
    • US10061048
    • 2002-01-30
    • Dusan MachoYan Ming Cheng
    • Dusan MachoYan Ming Cheng
    • G10L1520
    • G10L15/02G10L15/30G10L21/0208G10L25/18
    • A voice sample characterization front-end suitable for use in a distributed speech recognition context. A digitized voice sample 31 is split between a low frequency path 32 and a high frequency path 33. Both paths are used to determine spectral content suitable for use when determining speech recognition parameters (such as cepstral coefficients) that characterize the speech sample for recognition purposes. The low frequency path 32 has a thorough noise reduction capability. In one embodiment, the results of this noise reduction are used by the high frequency path 33 to aid in de-noising without requiring the same level of resource capacity as used by the low frequency path 32.
    • 语音样本表征前端适用于分布式语音识别语境。 数字化语音样本31在低频路径32和高频路径33之间分离。当确定表征语音样本以识别目的的语音识别参数(例如倒谱系数)时,两个路径用于确定适合使用的频谱内容 。 低频路径32具有彻底的降噪能力。 在一个实施例中,由高频路径33使用该噪声降低的结果来帮助去噪,而不需要与低频路径32所使用的相同的资源容量。
    • 6. 发明授权
    • Method and apparatus for generating and updating a voice tag
    • 用于生成和更新语音标签的方法和装置
    • US07471775B2
    • 2008-12-30
    • US11170892
    • 2005-06-30
    • Yan Ming Cheng
    • Yan Ming Cheng
    • H04M1/64
    • G10L15/06G10L2015/0635H04M1/271H04M2250/74
    • A method and apparatus (100) for updating a voice tag comprising N stored voice tag phoneme sequences includes a function (110) for determining (205) an accepted stored voice tag phoneme sequence for an utterance, a function (140) for extracting(210) a current set of M phoneme sequences having highest likelihoods of representing the utterance, a function (160) for updating (215) a reference histogram associated with the accepted voice tag, and a function (160) for updating (225) the voice tag with N selected phoneme sequences that are selected from the current set of M phoneme sequences and the set of N voice tag phoneme sequences, wherein the N selected phoneme sequences have phoneme histograms most closely matching the reference histogram. The method and apparatus (100) also generates a voice tag using some functions (110, 140, 160) that are common with the method and apparatus to update the voice tag, such as the extracting (410) of the current set of M phoneme sequences.
    • 一种用于更新包括N个存储的语音标签音素序列的语音标签的方法和装置(100),包括用于确定(205)用于话语的接受的存储的语音标签音素序列的功能(110),用于提取(210) )具有表示发音的最高似然性的当前的一组M个音素序列,用于更新(215)与所接受的语音标签相关联的参考直方图的功能(160)和用于更新(225)语音标签的功能(160) 其中N个选择的音素序列选自当前的M个音素序列集合和一组N个语音标签音素序列,其中N个选择的音素序列具有与参考直方图最接近匹配的音素直方图。 方法和装置(100)还使用与方法和装置相同的功能(110,140,​​160)来生成语音标签,以更新语音标签,例如提取(410)当前的一组M个音素 序列。
    • 8. 发明授权
    • Methods and apparatus for reducing noise associated with an electrical speech signal
    • 用于降低与电语音信号相关联的噪声的方法和装置
    • US06480821B2
    • 2002-11-12
    • US09774840
    • 2001-01-31
    • Dusan MachoYan Ming Cheng
    • Dusan MachoYan Ming Cheng
    • G10L2102
    • G10L21/0208G10L21/0364G10L25/90
    • A system for enhancing the signal-to-noise ratio of a speech signal is avoided. A plurality of local energy maximums associated with a speech signal are determined. Presumably, each of these local energy maximums defines a speech pitch period. Typically, human pitch periods are approximately 100-400 Hz depending on the sex and age of the speaker. Because human speech typically includes more energy near the beginning of a pitch period than at the end of the pitch period, and background noise tends to remain relatively constant throughout the pitch period, the speech signal may be enhanced by increasing the energy associated with the beginning of the pitch period and/or by decreasing the energy associated with the end of the pitch period. Preferably, the amount of energy increase in the earlier portion of the pitch period is approximately equal to the amount of energy reduction in the later portion of the pitch period. In this manner, the total energy remains the constant.
    • 避免了用于提高语音信号的信噪比的系统。 确定与语音信号相关联的多个局部能量最大值。 大概地,这些局部能量最大值中的每一个定义了语音音调周期。 通常,根据演讲者的性别和年龄,人类音调周期约为100-400Hz。 因为人类语音通常在音调周期的开始处包括比在音调周期结束时更多的能量,并且背景噪声在整个音调周期期间趋于保持相对恒定,所以可以通过增加与开始相关联的能量来增强语音信号 和/或通过减小与音调周期结束相关联的能量。 优选地,在音调周期的较早部分中的能量增加量大约等于音调周期的稍后部分中的能量减少量。 以这种方式,总能量保持恒定。
    • 9. 发明授权
    • Method and apparatus for distributed voice searching
    • 分布式语音搜索的方法和装置
    • US07818170B2
    • 2010-10-19
    • US11733306
    • 2007-04-10
    • Yan Ming Cheng
    • Yan Ming Cheng
    • G10L17/00
    • H04M1/72561G06F17/30026G06F17/30899G10L15/08G10L15/30G10L2015/025G10L2015/221H04M2250/74
    • A method for distributed voice searching may include receiving a search query from a user of the mobile communication device, generating a lattice of coarse linguistic representations from speech parts in the search query, extracting query features from the generated lattice of coarse linguistic representations, generating coarse search feature vectors based on the extracted query features, performing a coarse search using the generated coarse search feature vectors and transmitting the generated coarse search feature vectors to a remote voice search processing unit, receiving remote resultant web indices from the remote voice search processing unit, generating a lattice of fine linguistic representations from speech parts in the search query, generating fine search feature vectors from the lattice of fine linguistic representations, performing a fine search using the coarse search results, the remote resultant web indices and the generated fine search feature vectors, and displaying the fine search results to the user.
    • 用于分布式语音搜索的方法可以包括从移动通信设备的用户接收搜索查询,从搜索查询中的语音部分生成粗略语言表示的格子,从生成的粗略语言表示的格子中提取查询特征,生成粗略 基于所提取的查询特征的搜索特征向量,使用所生成的粗略搜索特征向量执行粗略搜索,并将生成的粗略搜索特征向量发送到远程语音搜索处理单元,从远程语音搜索处理单元接收远程结果web索引, 从搜索查询中的语音部分生成精细语言表示的格子,从精细语言表示的格子生成精细搜索特征向量,使用粗略搜索结果,远程生成的网页索引和生成的精细搜索特征向量进行精细搜索 ,并显示t 他对用户的搜索结果很好。