会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • METHOD AND SYSTEM FOR PERFORMING AN AUDIO INFORMATION COLLECTION AND QUERY
    • 执行音频信息收集和查询的方法和系统
    • WO2014117578A1
    • 2014-08-07
    • PCT/CN2013/087827
    • 2013-11-26
    • TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    • ZHANG, XiaolongZHANG, BinLI, DeyuanLIU, HailongHOU, JieXIE, Dadong
    • G06F17/30
    • G06F17/30749G06F17/30769
    • An electronic device with one or more processors, memory, and a display detects a first trigger event and, in response to detecting the first trigger event, collects a audio sample of environmental audio data associated with a media item. The device transmits information corresponding to the audio sample to a server. In response to transmitting the information, the device obtains attribute information corresponding to the audio sample, where the attribute information includes metadata for the media item, a time indicator of a position of the audio sample in the media item, and stream information for the media item. The device displays a portion of the attribute information. The device detects a second trigger event and, in response to detecting the second trigger event: determines a last obtained time indicator; streams the media item based on the stream information; and presents the media item from the last obtained time indicator.
    • 具有一个或多个处理器,存储器和显示器的电子设备检测第一触发事件,并且响应于检测到第一触发事件,收集与媒体项目相关联的环境音频数据的音频样本。 设备将与音频样本相对应的信息发送到服务器。 响应于发送信息,设备获取对应于音频样本的属性信息,其中属性信息包括媒体项目的元数据,媒体项目中的音频样本的位置的时间指示符,以及用于媒体的流信息 项目。 设备显示属性信息的一部分。 该装置检测第二触发事件,并响应于检测到第二触发事件:确定最后获得的时间指示符; 基于流信息流媒体项目; 并从最后获得的时间指示器呈现媒体项目。
    • 3. 发明申请
    • METHOD AND SYSTEM FOR RECOGNIZING SPEECH COMMANDS
    • 用于识别语音命令的方法和系统
    • WO2014117544A1
    • 2014-08-07
    • PCT/CN2013/085738
    • 2013-11-21
    • TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    • YUE, ShuaiLU, LiZHANG, XiangXIE, DadongLIU, HaiboCHEN, BoLIU, Jian
    • G10L15/20
    • G10L15/14G10L15/063G10L15/083G10L15/32G10L2015/088G10L2015/223
    • A method of recognizing speech commands includes generating a background acoustic model for a sound using a first sound sample, the background acoustic model characterized by a first precision metric. A foreground acoustic model is generated for the sound using a second sound sample, the foreground acoustic model characterized by a second precision metric. A third sound sample is received and decoded by assigning a weight to the third sound sample corresponding to a probability that the sound sample originated in a foreground using the foreground acoustic model and the background acoustic model. The method further includes determining if the weight meets predefined criteria for assigning the third sound sample to the foreground and, when the weight meets the predefined criteria, interpreting the third sound sample as a portion of a speech command. Otherwise, recognition of the third sound sample as a portion of a speech command is forgone.
    • 识别语音命令的方法包括使用第一声音样本产生用于声音的背景声学模型,所述背景声学模型由第一精度度量表征。 使用第二声音样本为声音生成前景声学模型,前景声学模型以第二精度度量为特征。 通过使用前景声学模型和背景声学模型通过对与声音样本始发于前景的概率相对应的第三声音样本分配权重来接收和解码第三声音样本。 该方法还包括确定权重是否满足用于将第三声音样本分配给前景的预定准则,并且当权重满足预定标准时,将第三声音样本解释为语音命令的一部分。 否则,放弃了作为语音命令的一部分的第三声音样本的识别。
    • 4. 发明申请
    • METHOD AND SYSTEM FOR TESTING AND MONITORING REAL-TIME STREAMING MEDIA RECOGNITION SERVICE PROVIDER
    • 实时流媒体识别服务提供商的测试与监控方法与系统
    • WO2016004809A1
    • 2016-01-14
    • PCT/CN2015/081331
    • 2015-06-12
    • TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    • LIU, JianXIE, DadongHOU, JieLIU, HailongCHEN, Bo
    • H04N21/24
    • H04L41/5038H04L65/4076H04L65/607H04L67/42H04N17/004H04N21/24
    • A method of testing and monitoring a real-time streaming media recognition service provider is performed at a computer system. The computer system obtains a streaming media signal source, selects a testing sample from the streaming media signal source, records characteristics of the testing sample, and obtains an expected output according to the characteristics of the testing sample. Next, the computer system converts the testing sample into a digital streaming format preset by the service provider and initiates a media recognition request according to the testing sample in the digital streaming format to the service provider. After receiving a media recognition result of the testing sample returned by the service provider according to the media recognition request, the computer system compares the media recognition result with the expected output and indicates whether the service provider is normal in accordance with the comparison result.
    • 在计算机系统中执行测试和监视实时流媒体识别服务提供商的方法。 计算机系统获取流媒体信号源,从流媒体信号源选择测试样本,记录测试样本的特征,并根据测试样本的特征获得预期输出。 接下来,计算机系统将测试样本转换成由服务提供商预设的数字流格式,并根据数字流格式的测试样本向服务提供商发起媒体识别请求。 在根据媒体识别请求收到服务提供商返回的测试样本的媒体识别结果之后,计算机系统将媒体识别结果与预期输出进行比较,并根据比较结果指示服务提供商是否正常。
    • 6. 发明申请
    • METHOD AND SYSTEM FOR AUTOMATIC SPEECH RECOGNITION
    • 自动语音识别方法与系统
    • WO2014117555A1
    • 2014-08-07
    • PCT/CN2013/086707
    • 2013-11-07
    • TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    • RAO, FengLU, LiCHEN, BoYUE, ShuaiZHANG, XiangWANG, EryuXIE, DadongLI, LouLU, Duling
    • G10L15/00G10L15/02G10L15/14G10L15/22G10L15/26
    • G10L15/197
    • An automatic speech recognition method includes at a computer having one or more processors and a memory for storing one or more programs to be executed by the processors, obtaining a plurality of speech corpus categories through classifying and calculating raw speech corpus (801); obtaining a plurality of classified language models that respectively correspond to the plurality of speech corpus categories through language model training applied on each speech corpus category (802); obtaining an interpolation language model through implementing a weighted interpolation on each classified language model and merging the interpolated plurality of classified language models (803); constructing a decoding resource in accordance with an acoustic model and the interpolation language model (804); decoding input speech using the decoding resource, and outputting a character string with a highest probability as the recognition result of the input speech (805).
    • 自动语音识别方法包括在具有一个或多个处理器的计算机和用于存储要由处理器执行的一个或多个程序的存储器,通过分类和计算原始语音语料库(801)获得多个语音语料库类别; 通过在每个语音语料库类别(802)上应用的语言模型训练获得分别对应于多个语音语料库类别的多个分类语言模型; 通过对每个分类语言模型实施加权内插并合并内插多个分类语言模型(803)来获得内插语言模型; 根据声学模型和内插语言模型构造解码资源(804); 使用解码资源解码输入语音,并输出具有最高概率的字符串作为输入语音的识别结果(805)。
    • 9. 发明申请
    • METHOD AND SYSTEM FOR AUTOMATIC SPEECH RECOGNITION
    • 自动语音识别方法与系统
    • WO2014117577A1
    • 2014-08-07
    • PCT/CN2013/087816
    • 2013-11-26
    • TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    • YUE, ShuaiLU, LiZHANG, XiangXIE, DadongCHEN, BoRAO, Feng
    • G10L15/193G10L15/28G10L15/00
    • G10L15/193G10L15/083
    • A method and system for automatic speech recognition is provided. The method includes generating a decoding network that includes a primary sub-network and a classification sub-network. The primary sub-network includes a classification node corresponding to the classification sub-network. The classification sub-network corresponds to a group of uncommon words. Speech input is received and decoded by instantiating a token in the primary sub-network and passing the token through the primary network. When the token reaches the classification node, the method includes transferring the token to the classification sub-network and passing the token through the classification sub-network. When the token reaches an accept node of the classification sub-network, the method includes returning a result of the token passing through the classification sub-network to the primary sub-network. The result includes one or more words in the group of uncommon words. A string corresponding to the speech input is output that includes the one or more words.
    • 提供了一种自动语音识别的方法和系统。 该方法包括生成包括主子网和分类子网的解码网络。 主子网包括与分类子网对应的分类节点。 分类子网对应于一组不常见的单词。 通过在主子网中实例化令牌并传递令牌通过主网络来接收和解码语音输入。 当令牌到达分类节点时,该方法包括将令牌传送到分类子网,并通过分类子网传递令牌。 当令牌到达分类子网络的接受节点时,该方法包括将通过分类子网络的令牌的结果返回到主子网络。 结果包括不常见词组中的一个或多个单词。 输出与语音输入对应的字符串,其中包含一个或多个单词。
    • 10. 发明申请
    • METHOD AND DEVICE FOR AUDIO RECOGNITION
    • 用于音频识别的方法和装置
    • WO2014117542A1
    • 2014-08-07
    • PCT/CN2013/085309
    • 2013-10-16
    • TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    • LIU, HailongXIE, DadongHOU, JieXIAO, BinLIU, XiaoCHEN, Bo
    • G10L15/30
    • G10L15/30G06F17/30743G10H1/00G10H5/005G10L25/18G10L25/51
    • A method and device for performing audio recognition, including: collecting a first audio document to be recognized; initiating calculation of first characteristic information of the first audio document, including: conducting time-frequency analysis for the first audio document to generate a first preset number of phase channels; and extracting at least one peak value characteristic point from each phase channel of the first preset number of phrase channels, where the at least one peak value characteristic point of each phase channel constitutes the peak value characteristic point sequence of said each phase channel; and obtaining a recognition result for the first audio document, wherein the recognition result is identified based on the first characteristic information, and wherein the first characteristic information is calculated based on the respective peak value characteristic point sequences of the preset number of phase channels.
    • 一种用于执行音频识别的方法和装置,包括:收集要识别的第一音频文档; 开始计算第一音频文档的第一特征信息,包括:对第一音频文档进行时间 - 频率分析以产生第一预设数量的相位通道; 以及从所述第一预设数量的短语通道的每个相位通道提取至少一个峰值特征点,其中每个相位通道的至少一个峰值特征点构成所述每个相位通道的峰值特征点序列; 并且获得第一音频文档的识别结果,其中基于第一特征信息识别识别结果,并且其中基于预设数量的相位通道的相应峰值特征点序列来计算第一特征信息。