会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明授权
    • Keyword detection with international phonetic alphabet by foreground model and background model
    • 用前景模型和背景模型对国际语音字母进行关键词检测
    • US09466289B2
    • 2016-10-11
    • US14103775
    • 2013-12-11
    • Tencent Technology (Shenzhen) Company Limited
    • Li LuXiang ZhangShuai YueFeng RaoEryu WangLu Li
    • G10L15/06G10L15/08
    • G10L15/063G10L15/08G10L2015/088
    • An electronic device with one or more processors and memory trains an acoustic model with an international phonetic alphabet (IPA) phoneme mapping collection and audio samples in different languages, where the acoustic model includes: a foreground model; and a background model. The device generates a phone decoder based on the trained acoustic model. The device collects keyword audio samples, decodes the keyword audio samples with the phone decoder to generate phoneme sequence candidates, and selects a keyword phoneme sequence from the phoneme sequence candidates. After obtaining the keyword phoneme sequence, the device detects one or more keywords in an input audio signal with the trained acoustic model, including: matching phonemic keyword portions of the input audio signal with phonemes in the keyword phoneme sequence with the foreground model; and filtering out phonemic non-keyword portions of the input audio signal with the background model.
    • 具有一个或多个处理器和存储器的电子设备具有使用不同语言的国际语音字母(IPA)音素映射收集和音频样本的声学模型,其中声学模型包括:前景模型; 和背景模型。 该设备基于经过训练的声学模型生成电话解码器。 设备收集关键字音频样本,用手机解码器解码关键词音频样本,以产生音素序列候选,并从音素序列候选中选择关键词音素序列。 在获得关键字音素序列之后,设备利用经训练的声学模型检测输入音频信号中的一个或多个关键词,包括:使用前景模型将关键字音素序列中的输入音频信号的音素关键词部分与音素相匹配; 并用背景模型滤出输入音频信号的音素非关键字部分。
    • 3. 发明授权
    • Keyword detection for speech recognition
    • 语音识别的关键字检测
    • US09230541B2
    • 2016-01-05
    • US14567969
    • 2014-12-11
    • Tencent Technology (Shenzhen) Company Limited
    • Lu LlLi LuJianxiong MaLinghui KongFeng RaoShuai YueXiang ZhangHaibo LiuEryu WangBo Chen
    • G10L15/08
    • G10L15/08G10L15/083G10L2015/088
    • This application discloses a method implemented of recognizing a keyword in a speech that includes a sequence of audio frames further including a current frame and a subsequent frame. A candidate keyword is determined for the current frame using a decoding network that includes keywords and filler words of multiple languages, and used to determine a confidence score for the audio frame sequence. A word option is also determined for the subsequent frame based on the decoding network, and when the candidate keyword and the word option are associated with two distinct types of languages, the confidence score of the audio frame sequence is updated at least based on a penalty factor associated with the two distinct types of languages. The audio frame sequence is then determined to include both the candidate keyword and the word option by evaluating the updated confidence score according to a keyword determination criterion.
    • 本申请公开了一种实现的方法,其中识别语音中的关键字,其中包括进一步包括当前帧和后续帧的音频帧序列。 使用包括多种语言的关键词和填充词的解码网络为当前帧确定候选关键字,并且用于确定音频帧序列的置信度分数。 还基于解码网络为后续帧确定字选项,并且当候选关键词和词选项与两种不同类型的语言相关联时,至少基于惩罚来更新音频帧序列的置信度得分 与两种不同类型语言相关联的因素。 然后通过根据关键字确定标准评估更新的可信度得分,确定音频帧序列以包括候选关键词和词选项。
    • 10. 发明授权
    • User authentication method and apparatus based on audio and video data
    • 基于音频和视频数据的用户认证方法和设备
    • US09177131B2
    • 2015-11-03
    • US14262665
    • 2014-04-25
    • Tencent Technology (Shenzhen) Company Limited
    • Xiang ZhangLi LuEryu WangShuai YueFeng RaoHaibo LiuLou LiDuling LuBo Chen
    • H04L29/06G06F21/32
    • G06F21/32G06F2221/2117
    • A computer-implemented method is performed at a server having one or more processors and memory storing programs executed by the one or more processors for authenticating a user from video and audio data. The method includes: receiving a login request from a mobile device, the login request including video data and audio data; extracting a group of facial features from the video data; extracting a group of audio features from the audio data and recognizing a sequence of words in the audio data; identifying a first user account whose respective facial features match the group of facial features and a second user account whose respective audio features match the group of audio features. If the first user account is the same as the second user account, retrieve the sequence of words associated with the user account and compare the sequences of words for authentication purpose.
    • 在具有一个或多个处理器的服务器和由一个或多个处理器执行的用于从视频和音频数据认证用户的存储器存储程序的服务器执行计算机实现的方法。 该方法包括:从移动设备接收登录请求,登录请求包括视频数据和音频数据; 从视频数据中提取一组面部特征; 从音频数据提取一组音频特征并识别音频数据中的单词序列; 识别其各自的面部特征与该组面部特征相匹配的第一用户帐户和其各个音频特征与该组音频特征相匹配的第二用户帐户。 如果第一个用户帐户与第二个用户帐户相同,则检索与用户帐户相关联的单词序列,并比较用于验证目的的单词序列。