会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 5. 发明申请
    • SYSTEMS AND METHODS FOR PREDICTING PRONUNCIATIONS WITH WORD STRESS
    • 用词应力预测语音的系统和方法
    • WO2017213696A1
    • 2017-12-14
    • PCT/US2016/065759
    • 2016-12-09
    • GOOGLE LLC
    • CHUA, Mason VijayRAO, Kanury KanishkaVAN ESCH, Daniel Jacobus Josef
    • G10L13/10G10L25/30G10L15/187
    • G10L13/10G10L13/0335G10L13/047G10L13/08G10L15/02G10L15/063G10L15/16G10L15/1815G10L15/187G10L17/18G10L25/30G10L2015/027
    • Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating word pronunciations. One of the methods includes determining, by one or more computers, spelling data that indicates the spelling of a word, providing the spelling data as input to a trained recurrent neural network, the trained recurrent neural network being trained to indicate characteristics of word pronunciations based at least on data indicating the spelling of words, receiving output indicating a stress pattern for pronunciation of the word generated by the trained recurrent neural network in response to providing the spelling data as input, using the output of the trained recurrent neural network to generate pronunciation data indicating the stress pattern for a pronunciation of the word, and providing, by the one or more computers, the pronunciation data to a text-to-speech system or an automatic speech recognition system.
    • 包括在计算机存储介质上编码的计算机程序的方法,系统和装置,用于生成单词发音。 其中一种方法包括由一个或多个计算机确定指示单词拼写的拼写数据,将拼写数据作为输入提供给训练的递归神经网络,训练后的递归神经网络被训练以指示基于单词发音的特征 至少在指示单词拼写的数据上,接收指示由训练的回归神经网络响应于提供拼写数据作为输入而生成的单词的发音的压力模式的输出,使用训练的回归神经网络的输出来生成发音 指示单词发音的压力模式的数据,以及通过一个或多个计算机将发音数据提供给文本到语音系统或自动语音识别系统。
    • 6. 发明申请
    • MULTI-SPEAKER SPEECH SEPARATION
    • 多音箱语音分离
    • WO2017112466A1
    • 2017-06-29
    • PCT/US2016/066430
    • 2016-12-14
    • MICROSOFT TECHNOLOGY LICENSING, LLC
    • YU, Dong
    • G10L15/16G06N3/04G10L15/07G10L21/0272G10L17/18
    • G10L25/30G06N3/0445G10L15/07G10L15/16G10L15/197G10L15/20G10L15/22G10L15/26G10L17/18G10L21/0272G10L25/18G10L25/21G10L2015/223
    • The technology described herein uses a multiple-output layer RNN to process an acoustic signal comprising speech from multiple speakers to trace an individual speaker's speech. The multiple-output layer RNN has multiple output layers, each of which is meant to trace one speaker (or noise) and represent the mask for that speaker (or noise). The output layer for each speaker (or noise) can have the same dimensions and can be normalized for each output unit across all output layers. The rest of the layers in the multiple-output layer RNN are shared across all the output layers. The result from the previous frame is used as input to the output layer or to one of the hidden layers of the RNN to calculate results for the current frame. This pass back of results allows the model to carry information from previous frames to future frames to trace the same speaker.
    • 这里描述的技术使用多输出层RNN来处理包括来自多个扬声器的语音的声学信号以跟踪个体说话者的语音。 多输出层RNN具有多个输出层,每个输出层意味着跟踪一个扬声器(或噪声)并表示该扬声器(或噪声)的掩模。 每个扬声器(或噪声)的输出层可以具有相同的尺寸,并且可以针对所有输出层中的每个输出单元进行归一化。 多输出层RNN中的其余层在所有输出层之间共享。 来自前一帧的结果被用作输出层或RNN的隐藏层之一的输入,以计算当前帧的结果。 这种回传结果允许模型将来自先前帧的信息携带到未来帧以跟踪相同的说话者。
    • 8. 发明申请
    • DEVICE IMPAIRMENT DETECTION
    • 器件损害检测
    • WO2017049164A1
    • 2017-03-23
    • PCT/US2016/052258
    • 2016-09-16
    • SONOS, INC.
    • HARTUNG, KlausBRIGHT, Greg
    • H04R3/04H04S7/00G06N3/02
    • G10L25/30G06F3/16G06N3/02G06N3/084H04R3/04H04R27/00H04R29/007H04R2227/005H04S7/301
    • Examples described herein involve detecting known impairments or other known conditions using a neural network. An example implementation involves receiving data indicating a response of a playback device as captured by a microphone. The implementation also involves determining an input vector by projecting a response vector that represents the response of the playback device onto a principle component matrix representing variance caused by one or more known impairments. The implementation further involves providing the determined input vector to a neural network that includes an output layer comprising neurons that correspond to respective known impairments. The implementation involves detecting that the input vector caused one or more neurons of the neural network to fire such that the neural network indicates that a particular known impairment is affecting the microphone and/or the playback device and adjusting operation of the playback device and/or the microphone to offset the particular known impairment.
    • 本文描述的示例涉及使用神经网络来检测已知的损伤或其他已知的条件。 示例实现涉及接收指示由麦克风捕获的播放设备的响应的数据。 该实现还包括通过将表示回放设备的响应的响应向量投影到表示由一个或多个已知损伤引起的方差的原理分量矩阵上来确定输入向量。 该实现还包括将确定的输入向量提供给神经网络,该神经网络包括包括对应于各自已知损伤的神经元的输出层。 该实现涉及检测输入向量导致神经网络的一个或多个神经元触发,使得神经网络指示特定的已知损伤影响麦克风和/或播放设备并且调整播放设备的操作和/或 麦克风抵消特定的已知损伤。