会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 103. 发明公开
    • LOCAL AND REMOTE SPEECH PROCESSING
    • LOKALE UND ENTFERNTE SPRACHVERARBEITUNG
    • EP3047481A4
    • 2017-03-01
    • EP14846698
    • 2014-09-09
    • AMAZON TECH INC
    • STROM NIKKOVANLUND PETER SPALDINGHOFFMEISTER BJORN
    • G10L15/30G10L15/00G10L15/08G10L15/22G10L15/32
    • G10L15/22G10L15/30G10L15/32G10L2015/088G10L2015/223
    • A user device may be configured to detect a user-uttered trigger expression and to respond by interpreting subsequent words or phrases as commands. The commands may be recognized by sending audio containing the words or phrases to a remote service that is configured to perform speech recognition. Certain commands may be designated as local commands and may be detected locally rather than relying on the remote service. Upon detection of the trigger expression, audio is streamed to the remote service and also analyzed locally to detect utterances of local commands. Upon detecting a local command, a corresponding function is immediately initiated, and subsequent activities or responses by the remote service are canceled or ignored.
    • 用户设备可以被配置为检测用户发出的触发表达式并且通过将后续的单词或短语解释为命令来进行响应。 可以通过将包含单词或短语的音频发送到被配置为执行语音识别的远程服务来识别命令。 某些命令可以被指定为本地命令,并且可以被本地检测而不是依赖于远程服务。 在检测到触发表达式时,将音频流传输到远程服务,并在本地进行分析以检测本地命令的发声。 在检测到本地命令时,立即启动相应的功能,并且远程服务的后续活动或响应被取消或忽略。
    • 104. 发明公开
    • INDIVIDUALIZED HOTWORD DETECTION MODELS
    • 个性化的热点检测模型
    • EP3125234A1
    • 2017-02-01
    • EP16186281.8
    • 2016-07-12
    • Google Inc.
    • Guevara, Raziel Alvarez
    • G10L15/06G10L15/08G10L15/07
    • G10L17/04G10L15/02G10L15/063G10L15/07G10L15/075G10L15/1815G10L17/06G10L17/08G10L17/18G10L17/24G10L2015/0638G10L2015/088
    • Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for presenting notifications in a system. In one aspect, a method includes actions of obtaining enrollment acoustic data representing an enrollment utterance spoken by a user, obtaining a set of candidate acoustic data representing utterances spoken by other users, determining, for each candidate acoustic data of the set of candidate acoustic data, a similarity score that represents a similarity between the enrollment acoustic data and the candidate acoustic data, selecting a subset of candidate acoustic data from the set of candidate acoustic data based at least on the similarity scores, generating a detection model based on the subset of candidate acoustic data, and providing the detection model for use in detecting an utterance spoken by the user.
    • 包括编码在计算机存储介质上的计算机程序的方法,系统和装置,用于在系统中呈现通知。 在一个方面,一种方法包括以下动作:获取表示用户说出的注册话语的注册声学数据,获得表示由其他用户讲话的话语的一组候选声学数据,针对该组候选声学数据的每个候选声学数据 ,表示登记声学数据与候选声学数据之间的相似性的相似性分数,至少基于相似性分数从该组候选声学数据中选择候选声学数据的子集,基于子集 候选声学数据,并提供用于检测用户说出的话语的检测模型。
    • 110. 发明授权
    • Method and system for processing speech
    • 方法和系统语音处理
    • EP1652173B1
    • 2015-12-30
    • EP03739921.9
    • 2003-06-30
    • Chemtron Research LLC
    • ROY, Philippe
    • G10L15/08G10L15/187
    • G10L15/1822G10L15/08G10L2015/025
    • A system and method related to a new approach to speech recognition that reacts to concepts conveyed through speech. In its fullest implementation, the system and method shifts the balance of power in speech recognition from straight sound recognition and statistical models to a more powerful and complete approach determining and addressing conveyed concepts. This is done by using a probabilistically unbiased multi-phoneme recognition process, followed by a phoneme stream analysis process that builds the list of candidate words derived from recognized phonemes, followed by a permutation analysis process that produces sequences of candidate words with high potential of being syntactically valid, and finally, by processing targeted syntactic sequences in a conceptual analysis process to generate the utterance's conceptual representation that can be used to produce an adequate response. The invention can be employed for a myriad of applications, such as improving accuracy or automatically generating punctuation for transcription and dictation, word or concept spotting in audio streams, concept spotting in electronic text, customer support, call routing and other command/response scenarios.