会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • System and method for interfacing speech recognition grammars to individual components of a computer program
    • 将语音识别语法与计算机程序的各个组件进行接口的系统和方法
    • US06374226B1
    • 2002-04-16
    • US09369577
    • 1999-08-06
    • Andrew J. HuntWilliam D. WalkerJohan Wouters
    • Andrew J. HuntWilliam D. WalkerJohan Wouters
    • G10L1528
    • G10L15/26G10L15/18G10L15/193G10L2015/228
    • A system for incorporating speech recognition into a computer program, including a number of speech controller modules corresponding to program components within the computer program. Each speech controller module supports a speech recognition grammar having at least one rule, where the speech recognition grammar provides an interface to operations on the corresponding program component. The rules of the speech recognition grammar associate spoken commands with data stored in the corresponding program component. A rule may include a reference to another local rule, or to a rule in a different speech recognition grammar, in which case a “link” to the other rule is formed. In this way, the disclosed system allows rules from the same or different grammars to be combined together, in order to build complex grammars. Each speech controller module operates to dynamically enable one or more rules it contains within a speech recognizer, in response to detecting the occurrence of an associated enabling condition. The speech controller module receives a recognition result from the speech recognizer indicating that the speech recognizer has detected one or more tokens associated with an enabled rule. In response to receipt of the recognition result, a speech controller module operates to invoke a method on data within the corresponding program component, and passes the result on to other speech controller modules that are linked to the recognition rule corresponding to the result.
    • 一种用于将语音识别结合到计算机程序中的系统,包括与计算机程序内的程序组件对应的多个语音控制器模块。 每个语音控制器模块支持具有至少一个规则的语音识别语法,其中语音识别语法提供对相应程序组件的操作的接口。 语音识别语法的规则将口头命令与存储在相应程序组件中的数据相关联。 规则可以包括对另一个本地规则的引用,或者指向不同语音识别语法中的规则,在这种情况下,形成与其他规则的“链接”。 以这种方式,所公开的系统允许来自相同或不同语法的规则被组合在一起,以便构建复杂的语法。 响应于检测到相关联的启用条件的发生,每个语音控制器模块操作以动态地使其在语音识别器内包含的一个或多个规则。 语音控制器模块从语音识别器接收指示语音识别器已经检测到与启用的规则相关联的一个或多个令牌的识别结果。 响应于识别结果的接收,语音控制器模块操作以调用相应程序组件内的数据的方法,并将结果传递给链接到对应于结果的识别规则的其他语音控制器模块。
    • 3. 发明申请
    • Programmable remote control and method for programming a programmable remote control, a readable memory and a program
    • 可编程遥控器和可编程遥控器,可读存储器和程序的编程方法
    • US20050151726A1
    • 2005-07-14
    • US10509238
    • 2003-02-27
    • Johan Wouters
    • Johan Wouters
    • H04N5/00G08C19/28H04Q9/00G09G5/00
    • G08C19/28
    • A remote control comprises object keys and a selector for linking preset IR or RF code sets to the object keys. The remote control comprises:—a selector for the selection of an IR or RF preset code set by a user,—an activator for the activation of one or more links between an element of said preset IR or RF code set and an object key by the user after the selection,—a selector for the subsequent selection of a further preset code set by the user and—repeat means for repeating the steps a and b for the further preset code set until all object keys have been linked or the user terminates the process. This allows the user to combine more then one preset code set into a single code set.
    • 遥控器包括对象键和用于将预设的IR或RF代码集连接到对象键的选择器。 遥控器包括: - 用于选择由用户设置的IR或RF预设代码的选择器, - 用于激活所述预设IR或RF代码组的元素与对象键之间的一个或多个链接的激活器, 用户选择之后,选择器用于随后选择由用户设置的另外的预设代码和重复装置,用于重复步骤a和b用于另外的预设代码集,直到所有对象键已被链接或用户终止 的过程。 这允许用户将多于一个的预设代码集合在单个代码集中。
    • 5. 发明授权
    • Speech synthesis with dynamic constraints
    • 语音合成与动态约束
    • US08301451B2
    • 2012-10-30
    • US12457911
    • 2009-06-25
    • Johan Wouters
    • Johan Wouters
    • G10L13/00G10L13/08
    • G10L13/07
    • A method is disclosed for providing speech parameters to be used for synthesis of a speech utterance. In at least one embodiment, the method includes receiving an input time series of first speech parameter vectors, preparing at least one input time series of second speech parameter vectors consisting of dynamic speech parameters, extracting from the input time series of first and second speech parameter vectors partial time series of first speech parameter vectors and corresponding partial time series of second speech parameter vectors, converting the corresponding partial time series of first and second speech parameter vectors into partial time series of third speech parameter vectors, wherein the conversion is done independently for each set of partial time series and can be started as soon as the vectors of the input time series of the first speech parameter vectors have been received. The speech parameter vectors of the partial time series of third speech parameter vectors are combined to form a time series of output speech parameter vectors to be used for synthesis of the speech utterance. At least one embodiment of the method allows a continuous providing of speech parameter vectors for synthesis of the speech utterance. The latency and the memory requirements for the synthesis of a speech utterance are reduced.
    • 公开了一种用于提供用于合成语音话语的语音参数的方法。 在至少一个实施例中,该方法包括接收第一语音参数向量的输入时间序列,准备由动态语音参数组成的第二语音参数向量的至少一个输入时间序列,从输入时间序列中提取第一和第二语音参数 矢量部分时间序列的第一语音参数矢量和第二语音参数矢量的相应部分时间序列,将第一和第二语音参数矢量的对应部分时间序列转换成第三语音参数向量的部分时间序列,其中独立地完成转换 一组部分时间序列,一旦已经接收到第一语音参数矢量的输入时间序列的向量,就可以开始。 组合第三语音参数矢量的部分时间序列的语音参数矢量以形成用于合成语音话语的输出语音参数向量的时间序列。 该方法的至少一个实施例允许连续提供用于合成语音话语的语音参数矢量。 降低了语音语音合成的延迟和内存需求。
    • 6. 发明申请
    • Text to speech synthesis
    • 文字到语音综合
    • US20090076819A1
    • 2009-03-19
    • US11709056
    • 2007-02-22
    • Johan WoutersChristof TraberMarcel RiediMartin ReberJurgen Keller
    • Johan WoutersChristof TraberMarcel RiediMartin ReberJurgen Keller
    • G10L13/00G10L13/08
    • G10L13/033G10L13/07
    • An input linguistic description is converted into a speech waveform by deriving at least one target unit sequence corresponding to the linguistic description, selecting from a waveform unit database for the target unit sequences a plurality of alternative unit sequences approximating the target unit sequences, concatenating the alternative unit sequences to alternative speech waveforms and presenting the alternative speech waveforms to an operating person and enabling the choice of one of the presented alternative speech waveforms. There are no iterative cycles of manual modification and automatic selection, which enables a fast way of working. The operator does not need knowledge of units, targets, and costs, but chooses from a set of given alternatives. The fine-tuning of TTS prompts therefore becomes accessible to non-experts.
    • 输入语言描述通过导出与语言描述相对应的至少一个目标单元序列而被转换为语音波形,从波形单元数据库中选择目标单元序列多个替代单元序列近似目标单元序列,并置替代 单元序列到替代语音波形,并向操作人员呈现替代语音波形,并且能够选择所呈现的替代语音波形之一。 没有手动修改和自动选择的迭代循环,这使得快速的工作方式。 运营商不需要知道单位,目标和成本,而是从一组给定的替代方案中选择。 因此,TTS提示的微调可以非专业人员访问。
    • 7. 发明申请
    • Personal digital assistant device with stylus
    • 带触笔的个人数字助理设备
    • US20060055683A1
    • 2006-03-16
    • US10521710
    • 2003-07-04
    • Johan Wouters
    • Johan Wouters
    • G09G5/00
    • G06F3/0481G06F1/1626G06F2200/1632
    • A portable stylus information input processing apparatus (1) with a removable stylus (2) and a stylus housing (9), a user-interface (UI) (3), said apparatus comprising a computer program for running a user-interactive application on or via the apparatus, or a means for inputting such a computer program, interaction taking place by contact between the stylus and the user interface (UI), characterized in that the apparatus comprises means for generating a release signal (6) generated by a program run on or via the apparatus to release the stylus, the stylus housing comprising a receiver for receiving said release signal and a release mechanism (7) for releasing the stylus in response to the release signal.
    • 一种具有可移除的触控笔(2)和触笔外壳(9)的便携式触控笔信息输入处理装置(1),一种用户界面(UI)(3),所述装置包括用于将用户交互应用程序运行的计算机程序 或通过设备或用于输入这样的计算机程序的装置,通过触控笔和用户界面(UI)之间的联系进行的交互,其特征在于,所述装置包括用于生成由程序产生的释放信号(6)的装置 在所述设备上运行或通过所述设备释放所述触笔,所述触控笔壳体包括用于接收所述释放信号的接收器和用于响应于所述释放信号释放所述触笔的释放机构(7)。
    • 9. 发明授权
    • Text to speech synthesis
    • 文字到语音综合
    • US07979280B2
    • 2011-07-12
    • US11709056
    • 2007-02-22
    • Johan WoutersChristof TraberMarcel RiediMartin ReberJürgen Keller
    • Johan WoutersChristof TraberMarcel RiediMartin ReberJürgen Keller
    • G10L13/06
    • G10L13/033G10L13/07
    • An input linguistic description is converted into a speech waveform by deriving at least one target unit sequence corresponding to the linguistic description, selecting from a waveform unit database for the target unit sequences a plurality of alternative unit sequences approximating the target unit sequences, concatenating the alternative unit sequences to alternative speech waveforms and presenting the alternative speech waveforms to an operating person and enabling the choice of one of the presented alternative speech waveforms. There are no iterative cycles of manual modification and automatic selection, which enables a fast way of working. The operator does not need knowledge of units, targets, and costs, but chooses from a set of given alternatives. The fine-tuning of TTS prompts therefore becomes accessible to non-experts.
    • 输入语言描述通过导出与语言描述相对应的至少一个目标单元序列而被转换成语音波形,从波形单元数据库中选择目标单元序列多个替代单元序列近似目标单元序列,并置替代 单元序列到替代语音波形,并向操作人员呈现替代语音波形,并且能够选择所呈现的替代语音波形之一。 没有手动修改和自动选择的迭代循环,这使得快速的工作方式。 运营商不需要知道单位,目标和成本,而是从一组给定的替代方案中选择。 因此,TTS提示的微调可以非专业人员访问。
    • 10. 发明授权
    • Speech enhancement techniques on the power spectrum
    • 功率谱上的语音增强技术
    • US09031834B2
    • 2015-05-12
    • US13393667
    • 2009-09-04
    • Geert CoormanJohan Wouters
    • Geert CoormanJohan Wouters
    • G10L21/00G10L21/02G10L21/0232G10L21/003G10L13/033
    • G10L21/0205G10L13/033G10L21/003G10L21/0232G10L21/0364
    • The method provides a spectral speech description to be used for synthesis of a speech utterance, where at least one spectral envelope input representation is received. In one solution the improvement is made by manipulation an extremum, i.e. a peak or a valley, in the rapidly varying component of the spectral envelope representation. The rapidly varying component of the spectral envelope representation is manipulated to sharpen and/or accentuate extrema after which it is merged back with the slowly varying component or the spectral envelope input representation to create an enhanced spectral envelope final representation. In other solutions a complex spectrum envelope final representation is created with phase information derived from one of the group delay representation of a real spectral envelope input representation corresponding to a short-time speech signal and a transformed phase component of the discrete complex frequency domain input representation corresponding to the speech utterance.
    • 该方法提供用于合成语音话语的频谱语音描述,其中接收至少一个频谱包络输入表示。 在一个解决方案中,通过在频谱包络表示的快速变化的分量中操纵极值,即峰值或谷值来进行改进。 操纵频谱包络表示的快速变化的分量以锐化和/或加强极值,之后将其与缓慢变化的分量或频谱包络输入表示相结合以产生增强的频谱包络最终表示。 在其他解决方案中,创建复频谱包络最终表示,其中相位信息从对应于短时间语音信号的实频谱包络输入表示的组延迟表示和离散复频域输入表示的变换相位分量 对应于言语发音。