会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明授权
    • System and methods for acoustic and language modeling for automatic speech recognition with large vocabularies
    • 用于具有大词汇的自动语音识别的声学和语言建模的系统和方法
    • US07801727B2
    • 2010-09-21
    • US11064643
    • 2005-02-24
    • Ponani GopalakrishnanDimitri KanevskyMichael Daniel MonkowskiJan Sedivy
    • Ponani GopalakrishnanDimitri KanevskyMichael Daniel MonkowskiJan Sedivy
    • G10L15/04
    • G10L15/197G06F17/27G10L15/183Y10S707/99942
    • A method for generating a language component vocabulary VC for a speech recognition system having a language vocabulary V of a plurality of word forms is disclosed. The method includes: partitioning the language vocabulary V into subsets of word forms based on frequencies of occurrence of the respective word forms; and in at least one of the subsets, splitting word forms having frequencies less than a threshold to thereby generate word form components. Also disclosed is a method for use in speech recognition including: splitting an acoustic vocabulary comprising baseforms into baseform components and storing the baseform components; and, performing sound to spelling mapping on the baseform components so as to generate a baseform components to word parts table for use in subsequent decoding of speech. A method for decoding a speech utterance using language model components and acoustic components, includes the steps of: generating from the utterance a stack of baseform component paths; concatenating baseform components in a path to generate concatenated baseforms, when the concatenated baseform components correspond to a baseform found in an acoustic vocabulary; mapping the concatenated baseforms into words; computing language model (LM) scores associated with the words using a language model, and performing further decoding of the utterance based thereupon.
    • 公开了一种用于生成具有多个单词形式的语言词汇V的语音识别系统的语言组件词汇VC的方法。 该方法包括:基于各个词形式的出现频率将语言词汇V划分成单词形式的子集; 并且在至少一个子集中,分割具有小于阈值的频率的字形式,从而生成词形分量。 还公开了一种用于语音识别的方法,包括:将包含基本形式的声学词汇分解成基本形式组件并存储基本形式组件; 并且对基本形式组件执行声音拼写映射,以便生成用于语音后续解码中的字部分表的基本形式分量。 一种使用语言模型分量和声学分量对语音发音进行解码的方法,包括以下步骤:从发音中产生一叠基础分量路径; 当级联的基本形式组件对应于在声学词汇中发现的基础形式时,将路径中的基本形式组件连接以生成级联的基本形式; 将连接的基本形式映射为单词; 与使用语言模型的单词相关联的计算语言模型(LM)得分,并且基于此进行对话语的进一步解码。
    • 3. 发明申请
    • System and methods for acoustic and language modeling for automatic speech recognition with large vocabularies
    • 用于具有大词汇的自动语音识别的声学和语言建模的系统和方法
    • US20050143972A1
    • 2005-06-30
    • US11064643
    • 2005-02-24
    • Ponani GopalakrishnanDimitri KanevskyMichael MonkowskiJan Sedivy
    • Ponani GopalakrishnanDimitri KanevskyMichael MonkowskiJan Sedivy
    • G06F17/27G10L15/18G06F17/21
    • G10L15/197G06F17/27G10L15/183Y10S707/99942
    • A method for generating a language component vocabulary VC for a speech recognition system having a language vocabulary V of a plurality of word forms is disclosed. The method includes: partitioning the language vocabulary V into subsets of word forms based on frequencies of occurrence of the respective word forms; and in at least one of the subsets, splitting word forms having frequencies less than a threshold to thereby generate word form components. Also disclosed is a method for use in speech recognition including: splitting an acoustic vocabulary comprising baseforms into baseform components and storing the baseform components; and, performing sound to spelling mapping on the baseform components so as to generate a baseform components to word parts table for use in subsequent decoding of speech. A method for decoding a speech utterance using language model components and acoustic components, includes the steps of: generating from the utterance a stack of baseform component paths; concatenating baseform components in a path to generate concatenated baseforms, when the concatenated baseform components correspond to a baseform found in an acoustic vocabulary; mapping the concatenated baseforms into words; computing language model (LM) scores associated with the words using a language model, and performing further decoding of the utterance based thereupon.
    • 公开了一种用于生成具有多个单词形式的语言词汇V的语音识别系统的语言组件词汇VC的方法。 该方法包括:基于各个词形式的出现频率将语言词汇V划分成单词形式的子集; 并且在至少一个子集中,分割具有小于阈值的频率的字形式,从而生成词形分量。 还公开了一种用于语音识别的方法,包括:将包含基本形式的声学词汇分解成基本形式组件并存储基本形式组件; 并且对基本形式组件执行声音拼写映射,以便生成用于语音后续解码中的字部分表的基本形式分量。 一种使用语言模型分量和声学分量对语音发音进行解码的方法,包括以下步骤:从发音中产生一叠基础分量路径; 当级联的基本形式组件对应于在声学词汇中发现的基础形式时,将路径中的基本形式组件连接以生成级联的基本形式; 将连接的基本形式映射为单词; 与使用语言模型的单词相关联的计算语言模型(LM)得分,并且基于此进行对话语的进一步解码。
    • 6. 发明申请
    • Smart book
    • 智能书
    • US20060091198A1
    • 2006-05-04
    • US11299917
    • 2005-12-12
    • Dimitri KanevskyMariusz SabathJan SedivyAlexander Zlatsin
    • Dimitri KanevskyMariusz SabathJan SedivyAlexander Zlatsin
    • G06F17/00
    • G06Q30/06G06F21/10Y10S707/99942
    • A method and system that permits the purchase of a license to make a limited number of copies of a book. At the time of purchase, the purchaser or user is given a key that contains the ability to obtain the limited number of copies on demand. The key contains a web address that can be used to obtain the authorized copies. In some embodiments, the key is a label in a machine readable form that is readable by a label reader, such as a bar code reader or a magnetic reader. In other embodiments, the key is merely a web address that the user may contact. At the point of sale, the key or record is formed, affixed to the book and also sent to copy tracker. The copy tracker then keeps track of the copies as made and processes each request to make a copy. If permitted, a database is enabled to send an electronic image of the requested copy to the user.
    • 允许购买许可证以制作有限数量的书籍的方法和系统。 在购买时,给予购买者或用户一个钥匙,其中包含根据需要获得有限数量的副本的能力。 密钥包含可用于获取授权副本的网址。 在一些实施例中,密钥是机器可读形式的标签,其可由诸如条形码读取器或磁性读取器的标签读取器读取。 在其他实施例中,密钥仅仅是用户可以联系的网址。 在销售点,密钥或记录形成,贴在本书上,并发送到复制跟踪器。 然后,复制跟踪器跟踪所作的副本,并处理每个请求以进行复制。 如果允许,则可以使用数据库将所请求的副本的电子图像发送给用户。
    • 7. 发明授权
    • Smart book
    • 智能书
    • US06974081B1
    • 2005-12-13
    • US09684207
    • 2000-10-06
    • Dimitri KanevskyMariusz SabathJan SedivyAlexander Zlatsin
    • Dimitri KanevskyMariusz SabathJan SedivyAlexander Zlatsin
    • G06F21/00G06K7/10G06Q30/00
    • G06Q30/06G06F21/10Y10S707/99942
    • A method and system that permits the purchase of a license to make a limited number of copies of a book. At the time of purchase, the purchaser or user is given a key that contains the ability to obtain the limited number of copies on demand. The key contains a web address that can be used to obtain the authorized copies. In some embodiments, the key is a label in a machine readable form that is readable by a label reader, such as a bar code reader or a magnetic reader. In other embodiments, the key is merely a web address that the user may contact. At the point of sale, the key or record is formed, affixed to the book and also sent to copy tracker. The copy tracker then keeps track of the copies as made and processes each request to make a copy. If permitted, a database is enabled to send an electronic image of the requested copy to the user.
    • 允许购买许可证以制作有限数量的书籍的方法和系统。 在购买时,给予购买者或用户一个钥匙,其中包含根据需要获得有限数量的副本的能力。 密钥包含可用于获取授权副本的网址。 在一些实施例中,密钥是机器可读形式的标签,其可由诸如条形码读取器或磁性读取器的标签读取器读取。 在其他实施例中,密钥仅仅是用户可以联系的网址。 在销售点,密钥或记录形成,贴在本书上,并发送到复制跟踪器。 然后,复制跟踪器跟踪所作的副本,并处理每个请求以进行复制。 如果允许,则可以使用数据库将所请求的副本的电子图像发送给用户。
    • 9. 发明授权
    • Speaker model adaptation via network of similar users
    • 通过类似用户的网络对扬声器模型进行适配
    • US06442519B1
    • 2002-08-27
    • US09437646
    • 1999-11-10
    • Dimitri KanevskyVit V. LibalJan SedivyWlodek W. Zadrozny
    • Dimitri KanevskyVit V. LibalJan SedivyWlodek W. Zadrozny
    • G10L1506
    • G10L15/07
    • A speech recognition system, method and program product for recognizing speech input from computer users connected together over a network of computers. Speech recognition computer users on the network are clustered into classes of similar users according their similarities, including characteristics nationality, profession, sex, age, etc. Each computer in the speech recognition network includes at least one user based acoustic model trained for a particular user. The acoustic models include an acoustic model domain, with similar acoustic models being clustered according to an identified domain. User characteristics are collected from databases over the network and from users using the speech recognition system and then, distributed over the network during or after user activities. Existing acoustic models are modified in response to user production activities. As recognition progresses, similar language models among similar users are identified on the network. Update information, including information about user activities and user acoustic model data, is transmitted over the network and identified similar language models are updated. Acoustic models improve for users that are connected over the network as similar users use their respective speech recognition system.
    • 一种用于识别通过计算机网络连接在一起的计算机用户的语音输入的语音识别系统,方法和程序产品。 网络上的语音识别计算机用户根据他们的相似性(包括特征国籍,专业,性别,年龄等)聚类成类似用户的类别。语音识别网络中的每个计算机包括针对特定用户训练的至少一个基于用户的声学模型 。 声学模型包括声学模型域,根据识别的域聚类相似的声学模型。 用户特征从网络上的数据库和使用语音识别系统的用户收集,然后在用户活动期间或之后通过网络分发。 响应于用户生产活动修改现有的声学模型。 随着识别的进行,在网络上识别出类似用户之间的类似语言模型。 通过网络传输关于用户活动和用户声学模型数据的信息的更新信息,并且识别出类似的语言模型被更新。 类似用户使用他们各自的语音识别系统,通过网络连接的用户的声学模型得到改善。
    • 10. 发明授权
    • Smart book
    • 智能书
    • US07156309B2
    • 2007-01-02
    • US11299917
    • 2005-12-12
    • Dimitri KanevskyMariusz SabathJan SedivyAlexander Zlatsin
    • Dimitri KanevskyMariusz SabathJan SedivyAlexander Zlatsin
    • G06K7/10
    • G06Q30/06G06F21/10Y10S707/99942
    • A method and system that permits the purchase of a license to make a limited number of copies of a book. At the time of purchase, the purchaser or user is given a key that contains the ability to obtain the limited number of copies on demand. The key contains a web address that can be used to obtain the authorized copies. In some embodiments, the key is a label in a machine readable form that is readable by a label reader, such as a bar code reader or a magnetic reader. In other embodiments, the key is merely a web address that the user may contact. At the point of sale, the key or record is formed, affixed to the book and also sent to copy tracker. The copy tracker then keeps track of the copies as made and processes each request to make a copy. If permitted, a database is enabled to send an electronic image of the requested copy to the user.
    • 允许购买许可证以制作有限数量的书籍的方法和系统。 在购买时,给予购买者或用户一个钥匙,其中包含根据需要获得有限数量的副本的能力。 密钥包含可用于获取授权副本的网址。 在一些实施例中,密钥是机器可读形式的标签,其可由诸如条形码读取器或磁性读取器的标签读取器读取。 在其他实施例中,密钥仅仅是用户可以联系的网址。 在销售点,密钥或记录形成,贴在本书上,并发送到复制跟踪器。 然后,复制跟踪器跟踪所作的副本,并处理每个请求以进行复制。 如果允许,则可以使用数据库将所请求的副本的电子图像发送给用户。