专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明授权

US06928404B1 System and methods for acoustic and language modeling for automatic speech recognition with large vocabularies 有权
标题翻译：用于具有大词汇的自动语音识别的声学和语言建模的系统和方法
公开(公告)号：US06928404B1
公开(公告)日：2005-08-09
申请号：US09271469
申请日：1999-03-17
申请人： Ponani Gopalakrishnan , Dimitri Kanevsky , Michael Daniel Monkowski , Jan Sedivy
发明人： Ponani Gopalakrishnan , Dimitri Kanevsky , Michael Daniel Monkowski , Jan Sedivy
IPC分类号： G06F17/27 , G10L15/18 , G06F17/21 , G06F17/28
CPC分类号： G10L15/197 , G06F17/27 , G10L15/183 , Y10S707/99942
摘要： Systems and methods are provided for generating a language component vocabulary VC for a speech recognition system having a language vocabulary V of a plurality of word forms. One method for generating a language component vocabulary VC for a speech recognition system having a language vocabulary V of a plurality of word forms includes partitioning the language vocabulary V into subsets of word forms based on frequencies of occurrence of the respective word forms, in at least one the subsets, splitting word forms having frequencies less than a threshold to thereby generate word form components and generating a language component vocabulary VC including word forms and word form components. The resulting language component vocabulary, which includes word forms and word components, is used to generate a language model that can be efficiently implemented for real-time automatic speech recognition applications for languages with large vocabularies.
摘要翻译：提供了用于为具有多个单词形式的语言词汇V的语音识别系统生成语言组件词汇VC的系统和方法。用于生成具有多个单词形式的语言词汇V的语音识别系统的语言组件词汇VC的一种方法包括至少基于各个单词形式的出现频率将语言词汇V划分成单词形式的子集一个子集，分裂词形式具有小于阈值的频率，从而生成单词形式分量并生成包括单词形式和单词形式分量的语言组成词汇VC。所产生的包括单词形式和单词组成的语言组件词汇用于生成语言模型，该语言模型可以有效地实现用于具有大词汇的语言的实时自动语音识别应用。

2. 发明授权

US07801727B2 System and methods for acoustic and language modeling for automatic speech recognition with large vocabularies 有权
标题翻译：用于具有大词汇的自动语音识别的声学和语言建模的系统和方法
公开(公告)号：US07801727B2
公开(公告)日：2010-09-21
申请号：US11064643
申请日：2005-02-24
申请人： Ponani Gopalakrishnan , Dimitri Kanevsky , Michael Daniel Monkowski , Jan Sedivy
发明人： Ponani Gopalakrishnan , Dimitri Kanevsky , Michael Daniel Monkowski , Jan Sedivy
IPC分类号： G10L15/04
CPC分类号： G10L15/197 , G06F17/27 , G10L15/183 , Y10S707/99942
摘要： A method for generating a language component vocabulary VC for a speech recognition system having a language vocabulary V of a plurality of word forms is disclosed. The method includes: partitioning the language vocabulary V into subsets of word forms based on frequencies of occurrence of the respective word forms; and in at least one of the subsets, splitting word forms having frequencies less than a threshold to thereby generate word form components. Also disclosed is a method for use in speech recognition including: splitting an acoustic vocabulary comprising baseforms into baseform components and storing the baseform components; and, performing sound to spelling mapping on the baseform components so as to generate a baseform components to word parts table for use in subsequent decoding of speech. A method for decoding a speech utterance using language model components and acoustic components, includes the steps of: generating from the utterance a stack of baseform component paths; concatenating baseform components in a path to generate concatenated baseforms, when the concatenated baseform components correspond to a baseform found in an acoustic vocabulary; mapping the concatenated baseforms into words; computing language model (LM) scores associated with the words using a language model, and performing further decoding of the utterance based thereupon.
摘要翻译：公开了一种用于生成具有多个单词形式的语言词汇V的语音识别系统的语言组件词汇VC的方法。该方法包括：基于各个词形式的出现频率将语言词汇V划分成单词形式的子集; 并且在至少一个子集中，分割具有小于阈值的频率的字形式，从而生成词形分量。还公开了一种用于语音识别的方法，包括：将包含基本形式的声学词汇分解成基本形式组件并存储基本形式组件; 并且对基本形式组件执行声音拼写映射，以便生成用于语音后续解码中的字部分表的基本形式分量。一种使用语言模型分量和声学分量对语音发音进行解码的方法，包括以下步骤：从发音中产生一叠基础分量路径; 当级联的基本形式组件对应于在声学词汇中发现的基础形式时，将路径中的基本形式组件连接以生成级联的基本形式; 将连接的基本形式映射为单词; 与使用语言模型的单词相关联的计算语言模型（LM）得分，并且基于此进行对话语的进一步解码。

3. 发明申请

US20050143972A1 System and methods for acoustic and language modeling for automatic speech recognition with large vocabularies 有权
标题翻译：用于具有大词汇的自动语音识别的声学和语言建模的系统和方法
公开(公告)号：US20050143972A1
公开(公告)日：2005-06-30
申请号：US11064643
申请日：2005-02-24
申请人： Ponani Gopalakrishnan , Dimitri Kanevsky , Michael Monkowski , Jan Sedivy
发明人： Ponani Gopalakrishnan , Dimitri Kanevsky , Michael Monkowski , Jan Sedivy
IPC分类号： G06F17/27 , G10L15/18 , G06F17/21
CPC分类号： G10L15/197 , G06F17/27 , G10L15/183 , Y10S707/99942
摘要： A method for generating a language component vocabulary VC for a speech recognition system having a language vocabulary V of a plurality of word forms is disclosed. The method includes: partitioning the language vocabulary V into subsets of word forms based on frequencies of occurrence of the respective word forms; and in at least one of the subsets, splitting word forms having frequencies less than a threshold to thereby generate word form components. Also disclosed is a method for use in speech recognition including: splitting an acoustic vocabulary comprising baseforms into baseform components and storing the baseform components; and, performing sound to spelling mapping on the baseform components so as to generate a baseform components to word parts table for use in subsequent decoding of speech. A method for decoding a speech utterance using language model components and acoustic components, includes the steps of: generating from the utterance a stack of baseform component paths; concatenating baseform components in a path to generate concatenated baseforms, when the concatenated baseform components correspond to a baseform found in an acoustic vocabulary; mapping the concatenated baseforms into words; computing language model (LM) scores associated with the words using a language model, and performing further decoding of the utterance based thereupon.
摘要翻译：公开了一种用于生成具有多个单词形式的语言词汇V的语音识别系统的语言组件词汇VC的方法。该方法包括：基于各个词形式的出现频率将语言词汇V划分成单词形式的子集; 并且在至少一个子集中，分割具有小于阈值的频率的字形式，从而生成词形分量。还公开了一种用于语音识别的方法，包括：将包含基本形式的声学词汇分解成基本形式组件并存储基本形式组件; 并且对基本形式组件执行声音拼写映射，以便生成用于语音后续解码中的字部分表的基本形式分量。一种使用语言模型分量和声学分量对语音发音进行解码的方法，包括以下步骤：从发音中产生一叠基础分量路径; 当级联的基本形式组件对应于在声学词汇中发现的基础形式时，将路径中的基本形式组件连接以生成级联的基本形式; 将连接的基本形式映射为单词; 与使用语言模型的单词相关联的计算语言模型（LM）得分，并且基于此进行对话语的进一步解码。

4. 发明授权

US6073091A Apparatus and method for forming a filtered inflected language model for automatic speech recognition 失效
标题翻译：用于形成用于自动语音识别的滤波变形语言模型的装置和方法
公开(公告)号：US6073091A
公开(公告)日：2000-06-06
申请号：US906812
申请日：1997-08-06
申请人： Dimitri Kanevsky , Michael Daniel Monkowski , Jan Sedivy
发明人： Dimitri Kanevsky , Michael Daniel Monkowski , Jan Sedivy
IPC分类号： G10L15/18 , G06F17/28 , G10L5/06 , G10L9/00
CPC分类号： G10L15/197
摘要： A method of forming a language model for a language having a selected vocabulary of word forms comprises: (a) mapping the word forms into integer vectors in accordance with frequencies of word form occurrence; (b) partitioning the integer vectors into subsets, the subsets respectively having ranges of frequencies of word form occurrence associated therewith, the subsets being arranged in a descending order of frequency ranges; (c) respectively assigning maps to the subsets; (d) filtering a textual corpora using the maps assigned to the subsets in order to generate indexed integers; (e) determining n-gram statistics for the indexed integers; and (f) estimating n-gram language model probabilities from the n-gram statistics to form the language model.
摘要翻译：一种形成具有所选词形的语言的语言模型的方法包括：（a）根据词形发生的频率将单词形式映射成整数向量; （b）将整数向量划分成子集，子集分别具有与其相关联的字形式出现的频率范围，子集以频率范围的降序排列; （c）分别将地图分配给子集; （d）使用分配给子集的映射过滤文本语料库，以生成索引整数; （e）确定索引整数的n-gram统计; 和（f）从n-gram统计量估计n-gram语言模型概率以形成语言模型。

5. 发明授权

US06584425B2 Smart thermometer 失效
标题翻译：智能温度计
公开(公告)号：US06584425B2
公开(公告)日：2003-06-24
申请号：US09748830
申请日：2000-12-27
申请人： Dimitri Kanevsky , Mariusz Sabath , Jan Sedivy , Alexander Zlatsin
发明人： Dimitri Kanevsky , Mariusz Sabath , Jan Sedivy , Alexander Zlatsin
IPC分类号： G01K1700
CPC分类号： G01K1/02
摘要： A smart thermometer distributed system comprising a thermometer with a screen that allows one to enter a variety of data for the thermometer. The thermometer is connected to a computer and to a network and can retrieve from a history data base information about family members, including their dress, names, ages, previous illnesses and other information. The smart thermometer system thus provides the user with specific weather information, and will enable him or her to choose appropriate dress for family members given their ages, previous illnesses and other information.
摘要翻译：一种智能温度计分布式系统，包括具有允许人们输入温度计的各种数据的屏幕的温度计。温度计连接到计算机和网络，并可以从历史数据库中检索关于家庭成员的信息，包括他们的衣着，姓名，年龄，先前的疾病和其他信息。智能温度计系统因此为用户提供特定的天气信息并且将使他或她能够根据年龄，以前的疾病和其他信息为家庭成员选择合适的衣服。

6. 发明申请

US20060091198A1 Smart book 有权
标题翻译：智能书
公开(公告)号：US20060091198A1
公开(公告)日：2006-05-04
申请号：US11299917
申请日：2005-12-12
申请人： Dimitri Kanevsky , Mariusz Sabath , Jan Sedivy , Alexander Zlatsin
发明人： Dimitri Kanevsky , Mariusz Sabath , Jan Sedivy , Alexander Zlatsin
IPC分类号： G06F17/00
CPC分类号： G06Q30/06 , G06F21/10 , Y10S707/99942
摘要： A method and system that permits the purchase of a license to make a limited number of copies of a book. At the time of purchase, the purchaser or user is given a key that contains the ability to obtain the limited number of copies on demand. The key contains a web address that can be used to obtain the authorized copies. In some embodiments, the key is a label in a machine readable form that is readable by a label reader, such as a bar code reader or a magnetic reader. In other embodiments, the key is merely a web address that the user may contact. At the point of sale, the key or record is formed, affixed to the book and also sent to copy tracker. The copy tracker then keeps track of the copies as made and processes each request to make a copy. If permitted, a database is enabled to send an electronic image of the requested copy to the user.
摘要翻译：允许购买许可证以制作有限数量的书籍的方法和系统。在购买时，给予购买者或用户一个钥匙，其中包含根据需要获得有限数量的副本的能力。密钥包含可用于获取授权副本的网址。在一些实施例中，密钥是机器可读形式的标签，其可由诸如条形码读取器或磁性读取器的标签读取器读取。在其他实施例中，密钥仅仅是用户可以联系的网址。在销售点，密钥或记录形成，贴在本书上，并发送到复制跟踪器。然后，复制跟踪器跟踪所作的副本，并处理每个请求以进行复制。如果允许，则可以使用数据库将所请求的副本的电子图像发送给用户。

7. 发明授权

US06974081B1 Smart book 失效
标题翻译：智能书
公开(公告)号：US06974081B1
公开(公告)日：2005-12-13
申请号：US09684207
申请日：2000-10-06
申请人： Dimitri Kanevsky , Mariusz Sabath , Jan Sedivy , Alexander Zlatsin
发明人： Dimitri Kanevsky , Mariusz Sabath , Jan Sedivy , Alexander Zlatsin
IPC分类号： G06F21/00 , G06K7/10 , G06Q30/00
CPC分类号： G06Q30/06 , G06F21/10 , Y10S707/99942
摘要： A method and system that permits the purchase of a license to make a limited number of copies of a book. At the time of purchase, the purchaser or user is given a key that contains the ability to obtain the limited number of copies on demand. The key contains a web address that can be used to obtain the authorized copies. In some embodiments, the key is a label in a machine readable form that is readable by a label reader, such as a bar code reader or a magnetic reader. In other embodiments, the key is merely a web address that the user may contact. At the point of sale, the key or record is formed, affixed to the book and also sent to copy tracker. The copy tracker then keeps track of the copies as made and processes each request to make a copy. If permitted, a database is enabled to send an electronic image of the requested copy to the user.
摘要翻译：允许购买许可证以制作有限数量的书籍的方法和系统。在购买时，给予购买者或用户一个钥匙，其中包含根据需要获得有限数量的副本的能力。密钥包含可用于获取授权副本的网址。在一些实施例中，密钥是机器可读形式的标签，其可由诸如条形码读取器或磁性读取器的标签读取器读取。在其他实施例中，密钥仅仅是用户可以联系的网址。在销售点，密钥或记录形成，贴在本书上，并发送到复制跟踪器。然后，复制跟踪器跟踪所作的副本，并处理每个请求以进行复制。如果允许，则可以使用数据库将所请求的副本的电子图像发送给用户。

8. 发明授权

US07233809B2 Efficient communication with passive devices 有权
标题翻译：与被动设备的高效通信
公开(公告)号：US07233809B2
公开(公告)日：2007-06-19
申请号：US10998466
申请日：2004-11-29
申请人： Dimitri Kanevsky , Mariusz Sabath , Jan Sedivy , Alexander Zlatsin
发明人： Dimitri Kanevsky , Mariusz Sabath , Jan Sedivy , Alexander Zlatsin
IPC分类号： H04B1/38
CPC分类号： H04W4/00 , H04W84/022
摘要： A system and method that provides data messages to a passive device. A passive device, for example watch, is registered together with the telephone number of a cellular telephone of a subscriber to the data message service. Since the cellular telephone periodically transmits a beacon signal, the wireless network knows its cell location. Accordingly, the system determines the cell location of the cellular telephone and establishes a communication of the subscribed data to the watch via the local cellular provider for the same cell location as that of the subscriber's cellular telephone.
摘要翻译：向无源设备提供数据消息的系统和方法。诸如手表的无源设备与用户的蜂窝电话的电话号码一起被注册到数据消息服务。由于蜂窝电话周期性地发送信标信号，所以无线网络知道其小区位置。因此，系统确定蜂窝电话的小区位置，并且通过本地蜂窝提供商建立与用户蜂窝电话相同小区位置的订阅数据到手表的通信。

9. 发明授权

US06442519B1 Speaker model adaptation via network of similar users 有权
标题翻译：通过类似用户的网络对扬声器模型进行适配
公开(公告)号：US06442519B1
公开(公告)日：2002-08-27
申请号：US09437646
申请日：1999-11-10
申请人： Dimitri Kanevsky , Vit V. Libal , Jan Sedivy , Wlodek W. Zadrozny
发明人： Dimitri Kanevsky , Vit V. Libal , Jan Sedivy , Wlodek W. Zadrozny
IPC分类号： G10L1506
CPC分类号： G10L15/07
摘要： A speech recognition system, method and program product for recognizing speech input from computer users connected together over a network of computers. Speech recognition computer users on the network are clustered into classes of similar users according their similarities, including characteristics nationality, profession, sex, age, etc. Each computer in the speech recognition network includes at least one user based acoustic model trained for a particular user. The acoustic models include an acoustic model domain, with similar acoustic models being clustered according to an identified domain. User characteristics are collected from databases over the network and from users using the speech recognition system and then, distributed over the network during or after user activities. Existing acoustic models are modified in response to user production activities. As recognition progresses, similar language models among similar users are identified on the network. Update information, including information about user activities and user acoustic model data, is transmitted over the network and identified similar language models are updated. Acoustic models improve for users that are connected over the network as similar users use their respective speech recognition system.
摘要翻译：一种用于识别通过计算机网络连接在一起的计算机用户的语音输入的语音识别系统，方法和程序产品。网络上的语音识别计算机用户根据他们的相似性（包括特征国籍，专业，性别，年龄等）聚类成类似用户的类别。语音识别网络中的每个计算机包括针对特定用户训练的至少一个基于用户的声学模型。声学模型包括声学模型域，根据识别的域聚类相似的声学模型。用户特征从网络上的数据库和使用语音识别系统的用户收集，然后在用户活动期间或之后通过网络分发。响应于用户生产活动修改现有的声学模型。随着识别的进行，在网络上识别出类似用户之间的类似语言模型。通过网络传输关于用户活动和用户声学模型数据的信息的更新信息，并且识别出类似的语言模型被更新。类似用户使用他们各自的语音识别系统，通过网络连接的用户的声学模型得到改善。

10. 发明授权

US07156309B2 Smart book 有权
标题翻译：智能书
公开(公告)号：US07156309B2
公开(公告)日：2007-01-02
申请号：US11299917
申请日：2005-12-12
申请人： Dimitri Kanevsky , Mariusz Sabath , Jan Sedivy , Alexander Zlatsin
发明人： Dimitri Kanevsky , Mariusz Sabath , Jan Sedivy , Alexander Zlatsin
IPC分类号： G06K7/10
CPC分类号： G06Q30/06 , G06F21/10 , Y10S707/99942
摘要： A method and system that permits the purchase of a license to make a limited number of copies of a book. At the time of purchase, the purchaser or user is given a key that contains the ability to obtain the limited number of copies on demand. The key contains a web address that can be used to obtain the authorized copies. In some embodiments, the key is a label in a machine readable form that is readable by a label reader, such as a bar code reader or a magnetic reader. In other embodiments, the key is merely a web address that the user may contact. At the point of sale, the key or record is formed, affixed to the book and also sent to copy tracker. The copy tracker then keeps track of the copies as made and processes each request to make a copy. If permitted, a database is enabled to send an electronic image of the requested copy to the user.
摘要翻译：允许购买许可证以制作有限数量的书籍的方法和系统。在购买时，给予购买者或用户一个钥匙，其中包含根据需要获得有限数量的副本的能力。密钥包含可用于获取授权副本的网址。在一些实施例中，密钥是机器可读形式的标签，其可由诸如条形码读取器或磁性读取器的标签读取器读取。在其他实施例中，密钥仅仅是用户可以联系的网址。在销售点，密钥或记录形成，贴在本书上，并发送到复制跟踪器。然后，复制跟踪器跟踪所作的副本，并处理每个请求以进行复制。如果允许，则可以使用数据库将所请求的副本的电子图像发送给用户。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式