专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

51. 发明授权

US06952164B2 Distributed apparatus to improve safety and communication for law enforcement applications 失效
标题翻译：分布式设备，用于改善执法应用的安全和通信
公开(公告)号：US06952164B2
公开(公告)日：2005-10-04
申请号：US10287954
申请日：2002-11-05
申请人： Jean-Claude Junqua
发明人： Jean-Claude Junqua
IPC分类号： G07B15/02 , G08B21/00
CPC分类号： G07B15/00
摘要： A wearable, computerized apparatus for use with law enforcement has an evidence collector adapted to collect evidentiary information of a type collected according to law enforcement procedures and useful for identification of a suspect. It further has a safety monitor adapted to collect safety information relating to well-being of an officer. A wireless communications link communicates the evidentiary information and the safety information to a centralized component of a distributed communications system to assist in identifying suspects and dispatching assistance.
摘要翻译：用于执法的可佩戴的计算机化装置有一个证据收集器，用于收集根据执法程序收集的类型的证据信息，并有助于识别嫌疑人。它还有一个安全监视器，适用于收集有关人员福祉的安全信息。无线通信链路将证据信息和安全信息传送到分布式通信系统的集中式组件，以帮助识别嫌疑人和发送协助。

52. 发明授权

US06895257B2 Personalized agent for portable devices and cellular phone 有权
标题翻译：便携式设备和手机的个性化代理
公开(公告)号：US06895257B2
公开(公告)日：2005-05-17
申请号：US10077904
申请日：2002-02-18
申请人： Robert Boman , Kirill Stoimenov , Roland Kuhn , Jean-Claude Junqua
发明人： Robert Boman , Kirill Stoimenov , Roland Kuhn , Jean-Claude Junqua
IPC分类号： H04M1/27 , H04M1/725 , H04M3/493 , H04M3/533 , H04Q7/20
CPC分类号： H04M3/53366 , H04M1/271 , H04M1/72547 , H04M1/72552 , H04M3/4938 , H04M2201/60 , H04M2203/4536 , H04M2250/74
摘要： Personalized agent services are provided in a personal messaging device, such as a cellular telephone or personal digital assistant, through services of a speech recognizer that converts speech into text and a text-to-speech synthesizer that converts text to speech. Both recognizer and synthesizer may be server-based or locally deployed within the device. The user dictates an e-mail message which is converted to text and stored. The stored text is sent back to the user as text or as synthesized speech, to allow the user to edit the message and correct transcription errors before sending as e-mail. The system includes a summarization module that prepares short summaries of incoming e-mail and voice mail. The user may access these summaries, and retrieve and organize email and voice mail using speech commands.
摘要翻译：通过将语音转换为文本的语音识别器的服务和将文本转换为语音的文本到语音合成器，个性化代理服务被提供在诸如蜂窝电话或个人数字助理的个人消息设备中。识别器和合成器可以是基于服务器的或本地部署在设备内。用户指定一个电子邮件消息，转换为文本并存储。存储的文本作为文本或合成语音发送回用户，以允许用户在作为电子邮件发送之前编辑消息并纠正转录错误。该系统包括一个汇总模块，准备收到的电子邮件和语音邮件的简要摘要。用户可以访问这些摘要，并使用语音命令检索和组织电子邮件和语音邮件。

53. 发明申请

US20050049860A1 Method and apparatus for improved speech recognition with supplementary information 有权
标题翻译：用于通过补充信息改进语音识别的方法和装置
公开(公告)号：US20050049860A1
公开(公告)日：2005-03-03
申请号：US10652146
申请日：2003-08-29
申请人： Jean-Claude Junqua , Roland Kuhn , Matteo Contolini , Rathinavelu Chengalvarayan
发明人： Jean-Claude Junqua , Roland Kuhn , Matteo Contolini , Rathinavelu Chengalvarayan
IPC分类号： G10L15/08 , G10L15/10 , G10L15/22 , H04M1/27 , G10L15/00
CPC分类号： H04M1/271 , G10L15/08 , G10L15/10 , G10L15/22
摘要： A method for improving recognition results of a speech recognizer uses supplementary information to confirm recognition results. A user inputs speech to a speech recognizer. The speech recognizer resides on a mobile device or on a server at a remote location. The speech recognizer determines a recognition result based on the input speech. A confidence measure is calculated for the recognition result. If the confidence measure is below a threshold, the user is prompted for supplementary data. The supplementary data is determined dynamically based on ambiguities between the input speech and the recognition result, wherein the supplementary data will distinguish the input speech over potential incorrect results. The supplementary data may be a subset of alphanumeric characters that comprise the input speech, or other data associated with a desired result, such as an area code or location. The user may provide the supplementary data verbally, or manually using a keypad, touchpad, touchscreen, or stylus pen.
摘要翻译：用于改善语音识别器的识别结果的方法使用补充信息来确认识别结果。用户向语音识别器输入语音。语音识别器驻留在移动设备或远程位置的服务器上。语音识别器基于输入语音来确定识别结果。计算识别结果的置信度量。如果置信度量值低于阈值，则会提示用户提供补充数据。基于输入语音和识别结果之间的模糊度来动态地确定补充数据，其中补充数据将通过潜在的不正确结果区分输入语音。补充数据可以是组成输入语音的字母数字字符的子集，或与期望结果相关联的其他数据，例如区域代码或位置。用户可以口头提供补充数据，或者使用键盘，触摸板，触摸屏或触控笔手动提供补充数据。

54. 发明申请

US20050010411A1 Speech data mining for call center management 审中-公开
标题翻译：语音数据挖掘用于呼叫中心管理
公开(公告)号：US20050010411A1
公开(公告)日：2005-01-13
申请号：US10616006
申请日：2003-07-09
申请人： Luca Rigazio , Patrick Nguyen , Jean-Claude Junqua , Robert Boman
发明人： Luca Rigazio , Patrick Nguyen , Jean-Claude Junqua , Robert Boman
IPC分类号： G10L15/26 , G10L17/00 , G10L15/00
CPC分类号： G10L15/26 , G10L17/00
摘要： A speech data mining system for use in generating a rich transcription having utility in call center management includes a speech differentiation module differentiating between speech of interacting speakers, and a speech recognition module improving automatic recognition of speech of one speaker based on interaction with another speaker employed as a reference speaker. A transcript generation module generates a rich transcript based on recognized speech of the speakers. Focused, interactive language models improve recognition of a customer on a low quality channel using context extracted from speech of a call center operator on a high quality channel with a speech model adapted to the operator. Mined speech data includes number of interaction turns, customer frustration phrases, operator polity, interruptions, and/or contexts extracted from speech recognition results, such as topics, complaints, solutions, and resolutions. Mined speech data is useful in call center and/or product or service quality management.
摘要翻译：用于产生在呼叫中心管理中具有效用的丰富录音的语音数据挖掘系统包括区分交互式扬声器的语音的语音区分模块和改善一个扬声器的语音的自动识别的语音识别模块，作为参考发言人。转录本生成模块基于扬声器的识别语音生成丰富的录音。专注的交互式语言模型通过使用适合于操作员的语音模型，在高质量频道上从呼叫中心运营商的语音提取的上下文，改善对低质量信道上客户的识别。挖掘的语音数据包括从诸如主题，投诉，解决方案和分辨率的语音识别结果中提取的交互轮廓数量，客户沮丧短语，运营商政治，中断和/或上下文。挖掘的语音数据在呼叫中心和/或产品或服务质量管理中是有用的。

55. 发明授权

US06687672B2 Methods and apparatus for blind channel estimation based upon speech correlation structure 有权
标题翻译：基于语音相关结构的盲信道估计方法与装置
公开(公告)号：US06687672B2
公开(公告)日：2004-02-03
申请号：US10099428
申请日：2002-03-15
申请人： Younes Souilmi , Luca Rigazio , Patrick Nguyen , Jean-Claude Junqua
发明人： Younes Souilmi , Luca Rigazio , Patrick Nguyen , Jean-Claude Junqua
IPC分类号： G10L1508
CPC分类号： G10L21/0208
摘要： Methods and apparatus for blind channel estimation of a speech signal corrupted by a communication channel are provided. One method includes converting a noisy speech signal into either a cepstral representation or a log-spectral representation; estimating a correlation of the representation of the noisy speech signal; determining an average of the noisy speech signal; constructing and solving, subject to a minimization constraint, a system of linear equations utilizing a correlation structure of a clean speech training signal, the correlation of the representation of the noisy speech signal, and the average of the noisy speech signal; and selecting a sign of the solution of the system of linear equations to estimate an average clean speech signal in a processing window.
摘要翻译：提供了由通信信道损坏的语音信号的盲信道估计的方法和装置。一种方法包括将噪声语音信号转换成倒谱表示或对数谱表示; 估计噪声语音信号的表示的相关性; 确定噪声语音信号的平均值; 利用最小化约束，构建和求解利用清晰语音训练信号的相关结构，噪声语音信号的表示与噪声语音信号的平均值的相关性的线性方程组; 以及选择线性方程式的解的符号来估计处理窗口中的平均清洁语音信号。

56. 发明授权

US06631346B1 Method and apparatus for natural language parsing using multiple passes and tags 有权
标题翻译：使用多遍和标签进行自然语言解析的方法和装置
公开(公告)号：US06631346B1
公开(公告)日：2003-10-07
申请号：US09287810
申请日：1999-04-07
申请人： Murat Karaorman , Jean-Claude Junqua
发明人： Murat Karaorman , Jean-Claude Junqua
IPC分类号： G06A1727
CPC分类号： G10L15/1822
摘要： A computer-implemented speech parsing method and apparatus for processing an input phrase. The method and apparatus include providing a plurality of grammars that are indicative of predetermined topics. A plurality of parse forests are generated using the grammars. Tags are associated with words preferably according to a scoring scheme utilizing the generated parse forests while parsing the input phrase. The tags that are associated with the words are used as a parsed representation of the input phrase.
摘要翻译：一种用于处理输入短语的计算机实现的语音解析方法和装置。该方法和装置包括提供指示预定主题的多个语法。使用语法生成多个解析林。标签与词语相关联，优选地根据利用生成的解析森林的分数方案，同时解析输入短语。与这些单词关联的标签用作输入短语的解析表示。

57. 发明授权

US06571208B1 Context-dependent acoustic models for medium and large vocabulary speech recognition with eigenvoice training 有权
标题翻译：用于具有本征语音训练的中大词汇语音识别的背景相关声学模型
公开(公告)号：US06571208B1
公开(公告)日：2003-05-27
申请号：US09450392
申请日：1999-11-29
申请人： Roland Kuhn , Jean-Claude Junqua , Matteo Contolini
发明人： Roland Kuhn , Jean-Claude Junqua , Matteo Contolini
IPC分类号： G01L1700
CPC分类号： G10L15/07
摘要： A reduced dimensionality eigenvoice analytical technique is used during training to develop context-dependent acoustic models for allophones. The eigenvoice technique is also used during run time upon the speech of a new speaker. The technique removes individual speaker idiosyncrasies, to produce more universally applicable and robust allophone models. In one embodiment the eigenvoice technique is used to identify the centroid of each speaker, which may then be “subtracted out” of the recognition equation. In another embodiment maximum likelihood estimation techniques are used to develop common decision tree frameworks that may be shared across all speakers when constructing the eigenvoice representation of speaker space.
摘要翻译：在训练期间使用减小的维度本征语音分析技术来开发用于异音素的上下文相关的声学模型。特定语音技术在运行时也用于新演讲者的演讲。该技术可以消除单个扬声器的特性，从而产生更普遍适用和强大的异音模型。在一个实施例中，本征语音技术用于识别每个说话者的质心，然后可以将其“减去”识别方程。在另一个实施例中，最大似然估计技术用于开发在构建扬声器空间的本征声表示时可以在所有扬声器之间共享的共同决策树框架。

58. 发明授权

US06480819B1 Automatic search of audio channels by matching viewer-spoken words against closed-caption/audio content for interactive television 有权
标题翻译：通过将观众口语与针对交互式电视的封闭字幕/音频内容相匹配来自动搜索音频频道
公开(公告)号：US06480819B1
公开(公告)日：2002-11-12
申请号：US09258115
申请日：1999-02-25
申请人： Robert Boman , Jean-Claude Junqua
发明人： Robert Boman , Jean-Claude Junqua
IPC分类号： G06F1727
CPC分类号： G10L15/26 , G10L15/1815
摘要： A method and apparatus is provided to enable a user watching and/or listening to a program to search for new information in the stream of a telecommunications data. The apparatus includes a voice recognition system that recognizes the user's request and causes a search to be performed in the long stream of data of at least one other telecommunication channel. The system includes a storage device for storing and processing the request. Upon recognition of the request, the incoming signal or signals are scanned for matches with the request. Upon finding the match between the request and the incoming signal, information related to the data is brought to the viewer's attention. This can be accomplished by either changing the viewer's station or by bringing in a split screen display forward into the display.
摘要翻译：提供了一种方法和装置，用于使用户能够观看和/或收听节目以搜索电信数据流中的新信息。该装置包括语音识别系统，其识别用户的请求并且使得在至少另一个电信信道的长流数据中执行搜索。该系统包括用于存储和处理该请求的存储装置。一旦识别到请求，就会扫描输入信号或与该请求匹配的信号。在找到请求和输入信号之间的匹配时，与数据相关的信息被引起观众的注意。这可以通过改变观众的电台或将分屏显示向前推入显示器来实现。

59. 发明授权

US06415257B1 System for identifying and adapting a TV-user profile by means of speech technology 有权
标题翻译：通过语音技术识别和调整电视用户资料的系统
公开(公告)号：US06415257B1
公开(公告)日：2002-07-02
申请号：US09383797
申请日：1999-08-26
申请人： Jean-Claude Junqua , Roland Kuhn , Tony Davis , Yi Zhao , Weiying Li
发明人： Jean-Claude Junqua , Roland Kuhn , Tony Davis , Yi Zhao , Weiying Li
IPC分类号： G10L1522
CPC分类号： H04N21/4532 , G10L17/00 , H04N5/44543 , H04N21/42203 , H04N21/4394 , H04N21/4415 , H04N21/4667 , H04N21/482
摘要： Speech input supplied by the user is evaluated by the speaker verification/identification module, and based on the evaluation, parameters are retrieved from a user profile database. These parameters adapt the speech models of the speech recognizer and also supply the natural language parser with customized dialog grammars. The user's speech is then interpreted by the speech recognizer and natural language parser to determine the meaning of the user's spoken input in order to control the television tuner. The parser works in conjunction with a command module that mediates the dialog with the user, providing on-screen prompts or synthesized speech queries to elicit further input from the user when needed. The system integrates with an electronic program guide, so that the natural language parser is made aware of what programs are available when conducting the synthetic dialog with the user.
摘要翻译：由用户提供的语音输入由说话人验证/识别模块进行评估，并且基于评估，从用户简档数据库检索参数。这些参数适应语音识别器的语音模型，并为自然语言解析器提供定制的对话语法。用户的语音然后由语音识别器和自然语言解析器进行解释，以确定用户的口头输入的含义，以控制电视调谐器。解析器与一个命令模块一起工作，该模块与用户中介对话，提供屏幕提示或合成语音查询，以便在需要时从用户中引出进一步的输入。该系统与电子节目指南集成，使得自然语言解析器在与用户进行合成对话时了解哪些程序可用。

60. 发明授权

US06324512B1 System and method for allowing family members to access TV contents and program media recorder over telephone or internet 有权
标题翻译：允许家庭成员通过电话或互联网访问电视内容和节目媒体记录器的系统和方法
公开(公告)号：US06324512B1
公开(公告)日：2001-11-27
申请号：US09383760
申请日：1999-08-26
申请人： Jean-Claude Junqua , Roland Kuhn , Tony Davis , Yi Zhao , Weiying Li
发明人： Jean-Claude Junqua , Roland Kuhn , Tony Davis , Yi Zhao , Weiying Li
IPC分类号： G10L1514
CPC分类号： G10L17/00 , H04N5/44543 , H04N21/42203 , H04N21/4227 , H04N21/42676 , H04N21/4334 , H04N21/435 , H04N21/4383 , H04N21/4394 , H04N21/4415 , H04N21/4532 , H04N21/4622 , H04N21/4662 , H04N21/6118 , H04N21/6125 , H04N21/6137
摘要： Users of the system can access the TV contents and program media recorder by speaking in natural language sentences. The user interacts with the television and with other multimedia equipment, such as media recorders and VCRs, through the unified access controller. A speaker verification/identification module determines the identity of the speaker and this information is used to control how the dialog between user and system proceeds. Speech can be input through either a microphone or over the telephone. In addition, the user can interact with the system using a suitable computer attached via the internet. Regardless of the mode of access, the unified access controller interprets the semantic content of the user's request and supplies the appropriate control signals to the television tuner and/or recorder.
摘要翻译：该系统的用户可以通过自然语言句子来访问电视内容和节目媒体记录器。用户通过统一的访问控制器与电视机和其他多媒体设备（如媒体录像机和录像机）进行交互。扬声器验证/识别模块确定扬声器的身份，并且该信息用于控制用户和系统之间的对话如何进行。语音可以通过麦克风或电话进行输入。此外，用户可以使用通过互联网连接的合适的计算机与系统交互。无论访问方式如何，统一访问控制器解释用户请求的语义内容，并向电视调谐器和/或记录器提供适当的控制信号。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式