会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 23. 发明授权
    • Multiple audio file processing method and system
    • 多种音频文件处理方法和系统
    • US08103511B2
    • 2012-01-24
    • US12127874
    • 2008-05-28
    • Sara H. BassonBrian R. HeasmanDimitri KanevskyEdward Emile Kelley
    • Sara H. BassonBrian R. HeasmanDimitri KanevskyEdward Emile Kelley
    • G10L21/00G10L11/02
    • G10L21/02G11B2020/10546
    • An audio file generation method and system. A computing system receives a first audio file comprising first speech data associated with a first party. The computing system receives a second audio file comprising second speech data associated with a second party. The first audio file differs from the second audio file. The computing system generates a third audio file from the second audio file. The third audio file differs from the second audio file. The process to generate the third audio file includes identifying a first set of attributes missing from the second audio file and adding the first set of attributes to the second audio file. The process to generate the third audio file additionally includes removing a second set of attributes from the second audio file. The third audio file includes third speech data associated with the second party. The computing system broadcasts the third audio file.
    • 音频文件生成方法和系统。 计算系统接收包括与第一方相关联的第一语音数据的第一音频文件。 计算系统接收包括与第二方相关联的第二语音数据的第二音频文件。 第一个音频文件与第二个音频文件不同。 计算系统从第二音频文件生成第三音频文件。 第三个音频文件与第二个音频文件不同。 生成第三音频文件的过程包括识别从第二音频文件丢失的第一组属性,并将第一组属性添加到第二音频文件。 生成第三音频文件的过程另外包括从第二音频文件中移除第二组属性。 第三音频文件包括与第二方相关联的第三语音数据。 计算系统广播第三个音频文件。
    • 24. 发明申请
    • REAL TIME BACKUP SYSTEM FOR COMPUTER USERS
    • 计算机用户的实时备份系统
    • US20110191293A1
    • 2011-08-04
    • US13085106
    • 2011-04-12
    • Dimitri KanevskyAlexander Zlatsin
    • Dimitri KanevskyAlexander Zlatsin
    • G06F17/30
    • G06F11/1461G06F11/1446G06F11/1456G06F11/1458G06F11/1464G06F11/1471G06F11/3438G06F17/30289G06F2201/80G06F2201/805Y10S707/99955
    • This invention involves tracking and backing all the information that a user generates on its computer devices (including embedded devices) in real time. The local user server records all user actions and gestures (via various means that include TV cameras). All of this information (user actions and saved files in a computer) is then sent to a remote server via the Internet. This remote server has a virtual map of all the embedded devices on a computer that the person uses. The remote server immediately starts to interpret the user's actions (including user gestures). In one implementation, the invention stores user actions that are related to data generation (e.g. actions that called some links where data is stored, or executed some programs that generated data). In another variant the remote server generates and downloads the same files that are downloaded on the local user computer devices. For example, if a person begins to download a program, the server may also download the same program on a remote backup server. This way, if the user loses this program, it can be retrieved automatically through a provided server on the Internet. If user's files are backed up by regular backup periodically, relevant data that were stored by real time backup servers can be eliminated.
    • 本发明涉及跟踪和支持用户在其计算机设备(包括嵌入式设备)上实时产生的所有信息。 本地用户服务器记录所有用户操作和手势(通过包括电视摄像机的各种方法)。 所有这些信息(用户操作和计算机中保存的文件)然后通过Internet发送到远程服务器。 该远程服务器具有人使用的计算机上的所有嵌入式设备的虚拟映射。 远程服务器立即开始解释用户的操作(包括用户手势)。 在一个实现中,本发明存储与数据生成相关的用户动作(例如,称为存储数据的一些链接的动作,或者执行一些生成数据的程序)。 在另一个变体中,远程服务器生成并下载在本地用户计算机设备上下载的相同文件。 例如,如果某人开始下载程序,则服务器还可以在远程备份服务器上下载相同的程序。 这样一来,如果用户丢失了这个程序,就可以通过Internet上提供的服务器自动检索。 如果定期备份用户文件,则可以消除实时备份服务器存储的相关数据。
    • 27. 发明授权
    • System and methods for acoustic and language modeling for automatic speech recognition with large vocabularies
    • 用于具有大词汇的自动语音识别的声学和语言建模的系统和方法
    • US07801727B2
    • 2010-09-21
    • US11064643
    • 2005-02-24
    • Ponani GopalakrishnanDimitri KanevskyMichael Daniel MonkowskiJan Sedivy
    • Ponani GopalakrishnanDimitri KanevskyMichael Daniel MonkowskiJan Sedivy
    • G10L15/04
    • G10L15/197G06F17/27G10L15/183Y10S707/99942
    • A method for generating a language component vocabulary VC for a speech recognition system having a language vocabulary V of a plurality of word forms is disclosed. The method includes: partitioning the language vocabulary V into subsets of word forms based on frequencies of occurrence of the respective word forms; and in at least one of the subsets, splitting word forms having frequencies less than a threshold to thereby generate word form components. Also disclosed is a method for use in speech recognition including: splitting an acoustic vocabulary comprising baseforms into baseform components and storing the baseform components; and, performing sound to spelling mapping on the baseform components so as to generate a baseform components to word parts table for use in subsequent decoding of speech. A method for decoding a speech utterance using language model components and acoustic components, includes the steps of: generating from the utterance a stack of baseform component paths; concatenating baseform components in a path to generate concatenated baseforms, when the concatenated baseform components correspond to a baseform found in an acoustic vocabulary; mapping the concatenated baseforms into words; computing language model (LM) scores associated with the words using a language model, and performing further decoding of the utterance based thereupon.
    • 公开了一种用于生成具有多个单词形式的语言词汇V的语音识别系统的语言组件词汇VC的方法。 该方法包括:基于各个词形式的出现频率将语言词汇V划分成单词形式的子集; 并且在至少一个子集中,分割具有小于阈值的频率的字形式,从而生成词形分量。 还公开了一种用于语音识别的方法,包括:将包含基本形式的声学词汇分解成基本形式组件并存储基本形式组件; 并且对基本形式组件执行声音拼写映射,以便生成用于语音后续解码中的字部分表的基本形式分量。 一种使用语言模型分量和声学分量对语音发音进行解码的方法,包括以下步骤:从发音中产生一叠基础分量路径; 当级联的基本形式组件对应于在声学词汇中发现的基础形式时,将路径中的基本形式组件连接以生成级联的基本形式; 将连接的基本形式映射为单词; 与使用语言模型的单词相关联的计算语言模型(LM)得分,并且基于此进行对话语的进一步解码。