会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 3. 发明申请
    • TRANSCRIPT ALIGNMENT
    • 代码对齐
    • US20100332225A1
    • 2010-12-30
    • US12493786
    • 2009-06-29
    • Jon A. ArrowoodKenneth King GriggsMarsal GavaldaRobert W. Morris
    • Jon A. ArrowoodKenneth King GriggsMarsal GavaldaRobert W. Morris
    • G10L15/26G06F17/30
    • G10L15/26
    • Some general aspects relate to systems and methods for media processing. One aspect, for example, relates to a method for aligning multimedia recording with a transcript. A group of search terms are formed from the transcript, with each search term being associated with a location within the transcript. Putative locations of the search terms are determined in a time interval of the multimedia recording. For each search term, zero or more putative locations are determined and, for at least some of the search terms, multiple putative locations are determined in the time interval of the multimedia recording. According to a first sequencing constraint, a first representation of a group of sequences each of a subset of the putative locations of the search terms is formed. A second representation of a group of sequences each of a subset of the search terms is formed. Using the first and the second representations, the time interval of the multimedia recording is partially aligned with the transcript.
    • 一些一般方面涉及用于媒体处理的系统和方法。 一方面,例如涉及用于将多媒体记录与抄本对齐的方法。 一组搜索词由抄本形成,每个搜索词与抄本中的一个位置相关联。 在多媒体记录的时间间隔内确定搜索项的推定位置。 对于每个搜索项,确定零个或多个推定位置,并且对于至少一些搜索项,在多媒体记录的时间间隔中确定多个推定位置。 根据第一排序约束,形成搜索项的推定位置的子集中的每一个序列组的第一表示。 形成搜索项的子集中的每一个的一组序列的第二表示。 使用第一和第二表示,多媒体记录的时间间隔与抄本部分对齐。
    • 7. 发明申请
    • TREND DISCOVERY IN AUDIO SIGNALS
    • 趋势发现在音频信号
    • US20110044447A1
    • 2011-02-24
    • US12545282
    • 2009-08-21
    • Robert W. MorrisMarsal GavaldaPeter S. CardilloJon A. Arrowood
    • Robert W. MorrisMarsal GavaldaPeter S. CardilloJon A. Arrowood
    • H04M3/00G10L15/00G06T11/20
    • H04M3/51G06T11/206G10L2015/088H04M2201/38H04M2203/357
    • Techniques for processing data representative of text associated with one or more content sources to generate a specification of a set of keyphrases of interest; processing a first set of audio signals collected during a first time period to generate first data characterizing putative occurrences of one or more keyphrases of the set in the first set of audio signals; evaluating the first data to generate keyphrase-specific comparison values for the first set of audio signals; deriving first trending data between the first set of audio signals and a second set of audio signals based in part on an analysis of the keyphrase-specific comparison values for the first set of audio signals relative to stored keyphrase-specific baseline values; and generating a visual representation of at least some of the first trending data and causing the visual representation of the first trending data to be presented on a display terminal.
    • 用于处理表示与一个或多个内容源相关联的文本的数据的技术,以生成一组感兴趣的关键短语的说明; 处理在第一时间段期间收集的第一组音频信号,以产生第一数据,以表征所述第一组音频信号中所述的一个或多个关键短语的推定出现; 评估所述第一数据以产生所述第一组音频信号的关键短语特定比较值; 部分地基于相对于存储的关键短语特定基线值对第一组音频信号的关键词特定比较值的分析,得出第一组音频信号和第二组音频信号之间的第一趋势数据; 以及生成第一趋势数据中的至少一些的视觉表示,并使第一趋势数据的视觉表示呈现在显示终端上。
    • 8. 发明授权
    • Speaker adaptation
    • 演讲者适应
    • US09001976B2
    • 2015-04-07
    • US13463104
    • 2012-05-03
    • Jon A. ArrowoodRobert W. MorrisMarsal Gavalda
    • Jon A. ArrowoodRobert W. MorrisMarsal Gavalda
    • H04M1/64G10L15/07G10L15/08H04M3/51
    • G10L15/07G10L2015/088H04M3/51H04M2201/40Y10S379/907
    • A method for speaker adaptation includes receiving a plurality of media files, each associated with a call center agent of a plurality of call center agents and receiving a plurality of terms. Speech processing is performed on at least some of the media files to identify putative instances of at least some of the plurality of terms. Each putative instance is associated with a hit quality that characterizes a quality of recognition of the corresponding term. One or more call center agents for performing speaker adaptation are determined, including identifying call center agents that are associated with at least one media file that includes one or more putative instances with a hit quality below a predetermined threshold. Speaker adaptation is performed for each identified call center agent based on the media files associated with the identified call center agent and the identified instances of the plurality of terms.
    • 用于说话者适应的方法包括接收多个媒体文件,每个媒体文件与多个呼叫中心代理的呼叫中心代理相关联并且接收多个条目。 对至少一些媒体文件执行语音处理,以识别多个术语中的至少一些术语的推定实例。 每个推定的实例与表征相应术语的识别质量的命中质量相关联。 确定用于执行说话者适应的一个或多个呼叫中心代理,包括识别与包括具有低于预定阈值的命中质量的一个或多个推定实例的至少一个媒体文件相关联的呼叫中心代理。 基于与所识别的呼叫中心代理相关联的媒体文件和所识别的多个术语的实例,为每个确定的呼叫中心代理执行音箱适配。
    • 9. 发明申请
    • SPEAKER ADAPTATION
    • 扬声器适应
    • US20130294587A1
    • 2013-11-07
    • US13463104
    • 2012-05-03
    • Jon A. ArrowoodRobert W. MorrisMarsal Gavalda
    • Jon A. ArrowoodRobert W. MorrisMarsal Gavalda
    • H04M1/64
    • G10L15/07G10L2015/088H04M3/51H04M2201/40Y10S379/907
    • A method for speaker adaptation includes receiving a plurality of media files, each associated with a call center agent of a plurality of call center agents and receiving a plurality of terms. Speech processing is performed on at least some of the media files to identify putative instances of at least some of the plurality of terms. Each putative instance is associated with a hit quality that characterizes a quality of recognition of the corresponding term. One or more call center agents for performing speaker adaptation are determined, including identifying call center agents that are associated with at least one media file that includes one or more putative instances with a hit quality below a predetermined threshold. Speaker adaptation is performed for each identified call center agent based on the media files associated with the identified call center agent and the identified instances of the plurality of terms.
    • 用于说话者适应的方法包括接收多个媒体文件,每个媒体文件与多个呼叫中心代理的呼叫中心代理相关联并且接收多个条目。 对至少一些媒体文件执行语音处理,以识别多个术语中的至少一些术语的推定实例。 每个推定的实例与表征相应术语的识别质量的命中质量相关联。 确定用于执行说话者适应的一个或多个呼叫中心代理,包括识别与包括具有低于预定阈值的命中质量的一个或多个推定实例的至少一个媒体文件相关联的呼叫中心代理。 基于与所识别的呼叫中心代理相关联的媒体文件和所识别的多个术语的实例,为每个确定的呼叫中心代理执行音箱适配。
    • 10. 发明授权
    • Speech signal similarity
    • 语音信号相似度
    • US08670983B2
    • 2014-03-11
    • US13221270
    • 2011-08-30
    • Jacob B. GarlandJon A. ArrowoodDrew LanhamMarsal Gavalda
    • Jacob B. GarlandJon A. ArrowoodDrew LanhamMarsal Gavalda
    • G10L15/04
    • G10L25/00
    • A method for determining a similarity between a first audio source and a second audio source includes: for the first audio source, determining a first frequency of occurrence for each of a plurality of phoneme sequences and determining a first weighted frequency for each of the plurality of phoneme sequences based on the first frequency of occurrence for the phoneme sequence; for the second audio source, determining a second frequency of occurrence for each of a plurality of phoneme sequences and determining a second weighted frequency for each of the plurality of phoneme sequences based on the second frequency of occurrence for the phoneme sequence; comparing the first weighted frequency for each phoneme sequence with the second weighted frequency for the corresponding phoneme sequence; and generating a similarity score representative of a similarity between the first audio source and the second audio source based on the results of the comparing.
    • 一种用于确定第一音频源和第二音频源之间的相似度的方法包括:对于第一音频源,确定多个音素序列中的每一个的第一出现频率,并且确定多个音素中的每一个的第一加权频率 基于音素序列的第一个发生频率的音素序列; 对于第二音频源,确定多个音素序列中的每一个的第二出现频率,并且基于音素序列的第二出现频率确定多个音素序列中的每一个的第二加权频率; 将每个音素序列的第一加权频率与相应音素序列的第二加权频率进行比较; 以及基于所述比较的结果生成表示所述第一音频源和所述第二音频源之间的相似度的相似度分数。