会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 4. 发明申请
    • METHODS AND SYSTEMS OF PROVIDING SUPPLEMENTAL INFORMATON
    • 提供补充信息的方法和系统
    • US20140281846A1
    • 2014-09-18
    • US13802113
    • 2013-03-13
    • Alexander SorinDavid SiegelMichael ThompsonJulian Gosper
    • Alexander SorinDavid SiegelMichael ThompsonJulian Gosper
    • G06F17/24
    • G06Q10/105
    • At least one analytical operation from a set of different analytical operations may be determined based on at least one input. The input(s) may comprise contextual information of working content being displayed to a user on a device and comprising numerical data. Supplemental information for the working content may be generated using the determined analytical operation(s), may comprise a numerical-based analysis of the numerical data, and may be caused to be displayed to the user concurrently with the working content. The contextual information may comprise structured data. The input(s) may further comprise at least one of a history of the user's interactions with the working content, a history of the user's interactions with recommendations of supplemental information for the working content, a history of other users' interactions with the working content, and a history of other users' interactions with recommendations of supplemental information for the working content.
    • 可以基于至少一个输入来确定来自一组不同分析操作的至少一个分析操作。 输入可以包括在设备上向用户显示的工作内容的上下文信息,并且包括数字数据。 可以使用确定的分析操作来生成用于工作内容的补充信息,可以包括数值数据的基于数值的分析,并且可以使其与工作内容同时显示给用户。 上下文信息可以包括结构化数据。 输入可以进一步包括用户与工作内容的交互的历史中的至少一个,用户与工作内容的补充信息的建议的交互的历史,其他用户与工作内容的交互的历史 ,以及其他用户与工作内容的补充信息建议的交互历史。
    • 6. 发明授权
    • System and method for automatic prediction of speech suitability for statistical modeling
    • 自动预测语音适用性的统计建模系统和方法
    • US09484045B2
    • 2016-11-01
    • US13606618
    • 2012-09-07
    • Alexander SorinSlava ShechtmanVincent Pollet
    • Alexander SorinSlava ShechtmanVincent Pollet
    • G10L13/06G10L13/04G10L19/00G10L25/48G10L25/18
    • G10L25/48G10L13/04G10L25/18
    • An embodiment according to the invention provides a capability of automatically predicting how favorable a given speech signal is for statistical modeling, which is advantageous in a variety of different contexts. In Multi-Form Segment (MFS) synthesis, for example, an embodiment according to the invention uses prediction capability to provide an automatic acoustic driven template versus model decision maker with an output quality that is high, stable and depends gradually on the system footprint. In speaker selection for a statistical Text-to-Speech synthesis (TTS) system build, as another example context, an embodiment according to the invention enables a fast selection of the most appropriate speaker among several available ones for the full voice dataset recording and preparation, based on a small amount of recorded speech material.
    • 根据本发明的实施例提供了一种自动预测给定语音信号对于统计建模有利的能力,这在各种不同的上下文中是有利的。 在多格段(MFS)合成中,例如,根据本发明的实施例使用预测能力来提供具有高,稳定的输出质量的自动声驱动模板与模型决策者,并逐渐依赖于系统占用。 在用于统计文本到语音合成(TTS)系统构建的说话者选择中,作为另一示例性上下文,根据本发明的实施例使得能够在完整语音数据集记录和准备中的几个可用的扬声器中快速选择最合适的说话者 ,基于少量的录音材料。
    • 9. 发明授权
    • OCR-based image compression
    • 基于OCR的图像压缩
    • US06487311B1
    • 2002-11-26
    • US09304861
    • 1999-05-04
    • Yaniv GalAlexander SorinAndrei HeilperEugene Wallach
    • Yaniv GalAlexander SorinAndrei HeilperEugene Wallach
    • G06K968
    • H04N1/4115
    • A method for compressing a digitized image of a document using optical character recognition (OCR). The method includes performing optical character recognition (OCR) on the digitized image, identifying, based, at least in part, on a result of the performing step, a plurality of classes of characters comprised in the image, each the class of characters having an associated character value and comprising at least one character, pruning each class of characters, thereby producing information describing the plurality of classes of characters and a residual image, and utilizing the information describing the plurality of classes of characters and the residual image as a compressed digitized image in further processing. Related methods and apparatus are also disclosed.
    • 一种使用光学字符识别(OCR)压缩文档的数字化图像的方法。 所述方法包括对所述数字化图像执行光学字符识别(OCR),至少部分地基于所述执行步骤的结果识别所述图像中包含的多个字符类别,每个所述字符类具有 相关联的字符值并且包括至少一个字符,修剪每个类别的字符,从而产生描述多个字符类别和残留图像的信息,并且利用描述多个类别的字符的信息和残差图像作为压缩数字化 图像进一步处理。还公开了相关方法和装置。
    • 10. 发明授权
    • Statistical enhancement of speech output from a statistical text-to-speech synthesis system
    • 从统计文本到语音合成系统的语音输出的统计增强
    • US08682670B2
    • 2014-03-25
    • US13177577
    • 2011-07-07
    • Slava ShechtmanAlexander Sorin
    • Slava ShechtmanAlexander Sorin
    • G10L13/00
    • G10L13/033G10L13/06
    • A method, system and computer program product are provided for enhancement of speech synthesized by a statistical text-to-speech (TTS) system employing a parametric representation of speech in a space of acoustic feature vectors. The method includes: defining a parametric family of corrective transformations operating in the space of the acoustic feature vectors and dependent on a set of enhancing parameters; and defining a distortion indictor of a feature vector or a plurality of feature vectors. The method further includes: receiving a feature vector output by the system; and generating an instance of the corrective transformation by: calculating a reference value of the distortion indicator attributed to a statistical model of the phonetic unit emitting the feature vector; calculating an actual value of the distortion indicator attributed to feature vectors emitted by the statistical model of the phonetic unit emitting the feature vector; calculating the enhancing parameter values depending on the reference value of the distortion indicator, the actual value of the distortion indicator and the parametric corrective transformation; and deriving an instance of the corrective transformation corresponding to the enhancing parameter values from the parametric family of the corrective transformations. The instance of the corrective transformation may be applied to the feature vector to provide an enhanced feature vector.
    • 提供了一种方法,系统和计算机程序产品,用于增强由在声学特征向量的空间中采用语音参数表示的统计文本到语音(TTS)系统合成的语音。 该方法包括:定义在声学特征向量的空间中操作并依赖于一组增强参数的校正变换的参数族; 以及定义特征向量或多个特征向量的失真指示符。 该方法还包括:接收系统输出的特征向量; 以及通过以下方式产生所述校正变换的实例:计算归因于发出所述特征向量的所述语音单元的统计模型的所述失真指标的参考值; 计算归因于发射特征向量的语音单元的统计模型发射的特征向量的失真指标的实际值; 根据失真指标的参考值,失真指标的实际值和参数校正变换来计算增强参数值; 并且从校正变换的参数族导出对应于增强参数值的校正变换的实例。 校正变换的实例可以应用于特征向量以提供增强的特征向量。