会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 52. 发明授权
    • Constructing a classifier for classifying queries
    • 构造一个用于分类查询的分类器
    • US08407214B2
    • 2013-03-26
    • US12145508
    • 2008-06-25
    • Xiao LiYe-Yi Wang
    • Xiao LiYe-Yi Wang
    • G06F17/30
    • G06F17/30672
    • To construct a classifier, a data structure correlating queries to items identified by the queries is received, where the data structure contains initial labeled queries that have been labeled with respect to predetermined classes, and unlabeled queries that have not been labeled with respect to the predetermined classes. The data structure is used to label at least some of the unlabeled queries with respect to the predetermined classes. Queries in the data structure that have been labeled with respect to the predetermined classes are used as training data to train the classifier.
    • 为了构建分类器,接收将查询与由查询识别的项目相关联的数据结构,其中数据结构包含已经针对预定类别标记的初始标记查询,以及未标记关于预定类别的未标记查询 课程 该数据结构用于标记关于预定类别的至少一些未标记查询。 已经将关于预定类标记的数据结构中的查询用作训练数据以训练分类器。
    • 54. 发明申请
    • SEARCH LEXICON EXPANSION
    • 搜索LEXICON EXPANSION
    • US20120158703A1
    • 2012-06-21
    • US12970477
    • 2010-12-16
    • Xiao LiJingjing LiuAlejandro AceroYe-Yi Wang
    • Xiao LiJingjing LiuAlejandro AceroYe-Yi Wang
    • G06F17/30
    • G06F17/30737G06F17/2735G06F17/30693G06F17/30864
    • One or more techniques and/or systems are disclosed for creating an expanded or improved lexicon for use in search-based semantic tagging. A set of first documents can be identified using a set of first lexicon elements as queries, and one or more first document patterns can be extracted from the set of first documents. The document patterns can be used to find one or more second documents in a query log that comprise the document patterns, which are associated with query terms used to return the second documents. The query terms for the second documents can be extracted and used to expand the lexicon. Elements within the lexicon may be weighted based upon relevance to different query domains, for example.
    • 公开了一种或多种技术和/或系统,用于创建用于基于搜索的语义标签中的扩展或改进的词典。 可以使用一组第一词典元素作为查询来识别一组第一文档,并且可以从该组第一文档中提取一个或多个第一文档图案。 文档模式可用于在查询日志中找到构成文档模式的一个或多个第二文档,这些文档模式与用于返回第二个文档的查询术语相关联。 可以提取和使用第二个文档的查询条款来扩展词典。 例如,词法中的元素可以基于与不同查询域的相关性来加权。
    • 55. 发明授权
    • Grapheme-to-phoneme conversion using acoustic data
    • 使用声学数据的语音对音素转换
    • US08180640B2
    • 2012-05-15
    • US13164683
    • 2011-06-20
    • Xiao LiAsela J. R. GunawardanaAlejandro Acero, Jr.
    • Xiao LiAsela J. R. GunawardanaAlejandro Acero, Jr.
    • G10L15/04
    • G10L13/08G10L15/063G10L15/187
    • Described is the use of acoustic data to improve grapheme-to-phoneme conversion for speech recognition, such as to more accurately recognize spoken names in a voice-dialing system. A joint model of acoustics and graphonemes (acoustic data, phonemes sequences, grapheme sequences and an alignment between phoneme sequences and grapheme sequences) is described, as is retraining by maximum likelihood training and discriminative training in adapting graphoneme model parameters using acoustic data. Also described is the unsupervised collection of grapheme labels for received acoustic data, thereby automatically obtaining a substantial number of actual samples that may be used in retraining. Speech input that does not meet a confidence threshold may be filtered out so as to not be used by the retrained model.
    • 描述了使用声学数据来改进用于语音识别的字形到音素转换,例如更准确地识别语音拨号系统中的语音名称。 描述了声学和图形(声学数据,音素序列,字形序列以及音素序列和图形序列之间的对齐)的联合模型,正如通过使用声学数据适应图形模型参数的最大似然训练和鉴别训练来重新训练。 还描述了用于接收的声学数据的无监督的字母标签集合,从而自动获得可用于再培训的大量实际样本。 不满足置信阈值的语音输入可以被滤除,以便不被再培训的模型使用。