会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • SYSTEMS AND METHODS FOR SPELL CORRECTION OF NON-ROMAN CHARACTERS AND WORDS
    • 用于非罗曼字符和文字的校正的系统和方法
    • WO2006002219A2
    • 2006-01-05
    • PCT/US2005022027
    • 2005-06-21
    • GOOGLE INCWU JUNZHU HONGJUNZHU HUICANHUANG WEI-HWACHAN CHIU-KI
    • WU JUNZHU HONGJUNZHU HUICANHUANG WEI-HWACHAN CHIU-KI
    • G06F17/27G06F15/00G06F17/20G06F17/22
    • G06F17/273G06F17/2223
    • Systems and methods to process and correct spelling errors for non-Roman based words such as in Chinese, Japanese, and Korean languages using a rule-based classifier and a hidden Markov model are disclosed. The method generally includes converting an input entry in a first language such as Chinese to at least one intermediate entry in an intermediate representation, such as pinyin, different from the first language, converting the intermediate entry to at least one possible alternative spelling or form of the input in the first language, and determining that the input entry is either a correct or questionable input entry when a match between the input entry and all possible alternative spellings to the input entry is or is not located, respectively. The questionable input entry may be classified using, for example, a transformation rule based classifier based on transformation rules generated by a transformation rules generator.
    • 公开了使用基于规则的分类器和隐马尔可夫模型来处理和纠正诸如中文,日文和韩文语言的基于非罗马字的词的拼写错误的系统和方法。 该方法通常包括将诸如汉语的第一语言的输入条目转换为与第一语言不同的中间表示(例如拼音)中的至少一个中间条目,将中间条目转换为至少一个可能的替代拼写或形式 以第一语言输入,并且当输入条目和输入条目的所有可能的替代拼写之间的匹配分别位于或不位于输入条目时,确定输入条目是正确的或可疑的输入条目。 可疑输入条目可以使用例如基于基于由变换规则生成器生成的变换规则的基于变换规则的分类器来分类。
    • 4. 发明申请
    • SYSTEMS AND METHODS FOR TRANSLATING CHINESE PINYIN TO CHINESE CHARACTERS
    • 将中文字母转换为中文字符的系统和方法
    • WO2005091167A2
    • 2005-09-29
    • PCT/US2005008863
    • 2005-03-16
    • GOOGLE INCWU JUNZHU HUICANZHU HONGJUN
    • WU JUNZHU HUICANZHU HONGJUN
    • G06F17/22G06F17/28
    • G06F17/2223
    • Systems and methods to process and translate pinyin to Chinese characters and words are disclosed. A chinese language model is trained by extracting unknown character strings from Chinese inputs, e.g., documents and/or user inputs/queries, determining valid words from the unknown character strings, and generating a transition matrix based on the Chinese inputs for predicting a word string given the context. A method for translating a pinyin input generally includes generating a set of Chinese character strings from the pinyin input using a Chinese dictionary including words derived from the Chinese inputs and a language model trained based on the Chinese inputs, each character string having a weight indicating the likelihood that the character string corresponds to the pinyin input. Ambiguous user input may be classified as non-pinyin or pinyin by identifying an ambiguous pinyin/non-pinyin ASCII word in the user input and analyzing the context to classify the user input.
    • 披露了将拼音处理和翻译成汉字和词语的系统和方法。 通过从中文输入(例如文档和/或用户输入/查询)中提取未知字符串,从未知字符串中确定有效字,并基于中文输入生成用于预测字串的转换矩阵来训练中文模型 给定上下文。 用于翻译拼音输入的方法通常包括使用中文字典从拼音输入生成一组汉字字符串,包括从中文输入得到的词和基于中文输入训练的语言模型,每个字符串具有指示 字符串对应于拼音输入的可能性。 通过识别用户输入中的歧义拼音/非拼音ASCII字词并分析上下文以对用户输入进行分类,可能将歧义用户输入分类为非拼音或拼音。