专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明申请

WO2006002219A2 SYSTEMS AND METHODS FOR SPELL CORRECTION OF NON-ROMAN CHARACTERS AND WORDS 审中-公开
标题翻译：用于非罗曼字符和文字的校正的系统和方法
公开(公告)号：WO2006002219A2
公开(公告)日：2006-01-05
申请号：PCT/US2005022027
申请日：2005-06-21
申请人： GOOGLE INC , WU JUN , ZHU HONGJUN , ZHU HUICAN , HUANG WEI-HWA , CHAN CHIU-KI
发明人： WU JUN , ZHU HONGJUN , ZHU HUICAN , HUANG WEI-HWA , CHAN CHIU-KI
IPC分类号： G06F17/27 , G06F15/00 , G06F17/20 , G06F17/22
CPC分类号： G06F17/273 , G06F17/2223
摘要： Systems and methods to process and correct spelling errors for non-Roman based words such as in Chinese, Japanese, and Korean languages using a rule-based classifier and a hidden Markov model are disclosed. The method generally includes converting an input entry in a first language such as Chinese to at least one intermediate entry in an intermediate representation, such as pinyin, different from the first language, converting the intermediate entry to at least one possible alternative spelling or form of the input in the first language, and determining that the input entry is either a correct or questionable input entry when a match between the input entry and all possible alternative spellings to the input entry is or is not located, respectively. The questionable input entry may be classified using, for example, a transformation rule based classifier based on transformation rules generated by a transformation rules generator.
摘要翻译：公开了使用基于规则的分类器和隐马尔可夫模型来处理和纠正诸如中文，日文和韩文语言的基于非罗马字的词的拼写错误的系统和方法。该方法通常包括将诸如汉语的第一语言的输入条目转换为与第一语言不同的中间表示（例如拼音）中的至少一个中间条目，将中间条目转换为至少一个可能的替代拼写或形式以第一语言输入，并且当输入条目和输入条目的所有可能的替代拼写之间的匹配分别位于或不位于输入条目时，确定输入条目是正确的或可疑的输入条目。可疑输入条目可以使用例如基于基于由变换规则生成器生成的变换规则的基于变换规则的分类器来分类。

2. 发明申请

WO2007027608A3 LOCAL SEARCH 审中-公开
标题翻译：本地搜索
公开(公告)号：WO2007027608A3
公开(公告)日：2007-08-30
申请号：PCT/US2006033537
申请日：2006-08-30
申请人： GOOGLE INC , LUK KUN SHING , ZHU HUICAN , ZHU HONGJUN
发明人： LUK KUN SHING , ZHU HUICAN , ZHU HONGJUN
IPC分类号： G06F17/30 , G06Q30/00
CPC分类号： G06Q30/02 , G06F17/3087
摘要： A system receives yellow page data, map provider data, and document data in response to a local search query, and geocodes the data to assign a geographic idetifier and to match at least one address associated with the local search query. The system also index the geocoded data to determine business information and location information associated with thee local search query. The system further provides local search results and a map based on the indexed data (Figure 1).
摘要翻译：系统响应于本地搜索查询接收黄页数据，地图提供者数据和文档数据，并且对数据进行地理编码以分配地理标识符并且匹配至少一个与本地搜索查询相关联的地址。系统还对地理编码数据进行索引，以确定与本地搜索查询相关联的业务信息和位置信息。该系统还提供本地搜索结果和基于索引数据的地图（图1）。

3. 发明申请

WO2006002219A3 SYSTEMS AND METHODS FOR SPELL CORRECTION OF NON-ROMAN CHARACTERS AND WORDS 审中-公开
公开(公告)号：WO2006002219A3
公开(公告)日：2006-01-05
申请号：PCT/US2005/022027
申请日：2005-06-21
申请人： GOOGLE INC. , WU, Jun , ZHU, Hongjun , ZHU, Huican , HUANG, Wei-Hwa , CHAN, Chiu-Ki
发明人： WU, Jun , ZHU, Hongjun , ZHU, Huican , HUANG, Wei-Hwa , CHAN, Chiu-Ki
IPC分类号： G06F17/27
摘要： Systems and methods to process and correct spelling errors for non-Roman based words such as in Chinese, Japanese, and Korean languages using a rule-based classifier and a hidden Markov model are disclosed. The method generally includes converting an input entry in a first language such as Chinese to at least one intermediate entry in an intermediate representation, such as pinyin, different from the first language, converting the intermediate entry to at least one possible alternative spelling or form of the input in the first language, and determining that the input entry is either a correct or questionable input entry when a match between the input entry and all possible alternative spellings to the input entry is or is not located, respectively. The questionable input entry may be classified using, for example, a transformation rule based classifier based on transformation rules generated by a transformation rules generator.

4. 发明申请

WO2005091167A2 SYSTEMS AND METHODS FOR TRANSLATING CHINESE PINYIN TO CHINESE CHARACTERS 审中-公开
标题翻译：将中文字母转换为中文字符的系统和方法
公开(公告)号：WO2005091167A2
公开(公告)日：2005-09-29
申请号：PCT/US2005008863
申请日：2005-03-16
申请人： GOOGLE INC , WU JUN , ZHU HUICAN , ZHU HONGJUN
发明人： WU JUN , ZHU HUICAN , ZHU HONGJUN
IPC分类号： G06F17/22 , G06F17/28
CPC分类号： G06F17/2223
摘要： Systems and methods to process and translate pinyin to Chinese characters and words are disclosed. A chinese language model is trained by extracting unknown character strings from Chinese inputs, e.g., documents and/or user inputs/queries, determining valid words from the unknown character strings, and generating a transition matrix based on the Chinese inputs for predicting a word string given the context. A method for translating a pinyin input generally includes generating a set of Chinese character strings from the pinyin input using a Chinese dictionary including words derived from the Chinese inputs and a language model trained based on the Chinese inputs, each character string having a weight indicating the likelihood that the character string corresponds to the pinyin input. Ambiguous user input may be classified as non-pinyin or pinyin by identifying an ambiguous pinyin/non-pinyin ASCII word in the user input and analyzing the context to classify the user input.
摘要翻译：披露了将拼音处理和翻译成汉字和词语的系统和方法。通过从中文输入（例如文档和/或用户输入/查询）中提取未知字符串，从未知字符串中确定有效字，并基于中文输入生成用于预测字串的转换矩阵来训练中文模型给定上下文。用于翻译拼音输入的方法通常包括使用中文字典从拼音输入生成一组汉字字符串，包括从中文输入得到的词和基于中文输入训练的语言模型，每个字符串具有指示字符串对应于拼音输入的可能性。通过识别用户输入中的歧义拼音/非拼音ASCII字词并分析上下文以对用户输入进行分类，可能将歧义用户输入分类为非拼音或拼音。

5. 发明申请

WO2007027608A2 LOCAL SEARCH 审中-公开
标题翻译：本地搜索
公开(公告)号：WO2007027608A2
公开(公告)日：2007-03-08
申请号：PCT/US2006/033537
申请日：2006-08-30
申请人： GOOGLE INC. , LUK, Kun Shing , ZHU, Huican , ZHU, Hongjun
发明人： LUK, Kun Shing , ZHU, Huican , ZHU, Hongjun
IPC分类号： G06Q10/00
CPC分类号： G06Q30/02 , G06F17/3087
摘要： A system receives yellow page data, map provider data, and document data in response to a local search query, and geocodes the data to assign a geographic identifier and to match at least one address associated with the local search query. The system also indexes the geocoded data to determine business information and location information associated with the local search query. The system further provides local search results and a map based on the indexed data.
摘要翻译：系统响应于本地搜索查询接收黄页数据，地图提供者数据和文档数据，并且对数据进行地理编码以分配地理标识符并且匹配至少一个与本地搜索查询相关联的地址。系统还对地理编码数据进行索引，以确定与本地搜索查询相关联的业务信息和位置信息。该系统进一步提供本地搜索结果和基于索引数据的地图。

6. 发明申请

WO2005091167A3 SYSTEMS AND METHODS FOR TRANSLATING CHINESE PINYIN TO CHINESE CHARACTERS 审中-公开
公开(公告)号：WO2005091167A3
公开(公告)日：2005-09-29
申请号：PCT/US2005/008863
申请日：2005-03-16
申请人： GOOGLE INC. , WU, Jun , ZHU, Huican , ZHU, Hongjun
发明人： WU, Jun , ZHU, Huican , ZHU, Hongjun
IPC分类号： G06F17/28
摘要： Systems and methods to process and translate pinyin to Chinese characters and words are disclosed. A chinese language model is trained by extracting unknown character strings from Chinese inputs, e.g., documents and/or user inputs/queries, determining valid words from the unknown character strings, and generating a transition matrix based on the Chinese inputs for predicting a word string given the context. A method for translating a pinyin input generally includes generating a set of Chinese character strings from the pinyin input using a Chinese dictionary including words derived from the Chinese inputs and a language model trained based on the Chinese inputs, each character string having a weight indicating the likelihood that the character string corresponds to the pinyin input. Ambiguous user input may be classified as non-pinyin or pinyin by identifying an ambiguous pinyin/non-pinyin ASCII word in the user input and analyzing the context to classify the user input.

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式