会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 36. 发明申请
    • SYSTEM AND METHOD FOR TRANSCRIBING HANDWRITTEN RECORDS USING WORD GROUPING WITH ASSIGNED CENTROIDS
    • 使用分配中心的WORD分组来转换手写记录的系统和方法
    • US20160063355A1
    • 2016-03-03
    • US14841542
    • 2015-08-31
    • Ancestry.com Operations Inc.
    • Jack ReeseMichael MurdockShawn ReidLaryn Brown
    • G06K9/62G06K9/00
    • G06K9/344G06K9/00456G06K9/00463G06K9/00852G06K9/00859G06K9/03G06K9/18G06K9/52G06K9/6201G06K9/6215G06K9/6218G06K2209/01G06T7/70G06T2207/30176
    • A handwriting recognition system converts word images on documents, such as document images of historical records, into computer searchable text. Word images (snippets) on the document are located, and have multiple word features identified. For each word image, a word feature vector is created representing multiple word features. Based on the similarity of word features (e.g., the distance between feature vectors), similar words are grouped together in clusters, and a centroid that has features most representative of words in the cluster is selected. A digitized text word is selected for each cluster based on review of a centroid in the cluster, and is assigned to all words in that cluster and is used as computer searchable text for those word images where they appear in documents. An analyst may review clusters to permit refinement of the parameters used for grouping words in clusters, including the adjustment of weights and other factors used for determining the distance between feature vectors.
    • 手写识别系统将诸如历史记录的文档图像的文档上的文字图像转换为计算机可搜索的文本。 位于文档上的Word图像(片段),并标识了多个单词特征。 对于每个单词图像,创建一个表示多个单词特征的单词特征向量。 基于词特征的相似性(例如,特征向量之间的距离),类似的词被分组在一起成簇,并且选择具有群体中最具代表性的特征的质心。 基于对集群中的质心的检查,为每个集群选择数字化文本字,并将其分配给该集群中的所有单词,并将其用作在文档中显示的单词图像的计算机可搜索文本。 分析人员可以查看群集以允许改进用于在群集中分组单词的参数,包括用于确定特征向量之间的距离的权重和其他因子的调整。
    • 38. 发明申请
    • SYSTEM AND METHOD FOR TRANSCRIBING HISTORICAL RECORDS INTO DIGITIZED TEXT
    • 将历史记录转换为数字文本的系统和方法
    • US20160063321A1
    • 2016-03-03
    • US14841502
    • 2015-08-31
    • Ancestry.com Operations Inc.
    • Jack ReeseMichael MurdockShawn ReidLaryn Brown
    • G06K9/00
    • G06K9/344G06K9/00456G06K9/00463G06K9/00852G06K9/00859G06K9/03G06K9/18G06K9/52G06K9/6201G06K9/6215G06K9/6218G06K2209/01G06T7/70G06T2207/30176
    • A handwriting recognition system converts word images on documents, such as document images of historical records, into computer searchable text. Word images (snippets) on the document are located, and have multiple word features identified. For each word image, a word feature vector is created representing multiple word features. Based on the similarity of word features (e.g., the distance between feature vectors), similar words are grouped together in clusters, and a centroid that has features most representative of words in the cluster is selected. A digitized text word is selected for each cluster based on review of a centroid in the cluster, and is assigned to all words in that cluster and is used as computer searchable text for those word images where they appear in documents. An analyst may review clusters to permit refinement of the parameters used for grouping words in clusters, including the adjustment of weights and other factors used for determining the distance between feature vectors.
    • 手写识别系统将诸如历史记录的文档图像的文档上的文字图像转换为计算机可搜索的文本。 位于文档上的Word图像(片段),并标识了多个单词特征。 对于每个单词图像,创建一个表示多个单词特征的单词特征向量。 基于词特征的相似性(例如,特征向量之间的距离),类似的词被分组在一起成簇,并且选择具有群体中最具代表性的特征的质心。 基于对集群中的质心的检查,为每个集群选择数字化文本字,并将其分配给该集群中的所有单词,并将其用作在文档中显示的单词图像的计算机可搜索文本。 分析人员可以查看群集以允许改进用于在群集中分组单词的参数,包括用于确定特征向量之间的距离的权重和其他因子的调整。