会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明专利
    • IML- Data Cleaning: INTELLIGENT DATA CLEANING USING MACHINE LEARNING PROGRAMMING
    • AU2020102129A4
    • 2020-10-15
    • AU2020102129
    • 2020-09-03
    • BALAKRISHNA RCHUNCHURE BASAVARAJD SAI MADHAVIHARSOOR BHARATIK S RAJESHMADGI MANOHARPATIL ASHOKPATIL NAGARAJ BPATIL PADMAPRIYASHAHABADKAR RAMESH
    • MADGI MANOHARSHAHABADKAR RAMESHPATIL NAGARAJ BHARSOOR BHARATIPATIL ASHOKPATIL PADMAPRIYACHUNCHURE BASAVARAJBALAKRISHNA RK S RAJESHD SAI MADHAVI
    • G06F16/35G06K9/00G06N5/02G06N20/00
    • Patent Title: IML- Data Cleaning: INTELLIGENT DATA CLEANING USING MACHINE LEARNING PROGRAMMING. Our invention" IML- Data Cleaning "is a system and article of manufacture enabling adapting to a shift in document content and also the instructions for receiving at least one labeled mapped seed document receiving unlabeled mapped documents receiving at least one predetermined cost factor training data a transductive classifier using the at least one predetermined cost factor calculated data and at least one seed document and the unlabeled documents. The invention also classifying the unlabeled documents having a confidence level above a predefined threshold into a plurality of indexing and categories using the classifier reclassifying at least some of the categorized documents into the categories using the classifier and outputting identifiers of the categorized documents to at least one of a user another system and another process. The invented systems and articles of also manufacture for separating documents are also presented and the systems and articles of manufacture for document searching are also presented a business and other information service provides data cleansing to correct and update both domestic and global addresses. The invented system a integrate and combination of processes generates cleansed data for input into a matching and mapping process and the matching, the mapping process matches information about a business. The invented technology data cleaning process includes the steps of validating data loaded from at least two source systems appending the validated data to a normalized data cleaning repository selecting the priority of the source system creating a clean database; loading the consistent, normalized, and cleansed data from the clean database into a format required by data systems and software tools using the data. The invented technology also creating reports and updating the clean database by a user without updating the source systems. The data cleaning process distributed and collecting, analyzing data from available sources for optimization models enabling consistent analysis. The invented technology the data cleaning process further provides complete auditability to the inputs and outputs of data systems and software tools that use a dynamic data set. Manohar Madgi (Associate Professor) Dr. Ramesh Shahabadkar (Professor) Dr. Nagaraj B. Patil (Principal) Dr. Bharati Harsoor (Professor & Head) Ashok Patil (Assistant Professor) Padmapriya Patil (Assistant Professor) Dr. Basavaraj Chunchure (Associate Professor) Dr. R. Balakrishna (Professor & Dean) Dr. Rajesh K. S. (Associate Professor) Dr. Sai Madhavi D. (Associate Professor) TOTAL NO OF SHEET: 8 NO OF FIG.: 10 Data Cleaning Process Parsing Correction Standardizing Consolidation Consolidation Parsing Thssisade0s0% edtake. Adapt 4 This usion 10%ediabte.apft, toyour Ticeds darenour :toyour needs a ateyw aidance v Step Step1 asden-s atfn Matching Correction Thts slideis 10FIG.2 DATAe.lati CLENIN PRC SS. 2 Ti ldi D0 dl~.~t