会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明授权
    • Web enabled recognition architecture
    • Web启用识别架构
    • US07506022B2
    • 2009-03-17
    • US09960232
    • 2001-09-20
    • Kuansan WangHsiao-Wuen Hon
    • Kuansan WangHsiao-Wuen Hon
    • G06F15/16G10L11/00
    • G10L15/30G06F3/16G06F17/218H04M1/271H04M1/72561H04M3/493H04M3/4936H04M2207/40
    • A server/client system for processing data includes a network having a web server with information accessible remotely. A client device includes a microphone and a rendering component such as a speaker or display. The client device is configured to obtain the information from the web server and record input data associated with fields contained in the information. The client device is adapted to send the input data to a remote location with an indication of a grammar to use for recognition. A recognition server receives the input data and the indication of the grammar. The recognition server returns data indicative of what was recognized to at least one of the client and the web server.
    • 用于处理数据的服务器/客户端系统包括具有Web服务器的网络,其中信息可远程访问。 客户端设备包括麦克风和诸如扬声器或显示器的渲染组件。 客户端设备配置为从Web服务器获取信息并记录与包含在信息中的字段相关联的输入数据。 客户端设备适于将输入数据发送到远程位置,并具有用于识别的语法指示。 识别服务器接收输入数据和语法的指示。 识别服务器返回表示对客户机和web服务器中的至少一个识别的内容的数据。
    • 3. 发明授权
    • Speech recognition method and apparatus utilizing multi-unit models
    • 使用多单元模型的语音识别方法和装置
    • US06629073B1
    • 2003-09-30
    • US09559505
    • 2000-04-27
    • Hsiao-Wuen HonKuansan Wang
    • Hsiao-Wuen HonKuansan Wang
    • G01L1506
    • G10L15/187G10L2015/022G10L2015/025
    • A speech recognition method and system utilize an acoustic model that is capable of providing probabilities for both a large acoustic unit and an acoustic sub-unit. Each of these probabilities describes the likelihood of a set of feature vectors from a series of feature vectors representing a speech signal. The large acoustic unit is formed from a plurality of acoustic sub-units. At least one sub-unit probability and at least on large unit probability from the acoustic model are used by a decoder to generate a score for a sequence of hypothesized words. When combined, the acoustic sub-units associated with all of the sub-unit probabilities used to determine the score span fewer than all of the feature vectors in the series of feature vectors. An overlapping decoding technique is also provided.
    • 语音识别方法和系统利用能够为大声学单元和声学子单元提供概率的声学模型。 这些概率中的每一个描述了来自表示语音信号的一系列特征向量的一组特征向量的可能性。 大型声学单元由多个声学子单元形成。 解码器使用来自声学模型的至少一个子单元概率和至少基于大的单位概率来为假设词的序列生成分数。 当组合时,与用于确定分数的所有子单元概率相关联的声学子单元小于该系列特征向量中的所有特征向量。 还提供了重叠的解码技术。
    • 5. 发明申请
    • Sequential multimodal input
    • 顺序多模态输入
    • US20050101355A1
    • 2005-05-12
    • US10705155
    • 2003-11-11
    • Hsiao-Wuen HonKuansan Wang
    • Hsiao-Wuen HonKuansan Wang
    • G06F3/16G06F3/038G06F15/16G06F17/00H04M1/725H04M3/42H04M3/493H04M7/00H04M11/08H04Q7/38H04M1/00
    • G06F3/038H04M1/72561H04M3/4938H04M7/0027H04M2201/38H04M2207/18H04M2250/22H04M2250/74
    • A method of interacting with a client/server architecture with a 2G mobile phone is provided. The 2G phone includes a data channel for transmitting data and a voice channel for transmitting speech. The method includes receiving a web page from a web server pursuant to an application through the data channel and rendering the web page on the 2G phone. Speech is received from the user corresponding to at least one data field on the web page. A call is established from the 2G phone to a telephony server over the voice channel. The telephony server is remote from the 2G phone and is adapted to process speech. The telephony server obtains a speech-enabled web page from the web server corresponding to the web page provided to the 2G phone. Speech is transmitted from the 2G phone to the telephony server. The speech is processed in accordance with the speech-enabled web page to obtain textual data. The textual data is transmitted to the web server. The 2G phone obtains a new web page through the data channel and renders the new web page having the textual data.
    • 提供了一种与2G手机与客户端/服务器体系结构交互的方法。 2G电话包括用于发送数据的数据信道和用于发送语音的语音信道。 该方法包括根据通过数据通道的应用从Web服务器接收网页,并在2G电话上呈现网页。 从用户接收到对应于网页上的至少一个数据字段的语音。 通过语音信道从2G电话建立到电话服务器的呼叫。 电话服务器远离2G电话,适用于处理语音。 电话服务器从对应于提供给2G电话的网页的web服务器获取具有语音的网页。 语音从2G电话发送到电话服务器。 根据具有语音功能的网页处理语音以获得文本数据。 文本数据被传送到Web服务器。 2G手机通过数据通道获取新的网页,并使新网页具有文本数据。
    • 6. 发明申请
    • Sequential multimodal input
    • 顺序多模态输入
    • US20050101300A1
    • 2005-05-12
    • US10705019
    • 2003-11-11
    • Hsiao-Wuen HonKuansan Wang
    • Hsiao-Wuen HonKuansan Wang
    • G06F3/16G06F3/00G06F13/00G06F15/16G06F17/30H04M3/493H04M11/00H04Q7/38H04M7/00
    • G06F17/30899H04M3/4938
    • A method of interacting with a client/server architecture with a 2.5G mobile phone having a data channel for transmitting data and a voice channel for transmitting speech. The method includes receiving a web page from a web server pursuant to an application through the data channel and rendering the web page on the 2.5G phone, where rendering comprises processing the web page to be responsive speech input. Speech is received from the user corresponding to at least one data field on the web page. A call is established from the 2.5G phone to a telephony server over the voice channel. The telephony server is remote from the 2.5G phone and adapted to process speech. A speech-enabled web page is obtained from the web server corresponding to the web page provided to the 2.5G phone. Speech is transmitted from the 2.5G phne to the telephony server. The speech is processed in accordance with the speech-enabled web page to obtain textual data in accordance with the speech. The textual data is transmitted to the web server. A new web page is obtained on the 2.5G phone through the data channel and rendered having the textual data.
    • 一种与具有用于发送数据的数据信道的2.5G移动电话与用于发送语音的语音信道的客户机/服务器架构交互的方法。 该方法包括根据通过数据通道的应用从Web服务器接收网页,并在2.5G电话上呈现网页,其中渲染包括处理网页以进行响应语音输入。 从用户接收到对应于网页上的至少一个数据字段的语音。 通过语音信道从2.5G电话建立到电话服务器的呼叫。 电话服务器远离2.5G手机,适用于处理语音。 从与提供给2.5G电话的网页相对应的网络服务器获得支持语音的网页。 语音从2.5G电话传输到电话服务器。 根据具有语音功能的网页来处理语音,以根据语音获得文本数据。 文本数据被传送到Web服务器。 通过数据通道在2.5G手机上获得一个新的网页,并具有文本数据。
    • 10. 发明授权
    • Sequential multimodal input
    • 顺序多模态输入
    • US07363027B2
    • 2008-04-22
    • US10705019
    • 2003-11-11
    • Hsiao-Wuen HonKuansan Wang
    • Hsiao-Wuen HonKuansan Wang
    • H04Q7/22
    • G06F17/30899H04M3/4938
    • A method of interacting with a client/server architecture with a 2.5G mobile phone having a data channel for transmitting data and a voice channel for transmitting speech. The method includes receiving a web page from a web server pursuant to an application through the data channel and rendering the web page on the 2.5G phone, where rendering comprises processing the web page to be responsive speech input. Speech is received from the user corresponding to at least one data field on the web page. A call is established from the 2.5G phone to a telephony server over the voice channel. The telephony server is remote from the 2.5G phone and adapted to process speech. A speech-enabled web page is obtained from the web server corresponding to the web page provided to the 2.5G phone. Speech is transmitted from the 2.5G phone to the telephony server. The speech is processed in accordance with the speech-enabled web page to obtain textual data in accordance with the speech. The textual data is transmitted to the web server. A new web page is obtained on the 2.5G phone through the data channel and rendered having the textual data.
    • 一种与具有用于发送数据的数据信道的2.5G移动电话与用于发送语音的语音信道的客户机/服务器架构交互的方法。 该方法包括根据通过数据通道的应用从Web服务器接收网页,并在2.5G电话上呈现网页,其中渲染包括处理网页以进行响应语音输入。 从用户接收到对应于网页上的至少一个数据字段的语音。 通过语音信道从2.5G电话建立到电话服务器的呼叫。 电话服务器远离2.5G手机,适用于处理语音。 从与提供给2.5G电话的网页相对应的网络服务器获得支持语音的网页。 语音从2.5G手机发送到电话服务器。 根据具有语音功能的网页来处理语音,以根据语音获得文本数据。 文本数据被传送到Web服务器。 通过数据通道在2.5G手机上获得一个新的网页,并具有文本数据。