会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 123. 发明授权
    • Combined speech and alternate input modality to a mobile device
    • 组合语音和交替输入模式到移动设备
    • US07941316B2
    • 2011-05-10
    • US11262230
    • 2005-10-28
    • Milind V. MahajanAlejandro AceroBo-June Hsu
    • Milind V. MahajanAlejandro AceroBo-June Hsu
    • G10L15/26
    • G10L15/22
    • A method of entering information into a mobile device includes receiving a multi-word speech input from a user, performing speech recognition on the speech input to obtain a multi-word speech recognition result, and sequentially displaying, in a display, words in the speech recognition result for user confirmation or correction, by adding one word at a time to the display. A next word is only displayed after user confirmation or correct has been received for a previously displayed word that is immediately preceding the next word in the speech recognition result. The method also includes calculating a hypothesis lattice indicative of a plurality of speech recognition hypotheses based on the speech input and, prior to finishing calculating the hypothesis lattice and while continuing to calculate the hypothesis lattice, calculating a preliminary hypothesis lattice indicative of only partial speech recognition hypotheses based on the speech input and outputting the preliminary hypotheses lattice.
    • 将信息输入到移动设备的方法包括从用户接收多字语音输入,在语音输入上执行语音识别以获得多字语音识别结果,并且在显示器中依次显示语音中的单词 用户确认或校正的识别结果,通过一次添加一个单词到显示。 仅在用户确认之后才显示下一个单词,或者在语音识别结果中紧接在下一个单词之前的先前显示的单词已经接收到正确的单词。 该方法还包括基于语音输入来计算指示多个语音识别假设的假设格点,并且在完成计算假设网格之前并在继续计算假设网格的同时,计算指示仅部分语音识别的初步假设点 基于语音输入的假设,并输出初步假设格。
    • 124. 发明授权
    • Time synchronous decoding for long-span hidden trajectory model
    • 长跨隐藏轨迹模型的时间同步解码
    • US07877256B2
    • 2011-01-25
    • US11356905
    • 2006-02-17
    • Xiaolong LiLi DengDong YuAlejandro Acero
    • Xiaolong LiLi DengDong YuAlejandro Acero
    • G10L15/14
    • G10L15/08
    • A time-synchronous lattice-constrained search algorithm is developed and used to process a linguistic model of speech that has a long-contextual-span capability. In the algorithm, hypotheses are represented as traces that include an indication of a current frame, previous frames and future frames. Each frame can include an associated linguistic unit such as a phone or units that are derived from a phone. Additionally, pruning strategies can be applied to speed up the search. Further, word-ending recombination methods are developed to speed up the computation. These methods can effectively deal with an exponentially increased search space.
    • 开发了一种时间同步的格格约束搜索算法,用于处理具有长语境跨度能力的语言语言模型。 在算法中,假设被表示为包括当前帧,先前帧和未来帧的指示的迹线。 每个帧可以包括相关联的语言单元,例如从电话派生的电话或单元。 此外,可以应用修剪策略来加快搜索速度。 此外,开发了文字重组方法以加速计算。 这些方法可以有效地处理指数级增加的搜索空间。
    • 126. 发明申请
    • VISUAL FEEDBACK FOR NATURAL HEAD POSITIONING
    • 视觉反馈自然头位置
    • US20100149310A1
    • 2010-06-17
    • US12336534
    • 2008-12-17
    • Zhengyou ZhangChristian HuitemaAlejandro Acero
    • Zhengyou ZhangChristian HuitemaAlejandro Acero
    • H04N7/15
    • H04N7/147H04N7/15H04N21/42203H04N21/4223H04N21/4318H04N21/44218H04N21/4788
    • A videoconferencing conferee may be provided with feedback on his or her location relative a local video camera by altering how remote videoconference video is displayed on a local videoconference display viewed by the conferee. The conferee's location may be tracked and the displayed remote video may be altered in accordance to the changing location of the conferee. The remote video may appear to move in directions mirroring movement of the conferee. This effect may be achieved by modeling the remote video as offset and behind a virtual portal corresponding to the display. The remote video may be displayed according to a view of the remote video through the virtual portal. As the conferee's position changes, the view through the portal changes, and the remote video changes accordingly.
    • 可以通过改变远程视频会议视频在与会者观看的本地视频会议显示器上的显示方式,来向视频会议与会者提供关于其本地摄像机的反馈。 可以跟踪与会者的位置,并且可以根据与会者的不同位置改变所显示的远程视频。 远程视频可能会显示为反映与会者移动的方向。 可以通过将远程视频建模为偏移并且对应于显示器的虚拟门户后面来实现该效果。 远程视频可以根据通过虚拟门户的远程视频的视图来显示。 随着与会者的职位发生变化,通过门户的视图会发生变化,远程视频也会相应变化。
    • 128. 发明申请
    • ACOUSTIC ECHO SUPPRESSION
    • 呼声抑制
    • US20090323924A1
    • 2009-12-31
    • US12145579
    • 2008-06-25
    • Ivan J. TashevAlejandro AceroNilesh Madhu
    • Ivan J. TashevAlejandro AceroNilesh Madhu
    • H04M9/08
    • H04M9/082
    • Sound signals captured by a microphone are adjusted to provide improved sound quality. More particularly, an Acoustic Echo Reduction system which performs a first stage of echo reduction (e.g., acoustic echo cancellation) on a received signal is configured to perform a second stage of echo reduction (e.g., acoustic echo suppression) by segmenting the received signal into a plurality of frequency bins respectively comprised within a number of frames (e.g., 0.3 s to 0.5 s sound signal segments) for a given block. Data comprised within respective frequency bins is modeled according to a probability density function (e.g., Gaussian distribution). The probability of whether respective frequency bins comprise predominantly near-end signal or predominantly residual echo is calculated. The output of the acoustic echo suppression is computed as a product of the content of a frequency bin in a frame and the probability the frequency bin in a frame comprises predominantly near-end signal, thereby making near-end signals more prominent than residual echoes.
    • 由麦克风捕获的声音信号进行调整,以提高音质。 更具体地,在接收信号上执行回波减少的第一阶段(例如,声学回声消除)的声学回波减少系统被配置为通过将接收到的信号分段为进行回波减少的第二阶段(例如,声学回声抑制) 分别包括在给定块的多个帧(例如,0.3s至0.5s的声音信号段)内的多个频率仓。 根据概率密度函数(例如,高斯分布)对包含在相应频率仓内的数据进行建模。 计算各个频率仓主要包括近端信号或主要是残余回波的概率。 声波回声抑制的输出被计算为帧中的频率仓的内容与帧中的频率仓主要包含近端信号的概率的乘积,从而使近端信号比残余回波更突出。