会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Method and apparatus for real time speech recognition with and without
speaker dependency
    • 用于实时语音识别的方法和设备,具有和不具有扬声器依赖性
    • US5056150A
    • 1991-10-08
    • US433098
    • 1989-11-08
    • Tiecheng YuNing BiMeiling RongEnyao Zhang
    • Tiecheng YuNing BiMeiling RongEnyao Zhang
    • G10L15/08G10L15/10G10L25/87
    • G10L15/10G10L25/87
    • A method and apparatus for real time speech recognition with and without speaker dependency which includes the following steps. Converting the speech signals into a series of primitive sound spectrum parameter frames; detecting the beginning and ending of speech according to the primitive sound spectrum parameter frame, to determine the sound spectrum parameter frame series; performing non-linear time domain normalization on the sound spectrum parameter frame series using sound stimuli, to obtain speech characteristic parameter frame series with predefined lengths on the time domain; performing amplitude quantization normalization on the speech characteristic parameter frames; comparing the speech characteristic parameter frame series with the reference samples, to determine the reference sample which most closely matches the speech characteristic parameter frame series; and determining the recognition result according to the most closely matched reference sample.
    • 一种具有或不具有扬声器依赖性的实时语音识别的方法和装置,包括以下步骤。 将语音信号转换为一系列原始声谱参数帧; 根据原始声谱参数帧检测语音的开始和结束,确定声谱参数帧序列; 使用声音刺激对声谱参数帧序列进行非线性时域归一化,获得时域上具有预定长度的语音特征参数帧序列; 对语音特征参数帧执行振幅量化归一化; 将语音特征参数帧序列与参考样本进行比较,确定与语音特征参数帧序列最匹配的参考样本; 并根据最接近匹配的参考样本确定识别结果。
    • 4. 发明申请
    • HYBRID REALITY FOR 3D HUMAN-MACHINE INTERFACE
    • 用于3D人机界面的混合现实
    • US20120139906A1
    • 2012-06-07
    • US13234028
    • 2011-09-15
    • Xuerui ZhangNing BiYingyong Qi
    • Xuerui ZhangNing BiYingyong Qi
    • G06T15/00
    • G06T19/006H04N13/156
    • A three dimensional (3D) mixed reality system combines a real 3D image or video, captured by a 3D camera for example, with a virtual 3D image rendered by a computer or other machine to render a 3D mixed-reality image or video. A 3D camera can acquire two separate images (a left and a right) of a common scene, and superimpose the two separate images to create a real image with a 3D depth effect. The 3D mixed-reality system can determine a distance to a zero disparity plane for the real 3D image, determine one or more parameters for a projection matrix based on the distance to the zero disparity plane, render a virtual 3D object based on the projection matrix, combine the real image and the virtual 3D object to generate a mixed-reality 3D image.
    • 三维(3D)混合现实系统将由3D摄像机捕获的真实3D图像或视频与由计算机或其他机器呈现的虚拟3D图像组合以渲染3D混合现实图像或视频。 3D摄像机可以获取公共场景的两个单独的图像(左和右),并且叠加两个分离的图像以创建具有3D深度效果的实际图像。 3D混合现实系统可以确定实际3D图像到零视差平面的距离,基于到零视差平面的距离确定用于投影矩阵的一个或多个参数,基于投影矩阵渲染虚拟3D对象 ,组合真实图像和虚拟3D对象以产生混合现实的3D图像。
    • 5. 发明申请
    • MULTI-STAGE TESSELLATION FOR GRAPHICS RENDERING
    • 用于图形渲染的多阶段测量
    • US20090237401A1
    • 2009-09-24
    • US12052628
    • 2008-03-20
    • Jian WeiGuofang JiaoNing BiChehui Wu
    • Jian WeiGuofang JiaoNing BiChehui Wu
    • G06T17/00
    • G06T11/203
    • This disclosure describes a multi-stage tessellation technique for tessellating a curve during graphics rendering. In particular, a first tessellation stage tessellates the curve into a first set of line segments that each represents a portion of the curve. A second tessellation stage further tessellates the portion of the curve represented by each of the line segments of the first set into additional line segments that more finely represent the shape of the curve. In this manner, each portion of the curve that was represented by only one line segment after the first tessellation stage is represented by more than one line segment after the second tessellation stage. In some instances, more than two tessellation stages may be performed to tessellate the curve.
    • 本公开描述了用于在图形渲染期间细分曲线的多阶段镶嵌技术。 特别地,第一细分阶段将曲线细分为第一组线段,每组线段表示曲线的一部分。 第二细分阶段进一步将由第一组的每个线段表示的曲线的部分细分为更精细地表示曲线形状的附加线段。 以这种方式,在第一细分阶段之后仅由一个线段表示的曲线的每个部分在第二细分阶段之后被多于一个线段表示。 在一些情况下,可以执行多于两个的细分阶段来细分曲线。
    • 8. 发明授权
    • Noise-compensated speech recognition templates
    • 噪声补偿语音识别模板
    • US06381569B1
    • 2002-04-30
    • US09018257
    • 1998-02-04
    • Gilbert C. SihNing Bi
    • Gilbert C. SihNing Bi
    • G10L1520
    • G10L15/20G10L21/0216
    • The speech recognition training unit is modified to store digitized speech samples into a speech database that can be accessed at recognition time. The improved recognition unit comprises a noise analysis, modeling, and synthesis unit which continually analyzes the noise characteristics present in the audio environment and produces an estimated noise signal with similar characteristics. The recognition unit then constructs a noise-compensated template database by adding the estimated noise signal to each of the speech samples in the speech database and performing parameter determination on the resulting sums. This procedure accounts for the presence of noise in the recognition phase by retraining all the templates using an estimated noise signal with similar characteristics as the actual noise signal that corrupted the word to be recognized. This method improves the likelihood of a good template match, which increases the recognition accuracy.
    • 修改语音识别训练单元以将数字化语音样本存储到可在识别时被访问的语音数据库中。 改进的识别单元包括噪声分析,建模和合成单元,其连续分析存在于音频环境中的噪声特性并产生具有相似特性的估计噪声信号。 然后,识别单元通过将估计的噪声信号加到语音数据库中的每个语音样本上并对所得到的和进行参数确定来构建噪声补偿模板数据库。 该过程通过使用具有与损坏要识别的字的实际噪声信号相似的特性的估计噪声信号重新训练所有模板来解决识别阶段中的噪声的存在。 该方法提高了模板匹配的可能性,从而提高了识别精度。
    • 9. 发明授权
    • System and method for segmentation and recognition of speech signals
    • 用于语音信号的分割和识别的系统和方法
    • US06278972B1
    • 2001-08-21
    • US09225891
    • 1999-01-04
    • Ning BiChienchung Chang
    • Ning BiChienchung Chang
    • G01L1504
    • G10L15/04
    • A system and method for forming a segmented speech signal from an input speech signal having a plurality of frames. The input speech signal is converted from a time domain signal to a frequency domain signal having a plurality of speech frames, wherein each speech frame in the frequency domain signal is represented by at least one spectral value associated with the speech frame. A spectral difference value is then determined for each pair of adjacent frames in the frequency domain signal, wherein the spectral difference value for each pair of adjacent frames is representative of a difference between the at least one spectral value associated with each frame in the pair of adjacent frames. An initial cluster boundary is set between each pair of adjacent frames in the frequency domain signal, and a variance value is assigned to each cluster in the frequency domain signal, wherein the variance value for each cluster is equal to one of the determined spectral difference values. Next, a plurality of cluster merge parameters is calculated, wherein each of the cluster merge parameters is associated with a pair of adjacent clusters in the frequency domain signal. A minimum cluster merge parameter is selected from the plurality of cluster merge parameters. A merged cluster is then formed by canceling a cluster boundary between the clusters associated with the minimum merge parameter and assigning a merged variance value to the merged cluster, wherein the merged variance value is representative of the variance values assigned to the clusters associated with the minimum merge parameter. The process is repeated in order to form a plurality of merged clusters, and the segmented speech signal is formed in accordance with the plurality of merged clusters.
    • 一种用于从具有多个帧的输入语音信号形成分段语音信号的系统和方法。 输入语音信号从时域信号转换为具有多个语音帧的频域信号,其中频域信号中的每个语音帧由与语音帧相关联的至少一个频谱值表示。 然后对频域信号中的每对相邻帧确定频谱差值,其中每对相邻帧的频谱差值表示与该对相邻帧中的每个帧相关联的至少一个频谱值之间的差异 相邻帧。 在频域信号中的每对相邻帧之间设置初始簇边界,并且将频域值分配给频域信号中的每个簇,其中每个簇的方差值等于所确定的光谱差值之一 。 接下来,计算多个集群合并参数,其中每个集群合并参数与频域信号中的一对相邻集群相关联。 从多个集群合并参数中选择最小集群合并参数。 然后通过消除与最小合并参数相关联的集群之间的集群边界并将合并的方差值分配给合并的集群来形成合并的集群,其中合并的方差值表示分配给与最小合并参数相关联的集群的方差值 合并参数。 重复该过程以形成多个合并的群集,并且根据多个合并的群集形成分段语音信号。