会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 4. 发明授权
    • Voiced/unvoiced speech classifier
    • 有声/无声语音分类器
    • US06640208B1
    • 2003-10-28
    • US09659318
    • 2000-09-12
    • Yaxin ZhangJianming SongAnton Madievski
    • Yaxin ZhangJianming SongAnton Madievski
    • G10L1106
    • G10L25/93
    • A voiced/unvoiced speech classifier (30) includes a speech segmentor (34) which segments an input digitized speech waveform into frames of speech and a band-pass filter (36) which filters the frames of speech. A relative energy generator (38) generates a relative energy value for each filtered frame of speech and a decision parameter generator (52) including an autocorrelation calculator (54) and a pitch calculator (56) generates a decision parameter based on an autocorrelation function and a pitch frequency index for the filtered frames of speech. A normalized energy calculator (46) adjusts the threshold and then normalizes the relative energy. A comparator (60) provides a signal indicative of whether a frame of speech is voiced speech or unvoiced speech depending on a comparison of the decision parameter and the normalized relative energy value for each filtered frame of speech.
    • 有声/无声语音分类器(30)包括将输入的数字化语音波形分成语音帧的语音分割器(34)和对语音帧进行滤波的带通滤波器(36)。 相对能量发生器(38)为每个经滤波的语音帧产生相对能量值,并且包括自相关计算器(54)和音高计算器(56)的判定参数发生器(52)基于自相关函数产生决策参数,并且 用于滤波的语音帧的音调频率索引。 归一化能量计算器(46)调整阈值,然后使相对能量归一化。 比较器(60)根据决定参数与每个被滤波的语音帧的归一化相对能量值的比较,提供指示语音帧是语音语音还是无声语音的信号。
    • 5. 发明授权
    • Method for chinese point-of-interest search
    • 中国兴趣点搜索方法
    • US08521539B1
    • 2013-08-27
    • US13429877
    • 2012-03-26
    • Jianzhong TengYaxin Zhang
    • Jianzhong TengYaxin Zhang
    • G10L21/00
    • G10L15/32G01C21/3608G01C21/3679G10L15/30
    • Techniques disclosed herein include systems and methods of automated speech recognition (ASR) for voice destination entry (VDE) include open voice searching (natural language searching) of destinations. A first part uses a server-based automated speech recognizer. The second part is client-based automatic speech recognition (ASR) processing. Thus, techniques include a hybrid VDE solution that provides users with an accurate and flexible way to use speech recognition technologies. A server-based speech recognizer executes the open-search task, while a client-based recognizer refines the results from the server to deliver an optimized result. This system and method significantly improves recognition accuracy for dictation engine based POI search of Chinese Mandarin input and input from other languages. Moreover, the methods herein largely improve the user experience by allowing users to say a partial POI name, and abbreviation, or even say a POI name in a reversed word order.
    • 本文公开的技术包括用于语音目的地输入(VDE)的自动语音识别(ASR)的系统和方法包括目的地的开放语音搜索(自然语言搜索)。 第一部分使用基于服务器的自动语音识别器。 第二部分是基于客户端的自动语音识别(ASR)处理。 因此,技术包括混合VDE解决方案,为用户提供使用语音识别技术的准确灵活方式。 基于服务器的语音识别器执行打开搜索任务,而基于客户端的识别器从服务器中精炼结果以递送优化结果。 该系统和方法显着提高了基于听写引擎的POI搜索​​中国汉语输入和其他语言输入的识别精度。 此外,这里的方法通过允许用户说出部分POI名称,缩写或甚至以反转的字顺序说出POI名称来大大改善用户体验。
    • 6. 发明申请
    • Open vocabulary speech recognition
    • 开放词汇语音识别
    • US20050049870A1
    • 2005-03-03
    • US10925601
    • 2004-08-24
    • Yaxin ZhangXin HeXiao-Lin RenFang Sun
    • Yaxin ZhangXin HeXiao-Lin RenFang Sun
    • G10L15/00G10L15/10
    • G10L15/10
    • There is described a method 300 for open vocabulary speech recognition performed by an electronic device (100). The method (300) includes receiving an utterance waveform (320) and Processing the waveform (350) to provide feature vectors representing the waveform. Then a step of comparing (360) is effected, the comparing compares the feature vectors with concatenated isolated word acoustic models from a concatenated isolated word acoustic model list to select a suitable concatenated isolated word acoustic model. Then a providing a response step (370) provides a response depending on the suitable concatenated isolated word acoustic model. The response typically is a control signal for activating a function of the device (100).
    • 描述了由电子设备(100)执行的用于开放词汇语音识别的方法300。 方法(300)包括接收发声波形(320)和处理波形(350)以提供表示波形的特征向量。 然后,进行比较(360)的步骤,比较将特征向量与来自级联的隔离词声模型列表的级联隔离词声模型进行比较,以选择适当的级联隔离词语模型。 然后,提供响应步骤(370)根据适当的级联隔离词语音模型提供响应。 响应通常是用于激活设备(100)的功能的控制信号。
    • 7. 发明授权
    • Method for estimating a confidence measure for a speech recognition system
    • 用于估计语音识别系统的置信度量度的方法
    • US06735562B1
    • 2004-05-11
    • US09588163
    • 2000-06-05
    • Yaxin ZhangHo Chuen ChoiJian Ming Song
    • Yaxin ZhangHo Chuen ChoiJian Ming Song
    • G10L1514
    • G10L15/01
    • A method of estimating a confidence measure for a speech recognition system, involves comparing an input speech signal with a number of predetermined models of possible speech signals. Best scores indicating the degree of similarity between the input speech signal and each of the predetermined models are then used to determine a normalized variance, which is used as the Confidence Measure, in order to determine whether the input speech signal has been correctly recognized, the Confidence Measure is compared to a threshold value. The threshold value is weighted according to the Signal to Noise Ratio of the input speech signal and according to the number of predetermined models used.
    • 一种估计语音识别系统的置信度测量的方法,包括将输入语音信号与可能的语音信号的多个预定模型进行比较。 然后使用表示输入语音信号与每个预定模型之间的相似程度的最佳分数来确定用作置信度量的归一化方差,以便确定输入语音信号是否已被正确识别, 将置信度与阈值进行比较。 阈值根据输入语音信号的信噪比和根据所使用的预定模型的数量进行加权。
    • 8. 发明授权
    • Tone based speech recognition
    • 基于语音识别
    • US06553342B1
    • 2003-04-22
    • US09496868
    • 2000-02-02
    • Yaxin ZhangJianming SongAnton Madievski
    • Yaxin ZhangJianming SongAnton Madievski
    • G10L1502
    • G10L15/02G10L25/15
    • A method and apparatus for speech recognition involves classifying (38) a digitized speech segment according to whether the speech segment comprises voiced or unvoiced speech and utilizing that classification to generate tonal feature vectors (41) of the speech segment when the speech is voiced. The tonal feature vectors are then combined (42) with other non-tonal feature vectors (40) to provide speech feature vectors. The speech feature vectors are compared (35) with previously stored models of speech feature vectors (37) for different segments of speech to determine which previously stored model is a most likely match for the segment to be recognized.
    • 用于语音识别的方法和装置涉及根据语音段是否包括有声或无声语音来分类(38)数字化语音段,并且当语音被语音时利用该分类来生成语音段的音调特征向量(41)。 然后将音调特征向量与其他非音调特征向量(40)组合(42)以提供语音特征向量。 将语音特征向量与先前存储的用于不同语音段的语音特征向量(37)的模型进行比较(35),以确定先前存储的模型是否将被识别的段最可能匹配。