专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明申请

US20090193959A1 AUDIO RECORDING ANALYSIS AND RATING 审中-公开
标题翻译：音频录音分析和评分
公开(公告)号：US20090193959A1
公开(公告)日：2009-08-06
申请号：US12026977
申请日：2008-02-06
申请人： Jordi Janer Mestres , Jordi Bonada Sanjaume , Maarten De Boer , Alex Loscos Mira
发明人： Jordi Janer Mestres , Jordi Bonada Sanjaume , Maarten De Boer , Alex Loscos Mira
IPC分类号： G10H7/00
CPC分类号： G10H1/361 , G10H1/0008 , G10H2210/066 , G10H2210/091 , G10L25/48
摘要： An audio recording is processed and evaluated. A sequence of identified notes corresponding to the audio recording is determined by iteratively identifying potential notes within the audio recording. A rating for the audio recording is determined using a tuning rating and an expression rating. The audio recording includes a recording of at least a portion of a musical composition.
摘要翻译：对录音进行处理和评估。通过迭代地识别音频记录中的潜在音符来确定对应于音频记录的识别音符的序列。使用调谐评级和表情评级来确定音频记录的评级。音频记录包括至少一部分音乐作品的记录。

2. 发明申请

US20060004569A1 Voice processing apparatus and program 有权
标题翻译：语音处理装置和程序
公开(公告)号：US20060004569A1
公开(公告)日：2006-01-05
申请号：US11165695
申请日：2005-06-24
申请人： Yasuo Yoshioka , Alex Loscos
发明人： Yasuo Yoshioka , Alex Loscos
IPC分类号： G10L19/14
CPC分类号： G10L13/033 , G10L2021/0135
摘要： Envelope identification section generates input envelope data (DEVin) indicative of a spectral envelope (EVin) of an input voice. Template acquisition section reads out, from a storage section, converting spectrum data (DSPt) indicative of a frequency spectrum (SPt) of a converting voice. On the basis of the input envelope data (DEVin) and the converting spectrum data (DSPt), a data generation section specifies a frequency spectrum (SPnew) corresponding in shape to the frequency spectrum (SPt) of the converting voice and having a substantially same spectral envelope as the spectral envelope (EVin) of the input voice, and the data generation section generates new spectrum data (DSPnew) indicative of the frequency spectrum (SPnew). Reverse FFT section and output processing section generates an output voice signal (Snew) on the basis of the new spectrum data (DSPnew).
摘要翻译：信封识别部分生成表示输入声音的频谱包络（EVin）的输入包络数据（DEVin）。模板获取部从存储部读出表示转换语音的频谱（SPt）的频谱数据（DSPt）。基于输入包络数据（DEVin）和转换频谱数据（DSPt），数据生成部分指定与转换声音的频谱（SPt）形状对应的频谱（SPnew），并具有基本相同频谱包络作为输入语音的频谱包络（EVin），并且数据产生部分生成指示频谱（SPnew）的新频谱数据（DSPnew）。反向FFT部分和输出处理部分基于新的频谱数据（DSPnew）生成输出语音信号（Snew）。

3. 发明申请

WO2009098181A3 AUDIO RECORDING ANALYSIS AND RATING 审中-公开
标题翻译：音频录音分析和评分
公开(公告)号：WO2009098181A3
公开(公告)日：2009-10-15
申请号：PCT/EP2009051148
申请日：2009-02-02
申请人： UNI POMPEU FABRA , BMAT LICENSING S L , MESTRES JORDI JANER , SANJAUME JORDI BONADA , DE BOER MAARTEN , MIRA ALEX LOSCOS
发明人： MESTRES JORDI JANER , SANJAUME JORDI BONADA , DE BOER MAARTEN , MIRA ALEX LOSCOS
IPC分类号： G10H1/36 , G10H1/00
CPC分类号： G10H1/361 , G10H1/0008 , G10H2210/066 , G10H2210/091 , G10L25/48
摘要： An audio recording is processed and evaluated. A sequence of identified notes corresponding to the audio recording is determined by iteratively identifying potential notes within the audio recording. A rating for the audio recording is determined using a tuning rating and an expression rating. The audio recording includes a recording of at least a portion of a musical composition.
摘要翻译：对录音进行处理和评估。通过迭代地识别音频记录中的潜在音符来确定对应于音频记录的识别音符的序列。使用调谐评级和表情评级来确定音频记录的评级。音频记录包括至少一部分音乐作品的记录。

4. 发明申请

WO2009098181A2 AUDIO RECORDING ANALYSIS AND RATING 审中-公开
标题翻译：音频记录分析和评分
公开(公告)号：WO2009098181A2
公开(公告)日：2009-08-13
申请号：PCT/EP2009/051148
申请日：2009-02-02
申请人： UNIVERSITAT POMPEU FABRA , BMAT LICENSING, S.L. , MESTRES, Jordi Janer , SANJAUME, Jordi Bonada , DE BOER, Maarten , MIRA, Alex Loscos
发明人： MESTRES, Jordi Janer , SANJAUME, Jordi Bonada , DE BOER, Maarten , MIRA, Alex Loscos
IPC分类号： G10H1/36 , G10H1/00
CPC分类号： G10H1/361 , G10H1/0008 , G10H2210/066 , G10H2210/091 , G10L25/48
摘要： An audio recording is processed and evaluated. A sequence of identified notes corresponding to the audio recording is determined by iteratively identifying potential notes within the audio recording. A rating for the audio recording is determined using a tuning rating and an expression rating. The audio recording includes a recording of at least a portion of a musical composition.
摘要翻译：
处理和评估录音。通过迭代识别音频记录内的潜在音符来确定对应于音频记录的识别音符的序列。音频记录的评分是使用调谐评分和表情评分确定的。音频记录包括至少一部分音乐作品的记录。

5. 发明授权

US08073688B2 Voice processing apparatus and program 有权
标题翻译：语音处理装置和程序
公开(公告)号：US08073688B2
公开(公告)日：2011-12-06
申请号：US11165695
申请日：2005-06-24
申请人： Yasuo Yoshioka , Alex Loscos
发明人： Yasuo Yoshioka , Alex Loscos
IPC分类号： G10L19/14
CPC分类号： G10L13/033 , G10L2021/0135
摘要： Envelope identification section generates input envelope data (DEVin) indicative of a spectral envelope (EVin) of an input voice. Template acquisition section reads out, from a storage section, converting spectrum data (DSPt) indicative of a frequency spectrum (SPt) of a converting voice. On the basis of the input envelope data (DEVin) and the converting spectrum data (DSPt), a data generation section specifies a frequency spectrum (SPnew) corresponding in shape to the frequency spectrum (SPt) of the converting voice and having a substantially same spectral envelope as the spectral envelope (EVin) of the input voice, and the data generation section generates new spectrum data (DSPnew) indicative of the frequency spectrum (SPnew). Reverse FFT section and output processing section generates an output voice signal (Snew) on the basis of the new spectrum data (DSPnew).
摘要翻译：信封识别部分生成表示输入声音的频谱包络（EVin）的输入包络数据（DEVin）。模板获取部从存储部读出表示转换语音的频谱（SPt）的频谱数据（DSPt）。基于输入包络数据（DEVin）和转换频谱数据（DSPt），数据生成部分指定与转换声音的频谱（SPt）形状对应的频谱（SPnew），并具有基本相同频谱包络作为输入语音的频谱包络（EVin），并且数据产生部分生成指示频谱（SPnew）的新频谱数据（DSPnew）。反向FFT部分和输出处理部分基于新的频谱数据（DSPnew）生成输出语音信号（Snew）。

6. 发明授权

US08013231B2 Sound signal expression mode determining apparatus method and program 有权
标题翻译：声音信号表达模式确定装置的方法和程序
公开(公告)号：US08013231B2
公开(公告)日：2011-09-06
申请号：US11439818
申请日：2006-05-24
申请人： Takuya Fujishima , Alex Loscos , Jordi Bonada , Oscar Mayor
发明人： Takuya Fujishima , Alex Loscos , Jordi Bonada , Oscar Mayor
IPC分类号： G10H1/02
CPC分类号： G10H1/361 , G10H2210/061 , G10H2210/091 , G10H2250/005 , G10H2250/015 , G10H2250/235 , G10L15/142
摘要： A sound signal processing apparatus which is capable of correctly detecting expression modes and expression transitions of a song or performance from an input sound signal. A sound signal produced by performance or singing of musical tones is input and divided into frames of predetermined time periods. Characteristic parameters of the input sound signal are detected on a frame-by-frame basis. An expression determining process is carried out in which a plurality of expression modes of a performance or song are modeled as respective states, the probability that a section including a frame or a plurality of continuous frames lies in a specific state is calculated with respect to a predetermined observed section based on the characteristic parameters, and the optimum route of state transition in the predetermined observed section is determined based on the calculated probabilities so as to determine expression modes of the sound signal and lengths thereof.
摘要翻译：一种声音信号处理装置，其能够从输入声音信号正确地检测歌曲或演奏的表情模式和表情转换。通过演奏或唱歌产生的声音信号被输入并分成预定时间段的帧。在逐帧的基础上检测输入声音信号的特征参数。执行表达确定处理，其中表演或歌曲的多个表达模式被建模为各自的状态，关于一个或多个关于一个或多个连续帧的部分包括帧或多个连续帧的部分位于特定状态的概率被计算基于特征参数的预定观测部分，并且基于所计算的概率来确定预定观测部分中的最佳状态转换路线，以便确定声音信号的表达模式及其长度。

7. 发明申请

US20050049875A1 Voice converter for assimilation by frame synthesis with temporal alignment 失效
标题翻译：语音转换器通过帧合成与时间对准同化
公开(公告)号：US20050049875A1
公开(公告)日：2005-03-03
申请号：US10951328
申请日：2004-09-27
申请人： Takahiro Kawashima , Yasuo Yoshioka , Pedro Cano , Alex Loscos , Xavier Serra , Mark Schiementz , Jordi Bonada
发明人： Takahiro Kawashima , Yasuo Yoshioka , Pedro Cano , Alex Loscos , Xavier Serra , Mark Schiementz , Jordi Bonada
IPC分类号： G10L13/02 , G10L21/00 , G10L13/00
CPC分类号： G10L13/033 , G10L2021/0135
摘要： A voice converting apparatus is constructed for converting an input voice into an output voice according to a target voice. In the apparatus, a storage section provisionally stores source data, which is associated to and extracted from the target voice. An analyzing section analyzes the input voice to extract therefrom a series of input data frames representing the input voice. A producing section produces a series of target data frames representing the target voice based on the source data, while aligning the target data frames with the input data frames to secure synchronization between the target data frames and the input data frames. A synthesizing section synthesizes the output voice according to the target data frames and the input data frames. In the recognizing feature analysis, a characteristic analyzer extracts from the input voice a characteristic vector. A memory memorizes target behavior data representing a behavior of the target voice. An alignment processor determines a temporal relation between the input data frames and the target data frames according to the characteristic vector and the target behavior data so as to output alignment data. A target decoder produces the target data frames according to the alignment data, the input data frames and the source data containing phoneme of the target voice.
摘要翻译：构成语音转换装置，用于根据目标语音将输入语音转换为输出语音。在装置中，存储部临时存储与目标语音相关联并从其中提取的源数据。分析部分分析输入声音以从中提取代表输入声音的一系列输入数据帧。产生部分基于源数据产生一系列表示目标语音的目标数据帧，同时使目标数据帧与输入数据帧对齐，以确保目标数据帧与输入数据帧之间的同步。合成部根据目标数据帧和输入数据帧合成输出声音。在识别特征分析中，特征分析器从输入语音中提取特征向量。存储器存储表示目标语音行为的目标行为数据。对准处理器根据特征向量和目标行为数据确定输入数据帧和目标数据帧之间的时间关系，以输出对准数据。目标解码器根据对准数据，输入数据帧和包含目标声音的音素的源数据产生目标数据帧。

8. 发明授权

US07464034B2 Voice converter for assimilation by frame synthesis with temporal alignment 失效
标题翻译：语音转换器通过帧合成与时间对准同化
公开(公告)号：US07464034B2
公开(公告)日：2008-12-09
申请号：US10951328
申请日：2004-09-27
申请人： Takahiro Kawashima , Yasuo Yoshioka , Pedro Cano , Alex Loscos , Xavier Serra , Mark Schiementz , Jordi Bonada
发明人： Takahiro Kawashima , Yasuo Yoshioka , Pedro Cano , Alex Loscos , Xavier Serra , Mark Schiementz , Jordi Bonada
IPC分类号： G10L13/06
CPC分类号： G10L13/033 , G10L2021/0135
摘要： A voice converting apparatus is constructed for converting an input voice into an output voice according to a target voice. The apparatus includes a storage section, an analyzing section including a characteristic analyzer, a producing section, a synthesizing section, a memory, an alignment processor, and target decoder.
摘要翻译：构成语音转换装置，用于根据目标语音将输入语音转换为输出语音。该装置包括存储部分，分析部分，包括特征分析器，产生部分，合成部分，存储器，对准处理器和目标解码器。

9. 发明授权

US06992245B2 Singing voice synthesizing method 有权
公开(公告)号：US06992245B2
公开(公告)日：2006-01-31
申请号：US10375420
申请日：2003-02-27
申请人： Hideki Kenmochi , Alex Loscos , Jordi Bonada
发明人： Hideki Kenmochi , Alex Loscos , Jordi Bonada
IPC分类号： G10H1/06 , G10H7/00
CPC分类号： G10H7/002 , G10H2240/056 , G10H2240/311 , G10H2250/235 , G10H2250/455 , G10L13/02
摘要： A frequency spectrum is detected by analyzing a frequency of a voice waveform corresponding to a voice synthesis unit formed of a phoneme or a phonemic chain. Local peaks are detected on the frequency spectrum, and spectrum distribution regions including the local peaks are designated. For each spectrum distribution region, amplitude spectrum data representing an amplitude spectrum distribution depending on a frequency axis and phase spectrum data representing a phase spectrum distribution depending on the frequency axis are generated. The amplitude spectrum data is adjusted to move the amplitude spectrum distribution represented by the amplitude spectrum data along the frequency axis based on an input note pitch, and the phase spectrum data is adjusted corresponding to the adjustment. Spectrum intensities are adjusted to be along with a spectrum envelope corresponding to a desired tone color. The adjusted amplitude and phase spectrum data are converted into a synthesized voice signal.

10. 发明申请

US20050288921A1 Sound effect applying apparatus and sound effect applying program 有权
标题翻译：声效应用装置和声效应用程序
公开(公告)号：US20050288921A1
公开(公告)日：2005-12-29
申请号：US11159032
申请日：2005-06-22
申请人： Yasuo Yoshioka , Alex Loscos
发明人： Yasuo Yoshioka , Alex Loscos
IPC分类号： G10H1/00 , G10H1/10 , G10L21/00
CPC分类号： G10H1/10 , G10H1/0091 , G10H2210/155 , G10H2250/235 , G10L21/003
摘要： In a sound effect applying apparatus, an input part frequency-analyzes an input signal of sound or voice for detecting a plurality of local peaks of harmonics contained in the input signal. A subharmonics provision part adds a spectrum component of subharmonics between the detected local peaks so as to provide the input signal with a sound effect. An output part converts the input signal of a frequency domain containing the added spectrum component into an output signal of a time domain for generating the sound or voice provided with the sound effect.
摘要翻译：在声音施加装置中，输入部分对用于检测包含在输入信号中的多个谐波局部峰值的声音或声音的输入信号进行频率分析。次谐波提供部分在检测到的局部峰值之间加上次谐波的频谱分量，以便为输入信号提供声音效果。输出部分将包含附加的频谱分量的频域的输入信号转换成时域的输出信号，以产生具有声音效果的声音或声音。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式