搜索资源列表
WAVE文件格式剖析
- WAVE文件作为多媒体中使用的声波文件格式之一,它是以RIFF格式为标准的。RIFF是英文Resource Interchange File Format的缩写,每个WAVE文件的头四个字节便是“RIFF”。WAVE文件由文件头和数据体两大部分组成。其中文件头又分为RIFF/WAV文件标识段和声音数据格式说明段两部分。WAVE文件各部分内容及格式见附表。-WAVE as a multimedia documents used in the acoustic format, it is based
2006114
- 录音部分是参照vckbase的录音api代码,结合了一小段socket(TCP)就可以用来单方说话录音了,程序分两部分一部分是(录音机+网络发送代码),一部分是(接收数据+播放波形音乐代码),由于程序只是为了自己用,很多地方没有注意错误的识别,结构也比较乱,本程序当初最头疼的就是控制损耗内存,结果用了双缓存来存储波形数据来交替的存储/清除. -part of the recording light vckbase recording api code combination of a small
fdpsola
- 语音合成程序!psalo频域基音同步叠加方法。它首先对原始语音信号进行短时频域变换,得到短时谱和短时谱包络。短时谱除以短时谱包络得到声源短时谱,对声源短时谱的实部和虚部分别进行线性插值,就可以达到改变语音信号基频的目的,然后再进行频域反变换,可得到变换后的短时语音信号。短时谱包络部分也可以独立改变,以达到改变音色的目的。-speech synthesis procedures! Psalo frequency domain pitch synchronous superposition meth
Meter
- (第一部分)混音函数演示程序, VC源码-多媒体-,由于网速太慢了,我这里只有分成几次上传了啊-(Part I) mixing function demo program, VC Source- Multimedia-, due to slow speed, I here only divided into several uploaded ah
MeterDlg
- (第一部分)混音函数演示程序, VC源码-多媒体-,由于网速太慢了,我这里只有分成几次上传了啊-(Part I) mixing function demo program, VC Source- Multimedia-, due to slow speed, I here only divided into several uploaded ah
Meter3
- (第三部分)混音函数演示程序, VC源码-多媒体-,由于网速太慢了,我这里只有分成几次上传了啊-(Part III) mixing function demo program, VC Source- Multimedia-, due to slow speed, I here only divided into several uploaded ah
resourcer
- (第四部分)混音函数演示程序, VC源码-多媒体-,由于网速太慢了,我这里只有分成几次上传了啊-(Part IV) mixing function demo program, VC Source- Multimedia-, due to slow speed, I here only divided into several uploaded ah
stdafxc
- (第五部分)混音函数演示程序, VC源码-多媒体-,由于网速太慢了,我这里只有分成几次上传了啊-(Part V) mixing function demo program, VC Source- Multimedia-, due to slow speed, I here only divided into several uploaded ah
newblms
- 分块BLMS 算法,首先将信号分成若干小快,然后进行LMS计算,解决了计算工程量大的问题-Block BLMS algorithm, first of all, the signal is divided into several small fast, and then proceed to LMS, the solution to calculate the engineering problem of a large quantity of
spectrelentropy
- 使用子带谱熵进行的端点检测,将谱熵分为几个子带,检测效果不错-The use of sub-band entropy of the endpoint detection, the spectral entropy is divided into several sub-band to detect the effect of good
AnalogVoiceSignal
- 观测实时模拟信号(语音)的频谱 用音频设备采集一段语音,将语音存为.wav格式。对wav文件作分段傅里叶变换分析。语音是分音节的,应把它分段分析,而且实际运用中的数字信号处理的FFT的点数是有限的,一般只能达到千点。用傅里叶反变换IFFT,从频域恢复信号。画出频谱图和语音波形图。 -Observing real-time analog signal (voice) of the spectrum collected with the audio devices section of
11
- 为提高语音端点检测系统在低信噪(0 dB 以下) 下 检测的准确率, 提出了一种基于谱熵的端点检测算法。将每 帧信号分为16 个子带, 选取频谱分布在250~ 3. 5 kHz 并且 能量不超过该帧总能量90 的子带, 计算经过语音增强后的 子带能量以及各子带信噪比, 根据各子带信噪比的不同调整 其在整个谱熵计算过程中的权重, 然后平滑谱熵, 以最终的 谱熵作为端点检测的依据-To improve endpoint detection system in the low
Pattern-Recognition
- 西奥多里蒂斯著,李晶皎译 本书系统阐述了模式识别的原理与方法,并在此基础上介绍了模式识别的应用。全书分为:基础部分和应用部分:基础部分主要包括统计模式识别、模糊模式识别、神经网络模式识别等内容;应用部分有车牌识别和语音识别。 -This paper discussed the principles and methods of pattern recognition, and based on this, the application of pattern recognition. Enc
PLAY
- 把语音芯片ISD1420录放音时间20秒分成20段,每段一秒,调用录音子程序,录入语音,建立语音库,语音录入结束后,根据段地址,调用放音子程序,还原原来录入语音信号。- The voice chip ISD1420 sound recording time of 20 seconds is divided into 20 segments, each second, a subroutine call recording, voice input, speech database estab
LD_Demo_Source
- 语音识别芯片LD3320的开发程序,里面有相关的PROJECT,源码分为三个部分,分别是主函数main,读写函数Reg_RW,芯片操作函数LDChip,还附有相关的头文件。-The development of voice recognition chip LD3320 program, which has the relevant PROJECT, source code is divided into three parts, namely, the main function main,
HMM
- 语音识别与合成的基本程序,分为特征提取、模型建立和语音识别,且包括特征补偿-Basic procedures for speech recognition and synthesis, divided into feature extraction, modeling and speech recognition, including feature compensation
anglecos
- 利用夹角余弦距离进行样本数据分类。实现步骤主要分为以下两部分:a、待测样品X与训练集里每个样品Xi的距离采用夹角余弦距离公式计算。b、循环计算待测样品和训练集中各已知样品之间的距离,找出距离待测样品最近的已知样品,该已知样品的类别就是待测样品的类别。-Using the sample data classification Angle cosine distance.Implementation steps are divided into the following two parts: a,
19627016SpeechLPC
- Speech interface to computer is the next big step that the technology needs to take for general users. Automatic speech recognition (ASR) will play an important role in taking technology to the people. There are numerous applications of speech re
chenxu
- (1)录制一段语音信号,完成对信号的采样,画出信号的时域波形和频谱图,确定信号的频谱范围; (2)给信号叠加噪声(噪声类型分为如下几种:a白噪声;b单频噪色(正弦干扰);c多频噪声(多正弦干扰);d其它干扰。),画出受噪声干扰的信号时域波形和频谱图; (3)采用窗函数法设计FIR低通滤波器,画出滤波器的频响特性图; (4)用所设计的滤波器对受噪声影响的信号进行滤波,画出滤波后语音信号的时域波形图和频谱图; (5)对滤波前后的信号进行对比,分析信号的变化;回放语音信号,并与原始语音信号对比
Speech Encoding - Frequency Analysis MATLAB
- The speech signal for the particular isolated word can be viewed as the one generated using the sequential generating probabilistic model known as hidden Markov model (HMM). Consider there are n states in the HMM. The particular isolated speech sig
